parpool failed to start
13 ビュー (過去 30 日間)
古いコメントを表示
I'm trying to start a parallel pool in Matlab 2020b, but the pool repeatedly fails to start. Can anyone offer any guidance? Here are the messages I get:
VALIDATION REPORT
Profile: LocalProfile1
Scheduler Type: Local
Stage: Cluster connection test (parcluster)
Status: Passed
Start Time: Wed Sep 07 15:08:24 PDT 2022
Finish Time: Wed Sep 07 15:08:24 PDT 2022
Running Duration: 0 min 0 sec
Description:
Error Report:
Command Line Output:
Debug Log:
Stage: Job test (createJob)
Status: Passed
Start Time: Wed Sep 07 15:08:24 PDT 2022
Finish Time: Wed Sep 07 15:08:59 PDT 2022
Running Duration: 0 min 35 sec
Description:
Error Report:
Command Line Output:
Debug Log:
Stage: SPMD job test (createCommunicatingJob)
Status: Passed
Start Time: Wed Sep 07 15:08:59 PDT 2022
Finish Time: Wed Sep 07 15:10:11 PDT 2022
Running Duration: 1 min 12 sec
Description: Job ran with 64 workers.
Error Report:
Command Line Output:
Debug Log:
Stage: Pool job test (createCommunicatingJob)
Status: Passed
Start Time: Wed Sep 07 15:10:11 PDT 2022
Finish Time: Wed Sep 07 15:10:58 PDT 2022
Running Duration: 0 min 47 sec
Description: Job ran with 64 workers.
Error Report:
Command Line Output:
Debug Log:
Stage: Parallel pool test (parpool)
Status: Failed
Start Time: Wed Sep 07 15:10:58 PDT 2022
Finish Time: Wed Sep 07 15:16:02 PDT 2022
Running Duration: 5 min 5 sec
Description: Failed to initialize the interactive session.
Error Report: Failed to initialize the interactive session.
Caused by:
Error using parallel.internal.pool.InteractiveClient>iThrowIfBadParallelJobStatus (line 789)
The interactive communicating job failed with no message.
Command Line Output:
Debug Log:
Error using parpool (line 139)
Parallel pool failed to start with the following error. For more detailed information, validate the profile 'LocalProfile1' in the Cluster
Profile Manager.
Error using parallel.internal.pool.InteractiveClient>iThrowWithCause (line 678)
Failed to initialize the interactive session.
Error in parallel.internal.pool.InteractiveClient/start (line 376)
iThrowWithCause( 'parallel:convenience:FailedToInitializeInteractiveSession', err );
Error in parallel.internal.pool.AbstractClusterPool>iStartClient (line 826)
spmdInitialized = client.start(poolType , numWorkers, cluster, ...
Error in parallel.internal.pool.AbstractClusterPool.hBuildPool (line 596)
iStartClient(client, 'pool', cluster, guiMode, supportRestart, argsList);
Error in parallel.internal.types.ValidationStages>iOpenPoolForCluster (line 399)
aPool = parallel.internal.pool.AbstractClusterPool.hBuildPool('Cluster', cluster, 'NumWorkers', numWorkers);
Error in parallel.internal.types.ValidationStages>@()iOpenPoolForCluster(runInfo)
Error in parallel.internal.types.ValidationStages>iCallWithNoHotlinks (line 311)
[varargout{1:nargout}] = fcn();
Error in parallel.internal.types.ValidationStages>iRunParpoolStage (line 226)
[commandWindowOutput, aPool] = evalc(iWrapForEvalc(openPoolFcn));
Error in parallel.internal.types.ValidationStages/run (line 55)
[eventData, runInfo] = obj.RunFunction(obj, runInfo);
Error in parallel.internal.validator.Validator/runValidationSuite (line 191)
[eventData, stageRunInfo] = currentStage.run(stageRunInfo);
Error in parallel.internal.validator.Validator/validate (line 103)
status = obj.runValidationSuite(profileName, suite);
Error in parallel.internal.ui.AbstractValidationManager/validate (line 36)
obj.Validator.validate(profileName, validationSuite);
Error in parallel.internal.ui.ValidationManager.validateProfile (line 36)
parallel.internal.ui.ValidationManager.getOrCreateInstance().validate(profileName, suite);
Caused by:
Error using parallel.internal.pool.InteractiveClient>iThrowIfBadParallelJobStatus (line 789)
The interactive communicating job failed with no message.
4 件のコメント
Raymond Norris
2022 年 9 月 8 日
Let's try one thing to get some more diagnostics and then turn this over to Techinical Support (support@mathworks.com).
pctconfig('preservejobs',true);
local = parcluster("local");
pool = local.parpool(64);
<... wait for error ...>
local.getDebugLog(local.Jobs(end))
It would also be interesting to see if you have issues with <64 workers (i.e., try 32 and/or 48 workers).
回答 (0 件)
参考
カテゴリ
Help Center および File Exchange で Parallel Computing Fundamentals についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!