Error using Parpool inside a SC with MCR v98 (R2020a), and SLURM is the job sch. manager
2 ビュー (過去 30 日間)
古いコメントを表示
Hello all,
I was running a compiled standalone app/program that uses the parallel toolbox with MCR v98 (2020a) inside a SC normally, this is, I got the results I wanted. After some other tests, and NOT modifying anything in the compile stand-alone app, I am getting this output error file:
Parallel pool failed to start with the following error.
Error in StackCurrentF/OpenParPool (line 551)
Error in StackCurrentF (line 87)
Caused by:
Error using parallel.internal.pool.InteractiveClient>iThrowWithCause (line 670)
Failed to locate and destroy old interactive jobs.
Error using parallel.Cluster/findJob (line 74)
Unknown type: concurrentconcurrent.
parallel:cluster:PoolCreateFailed
So, no parallel computation. This happens even when I run a small interactive Job with srun that only turns-on the Pool and then wait and then closses it.
What can be the problem?
Any insights, or past experienses with similar problems, might be of great help.
Thank you!
1 件のコメント
Edric Ellis
2024 年 4 月 2 日
I suggest contacting MathWorks support who should be able to help resolve this.
採用された回答
R
2024 年 5 月 8 日
I previously encountered this error due to the local job storage location being accessed simultaneously by multiple jobs/users, which triggered the issue. I managed to resolve it by implementing the solution provided in the following MATLAB Answer:
その他の回答 (0 件)
参考
カテゴリ
Help Center および File Exchange で Third-Party Cluster Configuration についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!