Multiple Parallel Jobs on Server

8 ビュー (過去 30 日間)
Shumao Zhang
Shumao Zhang 2022 年 2 月 10 日
回答済み: Edric Ellis 2022 年 2 月 11 日
Hi,
Let's say I have a script "main.m" which uses the parallel computing toolbox for parallelization. I have a server (Mac) with 20+ cores of cpus, and would like to run multiple "main.m" jobs at the same time for parameter searching. Each "main.m" job only needs 4 cpus. I opened multiple instances of Matlab and ran one job in each instance. However, I found that the speed of each job slows done significantly (compared to just run one single job). I guess it might because some cores are used by more than one job. Suppose 4*number of jobs < 20, what would be a suggested way to achieve the goal? Is there a way to ensure that each job will use different cores?
As a remark, "main.m" is a script and is not a function.
Thanks!
  2 件のコメント
Benjamin Thompson
Benjamin Thompson 2022 年 2 月 10 日
I don't think you can assign jobs to cores like that. Have you tried reducing the size of your parallel pool to maybe 5 instead of 20?
Shumao Zhang
Shumao Zhang 2022 年 2 月 10 日
Thanks for the reply. However, for each job I only need 4 cores so the size of my parallel pool is 4.
I need to, for example, run two jobs at one time, which means ideally I just need 8 cores and assign them to these two jobs without sharing.
When I tried openning two Matlab instances, I feel that some cores are shared by two jobs so the speed for each job slows down significantly.

サインインしてコメントする。

回答 (1 件)

Edric Ellis
Edric Ellis 2022 年 2 月 11 日
You say that "main.m" script uses 4 cores to do parallel computing. I presume this means that it contains a parfor loop. One way you could launch multiple different copies of "main.m" is by using the batch command. Each time you call batch, you get back a parallel.Job object which lets you see how the computation is going, and retrieve the results at the end. By using the 'Pool' parameter to the batch command, you can make the job have a parallel pool available for running the parfor loops. You might do something like this:
for idx = 1:3
param = idx; % param is a variable read by your script "main.m"
job(idx) = batch('main', 'Pool', 4); % Launches "main.m" with a parallel pool of 4 workers
end
% Now that all jobs are launched, we can wait for them to complete and
% collect the result
for idx = 1:3
wait(job(idx));
results{idx} = load(job(idx)); % This will retrieve variables created by "main.m"
end

製品


リリース

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by