Parallelization of SVD on research clusters

Question

Kamil Dylewicz 2022 年 4 月 2 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1687014-parallelization-of-svd-on-research-clusters

コメント済み: Steven Lord 2022 年 4 月 2 日

Hello MATLAB Community,

Currently, I am trying to perform Singular Value Decomposition of big datasets in MATLAB using svd() command. However, I encounter a problem with memory when forming and storing the matrices as indeed the datasets are of significant sizes (full flow fields of CFD simulations).

Luckily, I do have access to a research cluster with mutliple nodes (~200GB memory each or ~500GB in case of hi-mem nodes). I'd like to use 2 (or more in the future) nodes (i.e. 2x40 processors) to perform the SVD.

I am not too certain how to parallelize the SVD operation such that it can distribute workload over 80 workers using all memory of 2 nodes if needed. It is also important that the solution scales such that in the future I can increase number of nodes for bigger problems.

Can anyone in the community help me achieve this goal? Are there any resources (couldn't find any so far) by MathWorks on how to do this?

Many thanks in advance,

Kamil

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

Steven Lord 2022 年 4 月 2 日

MATLAB Online で開く

How sparsely populated are your matrices? If they are sparsely populated enough, try creating them as sparse matrices and using svds instead of svd to obtain some of the singular values (largest magnitude, smallest magnitude, etc.)

F = eye(10000); % 10k-by-10k full double matrix
S = speye(10000); % 10k-by-10k sparse double matrix, 1/10000 of the elements are nonzero

Look at the difference in memory consumption caused by not storing the (many) 0 values in S.

whos F S
  Name          Size                   Bytes  Class     Attributes

  F         10000x10000            800000000  double              
  S         10000x10000               240008  double    sparse    

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Riccardo Scorretti 2022 年 4 月 2 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1687014-parallelization-of-svd-on-research-clusters#answer_932989

Hi Kamil, perhaps this answer will help you:

https://fr.mathworks.com/matlabcentral/answers/247506-undefined-function-svd-for-distributed-matrix

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

Answer 2

Raymond Norris 2022 年 4 月 2 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1687014-parallelization-of-svd-on-research-clusters#answer_933159

MATLAB Online で開く

Building off of Kamil's suggestion to look at Edric's post, Edric suggests you ought to build the distributed array directly on the workers. In his example, he shows

D = rand(1000, 'distributed');

If however, you're not using rand (or any other helper function like ones, zeros, etc.), then you'll want to create the distributed array using the codistributed constructor, comprised of variant composite arrays. The advantage of codistributed arrays is that you can device your own schema of how to design the distributed array.

Sounds like you have 2 nodes (for now), each with 40 cores. Something to consider is the performance of 80 workers vs just 2 workers, but with one worker per node. In most schedulers, you can tailor the job submission. For example

% Create your pool of workers
cluster = parcluster('myScheduler');
cluster.AdditionalProperties.ProcPerNode = 1;
cluster.AdditionalProperties.ExclusiveNode = true;
pool = cluster.parpool(2);

The AdditionalProperties is a bit of psuedo code and would need to be added and coded in your cluster object. For information on adding properties, contact Technical Support (support@mathworks.com). For the sake of discussion, I'm also assuming you're not using MJS, otherwise getting workers to run on two distinct nodes would be a bit different.

Next build up your local parts of the SVD and the calcuate. In this case, I am using rand to generate the data, but this might be data read from images, etc.

spmd
    % Build scheme of distributed array A
    N = 1000;
    j = getCurrentJob;
    % Each worker will get one column vector (in essence N^2 x 2)
    workers = numel(j.Tasks);
    globalSize = [N^2, workers];
    codistr = codistributor1d(2, codistributor1d.unsetPartition, globalSize);
    % Create local variant.  Using rand, but this might be data read from
    % an image file.
    local_a = rand(N);
    % Reshape to be a column vector
    local_a = local_a(:);
    % Stitch local parts together to create distributed array A
    A = codistributed.build(local_a, codistr);
    
    % Calculate svd
    [U, S] = svd(A);
end

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

Parallelization of SVD on research clusters

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

回答 (2 件)

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Parallelization of SVD on research clusters

1 件のコメント -1 件の古いコメントを表示-1 件の古いコメントを非表示

回答 (2 件)

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示