Trying to include learning rate and momentum in sgdmupdate function under multiple GPUs

1 回表示 (過去 30 日間)

hyeonjin kim 2022 年 3 月 1 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1660885-trying-to-include-learning-rate-and-momentum-in-sgdmupdate-function-under-multiple-gpus

コメント済み: hyeonjin kim 2022 年 4 月 1 日

I am working on modifying the example given in

"https://www.mathworks.com/help/deeplearning/ug/train-network-in-parallel-with-custom-training-loop.html?searchHighlight=sgdmupdate%20multiple%20gpu&s_tid=srchtitle_sgdmupdate%2520multiple%2520gpu_3".

The modification is that I am trying to incorporate learning rate and momentum into the sgdmupdate function.

That is,

[dlnet.Learnables,workerVelocity] = sgdmupdate(dlnet.Learnables,workerGradients,workerVelocity, learning rate, momentum);

instead of

[dlnet.Learnables,workerVelocity] = sgdmupdate(dlnet.Learnables,workerGradients,workerVelocity); (line 108 in the example).

However, that modification results in the following error in the case of using 4 GPUs (no error with a single GPU)

Would there be anyone who could help me on this ?

Thanks very much !!!

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

Joss Knight 2022 年 3 月 10 日

Also make sure all your gradients are finite with something like assert(all(cellfun(@(x)all(isfinite(x(:))),workerGradients))). Sometimes when you mess with the learn rate you can get NaNs and infinities in the gradients.

hyeonjin kim 2022 年 4 月 1 日

Dear Joss,

First of all, I am sorry to get back to you this late.

Thanks very much for your help. I will try to fix the problem according to your comments once our workstation with GPUs becomes available again.

Thanks very much !

サインインしてコメントする。

サインインしてこの質問に回答する。

回答 (0 件)

サインインしてこの質問に回答する。

カテゴリ

Parallel Computing Parallel Computing Toolbox GPU Computing

Help Center および File Exchange で GPU Computing についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Trying to include learning rate and momentum in sgdmupdate function under multiple GPUs

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

回答 (0 件)

参考

カテゴリ

タグ

Community Treasure Hunt

Trying to include learning rate and momentum in sgdmupdate function under multiple GPUs

3 件のコメント 1 件の古いコメントを表示1 件の古いコメントを非表示

回答 (0 件)

参考

カテゴリ

タグ

Community Treasure Hunt

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示