Reinforcement Learning experience buffer length and parallelisation toolbox

Question

Tech Logg Ding 2020 年 12 月 2 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/673448-reinforcement-learning-experience-buffer-length-and-parallelisation-toolbox

編集済み: Emmanouil Tzorakoleftherakis 2020 年 12 月 3 日

When parallelisation is used when training a DDPG agent with the following settings:

trainOpts.UseParallel = true;
trainOpts.ParallelizationOptions.Mode = 'async';
trainOpts.ParallelizationOptions.StepsUntilDataIsSent = -1;
trainOpts.ParallelizationOptions.DataToSendFromWorkers = 'Experiences';

Does the the parallel simulations have their own experience buffer? This could take up more memory hence I am hoping that only one experience buffer is stored to update the critic network.

From the documentations, it seems like there will only be one experience buffer as the experiences are sent back to the host.

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Emmanouil Tzorakoleftherakis 2020 年 12 月 3 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/673448-reinforcement-learning-experience-buffer-length-and-parallelisation-toolbox#answer_564503

編集済み: Emmanouil Tzorakoleftherakis 2020 年 12 月 3 日

Hello,

There is one big experience buffer on the host, the size of which you determine as usual in your agent options. Each worker has a much smaller buffer to collect experiences until you reach "StepsUntilDataIsSent".

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

Reinforcement Learning experience buffer length and parallelisation toolbox

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Reinforcement Learning experience buffer length and parallelisation toolbox

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示