DDPG Not getting any reward? Using the example from MathWorks

4 ビュー (過去 30 日間)
Jun
Jun 2020 年 10 月 27 日
I am using the water pump example from Mathworks for the DDPG Reinforcement example, but I changed the pump model slightly and I am not getting any reward when I run the program, it is always zero. I changed the reward to always be 10 just to see if maybe it just wasnt learning, but it still only shows a reward of 0. Does anyone have any ideas? This is my first post so sorry if I am not submitting it correctly. Thanks!
  2 件のコメント
Walter Roberson
Walter Roberson 2020 年 10 月 27 日
It would help if you could attach your files so people could replicate the problem
Jun
Jun 2020 年 10 月 27 日
Sorry, here you go. I am trying to have it change a resistor for a voltage divider network as a start for a more advanced circuit. But I am not an expert at reinforced learning so I have been adjusting the example provided by Matlab.

サインインしてコメントする。

回答 (1 件)

Emmanouil Tzorakoleftherakis
Emmanouil Tzorakoleftherakis 2020 年 11 月 11 日
Hi Jun,
The IsDone input to the agent block is always true, so all episodes end prematurely. This is why you don't see any change in the reward. Change the conditions that set the IsDone flag to be true or set it to be false and the training will resume.

カテゴリ

Help Center および File ExchangeEnvironments についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by