DDPG Not getting any reward? Using the example from MathWorks

I am using the water pump example from Mathworks for the DDPG Reinforcement example, but I changed the pump model slightly and I am not getting any reward when I run the program, it is always zero. I changed the reward to always be 10 just to see if maybe it just wasnt learning, but it still only shows a reward of 0. Does anyone have any ideas? This is my first post so sorry if I am not submitting it correctly. Thanks!

2 件のコメント

Walter Roberson
Walter Roberson 2020 年 10 月 27 日
It would help if you could attach your files so people could replicate the problem
Jun
Jun 2020 年 10 月 27 日
Sorry, here you go. I am trying to have it change a resistor for a voltage divider network as a start for a more advanced circuit. But I am not an expert at reinforced learning so I have been adjusting the example provided by Matlab.

サインインしてコメントする。

回答 (1 件)

Emmanouil Tzorakoleftherakis
Emmanouil Tzorakoleftherakis 2020 年 11 月 11 日

0 投票

Hi Jun,
The IsDone input to the agent block is always true, so all episodes end prematurely. This is why you don't see any change in the reward. Change the conditions that set the IsDone flag to be true or set it to be false and the training will resume.

カテゴリ

ヘルプ センター および File ExchangeReinforcement Learning Toolbox についてさらに検索

質問済み:

Jun
2020 年 10 月 27 日

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by