How to train DDPG episode reward more better?

2 ビュー (過去 30 日間)

hunson yang 2020 年 2 月 26 日

1
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/507677-how-to-train-ddpg-episode-reward-more-better

コメント済み: Guoge Tan 2020 年 5 月 25 日

I'm training a DDPG agent from the Reinforcement Learning toolbox. But as you can see, my episode reward never change. I try so many way to fix this problem. Like change the netwoek, Gradient Threshold, Learning Rate. But the result will be the same. I check my reward funtion, if the situation is eligible I will give it some reward or penalty. But its reward is always be same.

Is my condtion have some problem? Or my results are not input into the model? I dont have anyway to do.

2 件のコメント
なしを表示なしを非表示

Emmanouil Tzorakoleftherakis 2020 年 2 月 28 日

How did you set the IsDone flag? This may lead to premature episode termination

Guoge Tan 2020 年 5 月 25 日

Hi, sorry to bother you, but I'd like to ask if your problem is solved or not? I‘m working on a path planning problem using the Reinforcement Learning toolbox on MATLAB R2020a and I also encountered a problem similar to yours.