How to train DDPG episode reward more better?
2 ビュー (過去 30 日間)
古いコメントを表示
I'm training a DDPG agent from the Reinforcement Learning toolbox. But as you can see, my episode reward never change. I try so many way to fix this problem. Like change the netwoek, Gradient Threshold, Learning Rate. But the result will be the same. I check my reward funtion, if the situation is eligible I will give it some reward or penalty. But its reward is always be same.
data:image/s3,"s3://crabby-images/0dc67/0dc672ee5c56eeac4cf8adec6d528072921ea198" alt=""
Is my condtion have some problem? Or my results are not input into the model? I dont have anyway to do.
2 件のコメント
Emmanouil Tzorakoleftherakis
2020 年 2 月 28 日
How did you set the IsDone flag? This may lead to premature episode termination
Guoge Tan
2020 年 5 月 25 日
Hi, sorry to bother you, but I'd like to ask if your problem is solved or not? I‘m working on a path planning problem using the Reinforcement Learning toolbox on MATLAB R2020a and I also encountered a problem similar to yours.data:image/s3,"s3://crabby-images/73e28/73e2856651a2a7ba809407021391250c2ab29cf0" alt=""
data:image/s3,"s3://crabby-images/73e28/73e2856651a2a7ba809407021391250c2ab29cf0" alt=""
回答 (0 件)
参考
カテゴリ
Help Center および File Exchange で Training and Simulation についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!