I see a zero mean reward for the first agent in multi-agent RL Toolbox
4 ビュー (過去 30 日間)
古いコメントを表示
Hello, I have extended the PPO Coverage coverage path planning example of the Matlab for 5 agents. I can see now that always, I have a reward for the first agent, and the problem is always, I see a zero mean reward in the toolbox for the first agent like the following image which is not correct. Do you have any idea what is happening there?
![](https://www.mathworks.com/matlabcentral/answers/uploaded_files/1479366/image.png)
0 件のコメント
回答 (0 件)
参考
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!