agent doesn't take different actions to different states
5 ビュー (過去 30 日間)
古いコメントを表示
Hello everyone,
I have two issues:
- I wasn't able to set up the environment so that the agent takes 24 different actions over the course of a day, meaning the agent takes one action every hour. As a workaround, I decided to train agents by the hour.
- The second issue, which is the reason for my question, arises after training the agent. When I test the efficiency of its decision-making and run the simulation part of the RL Toolbox, I notice that the agent always takes the same action regardless of the state of the environment. This leads me to believe that the training process determines the best action for a set of states, which is not what I want. I want the agent to take the correct action for different states. I've been analyzing my environment code but can't figure out why the agent behaves this way.
Thank you in advance.
Bryan
3 件のコメント
Alan
2024 年 7 月 4 日
編集済み: Alan
2024 年 7 月 4 日
Hi Bryan,
Could you describe your environment a bit more? The following is some information I would like to know:
- What happens in each step of the episode? Does a step span an hour or 24h?
- How have you modeled your reward function? Does it incentivize the agent well?
- What agent are you using?
It would be great if you can share the environment file and the train script as well.
Regards.
回答 (0 件)
参考
カテゴリ
Help Center および File Exchange で Agents についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!