How do I define a continuous reward function for RL environment?
古いコメントを表示
I am trying to follow the double integrator example for giving a continuous reward function. When I used the custom template, and defined the reward using the QR cost function, I get an error stating that the reward should be a scalar value. Where can I find the property of reward and change it to accept vector values?
3 件のコメント
Emmanouil Tzorakoleftherakis
2020 年 10 月 12 日
Not sure why you want the reward to be scalar. Typically, rewards are treated as cost functions - they output a scalar value. If you have more than one states, you can turn it into a scalar using e.g. an l2 norm for example/some distance metric.
Prashanth Chivkula
2020 年 10 月 12 日
Emmanouil Tzorakoleftherakis
2020 年 10 月 12 日
That's right
採用された回答
その他の回答 (0 件)
カテゴリ
ヘルプ センター および File Exchange で Environments についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!