photo

Francisco Serra


Last seen: 2ヶ月 前 2024 年からアクティブ

Followers: 0   Following: 0

統計

Feeds

表示方法

質問


Why is my DDPG agent converging to a state where it gets continuous penalization, while having a state it can go with 0 penalization?
I am training a Reinforcement Learning DDPG agent to drive a vehicle to a reference. The vehicle dynamics are: x_dot = v*cos(...

5ヶ月 前 | 1 件の回答 | 0

1

回答