Resume training of a DQN agent. How to avoid Epsilon from being reset to max value?

3 ビュー (過去 30 日間)

Cecilia S. 2021 年 6 月 9 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/852205-resume-training-of-a-dqn-agent-how-to-avoid-epsilon-from-being-reset-to-max-value

コメント済み: Cecilia S. 2021 年 6 月 22 日

When I want to resume training of an agent, I simply load it and set the "resetexperiencebuffer" option to false, but this does not avoid the exploration (depending on epsilon) to start anew. Is there any way to make the agent start from the exact point it left off without manually setting the epsilon value?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

採用された回答

Emmanouil Tzorakoleftherakis 2021 年 6 月 22 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/852205-resume-training-of-a-dqn-agent-how-to-avoid-epsilon-from-being-reset-to-max-value#answer_730700

Hello,

This is currently not possible, but it is a great enhancement idea. I have informed the developers about your request and it will be considered for a future release.