Reinforcement Learning Toolbox - When does algorithm train?
1 回表示 (過去 30 日間)
古いコメントを表示
Hans-Joachim Steinort
2019 年 9 月 17 日
コメント済み: Hans-Joachim Steinort
2019 年 9 月 26 日
I am currently using the RL-Toolbox with a DQN-Agent built into a long-running process-simulation.
The maximum stepcount is currently 8000 steps per episode.
Unfortunately the documentation seems a little ambiguous to me, so here my question:
Doese the train-function of the RL-Toolbox train the agent at the end of an episode or during the episode when the step count exeeds the minibatch-size (like in the baseline algorithms)?
Thank you in advance.
0 件のコメント
採用された回答
その他の回答 (0 件)
参考
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!