フィルターのクリア

Use saved reinforcement learning DDPG agent

5 ビュー (過去 30 日間)
Sayak Mukherjee
Sayak Mukherjee 2020 年 9 月 26 日
I have saved DDPG agent using the optiopn
rlTrainingOptions.SaveAgentValue = 3000
During the simulations number of agents are saved that have episode value greater than 3000. However when I am trying to use the exact same agent for simulation using the command:
simOptions = rlSimulationOptions('MaxSteps',maxSteps);
experience = sim(env,saved_agent,simOptions);
But i an not getting the exact same response as I got during the training. My variance is 0.5 and my variance decay rate is 1e-4. How to replicate the behavior that I got during training using the same agent

回答 (1 件)

Emmanouil Tzorakoleftherakis
Emmanouil Tzorakoleftherakis 2020 年 9 月 29 日
Hello,
Please see my response here. In short, the behavior you see during training and after training are not expexted to match 100%.

カテゴリ

Help Center および File ExchangeTraining and Simulation についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by