Pausing reinforcement learning by forcing

Question

SHromaneko 2024 年 1 月 2 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2065726-pausing-reinforcement-learning-by-forcing

編集済み: SHromaneko 2024 年 1 月 10 日

I'm running reinforcement learning, but I think there are times when I think it's not going well and want to stop it.

If you stop today's learning agent executed with the code below with Training stopped, simulink will freeze and stop working.

Repeatedly hitting the escape key doesn't work either.

so that you can stop it properly

Is there something wrong with the code?

numObs = 9;

obsInfo = rlNumericSpec([numObs 1]);

obsInfo.Name = "observations";

mdl = "kineticmodel_wIAVFBC_IncS2Ea_NH3FBC";

open_system(mdl)

numAct = 1;

actInfo = rlNumericSpec([numAct 1],LowerLimit=0,UpperLimit=1);

actInfo.Name = "NH3";

blk = mdl + "/RL agent/RL Agent";

env = rlSimulinkEnv(mdl,blk,obsInfo,actInfo);

Ts = 1

agent = createDDPGAgent(numObs,obsInfo,numAct,actInfo,Ts);

maxEpisodes = 2000;

Tf = 1240*3

maxSteps = floor(Tf/Ts);

trainOpts = rlTrainingOptions(...

MaxEpisodes=maxEpisodes,...

MaxStepsPerEpisode=maxSteps,...

ScoreAveragingWindowLength=250,...

Verbose=false,...

Plots="training-progress",...

StopTrainingCriteria="EpisodeCount",...

StopTrainingValue=maxEpisodes,...

SaveAgentCriteria="EpisodeCount",...

SaveAgentValue=maxEpisodes);

doTraining = true;

if doTraining

% Train the agent.

trainingStats = train(agent,env,trainOpts);

else

% Load a pretrained agent for the selected agent type.

if strcmp(AgentSelection,"DDPG")

load("rlWalkingBipedRobotDDPG.mat","agent")

else

load("rlWalkingBipedRobotTD3.mat","agent")

end

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Emmanouil Tzorakoleftherakis 2024 年 1 月 9 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2065726-pausing-reinforcement-learning-by-forcing#answer_1386366

The proper way to stop it would be through the Episode Manager (top right of the window). Does this not work for you?

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

SHromaneko 2024 年 1 月 10 日

編集済み: SHromaneko 2024 年 1 月 10 日

It's just a bug, I re-installed and fixed it.

Thanks a lot.

サインインしてコメントする。

Pausing reinforcement learning by forcing

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Pausing reinforcement learning by forcing

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

1 件のコメント -1 件の古いコメントを表示-1 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示