How to Use the Reinforcement Learning Toolbox to Draw Observations While Training？

Question

岩滨黄 2022 年 9 月 30 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1814920-how-to-use-the-reinforcement-learning-toolbox-to-draw-observations-while-training

編集済み: Emmanouil Tzorakoleftherakis 2025 年 3 月 31 日

Hi!

How to Use the Reinforcement Learning Toolbox to Draw Observations While Training？Here is my code:

ObservationInfo = rlNumericSpec([12 1]);

% Initialize Action settings

ActionInfo = rlNumericSpec([6 1], ...

'LowerLimit', [-1; -1; -1; -1; -1; -1], ...

'UpperLimit', [1; 1; 1; 1; 1; 1]);

%Env

env = rlFunctionEnv(ObservationInfo,ActionInfo,'myStepFunction','myResetFunction');

% Simulation time and sample rate

Ts = 0.02;

% %% Deep Neural Network Options

% %Define the critic network

statePath = [

imageInputLayer([12 1 1],'Normalization','none','Name','observation')

fullyConnectedLayer(400,'Name','CriticStateFC1')

reluLayer('Name', 'Criticrelu1')

fullyConnectedLayer(300,'Name','CriticStateFC2')];

actionPath = [

imageInputLayer([6 1 1],'Normalization','none','Name','action')

fullyConnectedLayer(300,'Name','CriticActionFC1')];

commonPath = [

additionLayer(2,'Name','add')

reluLayer('Name','CriticCommonRelu')

fullyConnectedLayer(1,'Name','CriticOutput')];

criticNetwork = layerGraph();

criticNetwork = addLayers(criticNetwork,statePath);

criticNetwork = addLayers(criticNetwork,actionPath);

criticNetwork = addLayers(criticNetwork,commonPath);

criticNetwork = connectLayers(criticNetwork,'CriticStateFC2','add/in1');

criticNetwork = connectLayers(criticNetwork,'CriticActionFC1','add/in2');

criticOpts = rlRepresentationOptions('LearnRate',1e-03,'GradientThreshold',1);

critic = rlQValueRepresentation(criticNetwork,ObservationInfo,ActionInfo,...

'Observation',{'observation'},'Action',{'action'},criticOpts);

%Define the actor network

actorNetwork = [

imageInputLayer([12 1 1],'Normalization','none','Name','observation')

fullyConnectedLayer(400,'Name','ActorFC1')

reluLayer('Name','ActorRelu1')

fullyConnectedLayer(300,'Name','ActorFC2')

reluLayer('Name','ActorRelu2')

fullyConnectedLayer(6,'Name','ActorFC3')

tanhLayer('Name','ActorTanh')

scalingLayer('Name','ActorScaling','Scale',max(ActionInfo.UpperLimit))];

actorOpts = rlRepresentationOptions('LearnRate',1e-04,'GradientThreshold',1);

actor = rlDeterministicActorRepresentation(actorNetwork,ObservationInfo,ActionInfo,'Observation',{'observation'},'Action',{'ActorScaling'},actorOpts);

%% Set Agent and DDPG Options

agentOpts = rlDDPGAgentOptions(...

'SampleTime',Ts,...

'TargetSmoothFactor',1e-3,...

'ExperienceBufferLength',1e5,...

'DiscountFactor',0.99,...

'MiniBatchSize',128);

agentOpts.NoiseOptions.Variance = 0.6;

agentOpts.NoiseOptions.VarianceDecayRate = 1e-5;

agent = rlDDPGAgent(actor,critic,agentOpts);

%% Set Training Options

maxepisodes = 100;

trainOpts = rlTrainingOptions(...

'MaxEpisodes',maxepisodes,...

'MaxStepsPerEpisode',1000,...

'ScoreAveragingWindowLength',50,...

'Verbose',false,...

'Plots','training-progress',...

'StopTrainingCriteria','AverageReward',...

'StopTrainingValue',0,...

'SaveAgentCriteria','EpisodeReward',...

'SaveAgentValue',0);

%% Training

%Train the DDPG algorithm on the enviroment.

trainingStats = train(agent,env,trainOpts);

I would be grateful if you could help me!

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Emmanouil Tzorakoleftherakis 2023 年 1 月 25 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1814920-how-to-use-the-reinforcement-learning-toolbox-to-draw-observations-while-training#answer_1156400

You can use the information on plotting and visualization from this page to plot/visualize information during training

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

Harold 2025 年 3 月 31 日

Hello @Emma level devil I'm sorry, but I don't see any information on this page about plotting and visualization techniques during training. Could you please provide the page again or perhaps share the specific section where this information is located? I'd be happy to help once I have the necessary context.

Emmanouil Tzorakoleftherakis 2025 年 3 月 31 日

編集済み: Emmanouil Tzorakoleftherakis 2025 年 3 月 31 日

Updated the link above

サインインしてコメントする。

How to Use the Reinforcement Learning Toolbox to Draw Observations While Training？

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

How to Use the Reinforcement Learning Toolbox to Draw Observations While Training？

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

3 件のコメント 1 件の古いコメントを表示1 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示