Scaling layer usage for action output

5 ビュー (過去 30 日間)

Yihao Wan 2023 年 6 月 9 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1980699-scaling-layer-usage-for-action-output

コメント済み: Yihao Wan 2023 年 6 月 20 日

Hello, I am using the tanhlayer as the output activation function for the action network while my action space is [0,10]. In this sense, I am referring to this answer by adding scaling layer.

However, I got saturated action value. I looked into the answer, shouldn't it be

scalingLayer('Scale',(actionInfo.UpperLimit-actionInfo.LowerLimit)/2,'Bias',(actionInfo.UpperLimit+actionInfo.LowerLimit)/2)

Thanks for your help.

Here is the code"

numActions = 2; 
actInfo = rlNumericSpec([numActions 1],'LowerLimit',0,'UpperLimit', 10); 
actorNetwork = [
    featureInputLayer(numObservations,'Normalization','none','Name','State')
    fullyConnectedLayer(32, 'Name','actorFC1')
    reluLayer('Name','relu1')
    fullyConnectedLayer(16, 'Name','actorFC2')
    reluLayer('Name','relu2')
    fullyConnectedLayer(numActions,'Name','Action')
    tanhLayer('Name','tanh3')
    scalingLayer('Scale',actInfo.UpperLimit-actInfo.LowerLimit,'Bias',(actInfo.UpperLimit-actInfo.LowerLimit)/2)
    ];
actordlNet = dlnetwork(actorNetwork);
actor = rlContinuousDeterministicActor(actordlNet,obsInfo,actInfo);