DDPG Agent: Noise settings without any visible consequence

Question

Tobias Michl 2022 年 7 月 25 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1767475-ddpg-agent-noise-settings-without-any-visible-consequence

回答済み: Emmanouil Tzorakoleftherakis 2023 年 1 月 26 日

Despite having noise settings - as below -, my RL Agent's output sticks to the limits for many consecutive steps (hundreds to thousands).

My understanding of the sequence order is:

Actor gets observation as input.
Actor outputs through tanh output in the range of [-1, 1]
Noise gets added to actor output
RL Agent outputs actor output plus additive noise

(At least this sequence is my understanding of this: https://de.mathworks.com/matlabcentral/answers/515602-incorrect-tanhlayer-output-in-rl-agent#answer_425717)

Did I get it wrong? What do I miss?

I'm using:

DDPG Agent
actor output layer: tanh --> resulting action space: [-1, 1]
Agent sample time: Ts = 0.0005;
agentOptions.NoiseOptions.StandardDeviation = 0.89443;
actionInfo = rlNumericSpec([2 1], 'LowerLimit', [-1; -1], 'UpperLimit', [1; 1]);
used Ornstein-Uhlenbeck

Besides, if I set rlDDPGAgent('UseEplorationPolicy', true), do I use Gaussian function instead of Ornstein Uhlenbeck?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Emmanouil Tzorakoleftherakis 2023 年 1 月 26 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1767475-ddpg-agent-noise-settings-without-any-visible-consequence#answer_1156860

Your standard deviation is very high compared to the action range that you have set. As a result, when noise is added to the tanh output, you are always hitting the limits you have set in your action space definition (which looks like they are [-1, 1]). I would use smaller std

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

DDPG Agent: Noise settings without any visible consequence

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

DDPG Agent: Noise settings without any visible consequence

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示