question about external action of DDPG

3 ビュー (過去 30 日間)

古いコメントを表示

Zicheng 2023 年 9 月 7 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2018076-question-about-external-action-of-ddpg

回答済み: Emmanouil Tzorakoleftherakis 2023 年 9 月 25 日

採用された回答: Emmanouil Tzorakoleftherakis

Is anyone know the loss function of the Q-network when I set external action=1 during training process?(DDPG)

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

採用された回答

Emmanouil Tzorakoleftherakis 2023 年 9 月 25 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2018076-question-about-external-action-of-ddpg#answer_1318002

The loss function does not change. What happens is that the experience buffer is populated with the action from the external signal and the respective observations/reward.

Help Center および File Exchange で Deep Learning Toolbox についてさらに検索

製品

Reinforcement Learning Toolbox

リリース

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

question about external action of DDPG

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

question about external action of DDPG

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示