How I can access the action output of the actor network in DDPG during training?

Question

Maha Mosalam 2021 年 12 月 2 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1601730-how-i-can-access-the-action-output-of-the-actor-network-in-ddpg-during-training

回答済み: Yash 2024 年 12 月 24 日

I want to access the action output of the actor network in DDPG during training since I want to change it by force function to other action optimized from sepeate function to accelerate training and improve learning effeciecncy for actor , if any help for that? I wil be thankful

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Yash 2024 年 12 月 24 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1601730-how-i-can-access-the-action-output-of-the-actor-network-in-ddpg-during-training#answer_1556358

You can use the function getAction which returns action from agent, actor or policy object given environment observations. You can write a custom loss function that directly uses getAction and dlgradient within it, and then use dlfeval and dlaccelerate with your custom loss function. For an example, see Train Reinforcement Learning Policy Using Custom Training Loop and Custom Training Loop with Simulink Action Noise.

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

How I can access the action output of the actor network in DDPG during training?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

Community Treasure Hunt

How I can access the action output of the actor network in DDPG during training?

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示