question about external action of DDPG

1 回表示 (過去 30 日間)
Zicheng
Zicheng 2023 年 9 月 7 日
Is anyone know the loss function of the Q-network when I set external action=1 during training process?(DDPG)

採用された回答

Emmanouil Tzorakoleftherakis
Emmanouil Tzorakoleftherakis 2023 年 9 月 25 日
The loss function does not change. What happens is that the experience buffer is populated with the action from the external signal and the respective observations/reward.

その他の回答 (0 件)

カテゴリ

Help Center および File ExchangeReinforcement Learning についてさらに検索

製品


リリース

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by