The definition of the Target update frequency in Reinforcement Learning Designer.
5 ビュー (過去 30 日間)
古いコメントを表示
In DDPG Agent, there are four networks. Online policy, Target policy, Online Q and Target Q.
The [Target update frequency] is used to the Target policy and Target Q in Reinforcement Learning Designer.
Are the Update frequency of the Online policy and Online Q same as the [Target update frequency] ?
0 件のコメント
採用された回答
UDAYA PEDDIRAJU
2024 年 3 月 12 日
Hi Xian,
No, the update frequency of the Online Policy and Online Q networks is not the same as the Target Update Frequency. The Target Update Frequency specifically applies to how often the Target Policy and Target Q networks are updated, which is typically less frequent or managed differently to ensure stability in learning.
その他の回答 (0 件)
参考
カテゴリ
Help Center および File Exchange で Deep Learning Toolbox についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!