number of look ahead steps in DDPG Agent Options

1 回表示 (過去 30 日間)

ALOK RANJAN SWAIN 2020 年 2 月 21 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/506744-number-of-look-ahead-steps-in-ddpg-agent-options

コメント済み: Dingshan Sun 2022 年 9 月 1 日

I want to know how does the parameter "NumStepsToLookAhead" in rlDDPGAgentOptions from reinforcement learning toolboxof matlab 2019b works?

Whether the look ahead is done on target networks? (like modification in critic objective, from {r+gamma*Qt - Q} to {r+ sum(gamma**i*Qt) -Q}
Or the look ahead is done on reward sampling itself? ( like changing reward "r" from each sample to "r+gamma*r_t+gamma**2*r_t+1+...

Any help is highly appreciated.

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

回答 (1 件)

Anh Tran 2020 年 3 月 1 日

1
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/506744-number-of-look-ahead-steps-in-ddpg-agent-options#answer_417996

I am not sure what does reward sampling mean. "NumStepsToLookAhead" in rlDDPGAgentOptions changes the critic's target values in step 5 of DDPG training algorithm.

Assume g is the discount factor, the critic target will be as followed

4 件のコメント
2 件の古いコメントを表示2 件の古いコメントを非表示

ALOK RANJAN SWAIN 2020 年 3 月 4 日

Thanks for your help.??

Dingshan Sun 2022 年 9 月 1 日

Could you give a hint how R_t,R_t_1,,R_t+2,...,R_t+n-1 can be obtained in an online off-policy algorithm? Especially for DRL methods that use an experience replay?

サインインしてコメントする。

サインインしてこの質問に回答する。

カテゴリ

Control Systems Reinforcement Learning Toolbox Environments

Help Center および File Exchange で Environments についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

number of look ahead steps in DDPG Agent Options

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

4 件のコメント
2 件の古いコメントを表示2 件の古いコメントを非表示

参考

カテゴリ

タグ

Community Treasure Hunt

number of look ahead steps in DDPG Agent Options

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

4 件のコメント 2 件の古いコメントを表示2 件の古いコメントを非表示

参考

カテゴリ

タグ

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

4 件のコメント
2 件の古いコメントを表示2 件の古いコメントを非表示