Soft Actor Critic deploy mean path only

Question

Tech Logg Ding 2021 年 5 月 6 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/823645-soft-actor-critic-deploy-mean-path-only

編集済み: Tech Logg Ding 2021 年 5 月 13 日

Hi, I'm wondering if there's a way to only deploy the mean path of the SAC agent after it's been trained? This is useful to create more stable actions after the network has been trained.

Should I extract the network weights manually, create a network, then extract an output path for the mean network?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Emmanouil Tzorakoleftherakis 2021 年 5 月 13 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/823645-soft-actor-critic-deploy-mean-path-only#answer_698953

Hello,

Please take a look at this option here which was added in R2021a to allow exactly the behavior you mentioned.

Hope this helps

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

Tech Logg Ding 2021 年 5 月 13 日

編集済み: Tech Logg Ding 2021 年 5 月 13 日

Thank you for the reply. That setting works. I've also tried the roundabout way of extracting the actor neural network and modifying it to only have the mean path. Then I deploy the actor neural network into the simulation to act as a controller. Both method works!

サインインしてコメントする。

Soft Actor Critic deploy mean path only

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Soft Actor Critic deploy mean path only

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

1 件のコメント -1 件の古いコメントを表示-1 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示