Deep Q-network reinforcement learning

Question

0 投票

Using the rl agent provided by Matlab, How can I change the action set after each step or episode? Because I assume that the action cannot be repetitive.

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

Answer 1

Emmanouil Tzorakoleftherakis 2020 年 9 月 15 日

0 投票

Hello,

The functionality to customize the action space is not yet available. A couple of workarounds:

1) Use penalties in the reward signal every time a repetitive action is selected. Make sure you use the previously selected action as an observation here. This may work but if the number of possible actions is small, it may interfere with exploration

2) Use a custom agent following the template guidelines here and here. You can subclass the provided DQN agent and set exploration and action selection as needed for your application.

2 件のコメント
なしを表示なしを非表示

Zhengcheng Dong 2020 年 9 月 18 日

MATLAB Online で開く

Thank you for your answer and helpful suggestions.We tried your first method, and it seems to converge very slowly. We are now trying to creat a custom agent. Thank you again.

Emmanouil Tzorakoleftherakis 2020 年 9 月 18 日

Happy to help. Here is another page that just went live with 20b release that should be helpful.

サインインしてコメントする。

Deep Q-network reinforcement learning

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

採用された回答

2 件のコメント
なしを表示なしを非表示

その他の回答 (0 件)

カテゴリ

タグ

Community Treasure Hunt

Deep Q-network reinforcement learning

0 件のコメント -2 件の古いコメントを表示 -2 件の古いコメントを非表示

採用された回答

2 件のコメント なしを表示 なしを非表示

その他の回答 (0 件)

カテゴリ

タグ

参考

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

2 件のコメント
なしを表示なしを非表示