Customized Action Selection in RL DQN

Question

ches 2021 年 1 月 11 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/714038-customized-action-selection-in-rl-dqn

編集済み: ches 2021 年 1 月 20 日

Hi,

I would like to ask if the latest Reinforcement Learning (RL) toolbox version supports customized action selection.

I’m currently using a DQN agent, and the action in each time step is selected randomly following the epsilon-greedy algorithm. However, I would like to feed in some probabilities in the action selection, such that certain actions are more likely to be chosen. Is this possible using the RL toolbox?

Thank you!

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Emmanouil Tzorakoleftherakis 2021 年 1 月 16 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/714038-customized-action-selection-in-rl-dqn#answer_599535

編集済み: Emmanouil Tzorakoleftherakis 2021 年 1 月 16 日

MATLAB Online で開く

Hello,

I believe this is not possible yet. A potential workaround (although not state dependent) would be to emulate a pdf by providing actions with higher probabilities multiple times when creating your action space with rlFinitesetSpec but I haven't tested that. So something like:

actInfo = rlFiniteSetSpec([-2 0 2 2 2])

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

ches 2021 年 1 月 20 日

編集済み: ches 2021 年 1 月 20 日

Hello,

Thank you for the information.

I'm currently trying to improve the exploration during training, so I'm thinking of other ways to do that apart from adjusting the epsilon parameters of the epsilon-greedy algorithm.

In line with that, may I also ask if the following are possible in the latest RL toolbox?

- Setting optimistic initial values

- Other exploration strategies (such as Boltzmann)

Thanks!

サインインしてコメントする。

Customized Action Selection in RL DQN

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Customized Action Selection in RL DQN

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

1 件のコメント -1 件の古いコメントを表示-1 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示