Reinforcement Learning: Multiple Unique Discrete Actions

Question

Huzaifah Shamim 2020 年 7 月 27 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/571297-reinforcement-learning-multiple-unique-discrete-actions

コメント済み: Huzaifah Shamim 2020 年 9 月 7 日

THis question has been answered before here, but I wanted to make a variation of this. Lets say we wanted to make the following modification. Once one action value has been selected, the second possible action value can not be the same. So if we had:

a = [1,2,3,4,5,6,7,8,9, 10];
b = [1,2,3,4,5,6,7,8,9, 10];
[A,B]   = meshgrid(a,b);
actions = reshape(cat(2,A',B'),[],2);
actionInfo = num2cell(actions,2);

I basically want to get rid of the cell arrays that have the same values such as [1,1], [2,2], [3,3].....[10,10]. Is there an easy way to do this?

Clarification: But I do want to have values such as [1,2] and [2,1]

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Uday Pradhan 2020 年 9 月 7 日

1
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/571297-reinforcement-learning-multiple-unique-discrete-actions#answer_490864

MATLAB Online で開く

Hi Huzaifa,

According to my understanding, you would like to remove the action pairs which have the same values like [1 1],[2 2],...[10 10]. To do this, you can create a separate action matrix which do not have such arrays at all (10 in total to be excluded).

newActions = actions(actions(:,1)~=actions(:,2),:);
actionInfo = num2cell(newActions,2);

This will give you a cell array which do not contain pairs of type [x x]. Hope this helps!

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

Huzaifah Shamim 2020 年 9 月 7 日

Thank you so much!

サインインしてコメントする。

Reinforcement Learning: Multiple Unique Discrete Actions

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Reinforcement Learning: Multiple Unique Discrete Actions

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

1 件のコメント -1 件の古いコメントを表示-1 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示