How do I define a continuous reward function for RL environment?

Prashanth Chivkula

2020 10 月 5

1 回答

回答採用済み

2020 10 月 12 に更新

1 回表示 (30 日間)

サインインしてこの質問に回答する。

Follow Question

サインインしてこの質問に回答する。

Follow Question

古いコメントを表示

0 投票

I am trying to follow the double integrator example for giving a continuous reward function. When I used the custom template, and defined the reward using the QR cost function, I get an error stating that the reward should be a scalar value. Where can I find the property of reward and change it to accept vector values?

3 件のコメント
1 件の古いコメントを表示 1 件の古いコメントを非表示

Prashanth Chivkula 2020 年 10 月 12 日

Yes I did that, thank you, Just to confirm the output of the cost function will always be a scalar value, right? So in the double integrator continuous example there are two states but the output reward at each step is a scalar value, right?

Emmanouil Tzorakoleftherakis 2020 年 10 月 12 日

That's right

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

採用された回答

Priysha LNU 2020 年 10 月 8 日

0 投票

Here is an excerpt from the documentation :

To guide the learning process, reinforcement learning uses a scalar reward signal generated from the environment.

For detailed information on defining reward signals, discrete and continous rewards, please refer to this documentation link.

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

その他の回答 (0 件)

サインインしてこの質問に回答する。

カテゴリ

ヘルプセンターおよび File Exchange で Reinforcement Learning についてさらに検索

製品

Reinforcement Learning Toolbox

リリース

R2020a

タグ

Prashanth Chivkula

2020 年 10 月 5 日

コメント済み:

Emmanouil Tzorakoleftherakis

2020 年 10 月 12 日

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by