フィルターのクリア

Custom environment in Deep reinforcement learning

3 ビュー (過去 30 日間)
Bha Pr
Bha Pr 2020 年 4 月 1 日
コメント済み: SULAKSHNA DEVI 2020 年 5 月 13 日
I am currently trying to buid to a custom environment for the implementation of deep reinforcement learning. My considered environment has 4 states low, med, high, severe represented by 1,2,3,4 respectively and the actions to be taken are 1,2,3 and rewards are decided on the basis of context like temperature, pressure,humidity which varies with time. So how i can define my reward that changes with time in mystepfunction?

回答 (1 件)

Ari Biswas
Ari Biswas 2020 年 4 月 20 日
One way to solve this is by introducing a property to keep track of elapsed time in your custom MATLAB environment. You can use this property to compute rewards and increment this as needed in the step function.
  1 件のコメント
SULAKSHNA DEVI
SULAKSHNA DEVI 2020 年 5 月 13 日
The property here refers to function. Can you please provide explanation on this

サインインしてコメントする。

カテゴリ

Help Center および File ExchangeEnvironments についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by