How to train RL-DQN agent with varying environment?

Question

Praveen Kumar Nambisan T M 2021 年 6 月 23 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/863085-how-to-train-rl-dqn-agent-with-varying-environment

編集済み: Jillian Eunice Oliveros 2021 年 10 月 25 日

RL HEV.pdf

The question is related to reinforcement learning based energy management in hybrid electic vehicle (HEV). I am considering DQN-RL for this work. The actions are the control variable for the energy management system which controls the fuel-rate.

In this case, my environment is an HEV with particular driving profile (UDDS). The objective is to train the agent for the energy management system to achieve the final fuel target (desired fuel) at the end of the drivecycle. However, I want to train a single agent for multiple drive profile to achieve the same target in all the cases.

The problem formulation is similar to the paper: Reference paper

I could train the agent for one driving profile, how to train the same agent for multiple profiles?

Note: The reference paper could help to clarify the exact problem. They have trained the agent for 5 driving profile to achieve same desired SOC.

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Emmanouil Tzorakoleftherakis 2021 年 6 月 24 日

2
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/863085-how-to-train-rl-dqn-agent-with-varying-environment#answer_732465

What you are describing is actually pretty standard process to create robust policies. To change the driving profiles, you can use the reset function in your MATLAB/Simulink environment definition.

A simple example is here (take a look at the Reset function at the bottom).

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

Jillian Eunice Oliveros 2021 年 10 月 25 日

編集済み: Jillian Eunice Oliveros 2021 年 10 月 25 日

@Emmanouil Tzorakoleftherakis Hello sir. I have a question regarding the Reinforcement Learning toolbox found at this link: https://www.mathworks.com/matlabcentral/answers/1570073-reinforcement-learning-toolbox-how-to-implement-markov-decision-process-mdp-environment-and-dqn. It would be great if you can take a look on it. Thanks!

サインインしてコメントする。

How to train RL-DQN agent with varying environment?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

How to train RL-DQN agent with varying environment?

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

1 件のコメント -1 件の古いコメントを表示-1 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示-1 件の古いコメントを非表示