The question is related to reinforcement learning based energy management in hybrid electic vehicle (HEV). I am considering DQN-RL for this work. The actions are the control variable for the energy management system which controls the fuel-rate.
In this case, my environment is an HEV with particular driving profile (UDDS). The objective is to train the agent for the energy management system to achieve the final fuel target (desired fuel) at the end of the drivecycle. However, I want to train a single agent for multiple drive profile to achieve the same target in all the cases.
I could train the agent for one driving profile, how to train the same agent for multiple profiles?
Note: The reference paper could help to clarify the exact problem. They have trained the agent for 5 driving profile to achieve same desired SOC.