How to train model-based reinforcement learning agents
Create and train model-based policy optimization (MBPO) agents. An MBPO agent uses neural networks to internally approximate the environment. This reusable internal model allows for a greater sample efficiency compared to a typical model-free agent.
You can also select a web site from the following list:
How to Get Best Site Performance
Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.