環境

強化学習環境のダイナミクスと出力をモデル化する

強化学習のシナリオでは、エージェントがやり取りをする世界を環境によってモデル化します。

Reinforcement Learning Toolbox™ は、さまざまなベンチマーク環境を実装する事前定義済みオブジェクトを提供します。環境ダイナミクスのカスタム関数を使用したり、既存の環境テンプレートクラスを変更したり、Simulink^® モデルを使用したりして、独自の環境を作成することもできます。

強化学習環境の概要については、Reinforcement Learning Environmentsを参照してください。

関数

すべて展開する

環境インターフェイス

`rlFiniteSetSpec`	有限集合アクションまたは観測チャネルのための仕様オブジェクトの作成
`rlNumericSpec`	数値アクションまたは観測チャネルのための仕様オブジェクトの作成
`getActionInfo`	Obtain action data specifications from reinforcement learning environment, agent, or experience buffer
`getObservationInfo`	Obtain observation data specifications from reinforcement learning environment, agent, or experience buffer
`validateEnvironment`	Validate custom reinforcement learning environment
`bus2RLSpec`	Create reinforcement learning data specifications for elements of a Simulink bus

グリッドワールド環境と MDP 環境

`createGridWorld`	強化学習用の 2 次元グリッドワールドの作成
`createMDP`	マルコフ決定過程モデルの作成
`rlMDPEnv`	強化学習のためのマルコフ決定過程環境の作成

事前定義済みの環境

rlPredefinedEnv 事前定義済みの強化学習環境の作成

報酬の計算

`generateRewardFunction`	Generate a reward function from control specifications to train a reinforcement learning agent (R2021b 以降)
`exteriorPenalty`	Exterior penalty value for a point with respect to a bounded region (R2021b 以降)
`hyperbolicPenalty`	Hyperbolic penalty value for a point with respect to a bounded region (R2021b 以降)
`barrierPenalty`	Logarithmic barrier penalty value for a point with respect to a bounded region (R2021b 以降)

カスタム環境

`rlFunctionEnv`	リセット関数とステップ関数を使用したカスタム強化学習環境の作成
`rlMultiAgentFunctionEnv`	Create custom multiagent reinforcement learning environment (R2023b 以降)
`rlTurnBasedFunctionEnv`	Create custom turn-based multiagent reinforcement learning environment (R2023b 以降)
`rlCreateEnvTemplate`	カスタム強化学習環境テンプレートの作成
`rlSimulinkEnv`	既にエージェントと環境を含む Simulink モデルからの環境オブジェクトの作成
`createIntegratedEnv`	Create environment object from a Simulink environment model that does not contain an agent block
`SimulinkEnvWithAgent`	Reinforcement learning environment with a dynamic model implemented in Simulink
`bus2RLSpec`	Create reinforcement learning data specifications for elements of a Simulink bus
`validateEnvironment`	Validate custom reinforcement learning environment

ニューラルネットワーク環境

`rlNeuralNetworkEnvironment`	Environment model with deep neural network transition models (R2022a 以降)
`rlContinuousDeterministicTransitionFunction`	Deterministic transition function approximator object for neural network-based environment (R2022a 以降)
`rlContinuousGaussianTransitionFunction`	Stochastic Gaussian transition function approximator object for neural network-based environment (R2022a 以降)
`rlContinuousDeterministicRewardFunction`	Deterministic reward function approximator object for neural network-based environment (R2022a 以降)
`rlContinuousGaussianRewardFunction`	Stochastic Gaussian reward function approximator object for neural network-based environment (R2022a 以降)
`rlIsDoneFunction`	Is-done function approximator object for neural network-based environment (R2022a 以降)
`predict`	Predict next observation, next reward, or episode termination given observation and action input data (R2022a 以降)
`evaluate`	Evaluate function approximator object given observation (or observation-action) input data (R2022a 以降)
`accelerate`	(Not recommended) Option to accelerate computation of gradient for approximator object based on neural network (R2022a 以降)

環境の設定、リセット、クリーンアップ

`reset`	Reset environment, agent, experience buffer, or policy object (R2022a 以降)
`setup`	Set up reinforcement learning environment or initialize data logger object (R2022a 以降)
`cleanup`	Clean up reinforcement learning environment or data logger object (R2022a 以降)

ブロック

RL Agent

強化学習エージェント

トピック

強化学習環境の概要

Reinforcement Learning Environments
Model environment dynamics using a MATLAB^® object that generates rewards and observations in response to agents actions.

グリッドワールド環境

Load Predefined Grid World Environments
Load grid world environments in which the actions, observations, and rewards are already defined.
Create Custom Grid World Environments
Create custom grid world environments by defining your own grid size, rewards and obstacles.

事前定義済み制御システム環境

Load Predefined Control System Environments
Load predefined environments used as benchmarks for control systems design.

カスタム MATLAB 環境

Define Reward and Observation Signals in Custom Environments
Create a reward signal that measures how successfully the agent actions are achieving a goal.
ステップ関数とリセット関数を使用したカスタム環境の作成
カスタムのステップ関数とリセット関数を提供して、強化学習環境を作成する。
Create Custom Environment from Class Template
Create a custom reinforcement learning environment by modifying a template environment class.

カスタム Simulink 環境

Define Reward and Observation Signals in Custom Environments
Create a reward signal that measures how successfully the agent actions are achieving a goal.
Create Custom Simulink Environments
Create a custom environment using a Simulink model that generates rewards and observations in response to agents actions.
Create and Simulate the Same Environment in both MATLAB and Simulink
Understand differences between reinforcement learning loops implemented in MATLAB and Simulink.
貯水タンクの強化学習環境モデル
タンクの水位コントローラーの代わりに、RL Agent ブロックを含む強化学習 Simulink 環境を作成する。

強化学習デザイナーでの環境の読み込み

強化学習デザイナーでの MATLAB 環境の読み込み
強化学習デザイナーアプリで MATLAB 環境を読み込む。
Load Simulink Environments in Reinforcement Learning Designer
Load a Simulink environment in the reinforcement designer app.

環境

関数