Deployment of a RL Agent

Question

Matt 2021 年 7 月 2 日

1
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/870608-deployment-of-a-rl-agent

回答済み: Hariprasad Ravishankar 2022 年 12 月 26 日

Hello!

I'm using Matlab 2021a to design a reinforcement learning agent. I am in the process of training a series of agents and I am (hopefully) nearing deployment. However, I have a few questions that I have not been able to find direct answers to.

First, can I deploy a RL agent in Simulink Real-Time? I have a Speedgoat available and I have done significant work to close the hardware loop around my agent in Matlab, but I haven't seen anything that says in a straightforward manner that I can deploy it in Simulink. I saw here that you can deploy CUDA code in Simulink but I don't really understand the process. I'm not sure if it applies to this situation or if there is a more straigtforward way to deploy an agent in Simulink.

Second, if I cannot deploy a RL agent in Simulink, what other platforms can I deploy a RL agent in? I would prefer to deploy the agent in Matlab or Simulink since I've done a lot of the hardware work in Matlab, but I'm not sure what to do if Simulink isn't an option. I know that I can hyptothetically generate C and CUDA code, but I'm nout sure what the next step is.

And lastly, I clearly am lacking some vital knowlege on this front - it turns out I'm an electrical engineer, not a computer scientist! What resources should I use to further my understanding of this? I'd rather understand what I need to do than ask others for help. I don't really know anything about CUDA or deploying programs on actual hardware and I'm not sure where to look... and it turns out that there's a pretty steep learning curve.

Thanks!

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Drew Davis 2021 年 8 月 2 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/870608-deployment-of-a-rl-agent#answer_759102

MATLAB Online で開く

Hi Matthew

The RL toolbox has a feature: generatePolicyFunction - which will generate a static MATLAB policy function from an RL agent object. This function can be placed inside a MATLAB Function block for code generation.

Note 1, the Speedgoat platform does not support compiling against any third party deep learning libraries, so the model parameter DLTargetLibrary is restricted to be "none". This limits the which layers can be used for deployment. You can check which built-in layers are supported via the following command:

coder.getDeepLearningLayers("TargetLibrary", "none")

Note 2, custom layers (such as the RL scalingLayer) are supported by the "none" target library.

Hope this helps

Drew

2 件のコメント
なしを表示なしを非表示

Drew Davis 2021 年 8 月 5 日

MATLAB Online で開く

Correction - the RL scalingLayer is not supported in 21a. Note:

supportedLayers = coder.getDeepLearningLayers

Will list ALL shipping layers that are supported by code generation.

Filip Poloczek 2022 年 3 月 6 日

Hi,

I'm trying to use an trained agent on a hardware. The training went well and I have an action space between -9 and 9, but if I use generatePolicyFunction() then the resulting evaluatePolicy() returns only values between -1 and 1. Do you have a solution for this?

サインインしてコメントする。

Answer 2

Hariprasad Ravishankar 2022 年 12 月 26 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/870608-deployment-of-a-rl-agent#answer_1136452

It looks like this video might be helpful

https://www.mathworks.com/videos/reinforcement-learning-agent-deployment-real-time-testing-on-a-speedgoat-machine-1638925352260.html

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

Deployment of a RL Agent

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

2 件のコメント
なしを表示なしを非表示

その他の回答 (1 件)

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Deployment of a RL Agent

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

2 件のコメント なしを表示なしを非表示

その他の回答 (1 件)

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

2 件のコメント
なしを表示なしを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示