How do we know that the PI controller can be modeled using a single neuron?
1 回表示 (過去 30 日間)
古いコメントを表示
In this tutorial (https://www.mathworks.com/help/reinforcement-learning/ug/tune-pi-controller-using-td3.html) one fully connected layer is used to model the PI controller. This translates to 2 inputs to and one output from a neuron. I am not sure what activation function is used.
How do we know that this problem doesn't need more neurons? If the answer is that the underlying model is linear then why have a non-linear activation function?
0 件のコメント
回答 (1 件)
Emmanouil Tzorakoleftherakis
2023 年 9 月 28 日
The network used to model the PI controller is exactly this one
actorNet = [
featureInputLayer(numObs)
fullyConnectedPILayer(initialGain,'ActOutLyr')
];
The fullyConnectedPILayer layer is a custom layer created to imitate a PI controller. If you open it, you will see that it performs the operation of a fully connected layer with two weights. Those two weights represent Ki and Kp so you don't need any more neurons for this problem.
As you can see, there is no use of any activation layer here because we intend the actor network to behave exactly like a PI controller. If you work out the input output math, you will see that the output of the actor network is equal to u= [∫e dt e]∗[Ki Kp] where [∫e dt e] are the two observation inputs. This should be familiar as it's essentially a PI controller.
Hope this helps
0 件のコメント
参考
カテゴリ
Help Center および File Exchange で Sequence and Numeric Feature Data Workflows についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!