回答済み
How to implement LSTM layers in MATLAB's DDPG agent
Hello, You can use lstm layers directly in both actors and critics and the built-in DDPG agent will handle the rest. Take a loo...

約3年前 | 0

| 採用済み

回答済み
Error in creating a custom environment in deep reinforcement learning code
The links below provide more info on how to create custome environments in MATLAB. https://www.mathworks.com/help/reinforcement...

約3年前 | 1

| 採用済み

回答済み
Resume training for PPO agent
PPO does not use an experience buffer so you should be fine loading the saved agent to resume training. If you are using advanta...

約3年前 | 1

| 採用済み

回答済み
How to use model predictive control and quadprog() in MicroAutobox III
Hi, You can use quadprog for simulation and code generation as a solver in Model Predictive Control Toolbox as of R2021b releas...

約3年前 | 0

回答済み
Model predictive control (MPC) for MIMO system on real hardware in simulink
Unfortunately, you cannot use Simulink models as prediction models for MPC design. The alternative would be to use data-driven m...

約3年前 | 0

回答済み
Adaptive model predictive controller
Have you seen this example?

約3年前 | 0

回答済み
Independently working multiple reinforcement learning agents
Centralized learning makes learning and exploration more efficient because the agents share things like experiences. If agents p...

約3年前 | 0

| 採用済み

回答済み
Problem with Using codegen commands to generate C++ code on NLMPC Code Generation Tutorial
You did not specify what kind of error you were seeing? In my case, doing the following worked: func = 'nlmpcmoveCodeGeneration...

約3年前 | 0

回答済み
RL agent does not learn properly
Some comments: 1) 150 episodes is really not much, you need to let the training continue for a bit longer 2) There is no guara...

約3年前 | 0

| 採用済み

回答済み
Action of the RL agents actions change when deployed in a different enviornment
A couple of suggestions/comments: 1) You mentioned env1 and env2 are different - why are you expecting to see the same results?...

約3年前 | 0

回答済み
How to specify a nonlinear mpc controller for continuous time delay differential equation state function?
You can basically add states to help model the delays. So your new discretized state vector would be [x(k) y(k) x(k-1) y(k-1) .....

約3年前 | 1

| 採用済み

回答済み
Plotting states while doing RL training
We recently added a mechanism that allows you to log any information you find helpful during training. Please take a look at thi...

約3年前 | 0

| 採用済み

回答済み
Inquiry about Neural Network Structure for Lane Keeping Assist Example
For this example, we did not rely on any papers/external sources, the development team put together this architecture when they ...

約3年前 | 0

| 採用済み

回答済み
Although I adjusted the Noise Options DDPG actions are always equal to the maximum and minimum value.
At first glance I don't see anything wrong. A couple of suggestions: 1) Try reducing the noise variance further, until you see ...

約3年前 | 0

| 採用済み

回答済み
How to log signal data from simulink to matlab with higher time interval to avoid high data storage?
If you are using R2022b, please take a look at this page. We recently added enhanced logging capabilities in Reinforcement Learn...

約3年前 | 1

回答済み
How to input action in reinforcement learning template environment?
Easiest thing you can do is add a break point and display what "action" variable is. It's obviously not a cell array so you cann...

約3年前 | 0

| 採用済み

回答済み
How to set the state with different variables in properties?
Hi Yang, We have an example in Reinforcement Learning Toolbox that does training based on nonhomogeneous observations, and spec...

約3年前 | 0

| 採用済み

回答済み
Receiving only one joint angle instead of a cycle of values necessary for walking during simulation?
Hello, There are several open questions here: 1) If you want to use imitation learning, you need to have input output data. In...

約3年前 | 1

| 採用済み

回答済み
Terminal Weights to nlmpc
For nonlinear mpc, the easiest way to do that is to use the multistage formulation and block. Then you can set constraints/cost ...

約3年前 | 0

| 採用済み

回答済み
How to put vector as a element in rlNumericSpec?
You could do something along the lines of: ObservationInfo(1) = rlNumericSpec([1 1]); ObservationInfo(1).Name = 'scalar'; Obs...

約3年前 | 0

| 採用済み

回答済み
Discretisation of a non-linear LTI system
If you have the dynamics in symbolic form, you need to turn it into a form that can be directly consumed by Model Predictive Con...

約3年前 | 0

| 採用済み

回答済み
How to import single input and three output data (of simulink model) stored in workspace, using system identification app (present in apps option) in matlab ?
Have you looked at this example which trains a state space model and then uses it for MPC design?

約3年前 | 0

回答済み
How to save multiple trained RL Agents?
You can really do whatever makes sense to you. Either save them separately or in the same mat file as follows: save('Agents.mat...

約3年前 | 0

回答済み
I'm getting the following error while doing the state update in mpc
Looks like the error is quite descriptive here, please check the dimensions of A, x, u, and B1 maybe by using a break point to s...

約3年前 | 0

| 採用済み

回答済み
How to optimize a parameter using Nonlinear model predictive controller
Looks like you are referring to parameters defined inside the prediction model/state function of the MPC controller. You can mak...

約3年前 | 0

回答済み
Problems importing Farama Gymnasiums (previously Open AI gym) continuous environments in MATLAB to use RL toolbox
Hi Alberto, In the post you are mentioning, I recommended a 3rd party tool to use OpenAI Gym with Reinforcement Learning Toolbo...

約3年前 | 0

| 採用済み

回答済み
how to choose the alternate cost function in MPC command line window? and can we know which cost function MPC block is considering?
By default, MPC controller will use the standard cost. If you want to use the alternate cost, you can see how to do it in this e...

3年以上前 | 0

| 採用済み

回答済み
Can we equate or un-equate the two MV's of the MPC controller in command line?
Hello, A couple of points first: 1) I am assuming your MVs are continuous (if they are discrete, what you are asking is not su...

3年以上前 | 0

| 採用済み

回答済み
Reinforcement Learning agent converges to a suboptimal policy
Hello, In your question you mention a graph but it has not been attached? It sounds like the agent you trained has converged t...

3年以上前 | 1

回答済み
reinforcement learning line tracer with Simulink
The closest I can think of are these examples in Reinforcement Learning Toolbox: https://www.mathworks.com/help/reinforcement-l...

3年以上前 | 0

| 採用済み