Takeshi Takahashi

MathWorks

Last seen: 3日前 | 2021 年からアクティブ

Followers: 0 Following: 0

メッセージ

I am a developer for Reinforcement Learning Toolbox at MathWorks.

My areas of interest are Reinforcement Learning, Deep Learning, and Robotics.

統計

バッジを表示

Feeds

回答済み
PPO algorithm training problem in Reinforcement Learning Toolbox
When N is smaller than ExperienceHorizon and N is also smaller than MiniBatchSize, the PPO agent uses N experiences to update i...

約3年前 | 0

| 採用済み

回答済み
Creating an actorLossFunction for ContinuousDeterministicActor
Please take a look at this example for rlContinuousDeterministicActor if you want to use it in a custom training loop. rlDiscre...

約4年前 | 0

| 採用済み

回答済み
Why does Soft actor critic have Entropy terms instead of Log probability?
RL toolbox also uses the log of the probability density to approximate the differential entropy.

約5年前 | 0

| 採用済み

回答済み
ExperienceBuffer has 0 Length when i load a saved agent and continue training in reinforcement training
Length 0 means there isn't any experience in this buffer. I think it didn't save the experience buffer due to this bug. Please s...

約5年前 | 0

| 採用済み

回答済み
How does RL algorithm work with RNNs?
Hi, rlDDPGAgent with RNN first randomly samples B sequences (trajectories) from the experience buffer, where B is MiniBatchSize...

5年以上前 | 0

| 採用済み