Community Profile

photo

Takeshi Takahashi

MathWorks

Last seen: 20日 前 2021 以来アクティブ

Statistics

  • Knowledgeable Level 2
  • First Answer

バッジを表示

Content Feed

表示方法

回答済み
Why does Soft actor critic have Entropy terms instead of Log probability?
RL toolbox also uses the log of the probability density to approximate the differential entropy.

3ヶ月 前 | 0

| 採用済み

回答済み
ExperienceBuffer has 0 Length when i load a saved agent and continue training in reinforcement training
Length 0 means there isn't any experience in this buffer. I think it didn't save the experience buffer due to this bug. Please s...

5ヶ月 前 | 0

| 採用済み

回答済み
How does RL algorithm work with RNNs?
Hi, rlDDPGAgent with RNN first randomly samples B sequences (trajectories) from the experience buffer, where B is MiniBatchSize...

7ヶ月 前 | 0

| 採用済み