Live Monitoring of Critic Predictions in the RL Toolbox

1 回表示 (過去 30 日間)

walli 2020 年 8 月 17 日

1
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/580884-live-monitoring-of-critic-predictions-in-the-rl-toolbox

編集済み: walli 2020 年 8 月 17 日

I'm wondering if it is possible to monitor the Q-value predictions within any critic-based RL approach using the RL toolbox? For example, having a multi-output DQN agent the internal deep NN has to be called at every step in order to evaluate all possible discrete actions given the current state sample - hence, somewhere internally there must be a Q-value prediction for every discrete action available which are then evaluated in order to find the optimal action.

However, having spend some time on the 2020a documentation I was not able to find a way accessing these internal Q-value predictions at each time step. In particular, it would be nice if the Simulink-based agent block would be able to provide these predictions for further processing and monitoring reasons during the training and deployment phase.

Does somebody have a useful hint in order to retrieve the Q-value estimates during learning?

Help Center および File Exchange で Reinforcement Learning についてさらに検索

製品

Reinforcement Learning Toolbox

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Live Monitoring of Critic Predictions in the RL Toolbox

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (0 件)

参考

カテゴリ

タグ

製品

Community Treasure Hunt

Live Monitoring of Critic Predictions in the RL Toolbox

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (0 件)

参考

カテゴリ

タグ

製品

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示