Why is the final ValidationLoss of my model always worse than the best value it obtained during training?

Question

Alfredo Montella 2022 年 4 月 12 日

1
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1694945-why-is-the-final-validationloss-of-my-model-always-worse-than-the-best-value-it-obtained-during-trai

回答済み: Gagan Agarwal 2023 年 12 月 21 日

Hello!

This is the very first time I post a question on the forum: I hope it fits the requirements for a good question, and I'll be open to any feedback on its format.

So, I'm training a Deep Learning model. My network is essentially a 3D rendition of the Inceptionresnetv2 with a custom MAE regression output layer that I copied from this page of the Matlab docs, with some minor adjustments to the size values to make it function with 3D data.

During training, the network reaches a certain ValidationLoss, and I've set the option 'OutputNetwork' to 'best-validation-loss' in order to get the best model as the final output of the trainNetwork function.

Once training is complete, I verify this value by using the outputed net to predict responses for the exact same Validation Set that was used during the training phase, and I always get a worse result. Say the best ValidationLoss during training was around 3.8, the value of the MAE after using the outputed network to predict responses on the Validation Set will be around 4.2 instead.

Is there a specific reason for this? The network is supposed to be the exact same that had obtained the best ValidationLoss (which in turn should be the best MAE, due to the custom regression layer), and the data is definitely the exact same Validation Set that was used during training, so I can't understand why the performance differs so greatly.

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Gagan Agarwal 2023 年 12 月 21 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1694945-why-is-the-final-validationloss-of-my-model-always-worse-than-the-best-value-it-obtained-during-trai#answer_1375507

Hi Alfredo,

I understand that you are trying to know the reasons behind a significant performance difference between two methods.

Here are a few potential reasons and considerations:

Overfitting - If the network is overfitting to the training data, the validation loss might appear better during training due to the network beginning to memorize the validation set, especially if the validation set is small or not well-randomized.
Evaluation During Training: Sometimes, the model is evaluated on the validation set while it's still in training mode, which could lead to optimistic validation loss due to factors like dropout still being active.
Randomness and State Reset: Deep learning frameworks often involve randomness. If the state of the random number generator changes between the training phase and the evaluation phase, results can differ.

I hope it helps!

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

Why is the final ValidationLoss of my model always worse than the best value it obtained during training?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Why is the final ValidationLoss of my model always worse than the best value it obtained during training?

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示