Why my test accuracy higher than validation accuracy?

Question

0 投票

I am using classification learner app. I get test accuracy higher than validation accuracy. For example 94.61% Accuracy (Validation), 94.81% Accuracy (Test). I'm sure I've splitted the train and test sets correctly. Why is test accuracy higher? How can i solve this? I would be grateful if you help.

4 件のコメント
2 件の古いコメントを表示 2 件の古いコメントを非表示

the cyclist 2023 年 4 月 29 日

@John D'Errico, I assume that @Deren is breaking out into three datasets:

training -- to fit the model
validation -- to tune hyperparameters
test -- to evaluate the final model choice

(This oversimplified, for brevity.)

Typically, training performace > validation performance > test performace.

(Again, oversimplified for brevity.)

So, his result is slightly more surprising than the two-stage method you describe. (I expect he did not train on the test set, as you are describing.)

See my answer for my take on the whole thing, which is effectively the same as your broader point, which is that the difference is small and not surprising.

John D'Errico 2023 年 4 月 29 日

Ok. That makes sense. Regardless, the difference is tiny, and could easily have been the other way.

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

Answer 1

the cyclist 2023 年 4 月 29 日

編集済み: the cyclist 2023 年 4 月 29 日

1 投票

There is no mystery here. Although in general a classifier will perform a little less well on the test set, sampling error can lead to a "lucky" test set, and you end up classifying it better.

Think of it like this. Suppose your validation accuracy is 95%, and the true accuracy of your model is really only 93%. It is still the case that you could perform better on any given randomly drawn test case. You could even get 100% accuracy in the test set.

There is a nice, fairly comprehensive discussion of these points by Jason Brownlee. Quoting from Kuhn and Johnson (from that article): "The uncertainty of the test set can be considerably large to the point where different test sets may produce very different results."

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

Why my test accuracy higher than validation accuracy?

4 件のコメント
2 件の古いコメントを表示 2 件の古いコメントを非表示

採用された回答

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

その他の回答 (0 件)

カテゴリ

製品

リリース

タグ

Community Treasure Hunt

Why my test accuracy higher than validation accuracy?

4 件のコメント 2 件の古いコメントを表示 2 件の古いコメントを非表示

採用された回答

0 件のコメント -2 件の古いコメントを表示 -2 件の古いコメントを非表示

その他の回答 (0 件)

カテゴリ

製品

リリース

タグ

参考

Community Treasure Hunt

4 件のコメント
2 件の古いコメントを表示 2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示