Prior probability in binary fitcsvm to take into account different class proportions in training and test sets

Question

Alexis Moscoso Rial 2017 年 10 月 6 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/360047-prior-probability-in-binary-fitcsvm-to-take-into-account-different-class-proportions-in-training-and

コメント済み: heng ma 2022 年 6 月 13 日

Hello,

I am working in a binary classification problem using svm. Due to unavoidable reasons, my training and test sets have different class proportions, (roughly 1:3 vs 1:5). I would like to know whether the introduction of the corresponding test prior probabilities in the option 'Prior' when training fitcsvm is going to take into account this difference when predicting in the test set.

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Carl 2017 年 10 月 10 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/360047-prior-probability-in-binary-fitcsvm-to-take-into-account-different-class-proportions-in-training-and#answer_285156

編集済み: Carl 2017 年 10 月 10 日

Hi Alexis. Specifying a value for 'Prior' will affect the training process for the SVM, which will then make a difference in how it predicts for the test set. In any case, the values for 'Prior' shouldn't necessarily be the prior probabilities of your test set, but rather, the realistic class prior probabilities.

It can be problematic when the real prior probabilities differ significantly from the prior probabilities in your training set. If your training set is representative of the population, then you shouldn't have to provide anything for 'Prior'.

This is a more general problem known as class imbalance, or imbalanced data sets. You can see the Answers post below for previous suggestions on how to account for this problem:

https://www.mathworks.com/matlabcentral/answers/11549-leraning-classification-with-most-training-samples-in-one-category

2 件のコメント
なしを表示なしを非表示

Alexis Moscoso Rial 2017 年 10 月 17 日

In that case I'm going to switch the prior probabilities to those of the test set, which are the realistic ones. Thank you very much.

heng ma 2022 年 6 月 13 日

wow! Thank you very much. I also meet this problem. trian data set prior is 1:7,trianing accuary is around 87.5%(which means can not separate well), but using this trianing result, test data set prior is 1:1, accuary is around 90% which is wrong.

サインインしてコメントする。

Prior probability in binary fitcsvm to take into account different class proportions in training and test sets

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

2 件のコメント
なしを表示なしを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

Community Treasure Hunt

Prior probability in binary fitcsvm to take into account different class proportions in training and test sets

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

2 件のコメント なしを表示なしを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

2 件のコメント
なしを表示なしを非表示