Create k-fold Cross Validation with Undersampling for highly imbalanced Dataset

Question

Dario Walter 2020 年 8 月 4 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/574828-create-k-fold-cross-validation-with-undersampling-for-highly-imbalanced-dataset

回答済み: Anshika Chaurasia 2020 年 8 月 14 日

Dear Community,

I am not sure how to implement the following requirement. When I use undersampling for my supervised Machine Learning Algorithm, how can I assure that the k-fold corresponds to the distribution of the original dataset. The performace metric (e.g. PR AUC) shall refer to the original distribution and not to the distribution of the undersampled set.

It does not make sense to solely perform k-fold cross validation on the entire undersampled dataset.

Your help is highly appreciated!

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Anshika Chaurasia 2020 年 8 月 14 日

1
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/574828-create-k-fold-cross-validation-with-undersampling-for-highly-imbalanced-dataset#answer_479856

MATLAB Online で開く

Hi Dario,

It is my understanding that you want k-folds (cross-validation) to preserve the imbalanced distribution of original dataset. The solution is stratified k-fold cross-validation.

Use cvpartition function and refer to cvpartition documentation for more information.

c = cvpartition(group,'KFold',k,'Stratify',stratifyOption)

You can also try following file exchange documents as a drop-in replacement to cvpartition:

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

Create k-fold Cross Validation with Undersampling for highly imbalanced Dataset

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Create k-fold Cross Validation with Undersampling for highly imbalanced Dataset

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (1 件)

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示