Creating numerical variables from categorical variables in an unbalanced dataset
5 ビュー (過去 30 日間)
表示 古いコメント
Grigorios Kyritsakas
2020 年 2 月 12 日
コメント済み: Grigorios Kyritsakas
2020 年 2 月 17 日
Hello there,
I would like to apply Random Forrest method in a highly unbalanced dataset that includes both numerical and categoorical variables.In order to improve my classification results, before applying the method for classification I thought to create synthtic datasets using the SMOTE and the ADASYN algorithm. However, both methods work only with numerical variables, therefore, I would like to ask if you have any suggestion regarding the way to transform my categorical variables into numerical ones.
With many thanks in advance for your help
0 件のコメント
採用された回答
Lei Hou
2020 年 2 月 14 日
Hi Grigorios,
You can do something as the following.
catVar = categorical(["a" "b" "c" "b" "a"]);
numValue = [0.1 3 100]; % The order of numbers refers to the order of categories returned by categories(catVar)
numVar = numValue(catVar)
Hoping my solution helpful to you.
その他の回答 (0 件)
参考
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!