How does tree bagger handle NaN values

2020 2 月 7

1 回答

2020 12 月 27 に更新

3 ビュー (30 日間)

サインインしてこの質問に回答する。

Follow Question

サインインしてこの質問に回答する。

Follow Question

古いコメントを表示

0 投票

In building a random forest classifier I have some features with a large amount of NaN values, but it is not clear to me how Tree Bagger handles these NaNs. I've seen quite a bit of documentation of how that is handled in other high level programming languages, but I don't see explicitly how this is done in Matlab. Can anyone point me in the right direction so I can understand the default settings for this or user specified settings?

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

回答 (1 件)

Puru Kathuria 2020 年 12 月 27 日

0 投票

General rules that are followed while NaN or missing values are encountered:

Rule1: The algorithm simply discards the data points where all the features have NaN values and does not use them while training.

Rule 2: If a data point have a few NaN feature values then the algorithm will find the split on the basis of valid values first.

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

カテゴリ

ヘルプセンターおよび File Exchange で Descriptive Statistics についてさらに検索

製品

Statistics and Machine Learning Toolbox

リリース

R2017b

タグ

2020 年 2 月 7 日

2020 年 12 月 27 日

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by