Make classification with huge dataset

Mindaugas Vaiciunas

2016 11 月 6

1 回答

6 ビュー (30 日間)

0 投票

I'm trying to make classification with huge dataset containing 6 persons for training and here I'm getting this error from only 1 person dataset: "Requested 248376x39305 (9.1GB) array exceeds maximum array size preference." First of all I'm trying Bagged Tree and Neural Network classificators and I want to ask how can I do it? It's possible to learn these classificators in portions of datasets (learn saved classification model again)?

9 件のコメント
7 件の古いコメントを表示 7 件の古いコメントを非表示

Mindaugas Vaiciunas 2016 年 11 月 9 日

This is solution to take some of features average for dimensionality reduction, but it may affect recognition percent.

Greg Heath 2016 年 11 月 10 日

Of course it will affect it. However, the way to choose is to set a limit on the loss of accuracy.

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

回答 (1 件)

Walter Roberson 2016 年 11 月 7 日

0 投票

Add more memory (RAM) to you computer. Then check or adjust Preferences -> MATLAB -> Workspace -> MATLAB array size limit.

Or, you could set the division ratios so that a much smaller fraction is used for training and validation, with most of it left for test. This effectively uses only a small subset of the data, but a different small subset each time it trains.

6 件のコメント
4 件の古いコメントを表示 4 件の古いコメントを非表示

Mindaugas Vaiciunas 2016 年 11 月 9 日

Thanks for advice, keep this in mind if there would be no other solution

Walter Roberson 2016 年 11 月 9 日

Let me put it this way:

You do not with to reduce the number of trees or the data because doing so might decrease the recognition rate
We do not have a magic low-memory implementation of the TreeBagger available.
You do not have enough memory on your system to run the classification using the existing software

Your choices would seem to be:

write the classifier yourself, somehow not using as much memory; or
obtain more memory for your own system; or
obtain use of a system with more memory

サインインしてコメントする。

サインインしてこの質問に回答する。

カテゴリ

ヘルプセンターおよび File Exchange で Licensing on Cloud Platforms についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Make classification with huge dataset

9 件のコメント 7 件の古いコメントを表示 7 件の古いコメントを非表示

回答 (1 件)

6 件のコメント 4 件の古いコメントを表示 4 件の古いコメントを非表示

カテゴリ

タグ

参考

Community Treasure Hunt

9 件のコメント
7 件の古いコメントを表示 7 件の古いコメントを非表示

6 件のコメント
4 件の古いコメントを表示 4 件の古いコメントを非表示