How to efficiently integrate big data without using memory / (How to create big data)
古いコメントを表示
- in a study i will produce large arrays.
- Each array will have at least 500 MB size.
- Each array will have the same number of rows.
- the total size of dataset will be approximately 20 GB or over.
- Somehow I have to create a single variable/array which includes all data and size of 20 GB.
matfile seems a good solution. However when the size of file increases, it gets slower. How can i handle this problem?
9 件のコメント
blaat
2015 年 8 月 18 日
How you store and access big data is strongly dependent on what you need to do with it. Do you need all arrays at the same time? Do you need specific values from these arrays at the same time? Can you somehow partition your calculations to a subset of the data?
Without more information it is very difficult to give advice on your problem.
Mehmet OZC
2015 年 8 月 18 日
Mehmet OZC
2015 年 8 月 18 日
blaat
2015 年 8 月 18 日
If can process the arrays separately, perhaps it would be more convenient to keep them as separated files. Or is there another reason you want a single, large file?
If a single file is required, I would advise against storing everything in a single variable. As far as I know, there is no way of reading only part of an array from a .mat-file, so the file will require 20 GB of memory to load.
Steven Lord
2015 年 8 月 18 日
Why do you need to write them to a single file? Why not put each in its own file; that way if something were to happen to one of the files you wouldn't lose all of your data?
Mehmet OZC
2015 年 8 月 18 日
編集済み: Mehmet OZC
2015 年 8 月 18 日
blaat
2015 年 8 月 18 日
So, if I understand correctly, your problem is this: you want to write 20 GB of data to a single variable in a .mat-file, but it's getting unworkably slow? Or doesn't it work at all?
Mehmet OZC
2015 年 8 月 18 日
編集済み: Mehmet OZC
2015 年 8 月 18 日
Walter Roberson
2015 年 8 月 18 日
I wonder if compression is leading to slowdowns? I do not know whether -v7.3 with matfile uses compression; see discussion http://www.mathworks.com/matlabcentral/answers/15521-matlab-function-save-and-v7-3 and http://www.mathworks.com/matlabcentral/answers/137592-compress-only-selected-variables-when-saving-to-mat
採用された回答
その他の回答 (0 件)
カテゴリ
ヘルプ センター および File Exchange で Standard File Formats についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!