Reading HDFS from Matlab - what toolboxes do I need?

6 ビュー (過去 30 日間)
Anna Dunblad
Anna Dunblad 2017 年 9 月 8 日
回答済み: Chad Greene 2017 年 9 月 11 日
We're planning to implement Hadoop at my work, and I need a way to retreive the data from the Hadoop clusters in the data lake and get it into Matlab. What toolboxes do I need for this? Note that I'm only reading the data from HDFS-files.
Additionally, would I need other toolboxes to be able to read data?

回答 (2 件)

Brandon Eidson
Brandon Eidson 2017 年 9 月 11 日
Hadoop Sequence Files can be read directly in base MATLAB.
If you want to do "mapreduce" on a Hadoop cluster, then you need to have licenses for the Parallel Computer Toolbox and MATLAB Distributed Computer Server.  Documentation on how to Configure a Hadoop cluster and run "mapreduce" on it is linked to below.

Chad Greene
Chad Greene 2017 年 9 月 11 日
The h5read function has come standard since Matlab release 2011a, and requires no special toolboxes.

カテゴリ

Help Center および File ExchangeDeploy Tall Arrays to a Spark Enabled Hadoop Cluster についてさらに検索

タグ

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by