Matlab Clustering technique with textual data

Hi, I am trying to figure out the best way to cluster numeric information (stock returns) using a series of textual information. For instance, let's say I have 10 sectors with of stock returns that I'd like to cluster to 3 distinct groups. My first thought was to use the K-means clustering algorithm from the "Stats and ML" toolbox however, it doesn't take textual information as a descriptor.
Please advise.
Example data set
Industry, Return
Financials,2%
Consumer Disc,3%
Consumer Staples,4.5%
Energy,1%
Health Care,1.5%
Industrials,2.2%
Info Tech,3.7%
Materials,4.8%
Telecom,-2%
Utilities,-1%

1 件のコメント

Brian
Brian 2016 年 12 月 16 日
Any ideas on this from statistical experts?

サインインしてコメントする。

回答 (1 件)

mizuki
mizuki 2016 年 12 月 20 日

0 投票

Make the textual data categorical to reduce information.

カテゴリ

ヘルプ センター および File ExchangeCluster Analysis and Anomaly Detection についてさらに検索

質問済み:

2016 年 12 月 12 日

回答済み:

2016 年 12 月 20 日

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by