clustering the data

Question

FIR 2011 年 11 月 17 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/21485-clustering-the-data

i have 100 gene ,cluster the 100 data into 20 groups using the K-means method

so i will lhave in first column the cluster number(1 to100) 2nd column-total no of genes(ex 3)(for cluster no 1) genes included (9,7,8)(for cluster no 1)

i have to do it for all 20 rows

please help

2 件のコメント
なしを表示なしを非表示

Amith Kamath 2011 年 11 月 17 日

if you are clustering into 20 groups, then the cluster number for each gene will be one of 1 to 20, and not 1 to 100. Also, how many samples do you have, meaning if each gene is a column, how many rows do you have? You'll have to run kmeans columnwise then, as far as I know!

FIR 2011 年 11 月 19 日

yes amith gene will be one of 1 to 20,I have one 100 rows and one column

please help

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Amith Kamath 2011 年 11 月 19 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/21485-clustering-the-data#answer_28539

MATLAB Online で開く

OK, so from what I understand, you are clustering the genes, and not the samples, which means that you need to transpose your sample-gene matrix, and then run kmeans. If you then type in:

IDX = kmeans(genedata',20); %where genedata is a matrix containing genes in rows, and samples in columns,

IDX will contain 100 elements, which contains numbers from 1 to 20, corresponding to which cluster the gene is in.

for i = 1:20
genenum(i) = sum(IDX == i);
end

and genenum will contain the number of genes in each cluster. You could alternatively take the histogram of IDX, and the resulting array will contain the same info as genenum.

If you'd want to include the genes included in each cluster as well, you can't store it in the same matrix as the numbers of elements are not the same in each 'cell' of the matrix, and hence I advise using cell variables instead. Hope this answers your questions!

2 件のコメント
なしを表示なしを非表示

FIR 2011 年 11 月 19 日

yes amith i git it but please answer include the genes included in each cluster..i need it badly..trying these for long time but could not get...

my genedata is =[1:1:100]

please help

Amith Kamath 2011 年 11 月 21 日

I don't really know how your code will work if the genedata is just a set of 100 numbers! Whatever I answered was for a matrix of size 100x100, where each sample has a value for each gene, and there are (arbitrarily) 100 samples. Assuming the above is true, you should extract the indices of IDX that are 1, which are the gene indices in each cluster.

サインインしてコメントする。

clustering the data

2 件のコメント
なしを表示なしを非表示

採用された回答

2 件のコメント
なしを表示なしを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

Community Treasure Hunt

clustering the data

2 件のコメント なしを表示なしを非表示

採用された回答

2 件のコメント なしを表示なしを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

Community Treasure Hunt

2 件のコメント
なしを表示なしを非表示

2 件のコメント
なしを表示なしを非表示