Creating equal lists by minimizing variance on two factors

3 ビュー (過去 30 日間)
Samantha Gustafson
Samantha Gustafson 2023 年 1 月 24 日
編集済み: John D'Errico 2023 年 1 月 24 日
We have a database of 773 unique words that vary based on two numerical dimensions. We would like to create 24 lists of 25 words each, with each list relatively equal on each of the two numerical dimensions. Is there a matlab function(s) that can handle this?

回答 (1 件)

John D'Errico
John D'Errico 2023 年 1 月 24 日
編集済み: John D'Errico 2023 年 1 月 24 日
Why should there be such a function?
My guess would be you want to use a tool like kmeans, clustering your data in 2 dimensions, into 24 separate clusters. Don't expect the solution to be unique, with nice well behaved uniformly sized clusters.
If you really, desperately want 24 clusters of size 25 in each cluster, you will probably need to start writing some code of your own. But don't expect that to be easy, or result in a nice stable solution either. I'd start with k-means, and learn how to use it well, or you might want to do some serious reading about clustering techniques to write your own.


Find more on Statistics and Machine Learning Toolbox in Help Center and File Exchange




Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by