Choosing bin size to include equal number of data points

2018 年 12 月 12 日
Image Analyst
2018 年 12 月 13 日
I have 'time' of certain event to happen in first column and the 'probability of event to happen' in the second column. If i want to plot the data with each bin containing equal number of visits based on the time of visit, how should i decide the bin size?

  3 件のコメント

Steven Lord
2018 年 12 月 12 日
How do you want to handle the case where this is impossible? If you have four bins in which to collect the following eight data points, what would you choose for the bins?
time = [1 1 1 1 1 2 3 4];
Hari krishnan 2018 年 12 月 13 日
I am sorry, i don't know.
Image Analyst
2018 年 12 月 13 日
Do the bins all have to be the same width, or can you have variable size bins? If equal bins, is it OK to ignore empty bins and just try to find the bin width such that the standard deviation of counts in the occupied bins is minimized (this is pretty easy - actually variable sized bins is also fairly easy)?

