Issues with splitting a matrix into many smaller matrix
1 回表示 (過去 30 日間)
古いコメントを表示
I am trying to group my origional matrix (size 11250 x 2) into many smaller matricies, grouped by the value in the first column.
I then want to take the mean and standard deviation of these values to report on the data that I have recieved.
PIN
PIN_SORT = sort(PIN.', 2).'
n = 1
for i = -4.5200:0.04: 0.7200
M{n} = PIN_SORT(find(PIN_SORT(:,1)== i),:);
%disp(M{'i'})
%disp(i)
n = n+1
end
avg{n} = mean(M{n})
stddev{n} = std(M{n})
The code does what I want it to do, for certain values, and will leave an empty spot in the array for the rest.
The two comments were attepmts at trouble shooting, but alas to no success.
Is there a glaringly obvious flaw in my approach or in my code, or is there an oversight on my behalf.
1 件のコメント
Stephen23
2023 年 8 月 11 日
"Is there a glaringly obvious flaw in my approach or in my code, or is there an oversight on my behalf."
This is worth reading as well:
採用された回答
Walter Roberson
2023 年 8 月 11 日
for i = -4.5200:0.04: 0.7200
The i are floating point values.
M{n} = PIN_SORT(find(PIN_SORT(:,1)== i),:);
You are comparing PIN_SORT(:,1) to a floating point value for bit-for-bit equality. Most values will not be found.
MATLAB does not calculate in decimal. The value -4.5200 cannot be exactly represented in binary double precision. The value 0.04 cannot be exactly represented in binary double precision. The not-exactly-0.04 will be repeatedly added to the original not-exiactly -4.5200 with the representation error for 0.04 accumulating each time.
You should not compare floating point numbers for equality -- except for cases where the values being compared are drawn from the same source. For example, it is valid to test
PIN_SORT(:,1) == min(PIN_SORT(:,1))
because min() will pull out a bit-for-bit copy of one of the values, and you can be sure that bit-for-bit testing will work in that case (provided the array is not all NaN)
1 件のコメント
Walter Roberson
2023 年 8 月 11 日
T1 = -4.5200:0.04: 0.7200;
T2 = (-452:4:72)/100;
mean(T1 == T2)
Only about 30% of the entries in -4.5200:0.04:0.7200 will be bit-for-bit identical to what you would get if you calculated the corresponding integer divided by 100. The cumulative error on the 0.04 is a problem
その他の回答 (1 件)
KSSV
2023 年 8 月 11 日
You need to load the data from csv file into MATLAB first.
T = readtable('https://in.mathworks.com/matlabcentral/answers/uploaded_files/1455237/PIN.CSV');
PIN = T.(4) % pick your column from table T
参考
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!