huge differences in single vs double precision math

Question

0 投票

I am calculating a sum of squares in 32-bit FP precision (for comparison with a GPU algorithm, which isn't relevant here).

Here is the code:

Y=single((0:499).^2);
sum(Y)
ans =
   41541684
sum(double(Y))
ans = 
   41541750

The (correct) double answer is off by 66! The largest value, 499^2 = 249001, is nowhere near any FP limits.

This is R2013A on OS X 10.9.

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

Answer 1

John D'Errico 2014 年 8 月 7 日

MATLAB Online で開く

4 投票

What you don't understand is that single precision has a 23 bit mantissa. While there are 32 total bits stored in a single, don't forget that one of those bits is a sign bit, which leaves 8 bits to store an exponent in a biased form. So you cannot store an INTEGER larger than 2^24-1 in a single, if you wish to do so without error.

The sum you formed was larger than that limit, so you should expect an error.

log2(41541750)
ans =
     25.308

It is time for you to start reading about floating point arithmetic.

Wiki floating point article

What Every Computer Scientist Should Know About Floating Point Arithmetic

Computers are not all powerful, except for those in the movies/tv.

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

huge differences in single vs double precision math

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

回答 (1 件)

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

カテゴリ

製品

タグ

Community Treasure Hunt

huge differences in single vs double precision math

0 件のコメント -2 件の古いコメントを表示 -2 件の古いコメントを非表示

回答 (1 件)

0 件のコメント -2 件の古いコメントを表示 -2 件の古いコメントを非表示

カテゴリ

製品

タグ

参考

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示