Large discrepency between CPU and GPU with fftn and ifftn?

Question

Nathan Zechar 2020 年 5 月 23 日

1
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/530898-large-discrepency-between-cpu-and-gpu-with-fftn-and-ifftn

回答済み: Andrea Picciau 2020 年 5 月 26 日

Hello,

I am using a GPU to compute multipul 3D FFTs. I've come across a problem where the output I am recieving from the GPU FFT is vastly different in terms of shape and values. I am unsure why this is the case and if there is any way to mitigate this problem.

I have included a small portion of code where data is generated, and the fftn and ifftn are applied to this data. Below the code are surface plots which show the difference in output. Is there a method that can be applied to where the output from the GPU FFT will be closer to the output of the CPU FFT?

N = 128;
x = (pi/4).*(0:N-1);
z = (3*pi/4).*(0:N-1);
tol = 1.0e-8;
kx  = [tol 1:N/2 -N/2+1:-1].*2.*pi.*(1i)./N;
[X,Y,Z]  = ndgrid(x,x,z);
X  = cos(2.*pi.*X./(32*pi));
Y  = cos(2.*pi.*Y./(32*pi));
Z  = exp(-(Z-(48*pi)).^2./100./(96*pi));
A  = 4.*Z.*(Y+1i.*X);
%% What the output should look like
A0        = A;
A0        = fftn(A0);
A0        = A0.*kx';
A0(1,:,:) = 0;
A0        = ifftn(A0);
A0        = real(A0);
A01       = squeeze(A0(:,:,64));
subplot(1,2,1)
surf(A01)
title('CPU')
%% GPU output is signigicantly different
A1        = gpuArray(A);
A1        = fftn(A1);
A1        = A1.*kx';
A1(1,:,:) = 0;
A1        = ifftn(A1);
A1        = real(A1);
A11       = squeeze(A1(:,:,64));
subplot(1,2,2)
surf(A11)
title('GPU')
%% Compares output of both
max(abs(A0-A1),[],'all')

2 件のコメント
なしを表示なしを非表示

David Goodmanson 2020 年 5 月 23 日

Hi Nathan,

in your definition of A11, could you explain why you have VM on the right hand side in place of A1?

Nathan Zechar 2020 年 5 月 25 日

Hey David, I appoligoze. It is now edited.

This is just a portion of a much larger code. I still had variables in my workspace when I tried to run it for the previous unedited example. It should produce the same results as shown in the graph.

I am using an RTX 2070 Max-Q if graphics cards make a difference.

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Andrea Picciau 2020 年 5 月 26 日

1
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/530898-large-discrepency-between-cpu-and-gpu-with-fftn-and-ifftn#answer_438908

Hi Nathan,

The results differ for two related reasons:

GPUs and CPUs have quite different architectures,
the CPU and GPU algorithms are slightly different.

On my GPU, I'm getting a max abs error of 1.6584e-15, which is within the expected tolerance and our error analysis.

As a rule of thumb, you should (almost always) expect a discrepancy between the GPU and the CPU that goes from 1e-14 to 1e-15 for computations in double precision.

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

Large discrepency between CPU and GPU with fftn and ifftn?

2 件のコメント
なしを表示なしを非表示

採用された回答

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Large discrepency between CPU and GPU with fftn and ifftn?

2 件のコメント なしを表示なしを非表示

採用された回答

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

2 件のコメント
なしを表示なしを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示