How to speedup left divide on GPU?
1 回表示 (過去 30 日間)
古いコメントを表示
Hello,
I have two same size 4D gpuArrays f(NxMxLxK) and f1(NxMxLxK) and I need to left divide each column, for that this code is implemented, which become a bottleneck in my algorithm and uses about 95% of runtime:
beta2= arrayfun(@(n) f(:,n)\f1(:,n), 1:numel(f)/size(f,1));
Result beta2 is vector. Is there a way to speed up this code? I assume the latency is due to fact that inside arrayfun is for loop which moves data from cpu to gpu and so on.
0 件のコメント
回答 (0 件)
参考
カテゴリ
Help Center および File Exchange で GPU Computing についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!