Efficient Vectorization of For Loop
1 回表示 (過去 30 日間)
古いコメントを表示
Shreyas Bharadwaj
2024 年 4 月 15 日
コメント済み: Shreyas Bharadwaj
2024 年 4 月 16 日
Hi,
I have three matrices
and C and am trying to compute a fourth matrix Min the following way:
data:image/s3,"s3://crabby-images/5838d/5838d9365e2032baf2b272c4dbe3eb6790719238" alt=""
for p = 1:N
for q = 1:N
M(p,q) = 2 * sum(A(:,q) .* conj(B(:,p)) .* C(:,q));
end
end
All matrices are
. I am trying to compute this for N = 750 or so and the computation is extremely slow. I cannot find any obvious way to vectorize the code. Any help would be very much appreciated.
data:image/s3,"s3://crabby-images/d1dd1/d1dd19ab9e085582f0e2472268d7a1d883480f8c" alt=""
Thanks.
0 件のコメント
採用された回答
Bruno Luong
2024 年 4 月 15 日
編集済み: Bruno Luong
2024 年 4 月 15 日
Not tested but the sign reading tell me
M = 2*B' * (A.*C);
4 件のコメント
James Tursa
2024 年 4 月 15 日
編集済み: James Tursa
2024 年 4 月 15 日
I would guess that having 2*B' at the front will force MATLAB to physically compute the conjugate transpose of B first. However, if you segregate the 2* operation as 2 * (B' * (A.*C)), the B' would not need to be physically formed to do the conjugate transpose matrix multiply since this will be handled by flags passed into the BLAS routine. Maybe a bit faster? E.g.,
A = rand(5000); B = rand(5000); C = rand(5000);
timeit(@()2*B' * (A.*C))
timeit(@()2*(B' * (A.*C)))
その他の回答 (0 件)
参考
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!