So I'm trying to optimize my code, and one aspect where it could really improve is to change one for loop to a SIMD instruction.
In short, I have two variables: A (16 x 16 x 3 x 100) and B(16 x 16 x 3 x 7)
What I want to do is to get an elementwise multiplication that expands to both the 100 and 7 dimensions, concatenating and returning a variable of size C (16 x 16 x 3 x 700).
If i do A .* B(:, :, :, 1) it automatically expands to the 100, but then i'd need a 7 iteration for loop concatenating along B.
How do i do this without a for loop?


Matt J
Matt J 2022 年 6 月 24 日
編集済み: Matt J 2022 年 6 月 24 日
AlexRD 2022 年 6 月 25 日
Thanks, that does it!


