Why is sum considerably slower that adding each individual element together, when using large loops?

10 ビュー (過去 30 日間)

Jack Palmer 2023 年 8 月 31 日

1
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2015156-why-is-sum-considerably-slower-that-adding-each-individual-element-together-when-using-large-loops

編集済み: Bruno Luong 2023 年 9 月 1 日

I have some code that includes loops that run at least 1e7 times (often up to 1e9 times) and I am trying to improve its performance.

My slowest step, somewhat surprisingly, is indexing into a variable and summing those numbers. I need to do this index and sum every time within the loop as the variable changes within the loop too.

The code below is a simplification of the problem.

Method 1

a = 1:16;
idx = [1 6 11 16];
N = 1e7;
A0 = zeros(1,N);
tic
for k = 1:N
  A = sum(a(idx));
  a = a + 0.001;
  A0(k) = A;
end
toc
Elapsed time is 5.948268 seconds.

Method 2

If I instead replace the first line of the loop with a nested loop that does an iterative addition, then I get a 10x improvement in performance. However, I am not too keen on using another loop, as in my actual code, I have another couple of loops too, so my code will begin to get even messier.

a = 1:16;
B0 = zeros(1,N);
tic
for k = 1:N
  B = 0;
  for j = 1:numel(idx)
    B = B + a(idx(j));
  end
  a = a + 0.001;
  B0(k) = B;
end
toc
Elapsed time is 0.659663 seconds.

Method 3

I get a smaller further improvement if I spell out the individual components of a instead and remove the loop.

a = 1:16;
C0 = zeros(1,N);
tic
for k = 1:N
  C = a(1)+a(6)+a(11)+a(16);
  a = a + 0.001;
  C0(k) = C;
end
toc
Elapsed time is 0.456529 seconds.

However, I need to keep the code as general as impossible - the index idx is dependent on the initial input, so this 3rd option is not possible.

My question is, is there a faster way than method 1, but that is also cleaner / more concise that method 2?

Curiously, the differences in time between Methods 1 and Methods 2 and 3 get progressively worse as N is increased.

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

採用された回答

Matt J 2023 年 8 月 31 日

1
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2015156-why-is-sum-considerably-slower-that-adding-each-individual-element-together-when-using-large-loops#answer_1297726

編集済み: Matt J 2023 年 8 月 31 日

MATLAB Online で開く

Yes, unfortunately, vector indexing is quite an expensive operation. If idx is constant, as in your example, than you should do as below. If idx is not constant, then the way to optimize the code will depend on how idx varies.

a = 1:16;
idx = [1 6 11 16];
N = 1e7;
A0 = zeros(1,N);
b=a(idx);
tic
for k = 1:N
  A = sum(b);
  b = b + 0.001;
  A0(k) = A;
end
toc
Elapsed time is 0.532537 seconds.
a(idx)=b;

12 件のコメント
10 件の古いコメントを表示10 件の古いコメントを非表示

Jack Palmer 2023 年 8 月 31 日

MATLAB Online で開く

Thank you Matt!

I think that was a poor example on my part, sorry. I was trying to keep it simplified... Below is the loop I have which I am trying to speed up. I can't see a way of doing this without a loop at the moment.

I've used rand to generate initial variables of the same size as the ones in my full script, so rand will probably cause the signal to decay to 0 or inf before n = N, but in reality in my script this is not the case.

I'm using A to 'normalise' B which then alters A through some propagator C. D then converts the vector A into a single number that is the signal. Everything outside of the loop can be treated as a constant.

A = rand(16,1);
idx = [1 6 11 16];
B0 = rand(16,1)*0.35;
C = (rand(16,16)-0.5)*1e-1;
D = rand(1,16);
N = 1e7;
signal = zeros(N,1);
tic
for n = 1:N
  Aidx = A(idx)
  Asum = sum(Aidx);
  % Asum = A(1) + A(6) + A(11) + A(16); % Elapsed time = 3 seconds
  B = B0*Asum;
  A = B + C*(A-B);
  signal(n) = D*A;
end
toc

The above code timed out above 55 seconds, whereas replacing it with the commeted out line makes it take 3 seconds. If you can spot a way of removing the loop that would solve all my issues.

Thanks!

Matt J 2023 年 8 月 31 日

MATLAB Online で開く

signal1(n:end) = sign(signal1(n-1))*Inf;

Bruno Luong 2023 年 9 月 1 日

編集済み: Bruno Luong 2023 年 9 月 1 日

MATLAB Online で開く

In my previous code, some extreme floating rouding around realin and reamax can defeat the test.

This modified code looks more robust. It must be something related to denormalization numbers that I'm sure the correct to deal with https://www.mathworks.com/help/fixedpoint/ug/floating-point-numbers.html#f32213

I also do the matrix power differently for the sake of variation

A = rand(16,1);
idx = [1 6 11 16];
B0 = rand(16,1)*0.35;
C = (rand(16,16)-0.5)*1e-1;
D = rand(1,16);
N = 1e7;
A0 = A;
%Method 3
tic;
signal1 = zeros(N,1);
m=numel(A);
s=accumarray(idx(:),1,[m,1]);
Q=B0*s.';
Q=Q+C*(eye(m)-Q);
d = eig(Q,'vector');
d = abs(d);
check0 = all(d < 1);
checkinf = any(d > 1);
QnA = A0;
for n = 1:N
    QnA=Q*QnA;
    DQnA = D*QnA;
    signal1(n) = DQnA;
    if check0 && abs(DQnA) < realmin
        break
    end
    if checkinf && abs(DQnA) > realmax
         signal1(n+1:end) = sign(DQnA)*Inf;
         break
    end
end
toc
Elapsed time is 0.014308 seconds.

サインインしてコメントする。

その他の回答 (0 件)

サインインしてこの質問に回答する。

カテゴリ

AI and Statistics Statistics and Machine Learning Toolbox Regression Generalized Linear Models Regularization

Help Center および File Exchange で Regularization についてさらに検索

製品

MATLAB

リリース

R2022b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Why is sum considerably slower that adding each individual element together, when using large loops?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

12 件のコメント
10 件の古いコメントを表示10 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Why is sum considerably slower that adding each individual element together, when using large loops?

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

12 件のコメント 10 件の古いコメントを表示10 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

12 件のコメント
10 件の古いコメントを表示10 件の古いコメントを非表示