How to speed up matrix update zeros by using previous value?

Question

Mantas Vaitonis 2018 年 11 月 22 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/431311-how-to-speed-up-matrix-update-zeros-by-using-previous-value

編集済み: Bruno Luong 2018 年 11 月 22 日

採用された回答: Jan

MATLAB Online で開く

Hello,

I am looking for the way to update my matrix zero values by using previous value. For example if c matrix is:

1 5 2 3 9 0
0 5 0 0 7 0
0 0 1 7 8 9
2 1 0 0 0 7

The result should look like this:

1 5 2 3 9 9
1 5 2 3 7 9
1 5 1 7 8 9
2 1 1 7 8 7

Now I use the following code:

c(c == 0) = NaN;
c = fillmissing(c,'previous');
c = fillmissing(c,'nearest',1);

However I use quite big matrixes like 10000000x150 and it takes very long to process, maybe somebody knows of a way to speed up this?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Jan 2018 年 11 月 22 日

1
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/431311-how-to-speed-up-matrix-update-zeros-by-using-previous-value#answer_348353

編集済み: Jan 2018 年 11 月 22 日

MATLAB Online で開く

Start with:

c = fillmissing(c, 'previous', 'EndValues', 'nearest');

Another idea is to process the matrix column-wise:

c = [0 1 5 2 3 9 0
     1 0 5 0 0 7 0
     0 0 0 1 7 8 9
     7 2 1 0 0 0 7];
for col = 1:size(c, 2)
    x = c(:, col);         % Get current column
    f = (x ~= 0);          % Find non-zero elements
    m = cumsum(f);         % Cumulative sum of logical indices
    ind1 = find(f, 1);     % Fill initial zeros:
    if ~isempty(ind1)
      m(1:ind1) = 1;
    end
    xf = x(f);             % Vector of non-zero elements
    c(:, col) = xf(m);     % Replace column
end

This avoids the useless replacing of 0 to NaN and the creation of the huge temporary matrices.

This code fails, if the column contains zeros only.

A C-mex file could avoid the creation of f, m and xf. Do you have a C-compiler?

6 件のコメント
4 件の古いコメントを表示4 件の古いコメントを非表示

Jan 2018 年 11 月 22 日

MATLAB Online で開く

@Bruno: Are you sure? After the first c(:, 1) = xf(m), this function has its own copy of c . Is there really a need to further deep copies?

I cannot run it currently, but it should be easy to test using:

format debug
c = [0 1 5 2 3 9 0
     1 0 5 0 0 7 0
     0 0 0 1 7 8 9
     7 2 1 0 0 0 7];
c
for col = 1:size(c, 2)
    x = c(:, col);         % Get current column
    f = (x ~= 0);          % Find non-zero elements
    m = cumsum(f);         % Cumulative sum of logical indices
    ind1 = find(f, 1);     % Fill initial zeros:
    if ~isempty(ind1)
      m(1:ind1) = 1;
    end
    xf = x(f);             % Vector of non-zero elements
    c(:, col) = xf(m);     % Replace column
    c
end

If this does really performs a deep copy of c in each iteration, this would be a waste of time. Then:

function R = FillMyGaps(c)
ncol = size(c, 2);
Result = cell(1, ncol)
for col = 1:ncol
    x = c(:, col);         % Get current column
    f = (x ~= 0);          % Find non-zero elements
    m = cumsum(f);           % Cumulative sum of logical indices
    ind1 = find(f, 1);     % Fill initial zeros:
    if ~isempty(ind1)
      m(1:ind1) = 1;
    end
    xf = x(f);             % Vector of non-zero elements
    Result{col} = xf(m);   % Replace column
end
R = cat(2, Result{:});
% Or maybe better: <https://www.mathworks.com/matlabcentral/fileexchange/28916-cell2vec>
% R = reshape(Cell2Vec(Result), [], ncol);

Jan 2018 年 11 月 22 日

Your interjections are always welcome, because they provide deeper insights very frequently.

I'm still disappointed, that xf=x(f); y=xf(m) cannot be abbreviated. Such a "cumulated indexing" should work in one step.

Nevertheless, in a C-mex, the creation of the vectors x, f, m and xf can be avoided. @Mantas Vaitonis: If this is time-critical, install a C-compiler and ask for the posting of the small function.

Bruno Luong 2018 年 11 月 22 日

編集済み: Bruno Luong 2018 年 11 月 22 日

MATLAB Online で開く

There are shorter code using INTERP1 but both are slighly slower than your code on my benchmark. Also both can work on multicolumns so one can tune up the chunk size depending on the amount of memory available.

for col = 1:size(c, 2)
    x = c(:, col);
    f = (x ~= 0); 
    x(~f) = interp1(find(f),x(f),find(~f),'previous','extrap');
    c(:, col) = x;  
end

Or

for col = 1:size(c, 2)
    x = c(:, col);
    f = (x ~= 0);
    c(:, col) =  interp1(find(f),x(f),1:length(f),'previous','extrap');
end

サインインしてコメントする。

How to speed up matrix update zeros by using previous value?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

6 件のコメント
4 件の古いコメントを表示4 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

Community Treasure Hunt

How to speed up matrix update zeros by using previous value?

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

6 件のコメント 4 件の古いコメントを表示4 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

6 件のコメント
4 件の古いコメントを表示4 件の古いコメントを非表示