unique across multiple cells or arrays

Question

1 投票

Let's say I got a bunch of arrays in a cell array that are not the same size:

a = {[1 2 2], [2 3 6 3], [4 3 5 6 7]}

I want to remove all dupplicates accross the cells. I want to remove things in a cell that are in the other cells as well as the dupplicates in each cells:

Example output:

b = {[1 2], [3 6], [4 5 7]}

I would like to have this vectorized. I can simply loop but this is very slow:

others = [];
for i = 1:length(a)
    b{i} = setdiff(unique(a{i}), others);
    others = [others, b{i}];
end

I'm thinking of putting it all into a single array to call unique:

b = unique([a{:}])
% now b = [1 2 3 4 5 6 7]

But I don't think there's anyway to put it back into its respective cells since the order is lost and I don't know where the limits are

4 件のコメント
2 件の古いコメントを表示 2 件の古いコメントを非表示

per isakson 2020 年 8 月 8 日

Thanks for sharing. I cannot think of anything simpler.

Matt J 2020 年 8 月 8 日

MATLAB Online で開く

The process you describe does not have a well defined output. For the example you gave,

a = {[1 2 2], [2 3 6 3], [4 3 5 6 7]}

this would also satisfy the requirements:

b = {[1], [2 3], [4 5 6 7]}

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

Answer 1

Stephen23 2020 年 8 月 8 日

編集済み: Stephen23 2020 年 8 月 8 日

MATLAB Online で開く

1 投票

>> a = {[1,2,2],[2,3,6,3],[4,3,5,6,7]};
>> [u,x] = unique([a{:}],'first'); % or 'last' to allocate to the last cell where value occurs.
>> [~,y] = sort(x);
>> n = cumsum([1,cellfun(@numel,a)]);
>> z = sum(bsxfun(@ge,x(y),n),2);
>> c = accumarray(z,u(y),[],@(m){m});
>> c{:}
ans =
     1
     2
ans =
     3
     6
ans =
     4
     5
     7

3 件のコメント
1 件の古いコメントを表示 1 件の古いコメントを非表示

Stephen23 2020 年 8 月 8 日

編集済み: Stephen23 2020 年 8 月 8 日

MATLAB Online で開く

@the cyclist: well spotted!

The behavior you describe is caused by the presence/lack of indexing into that cell: if accumarray is not told to put anything in that cell, then that cell simply won't be created. That occurs for any number of trailing cells which only contain duplicates of prior numbers, not just the last cell.

It can be resolved quite easily by telling accumarray the required output array size:

c = accumarray(z,u(y),size(a),@(m){m});
%                     ^^^^^^^ output size

Alexander Winter 2020 年 8 月 8 日

編集済み: Alexander Winter 2020 年 8 月 8 日

In my case the empty array is not desirable (I'm discarding them afterwards) so your initial solution is better for me. Thanks!

サインインしてコメントする。

unique across multiple cells or arrays

4 件のコメント
2 件の古いコメントを表示 2 件の古いコメントを非表示

採用された回答

3 件のコメント
1 件の古いコメントを表示 1 件の古いコメントを非表示

その他の回答 (0 件)

カテゴリ

製品

リリース

タグ

Community Treasure Hunt

unique across multiple cells or arrays

4 件のコメント 2 件の古いコメントを表示 2 件の古いコメントを非表示

採用された回答

3 件のコメント 1 件の古いコメントを表示 1 件の古いコメントを非表示

その他の回答 (0 件)

カテゴリ

製品

リリース

タグ

参考

Community Treasure Hunt

4 件のコメント
2 件の古いコメントを表示 2 件の古いコメントを非表示

3 件のコメント
1 件の古いコメントを表示 1 件の古いコメントを非表示