Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

Question

John Smith 2023 年 3 月 13 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1927735-why-doesn-t-concatlayer-in-deep-learning-toolbox-concatenate-the-t-dimension

コメント済み: Artem Lensky 2023 年 8 月 19 日

Hello,

While implementing a ViT transformer in Matlab, I found at that the concatLayer does not concatenate over the T dimension. This is needed to concatenate the class token with patch tokens, since the natural representation is CBT with C corresponding to features, B to batch and T to token within a batch (this is also the canonical representation in the attention function).

It's possible to work around this by hacking to e.g. SCB, but then other problems pop up which also need to be hacked around.

Thx

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Ben 2023 年 3 月 14 日

1
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1927735-why-doesn-t-concatlayer-in-deep-learning-toolbox-concatenate-the-t-dimension#answer_1192820

You can create a layer that concatenates on the T dimension with functionLayer

sequenceCatLayer = functionLayer(@(x,y) cat(3,x,y));

This will work in dlnetwork to concatenate two CBT dlarray-s.

Since you're concatenating the class token, it might also be worth considering creating a custom layer that has the class token embedding as a Learnable property, and performs the concatenation in the predict method.

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

Catalytic 2023 年 3 月 23 日

編集済み: Catalytic 2023 年 3 月 23 日

@John Smith - Since Ben's answer yielded a solution for you, you should hit the Accept this Answer button, and likewise with other answers you might not have accepted.

Artem Lensky 2023 年 8 月 19 日

Are there any plans to make concatenationLayer support concatetnation along the T dimension?

サインインしてコメントする。

Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

3 件のコメント 1 件の古いコメントを表示1 件の古いコメントを非表示

その他の回答 (0 件)

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示