Why does layerNormalizationLayer in Deep Learning Toolbox include T dimension into the batch?

3 ビュー (過去 30 日間)

John Smith 2023 年 3 月 13 日

1
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1927745-why-does-layernormalizationlayer-in-deep-learning-toolbox-include-t-dimension-into-the-batch

回答済み: John Smith 2023 年 3 月 24 日

Hello,

While implementing a ViT transformer in Matlab, I found at that the layerNormalizationLayer does include the T dimension in the statistics calculated for each sample in the batch. This is problematics when implementing a transformer, since tokens correspond to the T dimension and reference implementations calculate the statistics separately for each token.

Thx

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

採用された回答

John Smith 2023 年 3 月 24 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1927745-why-does-layernormalizationlayer-in-deep-learning-toolbox-include-t-dimension-into-the-batch#answer_1199924

It seems Mathworks have listened and changed the behavior of layerNormalizationLayer in R2023a.:

https://www.mathworks.com/help/deeplearning/ref/nnet.cnn.layer.layernormalizationlayer.html

Starting in R2023a, by default, the layer normalizes sequence data over the channel and spatial dimensions. In previous versions, the software normalizes over all dimensions except for the batch dimension (the spatial, time, and channel dimensions). Normalization over the channel and spatial dimensions is usually better suited for this type of data. To reproduce the previous behavior, set OperationDimension to "batch-excluded".

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

その他の回答 (1 件)

Matt J 2023 年 3 月 13 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/1927745-why-does-layernormalizationlayer-in-deep-learning-toolbox-include-t-dimension-into-the-batch#answer_1191890

Perhaps you can fold your T dimension into the C dimension and use a groupNormalizationLayer instead, with the groups defined so that different T belong to different groups.

7 件のコメント
5 件の古いコメントを表示5 件の古いコメントを非表示

John Smith 2023 年 3 月 15 日

Perhaps lamenting would cause someone from Mathworks to take notice and add the capability to the code base. Sigh ...

Matt J 2023 年 3 月 15 日

That happens sometimes, but usually you have to submit a formal enhancement request.

サインインしてコメントする。

サインインしてこの質問に回答する。

カテゴリ

AI and Statistics Deep Learning Toolbox Image Data Workflows

Help Center および File Exchange で Image Data Workflows についてさらに検索

製品

Deep Learning Toolbox

リリース

R2022b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Why does layerNormalizationLayer in Deep Learning Toolbox include T dimension into the batch?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

その他の回答 (1 件)

7 件のコメント
5 件の古いコメントを表示5 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Why does layerNormalizationLayer in Deep Learning Toolbox include T dimension into the batch?

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

採用された回答

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

その他の回答 (1 件)

7 件のコメント 5 件の古いコメントを表示5 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

7 件のコメント
5 件の古いコメントを表示5 件の古いコメントを非表示