Should I use a sequence input layer or an image input layer for a combined CNN/LSTM neural network?

68 ビュー (過去 30 日間)
Jade
Jade 2024 年 11 月 6 日 16:05
コメント済み: Jade 2024 年 11 月 7 日 21:43
I am attempting to use a CNN/LSTM to take in a series of frames from a video of two liquids mixing together to predict their viscosities.
My initial layout is shown in the attached image and I planned on seperating a cell array of frames into stacks of sequences to use as inputs.
I was told that this would not work and an alternative approach is to use 2D or 3D (not sure which) image input layers and then use time as a seperate input for the LSTM portion. I'm not sure I understand what this means or why my approach was said to be wrong.
Which, if any, approach is best? Also, if neither of them are, is there a better method?
  4 件のコメント
Matt J
Matt J 2024 年 11 月 7 日 20:08
OK, well it doesn't look like network analyzer is showing any errors. Is there something that's not working?
Jade
Jade 2024 年 11 月 7 日 21:43
Matt,
It seems to run with 1 video so far, and I'm in the process of scaling it up now. Training loss returned NaN at first, but adjusting the learning rate seems to have solved that issue.
Just wanted to make sure this was the correct approach to the problem before going too far in the wrong direction. There's a lot of different network structures and I'm still learning. Really appreciate the help!

サインインしてコメントする。

回答 (0 件)

カテゴリ

Help Center および File ExchangeImage Data Workflows についてさらに検索

製品


リリース

R2024b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by