Blocking pixel label data for semantic segmentation DL training

Question

Software Developer 2024 年 6 月 26 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2132396-blocking-pixel-label-data-for-semantic-segmentation-dl-training

回答済み: Ashish Uthama 2024 年 7 月 3 日

I'm trying to block images and their pixel labels for training a unet. I can use a blockedImageDatastore for the input images, but I don't know how to get this blocking behavior from the pixelLabelDatastore that holds the expected labels. I can get the behavior myself by splitting all the images beforehand and saving them to disk, but I'd rather not have to deal with the file cleanup or lose the dynamic changing of blocking. Does anyone know a way to achieve this?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Answer 1

Malay Agarwal 2024 年 6 月 27 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2132396-blocking-pixel-label-data-for-semantic-segmentation-dl-training#answer_1477896

編集済み: Malay Agarwal 2024 年 6 月 27 日

MATLAB Online で開く

Hi @Software Developer,

Please refer to the following link for an example on how to train a U-Net on multispectral images: https://www.mathworks.com/help/images/multispectral-semantic-segmentation-using-deep-learning.html

The example suggests using "blockedImage" to preprocess both your training samples and the labels. Specifically, you can refer to the following section of the example for sample code: https://www.mathworks.com/help/images/multispectral-semantic-segmentation-using-deep-learning.html#SemanticSegmentationOfMultispectralImagesExample-7.

In the code:

inputTileSize = [256 256];
bim = blockedImage(train_data(:,:,1:6),BlockSize=inputTileSize);
bLabels = blockedImage(labelsTrain,BlockSize=inputTileSize);
bmask = blockedImage(maskTrain,BlockSize=inputTileSize);

"bim" represents the first 6 channels of the training image, blocked using a block size of "[256 256]".
"bLabels" are the corresponding labels, blocked using the same block size.
"bmask" is the binary mask which represents the valid segmentation region, made using the 7th channel of the image and blocked using the same block size.

The example then finds the blocks of the image that overlap the mask using the following code:

overlapPct = 0.185;
blockOffsets = round(inputTileSize.*overlapPct);
bls = selectBlockLocations(bLabels, ...
    BlockSize=inputTileSize,BlockOffsets=blockOffsets, ...
    Masks=bmask,InclusionThreshold=0.95);

After one-hot encoding the labels, it then creates two "blockedImageDatastore" objects, one for the image and one for the labels. It uses the "BlockLocationSet" name-value argument to filter out only those image blocks and labels that overlap with the mask:

bimds = blockedImageDatastore(bim,BlockLocationSet=bls,PadMethod=0);
bimdsLabels = blockedImageDatastore(bLabels,BlockLocationSet=bls,PadMethod=0);

Finally, it combines the block images and the labels into a single datastore using the "combine" function. This combined datastore can be used to train the U-Net.

Please refer to the following links for more information:

"blockedImage" documentation - https://www.mathworks.com/help/images/ref/blockedimage.html.
"selectBlockLocations" documentation - https://www.mathworks.com/help/images/ref/selectblocklocations.html.
"blockedImageDatastore" documentation - https://www.mathworks.com/help/images/ref/blockedimagedatastore.html.
"BlockLocationSet" name-value argument documentation - https://www.mathworks.com/help/images/ref/blockedimagedatastore.html#mw_1c8c9014-3714-48cb-b151-4e5852e838df.

Hope this helps!

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

Answer 2

Ashish Uthama 2024 年 7 月 3 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/2132396-blocking-pixel-label-data-for-semantic-segmentation-dl-training#answer_1480791

Have not tried this - but instead of a pixelLabelDatastore, could you try to use another blockedImageDatastore to read the label data, and then use a transform() call to convert pixel data into label categories?

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

サインインしてコメントする。

Blocking pixel label data for semantic segmentation DL training

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (2 件)

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

Blocking pixel label data for semantic segmentation DL training

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

回答 (2 件)

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント -2 件の古いコメントを表示-2 件の古いコメントを非表示

参考

カテゴリ

タグ

製品

リリース

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示

0 件のコメント
-2 件の古いコメントを表示-2 件の古いコメントを非表示