R-CNN Deep Learning with 3D Data
1 回表示 (過去 30 日間)
古いコメントを表示
Is it possible to "Train Object Detector Using R-CNN Deep Learning" with 3D data?
3D data: [x,y,z] and not [x,y,channel]
In this case, how do we define de "bounding boxes"?
For the 2D case is something like [x y width height]. And for a set of 2D images (i.e., 3D data)?
Any help please??
Thank you very much in advance!!
Best regards
0 件のコメント
採用された回答
Shashank Gupta
2020 年 10 月 15 日
Hi Ana,
Generalising the 2d RCNN model to 3d is not so easy as it seems. the relevance that you intent to make from 2d and then trying to create a 3d architecture may not result in good performance, Although you can give a shot. So, there are few things you need to change. The input of 3d data should look something in the format [x,y,z,channel,batch_size] and the bounding boxes here will be cuboid, so the format will look somthing like [XMIN YMIN ZMIN WIDTH HEIGHT DEPTH]. You also need to change the layer to their respective 3d layers and write the custom training loop to train.
Hope this sounds good or atleast I provide you enough information to explore.
Cheers.
その他の回答 (0 件)
参考
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!