グラウンドトゥルースのイメージおよびビデオ

AI アシストによる自動化を使用してイメージやビデオに対話形式でラベルを付け、AI モデル用の学習データを作成し、大規模データセットに対するチーム共同でのラベル付けを管理する

Computer Vision Toolbox™ は、オブジェクト検出、セマンティックセグメンテーション、インスタンスセグメンテーション、テキスト認識、イメージやビデオの分類といったタスクに向けて AI モデルに学習させるために、イメージやビデオからグラウンドトゥルースデータを生成するための完全なワークフローを提供します。まずは、イメージラベラーアプリとビデオラベラーアプリを使って、さまざまなラベルタイプでデータを対話形式で注釈付けすることから始められます。これらには、四角形、多角形、ポリライン、シーンラベル、およびピクセルレベルのラベルが含まれます。イメージコレクションのラベル付けを開始するには、イメージラベラー入門を参照してください。ビデオまたはイメージシーケンスのラベル付けを開始するには、ビデオラベラー入門を参照してください。

イメージラベラーアプリとビデオラベラーアプリは、手動、AI アシスト、自動による注釈付けをサポートしており、Segment Anything モデル (SAM) や Grounding DINO などの組み込み AI モデルを使用してラベル付けを高速化できます。詳細については、Get Started with AI-Assisted and Automated Labelingを参照してください。また、独自のオートメーションアルゴリズムを統合することで、ラベル付けプロセスを特定のニーズに合わせて調整することも可能です。詳細については、Create Custom Automation Algorithm for Labelingを参照してください。

ラベル付けが完了したら、注釈付きデータをエクスポートし、後処理を行って AI モデル用の学習データセットを作成できます。ツールボックスは、ラベル付きデータの整理と管理のためのワークフローをサポートしており、分類、検出、セグメンテーションなどのタスクのための学習パイプラインとのシームレスな統合を可能にします。

共同プロジェクト向けに、イメージラベラーアプリにはチームベースのラベル付けを管理する機能が搭載されており、ラベル付けタスクの割り当て、注釈のレビュー、フィードバックの提供、複数のコントリビューター間での進捗状況の追跡などが可能です。これにより、ラベル付け作業の規模拡大が容易になり、大規模なデータセット全体で一貫性を維持することが可能になります。詳細については、Get Started with Team-Based Labelingを参照してください。

Montage with image on the left showing rectangle and projected cuboid bounding boxes, while the image on the right shows semantic pixel labels and polygon ROI labels.

主要なトピック

注目の例

Automatically Label Ground Truth Using Segment Anything Model

Produce pixel labels for semantic segmentation using the Segment Anything Model (SAM) in the イメージラベラー app. The SAM is an automatic segmentation technique that you can use to segment object regions to label with just a few clicks, or automatically segment the entire image and instantaneously create labels for selected regions. In this example, you interactively label pixels for semantic segmentation in two ways.

R2024b 以降
ライブスクリプトを開く

新規

Automatically Label Ground Truth Using Vision-Language Model

Automatically label ground truth images for object detection using the Grounding DINO vision-language model (VLM).

R2026a 以降
ライブスクリプトを開く

新規

Automate Ground Truth Polygon Labeling Using Grounded SAM Model

Combine Grounding DINO and the Segment Anything Model 2 (SAM 2) to automatically produce polygon labels using the Video Labeler app.

R2026a 以降
ライブスクリプトを開く

セマンティックセグメンテーションのためのグラウンドトゥルースのラベル付けの自動化

事前学習済みのセマンティックセグメンテーションアルゴリズムを使用して、イメージ内の空と道路をセグメント化する。

ライブスクリプトを開く

新規

Automate Ground Truth Labeling for Instance Segmentation

Create an automation algorithm to automatically label data for instance segmentation using a pretrained SOLOv2 network in the Video Labeler app.

R2026a 以降
ライブスクリプトを開く

Automate Ground Truth Labeling for Object Detection

Create an automation algorithm to automatically label data for object detection using a pretrained object detector.

ライブスクリプトを開く

Automate Ground Truth Labeling for OCR

Automate the labeling of text for OCR training and evaluation.

ライブスクリプトを開く

Automate Labeling of Objects in Video Using RAFT Optical Flow

Use a pretrained RAFT optical flow estimation network to propagate a predefined object mask from one frame to the next in a video sequence.

R2024b 以降
ライブスクリプトを開く

カスタム JSON ファイルおよび COCO JSON ファイルへのグラウンドトゥルースオブジェクトのエクスポート

グラウンドトゥルースオブジェクトをカスタムデータ形式の JavaScript Object Notation (JSON) ファイルと COCO データ形式の JSON ファイルにエクスポートする。

ライブスクリプトを開く

Convert Image Labeler Polygons to Labeled Blocked Image for Semantic Segmentation

Convert polygon labels stored in a groundTruth object into a labeled blocked image for semantic segmentation workflows.

ライブスクリプトを開く

グラウンド トゥルースのイメージおよびビデオ

主要なトピック

カテゴリ

注目の例

グラウンドトゥルースのイメージおよびビデオ