retrieveImages

イメージセットでの類似イメージの検索

構文

imageIDs = retrieveImages(queryImage,imageIndex)

[imageIDs,scores] = retrieveImages(queryImage,imageIndex)

[imageIDs,scores,imageWords] = retrieveImages(queryImage,imageIndex)

[imageIDs,___] = retrieveImages(queryImage,imageIndex,Name,Value)

説明

imageIDs = retrieveImages(queryImage,imageIndex) は、クエリイメージと視覚的に似ている imageIndex 内のイメージに対応するイメージ識別子 imageIDs を返します。imageIDs は、類似度が最も高いものから最も低いものへとランク付けされた順序で返されます。

例

[imageIDs,scores] = retrieveImages(queryImage,imageIndex) は、イメージ取得結果のランク付けに使用される類似スコアをオプションで返します。scores 出力には、0 ～ 1 の対応するスコアが含まれます。

例

[imageIDs,scores,imageWords] = retrieveImages(queryImage,imageIndex) は、類似イメージの検索に使用される queryImage のビジュアルワードをオプションで返します。

例

[imageIDs,___] = retrieveImages(queryImage,imageIndex,Name,Value) は、前述の構文のいずれかを使用し、Name,Value ペアの引数を 1 つ以上指定したオプションを追加で使用します。

例

すべて折りたたむ

クエリイメージを使用したイメージセットの検索

ライブスクリプトを開く

ブックカバーのイメージセットを作成します。

dataDir = fullfile(toolboxdir('vision'),'visiondata','bookCovers');
bookCovers = imageDatastore(dataDir);

データセットを表示します。

thumbnailGallery = [];
for i = 1:length(bookCovers.Files)
    I = readimage(bookCovers,i);
    thumbnail = imresize(I,[300 300]);
    thumbnailGallery = cat(4,thumbnailGallery,thumbnail);
end

figure
montage(thumbnailGallery);

Figure contains an axes object. The hidden axes object contains an object of type image.

イメージセットをインデックス付けします。この手順には数分かかる場合があります。

imageIndex = indexImages(bookCovers);

Creating an inverted image index using Bag-Of-Features.
-------------------------------------------------------

Creating Bag-Of-Features.
-------------------------

* Selecting feature point locations using the Detector method.
* Extracting SURF features from the selected feature point locations.
** detectSURFFeatures is used to detect key points for feature extraction.

* Extracting features from 58 images...done. Extracted 29216 features.

* Keeping 80 percent of the strongest features from each category.

* Balancing the number of features across all image categories to improve clustering.
** Image category 1 has the least number of strongest features: 23373.
** Using the strongest 23373 features from each of the other image categories.

* Creating a 20000 word visual vocabulary.
* Number of levels: 1
* Branching factor: 20000
* Number of clustering steps: 1

* [Step 1/1] Clustering vocabulary level 1.
* Number of features          : 23373
* Number of clusters          : 20000
* Initializing cluster centers...100.00%.
* Clustering...completed 5/100 iterations (~1.73 seconds/iteration)...converged in 5 iterations.

* Finished creating Bag-Of-Features


Encoding images using Bag-Of-Features.
--------------------------------------

* Encoding 58 images...done.
Finished creating the image index.

クエリイメージを選択および表示します。

queryDir = fullfile(dataDir,'queries',filesep);
queryImage = imread([queryDir 'query3.jpg']);

imageIDs = retrieveImages(queryImage,imageIndex);

クエリイメージとその最適一致を左右に並べて表示します。

bestMatch = imageIDs(1);
bestImage = imread(imageIndex.ImageLocation{bestMatch});

figure
imshowpair(queryImage,bestImage,'montage')

Figure contains an axes object. The hidden axes object contains an object of type image.

ROI を使用した特定のオブジェクトのイメージセットの検索

ライブスクリプトを開く

クエリイメージの関心領域 (ROI) を使用してオブジェクトのイメージセットを検索します。

検索する一連のイメージを定義します。

imageFiles = ...
  {'elephant.jpg', 'cameraman.tif', ...
  'peppers.png',  'saturn.png',...
  'pears.png',    'stapleRemover.jpg', ...
  'football.jpg', 'mandi.tif',...
  'kids.tif',     'liftingbody.png', ...
  'office_5.jpg', 'gantrycrane.png',...
  'moon.tif',     'circuit.tif', ...
  'tape.png',     'coins.png'};

imds = imageDatastore(imageFiles);

検索インデックスを作成します。

 imageIndex = indexImages(imds);

Creating an inverted image index using Bag-Of-Features.
-------------------------------------------------------

Creating Bag-Of-Features.
-------------------------

* Selecting feature point locations using the Detector method.
* Extracting SURF features from the selected feature point locations.
** detectSURFFeatures is used to detect key points for feature extraction.

* Extracting features from 16 images...done. Extracted 3680 features.

* Keeping 80 percent of the strongest features from each category.

* Balancing the number of features across all image categories to improve clustering.
** Image category 1 has the least number of strongest features: 2944.
** Using the strongest 2944 features from each of the other image categories.

* Creating a 2944 word visual vocabulary.
* Number of levels: 1
* Branching factor: 2944
* Number of clustering steps: 1

* [Step 1/1] Clustering vocabulary level 1.
* Number of features          : 2944
* Number of clusters          : 2944
* Initializing cluster centers...100.00%.
* Clustering...completed 1/100 iterations (~0.05 seconds/iteration)...converged in 1 iterations.

* Finished creating Bag-Of-Features


Encoding images using Bag-Of-Features.
--------------------------------------

* Encoding 16 images...done.
Finished creating the image index.

クエリイメージと ROI を指定します。この ROI は検索対象のオブジェクトであるゾウの外郭を表示します。

queryImage = imread('clutteredDesk.jpg');
queryROI = [130 175 330 365];

figure
imshow(queryImage)
rectangle('Position',queryROI,'EdgeColor','yellow')

Figure contains an axes object. The hidden axes object contains 2 objects of type image, rectangle.

また、関数 imrect を使用して対話形式で ROI を選択することもできます。たとえば、queryROI = getPosition(imrect) です。

オブジェクトが含まれているイメージを検索します。

imageIDs = retrieveImages(queryImage,imageIndex,'ROI',queryROI)

imageIDs = 12×1 uint32 column vector

    1
   11
    6
   12
    2
    3
    8
    5
   14
   13
   10
   16
      ⋮

最適一致を表示します。

bestMatch = imageIDs(1);

figure
imshow(imageIndex.ImageLocation{bestMatch})

Figure contains an axes object. The hidden axes object contains an object of type image.

関数 `estimateGeometricTransform2D` を使用した幾何学的な検証

ライブスクリプトを開く

ビジュアルワードの位置を使用して最適な検索結果を検証します。幾何学的な情報に基づいて検索結果のランクを付け直すには、上位 N 件の検索結果に対してこの手順を繰り返します。

イメージの位置を指定します。

dataDir = fullfile(toolboxdir('vision'),'visiondata','bookCovers');
bookCovers = imageDatastore(dataDir);

イメージセットをインデックス付けします。この処理には数分かかることがあります。

imageIndex = indexImages(bookCovers);

Creating an inverted image index using Bag-Of-Features.
-------------------------------------------------------

Creating Bag-Of-Features.
-------------------------

* Selecting feature point locations using the Detector method.
* Extracting SURF features from the selected feature point locations.
** detectSURFFeatures is used to detect key points for feature extraction.

* Extracting features from 58 images...done. Extracted 29216 features.

* Keeping 80 percent of the strongest features from each category.

* Balancing the number of features across all image categories to improve clustering.
** Image category 1 has the least number of strongest features: 23373.
** Using the strongest 23373 features from each of the other image categories.

* Creating a 20000 word visual vocabulary.
* Number of levels: 1
* Branching factor: 20000
* Number of clustering steps: 1

* [Step 1/1] Clustering vocabulary level 1.
* Number of features          : 23373
* Number of clusters          : 20000
* Initializing cluster centers...100.00%.
* Clustering...completed 5/100 iterations (~1.90 seconds/iteration)...converged in 5 iterations.

* Finished creating Bag-Of-Features


Encoding images using Bag-Of-Features.
--------------------------------------

* Encoding 58 images...done.
Finished creating the image index.

クエリイメージを選択および表示します。

queryDir = fullfile(dataDir,'queries',filesep);
queryImage = imread([queryDir 'query3.jpg']);

figure
imshow(queryImage)

Figure contains an axes object. The hidden axes object contains an object of type image.

最適一致を取得します。queryWords 出力にはクエリイメージのビジュアルワードの位置情報が含まれます。検索結果を検証するには、この情報を使用します。

[imageIDs, ~, queryWords] = retrieveImages(queryImage,imageIndex);

イメージインデックスからビジュアルワードを抽出し、クエリイメージの最適一致を検索します。イメージインデックスにはインデックス内のすべてのイメージのビジュアルワード情報が含まれます。

bestMatch = imageIDs(1);
bestImage = imread(imageIndex.ImageLocation{bestMatch});
bestMatchWords = imageIndex.ImageWords(bestMatch);

ビジュアルワードの割り当てに基づいて、一連の仮の一致を生成します。クエリ内の各ビジュアルワードは、ビジュアルワードの割り当てに使用されるハード量子化によって、複数の一致をもつ可能性があります。

queryWordsIndex     = queryWords.WordIndex;
bestMatchWordIndex  = bestMatchWords.WordIndex;

tentativeMatches = [];
for i = 1:numel(queryWords.WordIndex)
    
    idx = find(queryWordsIndex(i) == bestMatchWordIndex);
    
    matches = [repmat(i, numel(idx), 1) idx];
    
    tentativeMatches = [tentativeMatches; matches];
    
end

仮の一致に対する点の位置を表示します。不適切な一致が数多くあります。

points1 = queryWords.Location(tentativeMatches(:,1),:);
points2 = bestMatchWords.Location(tentativeMatches(:,2),:);

figure
showMatchedFeatures(queryImage,bestImage,points1,points2,'montage')

Figure contains an axes object. The hidden axes object contains 4 objects of type image, line. One or more of the lines displays its values using only markers

関数 estimateGeometricTransform2D を使用して不適切なビジュアルワードの割り当てを削除します。有効な幾何学的変換に適合する割り当ては保持します。

[tform,inlierIdx] = ...
    estimateGeometricTransform2D(points1,points2,'affine',...
        'MaxNumTrials',2000);
inlierPoints1 = points1(inlierIdx, :);
inlierPoints2 = points2(inlierIdx, :);

インライアのパーセント比で検索結果のランクを付け直します。これは、幾何学的な検証手順が上位 N 件の検索結果に適用される際に行います。インライアのパーセント比が高いこれらのイメージは、関連している可能性が高くなります。

percentageOfInliers = size(inlierPoints1,1)./size(points1,1);

figure
showMatchedFeatures(queryImage,bestImage,inlierPoints1,...
    inlierPoints2,'montage')

Figure contains an axes object. The hidden axes object contains 4 objects of type image, line. One or more of the lines displays its values using only markers

推定された変換を適用します。

outputView = imref2d(size(bestImage));
Ir = imwarp(queryImage, tform, 'OutputView', outputView);

figure
imshowpair(Ir,bestImage,'montage')

Figure contains an axes object. The hidden axes object contains an object of type image.

イメージ検索の検索パラメーターの変更

ライブスクリプトを開く

関数 evaluateImageRetrieval を使用して適切な検索パラメーターを選択できます。

イメージセットを作成します。

setDir  = fullfile(toolboxdir('vision'),'visiondata','imageSets','cups');
imds = imageDatastore(setDir, 'IncludeSubfolders', true, 'LabelSource', 'foldernames');

イメージセットをインデックス付けします。

 imageIndex = indexImages(imds,'Verbose',false);

イメージ検索パラメーターを調整します。

imageIndex.MatchThreshold = 0.2;
imageIndex.WordFrequencyRange = [0 1]

imageIndex = 
  invertedImageIndex with properties:

         ImageLocation: {6×1 cell}
            ImageWords: [6×1 vision.internal.visualWords]
         WordFrequency: [0.1667 0.1667 0.1667 0.3333 0.1667 0.1667 0.1667 0.5000 0.3333 0.1667 0.3333 0.1667 0.1667 0.1667 0.1667 0.1667 0.1667 0.1667 0.1667 0.3333 0.1667 0.1667 0.1667 0.1667 0.1667 0.1667 0.1667 0.1667 0.1667 0.1667 … ] (1×1366 double)
         BagOfFeatures: [1×1 bagOfFeatures]
               ImageID: [1 2 3 4 5 6]
        MatchThreshold: 0.2000
    WordFrequencyRange: [0 1]

queryImage = readimage(imds, 1);
indices = retrieveImages(queryImage,imageIndex);

入力引数

すべて折りたたむ

`queryImage` — 入力クエリイメージ
M×N×3 のトゥルーカラーイメージ | M 行 N 列の 2 次元グレースケールイメージ

入力クエリイメージ。M x N x 3 のトゥルーカラーイメージ、または M 行 N 列の 2 次元グレースケールイメージのいずれかに指定します。

`imageIndex` — イメージ検索インデックス
`invertedImageIndex` オブジェクト

イメージ検索インデックス。invertedImageIndex オブジェクトとして指定します。関数 indexImages は、イメージの検索に使用されるデータを格納する invertedImageIndex オブジェクトを作成します。

名前と値の引数

すべて折りたたむ

オプションの引数のペアを Name1=Value1,...,NameN=ValueN として指定します。ここで、Name は引数名で、Value は対応する値です。名前と値の引数は他の引数の後に指定しなければなりませんが、ペアの順序は重要ではありません。

R2021a より前では、コンマを使用して名前と値をそれぞれ区切り、Name を引用符で囲みます。

例: 'NumResults' の 25 は 'NumResults' プロパティを 25 に設定する

`NumResults` — 結果の最大数
`20` (既定値) | 数値

返される結果の最大数。NumResults と数値で構成されるコンマ区切りのペアとして指定します。できる限り多くの一致イメージが返されるようにするにはこの値を Inf に設定します。

`ROI` — クエリイメージ探索領域
`[1 1 size(queryImage,2) size(queryImage,1)]` (既定値) | [x y width height] ベクトル

クエリイメージ探索領域。'ROI' と [x y width height] ベクトルで構成されるコンマ区切りのペアとして指定します。

`Metric` — 類似度メトリクス
`'cosine'` (既定値) | `'L1'`

イメージ検索結果のランク付けに使用する類似度メトリクス。'cosine' または 'L1' [3] として指定します。

出力引数

すべて折りたたむ

`imageIDs` — 取得されたイメージのランク付けされたインデックス
M 行 1 列のベクトル

取得されたイメージのランク付けされたインデックス。M 行 1 列のベクトルとして返されます。イメージ ID が、最も類似する一致イメージから最も類似しない一致イメージまで、ランク付けされた順序で返されます。

`scores` — 類似度メトリクス
N 行 1 列のベクトル

類似度メトリクス。N 行 1 列のベクトルとして返されます。この出力には、imageIDs 出力で取得したイメージに一致するスコアが含まれます。スコアは、Metric プロパティを使用して計算され、範囲は 0 ～ 1 です。

`imageWords` — ビジュアルワード割り当てを格納するオブジェクト
`visualWords` オブジェクト

ビジュアルワード割り当てを格納するオブジェクト。visualWords オブジェクトとして返されます。このオブジェクトには、queryImage のビジュアルワード割り当てと、そのイメージ内における位置が格納されます。

参照

[1] Sivic, J. and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. ICCV (2003) pg 1470-1477.

[2] Philbin, J., O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. CVPR (2007).

[3] Gálvez-López, Dorian, and Juan D. Tardos. Bags of binary words for fast place recognition in image sequences. IEEE Transactions on Robotics 28.5 (2012): 1188-1197.

拡張機能

すべて展開する

C/C++ コード生成
MATLAB® Coder™ を使用して C および C++ コードを生成します。

バージョン履歴

R2015a で導入

参考

evaluateImageRetrieval | imageDatastore | imageSet | bagOfFeatures | invertedImageIndex

retrieveImages

構文

説明

例

クエリ イメージを使用したイメージ セットの検索

ROI を使用した特定のオブジェクトのイメージ セットの検索

関数 estimateGeometricTransform2D を使用した幾何学的な検証

イメージ検索の検索パラメーターの変更

入力引数

queryImage — 入力クエリ イメージ M×N×3 のトゥルーカラー イメージ | M 行 N 列の 2 次元グレースケール イメージ

imageIndex — イメージ検索インデックス invertedImageIndex オブジェクト

名前と値の引数

NumResults — 結果の最大数 20 (既定値) | 数値

ROI — クエリ イメージ探索領域 [1 1 size(queryImage,2) size(queryImage,1)] (既定値) | [x y width height] ベクトル

Metric — 類似度メトリクス 'cosine' (既定値) | 'L1'

出力引数

imageIDs — 取得されたイメージのランク付けされたインデックス M 行 1 列のベクトル

scores — 類似度メトリクス N 行 1 列のベクトル

imageWords — ビジュアル ワード割り当てを格納するオブジェクト visualWords オブジェクト

参照

拡張機能

C/C++ コード生成 MATLAB® Coder™ を使用して C および C++ コードを生成します。

バージョン履歴

参考

トピック

クエリイメージを使用したイメージセットの検索

ROI を使用した特定のオブジェクトのイメージセットの検索

関数 `estimateGeometricTransform2D` を使用した幾何学的な検証

`queryImage` — 入力クエリイメージ
M×N×3 のトゥルーカラーイメージ | M 行 N 列の 2 次元グレースケールイメージ

`imageIndex` — イメージ検索インデックス
`invertedImageIndex` オブジェクト

`NumResults` — 結果の最大数
`20` (既定値) | 数値

`ROI` — クエリイメージ探索領域
`[1 1 size(queryImage,2) size(queryImage,1)]` (既定値) | [x y width height] ベクトル

`Metric` — 類似度メトリクス
`'cosine'` (既定値) | `'L1'`

`imageIDs` — 取得されたイメージのランク付けされたインデックス
M 行 1 列のベクトル

`scores` — 類似度メトリクス
N 行 1 列のベクトル

`imageWords` — ビジュアルワード割り当てを格納するオブジェクト
`visualWords` オブジェクト

C/C++ コード生成
MATLAB® Coder™ を使用して C および C++ コードを生成します。