Matching based on the first word

1 回表示 (過去 30 日間)
Danielle Leblance
Danielle Leblance 2017 年 11 月 10 日
編集済み: Jan 2017 年 11 月 10 日
Hi,
I have two cell arrays A and B where each contains companies names: for example A contains "Biotech Capital Corp" while B contains the same company but listed differently "BIOTECH CAP CORP". Can I match A and B based on the first word in the text? I mean can i write a code that tells that "Biotech Capital Corp" matches "BIOTECH CAP CORP" based on teh first word in the two strings "biotech" ?

採用された回答

Jan
Jan 2017 年 11 月 10 日
編集済み: Jan 2017 年 11 月 10 日
A_list = {'Biotech Capital Corp', 'Apple something', 'tesla something else'};
B_list = {'Volvo Car AB', 'BIOTECH CAP CORP', 'TESLA by Elon Musk'};
A1 = strtok(A_list, ' ');
B1 = strtok(B_list, ' ');
[LiA, LocB] = ismember(lower(A1), lower(B1)); % Or the other way around
[AInd, BInd] = CStrAinBP(strtok(A_list), strtok(B_list), 'i')

その他の回答 (2 件)

per isakson
per isakson 2017 年 11 月 10 日
編集済み: per isakson 2017 年 11 月 10 日
Yes, try this
match = cssm()
match =
0 1 0
0 0 0
0 0 1
A_list items on the rows and B_list on the columns, i.e.
  • the first item in A_list matches the second item in B_list.
  • the fourth item in A_list matches the fourth item in B_list.
where
function match = cssm()
A_list = {'Biotech Capital Corp', 'Apple something', 'tesla something else'};
B_list = {'Volvo Car AB', 'BIOTECH CAP CORP', 'TESLA by Elon Musk'};
match = false( length( A_list ), length( B_list ) );
for jj = 1 : length( A_list )
a1 = regexp( A_list{jj}, '\<\w+\>', 'once', 'match' );
xpr = sprintf( '\\<%s\\>', a1 );
cac = regexpi( B_list, xpr );
%
match( jj, : ) = cellfun( @(pos) not(isempty(pos))&&pos==1, cac );
end
end

KSSV
KSSV 2017 年 11 月 10 日
Read about strcmpi. It compares strings irrespective of what case they are in.
  1 件のコメント
Danielle Leblance
Danielle Leblance 2017 年 11 月 10 日
I don't think it will work since i am not comparing two complete strings. Rather I would like to compare the first word of two strings.

サインインしてコメントする。

カテゴリ

Help Center および File ExchangeGenomics and Next Generation Sequencing についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by