Matching two texts
古いコメントを表示
Hi,
I have two texts . is it possible to extract the text where the two arrays match.For example: A='First Boston Corp Lehman Brothers ' B='Lehman Brothers Merill Lynch'; How can I get the match "Lehman Brothers"
3 件のコメント
Walter Roberson
2011 年 6 月 28 日
Should the matching be pure consecutive substring? For example, if B='Lehman Brother''s Merill Lynch' then should the algorithm notice the longer match available if you omit the '''' character?
C='Sam Brown eats a boiled egg on Fridays'
D='Sam Brown eats boiled eggs on Fridays'
should the algorithm match as
'Sam Brown eats boiled egg on Fridays'
as being a match if you remove the 'a ' in one and the 's' in the other?
Longest substring only? If several are of the longest size, first of those?
joseph Frank
2011 年 6 月 28 日
joseph Frank
2011 年 6 月 28 日
採用された回答
その他の回答 (2 件)
Matt Fig
2011 年 6 月 29 日
A = 'First Boston Corp Lehman Brothers ';
B = 'Lehman Brothers Merill Lynch';
Am = regexp(A,'\s','split');
Am = Am(ismember(Am,regexp(B,'\s','split')))
1 件のコメント
Walter Roberson
2011 年 6 月 29 日
That finds words in common, not substrings in common. For example if B='Brothers Merill Lehman Lynch' then that algorithm would output {'Lehman' 'Brothers'} even though 'Brothers ' is the longest common substring.
Longest substring could potentially be 'Lehman Brother' if one of the strings had 'Lehman Brothers' and the other had 'Lehman Brotherhood'. It is not completely clear from Joseph's description whether only "words" are to be matched or whether parts of words are okay as well.
Walter Roberson
2011 年 6 月 29 日
0 投票
This is the "longest common substring problem"; see http://en.wikipedia.org/wiki/Longest_common_substring_problem (which looks a bit biased in that it only presents one algorithm)
カテゴリ
ヘルプ センター および File Exchange で Grid Lines, Tick Values, and Labels についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!