Why do I get 0*1 cells?
3 ビュー (過去 30 日間)
古いコメントを表示
I am trying to web scraping using this tutorial : https://medium.com/@roymilaniitd/web-scraping-to-extract-news-using-matlab-dd78b954684 , But when I test the following code:
html = webread('https://www.indiatoday.in/top-stories');
list = extractBetween(html,'<h3 class=”” title=','</a></h3><p>');
list2=extractAfter(list,'<a href="');
list3 = extractAfter(list2,'">');
I get the contents in a html char variable, but three 0*1 cells list1, list2, list3!
Why this happens?
0 件のコメント
回答 (2 件)
Jan
2019 年 1 月 28 日
編集済み: Jan
2019 年 1 月 28 日
You are searching for:
'<h3 class=”” title='
% ^^
I'm sure, you mean:
'<h3 class="" title='
with standard double quotes ".
The author of this page seems to use a tool like MS Word to create webpages and let the automatic replacement insert smart quotes. This is a very bad idea when posting code in the internet.
0 件のコメント
Milan Roy
2019 年 1 月 29 日
Yes, just use the standard " " instead of the formatted double quote. It should work fine.
0 件のコメント
参考
カテゴリ
Help Center および File Exchange で MATLAB Compiler についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!