URL READ and URL WRITE ERROR

1 回表示 (過去 30 日間)

Priya 2013 年 6 月 7 日

0
リンク

この質問への直接リンク

https://jp.mathworks.com/matlabcentral/answers/78279-url-read-and-url-write-error

I want to copy and paste the contents of web page into a file, but using urlread or urlwrite, I get the HTML code for the webpage, instead I want to store the text content on that webpage to be stored in a text file or string array.

5 件のコメント
3 件の古いコメントを表示3 件の古いコメントを非表示

Priya 2013 年 6 月 9 日

Yeah proxy error got resolved.....

However, I don't want HTML coding as my output, instead the contents of the webpage to be saved in text file.

Hence URLWRITE or URLREAD is not working in this case.

Walter Roberson 2013 年 6 月 9 日

urlread() and urlwrite() are doing their intended purpose, fetching the page as-is. Processing the page afterwards is the responsibility of your code.

Do you blame your automobile for the fact that when you go grocery shopping, the automobile does not bring the groceries into the kitchen and take them out of the grocery bags?

サインインしてコメントする。

サインインしてこの質問に回答する。

回答 (1 件)

Walter Roberson 2013 年 6 月 7 日

0
リンク

この回答への直接リンク

https://jp.mathworks.com/matlabcentral/answers/78279-url-read-and-url-write-error#answer_88003

You will have to parse the text. The page you referenced before does not have the text presented in any simple way. Individual letters of the text are each surrounded <font> controls that select the color for the letters.

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

Walter Roberson 2013 年 6 月 7 日

Any of the standard techniques, including:

fread() a character at a time and have a bunch of ad-hoc code to figure out what to do with it
fread() a character at a time and use it to trigger a transition in a carefully constructed state machine
fgets() or fgetl() a line at a time and use basic string manipulation techniques such as find() or strfind() or ismember() or switch/case
fileread() or textscan() or fread() the entire file and use the basic techniques on the file that is now completely in memory
Use regexp() or regexprep() to process the file that is completely in memory
make a call to perl() with a perl script to do the work, perhaps having loaded in an HTML stripping routine from CPAN.org
On Linux or OS-X machines, shell out to ed or sed or nawk to do the work
write a C program to do the work
write a lex grammar to do the tokenizing. Write a yacc routine to express the BNF and take appropriate actions
http://blogs.mathworks.com/pick/2010/08/20/get-html-table-data-into-matlab/
find some Java library that does for HTML roughly what is done for xml in http://www.mathworks.com/matlabcentral/answers/44811

and so on.

Priya 2013 年 6 月 9 日

thanks a lot.

I am trying using shell scripting.

サインインしてコメントする。

サインインしてこの質問に回答する。

カテゴリ

MATLAB Language Fundamentals Data Types Characters and Strings String Parsing

Help Center および File Exchange で String Parsing についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

URL READ and URL WRITE ERROR

5 件のコメント
3 件の古いコメントを表示3 件の古いコメントを非表示

回答 (1 件)

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示

参考

カテゴリ

タグ

Community Treasure Hunt

URL READ and URL WRITE ERROR

5 件のコメント 3 件の古いコメントを表示3 件の古いコメントを非表示

回答 (1 件)

3 件のコメント 1 件の古いコメントを表示1 件の古いコメントを非表示

参考

カテゴリ

タグ

Community Treasure Hunt

5 件のコメント
3 件の古いコメントを表示3 件の古いコメントを非表示

3 件のコメント
1 件の古いコメントを表示1 件の古いコメントを非表示