HTML file scraping for Fields in a Table

v k

2020 6 月 15

1 回答

14 ビュー (30 日間)

0 投票

clientData.txt

The HTML file that I am working on, is a long one and contains particulars as given in the attached text file. Although the structure is simple and repetitive, due to the large number of characters in between the data fields, I am having hard time in scraping the required data. The objective is to get a two-column excel spreadsheet containing Name in the first column and Email in the second column. How to obtain these required fields in the xlsx file ? Thanks.

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

サインインしてコメントする。

サインインしてこの質問に回答する。

Follow Question

回答 (1 件)

Sean de Wolski 2020 年 6 月 15 日

MATLAB Online で開く

0 投票

Start playing with htmlTree in the Text analytics toolbox.

t = htmlTree(fileread('clientdata.txt'))
t.findElement('TD').extractHTMLText

1 件のコメント
-1 件の古いコメントを表示 -1 件の古いコメントを非表示

v k 2020 年 6 月 16 日

How to extract the fields "Name " and "Email " after this ?

サインインしてコメントする。

サインインしてこの質問に回答する。

カテゴリ

ヘルプセンターおよび File Exchange で Text Files についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

HTML file scraping for Fields in a Table

0 件のコメント -2 件の古いコメントを表示 -2 件の古いコメントを非表示

回答 (1 件)

1 件のコメント -1 件の古いコメントを表示 -1 件の古いコメントを非表示

カテゴリ

タグ

参考

Community Treasure Hunt

0 件のコメント
-2 件の古いコメントを表示 -2 件の古いコメントを非表示

1 件のコメント
-1 件の古いコメントを表示 -1 件の古いコメントを非表示