HTML file scraping for Fields in a Table

1 visualización (últimos 30 días)
v k
v k el 15 de Jun. de 2020
Comentada: v k el 16 de Jun. de 2020
The HTML file that I am working on, is a long one and contains particulars as given in the attached text file. Although the structure is simple and repetitive, due to the large number of characters in between the data fields, I am having hard time in scraping the required data. The objective is to get a two-column excel spreadsheet containing Name in the first column and Email in the second column. How to obtain these required fields in the xlsx file ? Thanks.

Respuestas (1)

Sean de Wolski
Sean de Wolski el 15 de Jun. de 2020
Start playing with htmlTree in the Text analytics toolbox.
t = htmlTree(fileread('clientdata.txt'))
t.findElement('TD').extractHTMLText
  1 comentario
v k
v k el 16 de Jun. de 2020
How to extract the fields "Name " and "Email " after this ?

Iniciar sesión para comentar.

Categorías

Más información sobre Text Data Preparation en Help Center y File Exchange.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by