How do I scrape a website using MATLAB?
Mostrar comentarios más antiguos
I want to scrape the text data from a table on a website. I have used urlfilter but this only scrapes numbers.
Respuestas (1)
Image Analyst
el 18 de Jul. de 2014
Editada: Image Analyst
el 18 de Jul. de 2014
1 voto
Use urlread() to get everything.
4 comentarios
azizullah khan
el 18 de Jul. de 2014
Image Analyst
el 18 de Jul. de 2014
You mean urlread gets only numbers for you? For me it gets everything, just as if you'd right clicked on the page and said "View Source". I know it's HTML and it will look like a mess and you'll have to parse it out, but what do you expect?
azizullah khan
el 19 de Jul. de 2014
Image Analyst
el 20 de Jul. de 2014
I don't have anything beyond urlread() for that. There is not going to be any built-in code that know exactly how your 200 web pages are laid out and how to extract the table from all the rest of the HTML code that is retrieved. There are billions of web pages out there and they just don't a page parser for all of them, or even yours. You're going to have to do that yourself.
Categorías
Más información sobre Web Services en Centro de ayuda y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!