Extracting the text from pdf file

13 visualizaciones (últimos 30 días)
Gopalakrishnan venkatesan
Gopalakrishnan venkatesan el 9 de Jul. de 2015
Editada: Stephen23 el 9 de Jul. de 2015
Is it possible to extract the text from pdf file using matlab script?
I need to parse through the pdf and extract the particular text in the pdf.
Is there any way to do it?

Respuesta aceptada

Stephen23
Stephen23 el 9 de Jul. de 2015
Editada: Stephen23 el 9 de Jul. de 2015
"Is there any way to do it?"
Of course, in principal any data with a known specification can be parsed by MATLAB.
Is there an easy way of reading a PDF into MATLAB?
Not really, because PDF's are not sequentially organized text, although they might look like that when they are displayed or printed. This is also a topic that has been covered before on this forum, and a simple search will bring up these very informative discussions on the topic:

Más respuestas (0)

Categorías

Más información sobre Text Data Preparation en Help Center y File Exchange.

Etiquetas

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by