PDF to Latex

382 views
Skip to first unread message

Suvadip Paul

unread,
Aug 15, 2011, 3:31:59 AM8/15/11
to LaTeX Users Group
Is there any easy way to convert pdf document to latex. I can copy the
sentences from pdf and paste it in the latex. But the problem is with
the mathematical expressions. Is there any easy way to do this job?

Jaspreet Sarao

unread,
Aug 15, 2011, 7:52:01 AM8/15/11
to latexus...@googlegroups.com
Use 'pdftotext filename.pdf'
Hope it will be helpful for you

--
Jaspreet sarao
email: jaspri...@gmail.com
Blog: jaspreetsarao.wordpress.com

Peter Flynn

unread,
Aug 15, 2011, 9:40:46 AM8/15/11
to latexus...@googlegroups.com
On Mon, Aug 15, 2011 at 8:31 AM, Suvadip Paul <mr.su...@gmail.com> wrote:
Is there any easy way to convert pdf document to latex.

No. PDF only contains the font characters and their position on the page. It has no information about why they are there or what they are there for. That information is in the source document (LaTeX, Word, etc). If you don't have the source document, all you can get is the characters.

I can copy the
sentences from pdf and paste it in the latex. But the problem is with
the mathematical expressions. Is there any easy way to do this job?

I believe there are some very expensive programs that will try to extract the information and use the fonts and positioning to guess what the original meant (eg, large and bold might be a heading), but with maths I don't think it is possible.

///Peter

Gildas Cotomale

unread,
Aug 15, 2011, 9:56:00 AM8/15/11
to latexus...@googlegroups.com
>> Is there any easy way to convert pdf document to latex. I can copy the
>> sentences from pdf and paste it in the latex. But the problem is with
>> the mathematical expressions. Is there any easy way to do this job?
> Use 'pdftotext filename.pdf'

Doing so, he won't have to copy-n-paste text by himself : this
programm only extract textual content (as far i can remember).
There are tools to also extract images and fonts informations (so the
PDF can be converted to RTF or maybe TeX or DVI), but as it's no
descriptive/semantic tagging you can't convert back to LaTaX or XML or
alikes. To short, you may have mathematical expression either as image
or text (i means characters with associeted visual glyphs but nothing
suitable for LaTeX or even a mathematical-assistance-software like
Mathematica or Maple)

Reply all
Reply to author
Forward
0 new messages