Convert PDF File into Text in Linux
March 22, 2007 4:19 PM
-----------------------------------------------------------------------------------------------------------
A PDF file can be converted into text by the Tool belonging to the XPDF package pdftotext:
pdftotext irgendein.pdf
This instruction produces a text file with the name irgendein.txt with contents of any PDF. They can work on this file now completely normally. The parameter - layout ensures for the fact that the appearance of the PDF file is transferred as far as possible to the text file:
pdftotext -layout irgendein.pdf
If you want to transfer only certain sides from a PDF file to the text excerpt, goes with the parameter - f (for first PAGE) and - l (for last PAGE):
pdftotext -f 3 -l 7 irgendein.pdf
in this example only the pages from 3 to 7 after irgendein.txt would be taken over.
|