>I need a tool that allow me to select the text from a PDF in batch mode. >For example: extract the text from the file xxxx in page i.
Ghostscript's ps2ascii utility can do this, although the output is a little rough:
gs -q -dFirstPage=i -dLastPage=i -dNODISPLAY -dBATCH -dSAFER -dNOBIND\ -dWRITESYSTEMDICT -dSIMPLE -c save pop -f ps2ascii.ps\ xxxx.pdf xxxx.txt
Ghostscript runs on all platforms and is freely available.
There is a better text extraction utility built on Ghostscript called pstotext. Look on the Web, or in the Ghostscript distribution, for more details.
--
L. Peter Deutsch | Aladdin Enterprises ghost@aladdin.com | http://www.aladdin.com | 203 Santa Margarita Ave. +1-650-322-0103 (9-12 M-F) | fax +1-650-322-1734 | Menlo Park, CA 94025 The future of software is at http://www.opensource.org
----------------------------------------------------------------------- To Unsubscribe: <mailto:acrobat-off@blueworld.com> Archives : <http://listsearch.blueworld.com/acrobattalksearch.lasso>
May 23
Picker, Anita RE: [PDF] Select text
May 23, 2000; 14:17
Picker, Anita
RE: [PDF] Select text
May 23
Leonard Rosenthol Re: [PDF] Select text
May 23, 2000; 13:47
Leonard Rosenthol
Re: [PDF] Select text
Search
Lasso Programming
This site manages and broadcasts several email lists pertaining to Lasso Programming and technologies related and used by Lasso developers. Sign up today!