Dov Isaacs Re: OT Help with searchable text in PDF
Feb 11, 2002; 11:33
Dov Isaacs
Re: OT Help with searchable text in PDF
Nini,
You've gotten pieces of answers from several other members of these two lists, but just to summarize from our perspective at Adobe:
(1) PDF is NOT an ASCII or Unicode text file even if portions of the file may look that way. Any edits that add or delete characters can readily corrupt a PDF file.
(2) Text both within PostScript and with PDF cannot be guaranteed to be in ASCII-searchable form, even if you judiciously chose joboptions for PDF except in the most degenerate cases. Text line justification, line spacing, word spacing, pair kerning, tracking, etc. all contribute to production of PostScript and subsequently PDF in which text runs as viewable and searchable in an ASCII text editor will be disjoint.
- Dov
At 2/11/2002 06:51 AM, Nini TjŠder wrote: >Hi all > >I need help generating a .pdf where the text in the .pdf is searchable either in BBEdit or in NotePad on the PC. I am on a Mac and have Acrobat 5.0.5. >Have gotten files from one of our customers. >Original is in QuarkXpress 4.11. >I generate the postscript file with Adobe PS 8.7.3. and distill from that with Distiller 5.0.5 with Press optimized compatible with Acrobat 4 and pdf 1.3. All fonts are embedded. I have tried it both as ASCII and Binary - same problem. i even tried PDf 1.2 for Acrobat 3 with no luck. >Problem is neither the postscriptfile nor the .pdf file is searchable for textstrings in the pdf (have opened both postscirptfile and pdf in BBEdit which is one of few textprograms that can read that large files). >The searchable textstring is later going to be exchanged by another textstring in another application before printing the pdf. >I know this should be possible. >I don«t remember having this problem in Acrobat 4.05a (which I since long no longer have installed). > >Do I have to save it in an earlier version of pdf to get searchability???? >Would be grateful for any hints about how to proceed. Thanks beforehand. > > >(have sent most of the above to the Blueworld Acrobat Talklist as well, but that list is soooo quiet I doubt I will get any replies from there within a reasonable timespan so if anybody of you know, I would be eternally grateful). >-- >nini ;-)
----------------------------------------------------------------------- To Unsubscribe: <mailto:acrobat-off@blueworld.com> Archives : <http://listsearch.blueworld.com/acrobattalksearch.lasso>
Search
Lasso Programming
This site manages and broadcasts several email lists pertaining to Lasso Programming and technologies related and used by Lasso developers. Sign up today!