http://qs321.pair.com?node_id=425751


in reply to Search within a PDF file

Have you tried grep? Strings, if I remember correctly, are stored as plain-text within the PDF format ...

Being right, does not endow the right to be rude; politeness costs nothing.
Being unknowing, is not the same as being stupid.
Expressing a contrary opinion, whether to the individual or the group, is more often a sign of deeper thought than of cantankerous belligerence.
Do not mistake your goals as the only goals; your opinion as the only opinion; your confidence as correctness. Saying you know better is not the same as explaining you know better.

Replies are listed 'Best First'.
Re^2: Search within a PDF file
by shenme (Priest) on Jan 28, 2005 at 03:29 UTC
    Much of what I see in PDF files is enclosed in 'stream' blocks, which appears to be a compression encoding. grep won't do it. When I am forced to do this myself, I sure hope one of the above mentioned modules or other will take care of pulling out the text I need to look at. (Oh, I'm not looking forward to this!)