PDF::Parse is the one most recommended. PDF::Reuse is also good, but for a different purpose. If you are willing to wade through a little code, PDF::API2 is the swiss-army knife for Perl/PDF work. It does require some knowledge of the PDF format, though.
Being right, does not endow the right to be rude; politeness costs nothing. Being unknowing, is not the same as being stupid. Expressing a contrary opinion, whether to the individual or the group, is more often a sign of deeper thought than of cantankerous belligerence. Do not mistake your goals as the only goals; your opinion as the only opinion; your confidence as correctness. Saying you know better is not the same as explaining you know better.
I shouldn't have to say this, but any code, unless otherwise stated, is untested
| [reply] |
| [reply] |
Yeah, the monk in that thread asked the same question and got no answer.
| [reply] |
| [reply] |
Some PDF-files are just pictures, so you cannot extract any text from it, unless you consider using OCR-techniques. I guess it all depends on how the PDF-file was generated.
CountZero "If you have four groups working on a compiler, you'll get a 4-pass compiler." - Conway's Law
| [reply] |