Clear questions and runnable code get the best and fastest answer |
|
PerlMonks |
comment on |
( [id://3333]=superdoc: print w/replies, xml ) | Need Help?? |
To put it bluntly, I need to extract data from a pdf file. More specifically; inside this two-page pdf file lies a 2-3 (it changes) column, multi-row table. Despite the oddly formatted table (you would have to see the document to understand what I mean, I guess), I believe I can parse it given the right module. The only one I see that may help is CAM::PDF. Do you know of anything that is more helpful for parsing pdf tables? Should I convert it to separate file format and go from there? Update: Decided to just convert it to an html document, (thanks, Popcorn Dave), but thanks to all who helped. I am still willing to listen to any further suggestions if you have them, though. I'm so adjective, I verb nouns! chomp; # nom nom nom In reply to [Updated] Extracting information from a PDF file by Lawliet
|
|