Problems? Is your data what you think it is? | |
PerlMonks |
Re: Need Help for Convert PDF to HTMLby LanX (Saint) |
on Feb 09, 2011 at 10:52 UTC ( [id://887167]=note: print w/replies, xml ) | Need Help?? |
The answer highly depends on the nature of your PDFs and the result you want! There is no simple answer for this general question, because a pure print format and a flowing format are different by nature. Even simple cases would need heuristics, but general solutions sophisticated artificial intelligence. This post lists some possibilities (especially pdftohtml -xml) and other corresponding discussions: Parsing PDFs by text position?
Cheers Rolf
In Section
Seekers of Perl Wisdom
|
|