good chemistry is complicated, and a little bit messy -LW |
|
PerlMonks |
Re^3: PDF alternative to mudrow to get XML structureby jcb (Parson) |
on Mar 07, 2020 at 00:02 UTC ( [id://11113923]=note: print w/replies, xml ) | Need Help?? |
A PDF file does not have an XML structure. Our questioner is using a tool that produces XML output describing PDF structure and now needs to replace that tool. There is no standard translation from PDF to XML. There is no easy replacement for mudraw because the XML our questioner is using is a mudraw-specific format because there is no standard XML mapping for PDF. The best solution is to process the PDF directly.
In Section
Seekers of Perl Wisdom
|
|