Beefy Boxes and Bandwidth Generously Provided by pair Networks
laziness, impatience, and hubris
 
PerlMonks  

Re^3: PDF alternative to mudrow to get XML structure

by vr (Curate)
on Mar 06, 2020 at 12:54 UTC ( [id://11113907]=note: print w/replies, xml ) Need Help??

Help for this page

Select Code to Download


  1. or download this
    >mutool.exe draw -F stext document.pdf 2>nul
    <?xml version="1.0"?>
    ...
    </font>
    </line>
    ...
    
  2. or download this
    >gswin64c -q -sDEVICE=txtwrite -dTextFormat=1 -o - document.pdf
    <page>
    ...
    </line>
    </block>
    ...
    
  3. or download this
    use strict;
    use warnings;
    ...
    ( 390.30, 533.24) ( 390.30, 533.24) UNSTRAPPED
    ( 487.91, 533.24) ( 487.91, 533.24) (A
    ....
    
  4. or download this
    package MyTestRenderer;
     
    ...
    }
    
    1;
    
  5. or download this
    use strict;
    use warnings;
    ...
    (x = 690.0, y = 549.2) (w = 0.500, h = 10.0) o
    (x = 695.0, y = 549.2) (w = 0.278, h = 10.0) t
    ....
    

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11113907]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others meditating upon the Monastery: (5)
As of 2024-04-24 12:40 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found