http://qs321.pair.com?node_id=1211220


in reply to Re^4: Build a PDF book index
in thread Build a PDF book index

I am curious to what are you referring to. Are you referring to the content of the XML file output-ed ? e.g.:

<text top="78" left="108" width="540" height="21" font="3">The develop +er, on the other hand, feels like he’s interrupted several times a da +y for</text>
Or are you talking about some cli option to give to the command? In this case, I don't see anything related (pdftohtml version 0.24.3).

Replies are listed 'Best First'.
Re^6: Build a PDF book index
by LanX (Saint) on Mar 18, 2018 at 23:56 UTC
    > Are you referring to the content of the XML file output-ed

    yes, this includes

    • fontnumber font="3" some fonts may need special translations
    • box geometry top="78" left="108" width="540" height="21" if you want to exclude special areas (footnotes, pagenumber, ...)

    Cheers Rolf
    (addicted to the Perl Programming Language and ☆☆☆☆ :)
    Wikisyntax for the Monastery