http://qs321.pair.com?node_id=705615


in reply to [Updated] Extracting information from a PDF file

There is a non-Perl way to do it, depending on what you're after and how many files you have. Adobe's website offers that as a free service, or at least they used to, so if you're having problems you might check that out as well.


Revolution. Today, 3 O'Clock. Meet behind the monkey bars.

I would love to change the world, but they won't give me the source code

  • Comment on Re: Extracting information from a PDF file

Replies are listed 'Best First'.
Re^2: Extracting information from a PDF file
by Your Mother (Archbishop) on Aug 20, 2008 at 22:30 UTC

    IIRC, Gmail will parse PDF attachments out for display as HTML too... I think I'm remembering right. It was a few months ago that I was playing with it and Adobe's service was either super slow or down. Obviously can't speak to the parse quality.

Re^2: Extracting information from a PDF file
by Lawliet (Curate) on Aug 20, 2008 at 20:35 UTC

    Just one file. I need to upload the data I extract to a database. I'll try and see what I can find on their website.

    Update: Do you mean they can extract information or convert the file? :\

    I'm so adjective, I verb nouns!

    chomp; # nom nom nom

      Been a while since I needed it, but as I remember you give them a link to your file and then they send you back the text via e-mail. Hopefully it will do what you want.


      Revolution. Today, 3 O'Clock. Meet behind the monkey bars.

      I would love to change the world, but they won't give me the source code

        Ah, thank you. I assume this is what you are referring to?

        I'm so adjective, I verb nouns!

        chomp; # nom nom nom