http://qs321.pair.com?node_id=645044


in reply to Re^2: PDF::Template and character encodings
in thread PDF::Template and character encodings

That might actually be a sign of progress. Is the PDF_findfont error still thrown? If it isnt, if it's different, there might be something wrong with your input files.

Do you mind posting a condensed example showing the problem? It's far easier to help if there's something to see, and not being forced at stabbing blindfolded into the fog (guessing wildly, that is).

--shmem

_($_=" "x(1<<5)."?\n".q·/)Oo.  G°\        /
                              /\_¯/(q    /
----------------------------  \__(m.====·.(_("always off the crowd"))."·
");sub _{s./.($e="'Itrs `mnsgdq Gdbj O`qkdq")=~y/"-y/#-z/;$e.e && print}
  • Comment on Re^3: PDF::Template and character encodings

Replies are listed 'Best First'.
Re^4: PDF::Template and character encodings
by geektron (Curate) on Oct 16, 2007 at 14:02 UTC
    I've updated the original node with a trimmed-down xml template. Essentially,  <VAR NAME='LAST_NAME'> is the problem; it contains the (misrendered) accented character.

    Changing the encoding to iso8859-1 *does* fix the PDF_findfont error, but it doesn't fix the problem with the accented characters.

      As per your OP, do you really get "~Aj", or is it per chance æ (which is a à á - a grave acute) ?

      If that is the case, you are getting utf-8 from your database - run that data through Encode. Alternatively, try using iso10646-1 (or utf8 without the hyphen).

      Using those fonts might fail since it seems likely that the strings coming from the database don't have the internal UTF8 flag set.

      --shmem

      _($_=" "x(1<<5)."?\n".q·/)Oo.  G°\        /
                                    /\_¯/(q    /
      ----------------------------  \__(m.====·.(_("always off the crowd"))."·
      ");sub _{s./.($e="'Itrs `mnsgdq Gdbj O`qkdq")=~y/"-y/#-z/;$e.e && print}
        re OP and problematic character: the top half of that 'pipe' character looks more like a dot, which is why i thought it was a 'j'. (it's supposed to be an a-accent, not a-grave)

        setting the pdf_encoding='utf8' also blows up with the same "can't find encoding" error.

        I'm reading up on Encode, though I'm not sure if I need and encode/decode sequence or a simpler transform.