http://qs321.pair.com?node_id=794030


in reply to Re^2: ( PDF::EasyPDF ) encoding problem
in thread ( PDF::EasyPDF ) encoding problem

I just had a closer look at the module...  Part of the problem is that PDF::EasyPDF specifies the encoding for its 14 Adobe Base Fonts (all it supports) as "MacRomanEncoding" — which is kind of unfortunate, as this encoding is rather different from ISO-8859-1, which Perl defaults to in most cases (for example, "é" is 0x8E in Mac Roman, while it's 0xE9 in ISO-Latin-1 and CP1252).  In other words, even if you had successfully solved the UTF-8 to ISO-8859-1 conversion issue, it still wouldn't work...

But you can get it working with two small changes to EasyPDF.pm  (tested, i.e. works for me):

  1. Replace all 14 occurrences of "/MacRomanEncoding" with "/WinAnsiEncoding" (case is important). Windows ANSI encoding (CP1252) is roughly the equivalent of ISO-8859-1/ISO-8859-15  (one of the differences is that the Euro symbol is in a different position (0xA4 in ISO-8859-15, 0x80 in CP1252).

  2. Change this line

    open (EASYPDF,">$self->{file}") or die ...

    to read

    open (EASYPDF, ">:encoding(cp1252)", $self->{file}) or die ...

    (and don't forget to put use utf8; in your script)

Update: alternatively, you could leave PDF::EasyPDF's /MacRomanEncoding declarations in place, and have Perl convert to that encoding directly (">:encoding(MacRoman)"), which would work, too (except for the Euro symbol)