Beefy Boxes and Bandwidth Generously Provided by pair Networks
good chemistry is complicated,
and a little bit messy -LW
 
PerlMonks  

Re^3: Encoding horridness

by Corion (Patriarch)
on Jul 12, 2017 at 14:20 UTC ( [id://1194938]=note: print w/replies, xml ) Need Help??


in reply to Re^2: Encoding horridness
in thread Encoding horridness

No, because high-bit characters/octets in Latin-1 encode differently as octets in UTF-8, and Perl doesn't know what to do with high-bit characters when writing them.

Replies are listed 'Best First'.
Re^4: Encoding horridness
by Anonymous Monk on Jul 12, 2017 at 14:26 UTC
    What I'm wondering, though, is if there's ever a situation where
    encode('utf8', decode('Latin-1', $_))
    produces different output from
    encode('utf8', $_)
      Yes, for example:
      $_ = decode('utf-8', "\N{LATIN SMALL LETTER A WITH ACUTE}"); say encode('utf8', $_); # Replacement character EF +BFBD. say encode('utf8', decode('Latin-1', $_)); # Dies.
      ($q=q:Sq=~/;[c](.)(.)/;chr(-||-|5+lengthSq)`"S|oS2"`map{chr |+ord }map{substrSq`S_+|`|}3E|-|`7**2-3:)=~y+S|`+$1,++print+eval$q,q,a,
        Fine. If the decode doesn't die, does it ever produce different output? (One might argue that call that dies doesn't produce any output, and therefore does not produce different output, but whatever.)

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1194938]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others perusing the Monastery: (5)
As of 2024-03-29 08:37 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found