Beefy Boxes and Bandwidth Generously Provided by pair Networks
No such thing as a small change
 
PerlMonks  

Re: How to sanely handle unicode in perl?

by choroba (Cardinal)
on Mar 20, 2015 at 17:24 UTC ( [id://1120777]=note: print w/replies, xml ) Need Help??


in reply to How to sanely handle unicode in perl?

Switching to binmode and manually picking the encoding from the $ENV seems to work for me:
#!/usr/bin/perl use strict; use warnings; open my $in, '-|:encoding(utf8)', "echo \xc3\xb6" or die $!; my $enc = $ENV{LC_ALL}; $enc =~ s/.*\.//; # TODO: en_US with no encoding not handled. binmode STDOUT, "encoding($enc)"; my $line = <$in>; chomp $line; print "I read a line, that is ", length $line, " chars long.\n"; print "That line is: $line\n"; $line =~ s/\x{f6}/o/; print "That line in ascii is: $line\n";

Update

It seems all that's needed in your original code is to remove the local encoding from the input:
open my $in, '-|:raw:encoding(utf8)', "echo \xc3\xb6" or die $!;
لսႽ† ᥲᥒ⚪⟊Ⴙᘓᖇ Ꮅᘓᖇ⎱ Ⴙᥲ𝇋ƙᘓᖇ

Replies are listed 'Best First'.
Re^2: How to sanely handle unicode in perl?
by Sec (Monk) on Mar 23, 2015 at 10:19 UTC
    Thanks. With ":raw" prepended it works just like it should.

    Fantastic!

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1120777]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others musing on the Monastery: (4)
As of 2024-04-25 16:51 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found