Beefy Boxes and Bandwidth Generously Provided by pair Networks
The stupid question is the question not asked
 
PerlMonks  

Re^5: JSON::XS Cyrillic unicode not saving properly

by Anonymous Monk
on Mar 22, 2021 at 04:40 UTC ( [id://11130075]=note: print w/replies, xml ) Need Help??


in reply to Re^4: JSON::XS Cyrillic unicode not saving properly
in thread JSON::XS Cyrillic unicode not saving properly

Hi,

Here you go

perlunitut: Unicode in Perl#I/O flow (the actual 5 minute tutorial)

Now ask your program (or mine) , who is doing the byte encoding (what statement, what function/method )?

#!/usr/bin/perl -- use strict; use warnings; use Data::Dump qw/ dd /; use Path::Tiny qw/ path /; use JSON::XS(); use JSON::PP(); my @humps = "\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}"; dd( JSON::XS->new->pretty(1)->encode( \@humps ) ); dd( JSON::PP->new->pretty(1)->encode( \@humps ) ); dd( JSON::XS->new->utf8(1)->pretty(1)->encode( \@humps ) ); dd( JSON::PP->new->utf8(1)->pretty(1)->encode( \@humps ) ); dd( JSON::XS->new->ascii(1)->pretty(1)->encode( \@humps ) ); dd( JSON::PP->new->ascii(1)->pretty(1)->encode( \@humps ) ); print "#" x 6, "\n"; path( 'deleteme.txt')->spew_raw( JSON::XS->new->pretty(1)->encode( \@h +umps ) ); dd( path( 'deleteme.txt')->slurp_raw ); path( 'deleteme.txt')->spew_raw( JSON::PP->new->pretty(1)->encode( \@h +umps ) ); dd( path( 'deleteme.txt')->slurp_raw ); print "#" x 6, "\n"; path( 'deleteme.txt')->spew_utf8( JSON::XS->new->pretty(1)->encode( \@ +humps ) ); dd( path( 'deleteme.txt')->slurp_raw ); dd( path( 'deleteme.txt')->slurp_utf8 ); path( 'deleteme.txt')->spew_utf8( JSON::PP->new->pretty(1)->encode( \@ +humps ) ); dd( path( 'deleteme.txt')->slurp_raw ); dd( path( 'deleteme.txt')->slurp_utf8 ); print "#" x 6, "\n"; path( 'deleteme.txt')->spew_utf8( JSON::XS->new->utf8(1)->pretty(1)->e +ncode( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); path( 'deleteme.txt')->spew_utf8( JSON::PP->new->utf8(1)->pretty(1)->e +ncode( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); print "#" x 6, "\n"; path( 'deleteme.txt')->spew_utf8( JSON::XS->new->ascii(1)->pretty(1)-> +encode( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); path( 'deleteme.txt')->spew_utf8( JSON::PP->new->ascii(1)->pretty(1)-> +encode( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); print "#" x 6, "\n"; path( 'deleteme.txt')->spew_raw( JSON::XS->new->utf8(1)->pretty(1)->en +code( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); path( 'deleteme.txt')->spew_raw( JSON::PP->new->utf8(1)->pretty(1)->en +code( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); # path( 'deleteme.txt')->remove; __END__ "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" "[\n \"\\ufeff\\ud83d\\udc2a one hump two humps \\ud83d\\udc2b\"\n]\ +n" "[\n \"\\ufeff\\ud83d\\udc2a one hump two humps \\ud83d\\udc2b\"\n]\ +n" ###### Wide character in print at C:/perl/site/lib/Path/Tiny.pm line 1848. "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" Wide character in print at C:/perl/site/lib/Path/Tiny.pm line 1848. "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" ###### "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" ###### "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" ###### "[\n \"\\ufeff\\ud83d\\udc2a one hump two humps \\ud83d\\udc2b\"\n]\ +n" "[\n \"\\ufeff\\ud83d\\udc2a one hump two humps \\ud83d\\udc2b\"\n]\ +n" ###### "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" "🐪 one hump two humps 🐫"
"🐪 one hump two humps 🐫"

Replies are listed 'Best First'.
Re^6: JSON::XS Cyrillic unicode not saving properly
by cormanaz (Deacon) on Mar 22, 2021 at 15:29 UTC
    I have no idea what I am supposed to make of that example.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11130075]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others avoiding work at the Monastery: (5)
As of 2024-04-24 07:49 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found