Beefy Boxes and Bandwidth Generously Provided by pair Networks
more useful options
 
PerlMonks  

Re^2: substr on UTF-8 strings

by ikegami (Patriarch)
on Jun 26, 2020 at 19:42 UTC ( [id://11118583]=note: print w/replies, xml ) Need Help??


in reply to Re: substr on UTF-8 strings
in thread substr on UTF-8 strings

Ah, if the input string contains only characters in 00..7F, the returned string will use the UTF8=0 internal storage format.

$ perl -e' use Devel::Peek qw( Dumpe ); my $s = "ab"; utf8::upgrade($s); # Force UTF8=1 storage format. Dump($s); my $ss = substr($s, 0, 1); Dump($ss); ' SV = PV(0x7fffdcf3dca0) at 0x7fffdcf6ca78 REFCNT = 1 FLAGS = (POK,pPOK,UTF8) PV = 0x7fffdcf5dbf0 "ab"\0 [UTF8 "ab"] CUR = 2 LEN = 10 SV = PV(0x7fffdcf3dd30) at 0x7fffdcf6cb50 REFCNT = 1 FLAGS = (POK,pPOK) PV = 0x7fffdcf54050 "a"\0 CUR = 1 LEN = 10

But like I said, it's Perl's perogative to pick whatever internal storage format it wants.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11118583]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others lurking in the Monastery: (5)
As of 2024-04-19 06:48 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found