Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?
 
PerlMonks  

Re^4: Length and Chomp ??

by afoken (Canon)
on Aug 23, 2009 at 19:22 UTC ( #790684=note: print w/replies, xml ) Need Help??


in reply to Re^3: Length and Chomp ??
in thread Length and Chomp ??

Some bean counting:

With a Unicode argument, length returns the number of characters in the argument. Unicode has the (no so) new / unusual / odd property that a character may be represented by more than one byte.

With a non-Unicode / pre-Unicode / legacy encoding argument, length still returns the number of characters in the argument. Those legacy encodings have the old / usual / familiar property that a character is represented by exactly one byte.

So, there is no need to remember any special cases. length always returns the character count.

Before Unicode support was added to Perl, there was no need to distinguish between byte and character, because both were equal. And as long as you don't work with Unicode, they still are. The quote from perlfunc, "if the EXPR is in Unicode, you will get the number of characters, not the number of bytes", is a hint that bytes and characters are different things when you work with Unicode, nothing more, nothing less.

Alexander

--
Today I will gladly share my knowledge and experience, for there are no sweeter words than "I told you so". ;-)

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://790684]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others taking refuge in the Monastery: (5)
As of 2020-08-05 11:46 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    Which rocket would you take to Mars?










    Results (35 votes). Check out past polls.

    Notices?