Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re: (OT) How to deal with non-ascii names

by jonadab (Parson)
on Aug 13, 2010 at 17:34 UTC ( [id://854956]=note: print w/replies, xml ) Need Help??


in reply to (OT) How to deal with non-ascii names

Is there any Perl module that helps translating things into ASCII in a reliable way?

HTML::Entities. HTH.HAND.

In all seriousness, I agree to a large extent with what the others have said. If you have to do this automatically (without getting a romanized version from the user), the transliteration method is going to need to be language-specific.

For instance, for Japanese you might check out Lingua::JA::Hepburn::Passport. It doesn't appear to support kanji, but I'm not sure it's possible to automatically romanize kanji, since most of them have at least half a dozen different readings. The same character might romanize to "mei" in one name, "myo" or "myou" in another name, "min" in another, "a" in another, "aka" in another, "aki" in another (this is a real example). If you can't get furigana (pronunciation guide characters, usually kana) from the user, names are going to get romanized very incorrectly.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://854956]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others making s'mores by the fire in the courtyard of the Monastery: (7)
As of 2024-03-28 19:18 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found