Beefy Boxes and Bandwidth Generously Provided by pair Networks
No such thing as a small change

Re: (OT) How to deal with non-ascii names

by jonadab (Parson)
on Aug 13, 2010 at 17:34 UTC ( #854956=note: print w/replies, xml ) Need Help??

in reply to (OT) How to deal with non-ascii names

Is there any Perl module that helps translating things into ASCII in a reliable way?

HTML::Entities. HTH.HAND.

In all seriousness, I agree to a large extent with what the others have said. If you have to do this automatically (without getting a romanized version from the user), the transliteration method is going to need to be language-specific.

For instance, for Japanese you might check out Lingua::JA::Hepburn::Passport. It doesn't appear to support kanji, but I'm not sure it's possible to automatically romanize kanji, since most of them have at least half a dozen different readings. The same character might romanize to "mei" in one name, "myo" or "myou" in another name, "min" in another, "a" in another, "aka" in another, "aki" in another (this is a real example). If you can't get furigana (pronunciation guide characters, usually kana) from the user, names are going to get romanized very incorrectly.

  • Comment on Re: (OT) How to deal with non-ascii names

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://854956]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others wandering the Monastery: (9)
As of 2021-03-07 12:43 GMT
Find Nodes?
    Voting Booth?
    My favorite kind of desktop background is:

    Results (121 votes). Check out past polls.