Beefy Boxes and Bandwidth Generously Provided by pair Networks
go ahead... be a heretic
 
PerlMonks  

Re: Re: abbreviation checking

by Anonymous Monk
on Dec 02, 2002 at 22:05 UTC ( [id://217066]=note: print w/replies, xml ) Need Help??


in reply to Re: abbreviation checking
in thread abbreviation checking

While making an MP3-renaming script, which attacks a problem similar to yours, I used a combination of Metaphone and "distance" modules. My approach:
  1. Get a list of "known-good" words. I use already-verified MP3 filenames as a source of these.
  2. Calculate their Metaphones.
  3. Calculate the Metaphone of any new words and look for matches. If none, see if there are any matches with a distance of 1 or 2. Distances larger than 2 produce too many matches.
  4. Have the user confirm the 'corrections'.
It's not an exact science, and human intervention is unavoidable if correctness matters.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://217066]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others rifling through the Monastery: (7)
As of 2024-03-28 11:20 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found