in reply to Re: Spell Check Logic in thread Spell Check Logic
you could speed this up a bit by maintaining a list of the words themselves in a separate hash, and checking that before you calculate the phonetic code (step 2). that way, you only have to calculate the code for words that are not in the list. of course, that only helps if most words are spelled correctly :)
Re: Re: Re: Spell Check Logic
by dree (Monsignor) on Jul 31, 2002 at 21:31 UTC
|
You are right! :)
But an hash (DB_File) with 2/3 millions words+phonetic codes is around 150+Mb.
So to gain some speed-up you have to twice the database: one that has keys as phonetic codes and the second that has keys as correct words.
And this is not always a good thing.
But in this particulary case, with only 100.000+ words, your suggestion is ok :) | [reply] |
|