Just another Perl shrine | |
PerlMonks |
Re: Re: Offsite Perlmonks Search Engineby blakem (Monsignor) |
on Jul 07, 2002 at 14:39 UTC ( [id://179971]=note: print w/replies, xml ) | Need Help?? |
Earlier this week, someone complained about not being able to search this site for 'AI' so some two letter words are worth keeping.
I do have a very short list of "stopwords" that I can tweak if need be. As far as load to the server... I have no idea... guess I'll find out. ;-)
I could have the "word search" behavior be optional. The current matching (done in the SQL) is similar to /\b$term\b/ but it would be easy enough to let the user turn off those boundary assertions. The "Terms are split on spaces after non-word chars are stripped" is a roundabout way of saying that I'm ignoring quotes. Searching for dogs cats and "perl 6" will get broken down into five terms. dogs, cats, and, perl, 6 '6' gets tossed out because its too short, 'and' is one of the stop words so it is removed as well. That leaves us with dogs, cats, perl and a bunch of bad results. The underscore gives us an easy way out, ala perl_6. Thanks for the feedback... I'll probably incorporate the optional "word search" feature in the next rev. Update: A partial word matching option has now been implemented... -Blake
In Section
Perl Monks Discussion
|
|