Beefy Boxes and Bandwidth Generously Provided by pair Networks
Come for the quick hacks, stay for the epiphanies.
 
PerlMonks  

Re: Re: Re: Site Suggestion

by shotgunefx (Parson)
on Jun 19, 2002 at 01:34 UTC ( [id://175548]=note: print w/replies, xml ) Need Help??


in reply to Re: Re: Site Suggestion
in thread Site Suggestion

This might not be a bad idea. You could cache the results so that if you were at the limit, you could just show the slightly staler cached results. Perhaps even do the search for each word of the query (after removing common words like the) rather than the whole string and then merging the combined results for display.

-Lee

"To be civilized is to deny one's nature."

Replies are listed 'Best First'.
Re(4):: Site Suggestion
by Molt (Chaplain) on Jun 19, 2002 at 09:36 UTC

    I don't think the idea of merging results would work, it would take about as much power per search as a standard search engine system would (Where you effectively have a lookup table of which words appear in which documents and merge the document sets to find a particular word- sometimes with relevancy weightings and other such funky things), and the limitation of only being able to update 1,000 words a day would mean that any results which could be merged would soon get very out of date.

    Any saving you made by not needing to index the documents yourself would be lost by the added network traffic and latency required to use an external site for this.

    The idea of using Google's API to offload the searches may be do-able, but would rely on there being not more than about 1,000 searches per day.. and don't forget this needs to be scalable too so less than 500 would be more realistic.

    Is there a Perl module to provide a nice, and efficient, search engine on a site? Just I don't think I've seen one, but I'm amazed if one doesn't exist.

      I certainly don't think Google's API is anything but a kludge for the Monastery. No offense to the monks who work very hard improving this site, but I can't believe a site full of programmers can't figure out a better search.

      You can't search for CGI,XML,DNS,NFS and many other important things. I've thought about making my own stab at this but have no idea where to start in Everything. Would anyone have objections if search wasn't a part of Everything?

      -Lee

      "To be civilized is to deny one's nature."

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://175548]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others chilling in the Monastery: (5)
As of 2024-03-28 20:53 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found