Beefy Boxes and Bandwidth Generously Provided by pair Networks
Think about Loose Coupling
 
PerlMonks  

Re: Writing a site search engine

by tune (Curate)
on Aug 13, 2001 at 20:05 UTC ( [id://104473]=note: print w/replies, xml ) Need Help??


in reply to Writing a site search engine

Glimpse/Webglimpse is a good search engine too.

Some tips if you want to develop it for yourself:

    You have to weigh the words by their source. The following is an example preference: Words from title: 100, words from meta keywords: 75, words from headlines: 50, words from the document body: 10.

    Store the place of the word in the given document. E.g. the word is in the first 100 words. When the user is looking for two or more words, you can compare the hits, as if the found words are closer to each other as the other findings. Then you can compute a rank.

--
tune

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://104473]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others sharing their wisdom with the Monastery: (4)
As of 2024-04-25 10:28 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found