Beefy Boxes and Bandwidth Generously Provided by pair Networks
Welcome to the Monastery
 
PerlMonks  

Re^7: speeding up a file-based text search (word list for phrase search)

by Aristotle (Chancellor)
on May 09, 2003 at 19:04 UTC ( [id://256979]=note: print w/replies, xml ) Need Help??


in reply to Re: Re: Re: Re: Re: Re: speeding up a file-based text search
in thread speeding up a file-based text search

Depends on your word list. You could store the in-record location(s) of the word as well; then, when doing a phrase search, you can intersect the sets for each word by record and then check for consecutive locations in the correct order. This is, AFAIK and at least roughly, the way all of the big web search engines work.

Makeshifts last the longest.

  • Comment on Re^7: speeding up a file-based text search (word list for phrase search)

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://256979]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others chilling in the Monastery: (8)
As of 2024-04-23 08:54 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found