artist has asked for the wisdom of the Perl Monks concerning the following question:
Dear Monks,
I am looking to build or use a search engine for my Internet Website. I searched here and at CPAN but found no useful results. (May be I am missing right terms in CPAN search). Search will be for only HTML files. Search engine has to search different set of files for queries coming from different customers. The platform could be Windows or Unix using Apache server (no mod_perl) if that matters.Building Google like search engine would excellent but anything and everything that has good value would help. I don't want to put Google front-end on my website.
Thus, what I am looking for is exisiting search mechanism or CPAN modules that helps me to build a search engine. Of course, I am open to any other suggestions.
Thank you,
{artist}
Update: Please note that I do not have access to database for this purpose. My files are not changed. They are added/deleted on daily base. Volume: Around 10000 files
Update2:(20031114)
- Volume : total around 200,000 files and 5000 files are added daily.. Some are removed at periodical interval.
- Perlfect Issues:.No incremental/differential index. And it takes large amount of time to build the index. So cannot search on recently updated things. The data has to be published ASAP. So cannot wait for indexing to finish before publishing. If someone has idea: How to encorporate differential indexing with perlfect, I would really appreciate it.
- Cannot use Google. The main reason is I have password protected sites and not everybody should be able see everybody else's contents. Think: -> Building a search system for mail accounts.
20031113 Edit by Corion: Fixed unclosed blockquote
Back to
Seekers of Perl Wisdom