http://qs321.pair.com?node_id=332274


in reply to Re: Machine learning
in thread Machine learning

Thanks for your tip, but that's not really what i wanted.

I already reached the state of PerlFect, i even implemented massive storage clustering.

The key feature of the search engine should be categorization of results and learning from the users input, changed keywords, wants more than first ten results...
The whole concept of the user interactivity goes far too wide to explain here.

What i am seeking is simply a good learning algorithm! ;)

Replies are listed 'Best First'.
Re: Re: Re: Machine learning
by inman (Curate) on Feb 27, 2004 at 16:28 UTC
    Automatic categorisation is the panacea of Knowledge Management and it is something that a great many people are working on with a view to making some serious financial gain. The auto-cat software is therfore expensive but the blurb on the vendor's websites may be of interest.

    I use search engine software from Verity who implement machine assisted categorisation in a workbench tool such that the output keyword net can be applied to content as it is indexed. This works well in a corporate environment where content doesn't change that much and you just want to locate it in a defined categorisation structure. Verity also have a 'social network' product that allows people to see locate subject matter experts. I haven't worked with this bit yet but the demo looked cool.

    I have also looked at Autonomy who popularised Baysian techniques for clustering results. Their search engine works really well for newsfeeds where the clustering is generally unknown and fluid. The search results can appear really random until the internals have caught up with a new cluster of information. I am told that the BBC News website uses this technique to create the 'related stories' links on it's website.