perlquestion
cLive ;-)
Hi all,
<p>In my current incarnation I'm developing a Search Engine from scratch. One thing I'd like to deal with is "headwords" and "derivatives" (excuse me if the terms aren't correct, I'm not a linguist :). Perhaps an example:
<code>
host
hosts
hosted
hosting
</code>
<p>I'm sure you get the idea...</p>
<p>Half of my problem is not knowing what to search for. I tried "suffix dictionaries" and "suffix tree" and found a few interesting articles, but CPAN appears to be rather sparse on this front (or maybe I'm searching on the wrong terms - I do find CPAN's search rather strange at times).</p>
<p>So,</p>
<ul>
<li>is there anything out there that might help here; or
<li>does anyone know of any good books/papers that discuss this issue
</ul>
<p>thoughts welcomed</p>
<p>cLive ;-)</p>