Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re: Filtering out stop words

by Ea (Chaplain)
on Feb 26, 2020 at 09:45 UTC ( [id://11113427]=note: print w/replies, xml ) Need Help??


in reply to Filtering out stop words

Is your stop words file growing because you are adding all possible endings for your words? In which case you might start thinking about Stemming. The standard module that implements Porter's stemming is Lingua::Stem or maybe Text::Context::Porter is more to your liking. If you're text processing you'll probably end up here eventually.

Ea

Sometimes I can think of 6 impossible LDAP attributes before breakfast.

Mojoconf was great!

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11113427]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others lurking in the Monastery: (8)
As of 2024-03-28 09:21 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found