Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine
 
PerlMonks  

Re: how to improve this?

by lima1 (Curate)
on Nov 13, 2007 at 12:33 UTC ( [id://650488]=note: print w/replies, xml ) Need Help??


in reply to how to improve this?

Maybe you find Bio::Grep useful. The fastest solution for your task is probably an enhanced suffix array (for a very short introduction to simple suffix arrays see http://en.wikipedia.org/wiki/Suffix_array). It needs a few minutes (and probably a lot of RAM) to construct them, but then you can search in O(m) (m query length) for exact matches.

Bio::Grep also supports the small tool GUUGle, which isn't that fast but does not require a precalculation and needs less RAM. In addition, it supports GU wobble pairs.

Update: If you don't want to use this module, then you should at least fetch your queries once and then use index or maybe better a regex.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://650488]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others having a coffee break in the Monastery: (5)
As of 2024-04-18 06:15 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found