in reply to Re^3: Fast common substring matching
in thread Fast common substring matching
The down side is that it is not only a single character repeated ('AAAA'), but short repeating sequences ('ACTACTACT') that can be missed or truncated. The up side is that for bioMan's problem a minimum match quanta of 128 is probably optimum and I'd guess that that is long enough to be unlikely to be a problem.
At this time I've not thought of a fast way of dealing with the issue and am somewhat inclined to ignore it unless someone can convince me that this is really useful code, but needs this bug fixed.
Perl is Huffman encoded by design.
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^5: Fast common substring matching
by BrowserUk (Patriarch) on Aug 25, 2005 at 02:23 UTC | |
by GrandFather (Saint) on Aug 25, 2005 at 02:41 UTC | |
by bioMan (Beadle) on Aug 25, 2005 at 16:42 UTC | |
by GrandFather (Saint) on Aug 25, 2005 at 21:46 UTC | |
by bioMan (Beadle) on Aug 26, 2005 at 15:52 UTC |
In Section
Cool Uses for Perl