http://qs321.pair.com?node_id=886998


in reply to search a large text file

For previous discussion of this problem see statistics of a large text and Reaped: a large text file into hash.

As I pointed out to you in the previous discussions, this is likely to be slow. The next step that I suggested is to parallelize work with Hadoop. Have you tried that yet?

Replies are listed 'Best First'.
Re^2: search a large text file
by BrowserUk (Patriarch) on Feb 08, 2011 at 17:31 UTC

    Doesn't hadoop require a cluster of servers and extensive software setup?