Beefy Boxes and Bandwidth Generously Provided by pair Networks
The stupid question is the question not asked

Re: renaming 1000's of FASTA files

by Anonymous Monk
on Jul 11, 2011 at 12:05 UTC ( #913683=note: print w/replies, xml ) Need Help??

in reply to renaming 1000's of FASTA files

If the total data size will sensibly -fit- in memory at the same time, then you can use a hash as you are doing now. Otherwise sort the files identically and write code that compares the two sorted files. Or ... put the data into an SQLite database file, which is a flat-file requiring no server at all, and use queries.

Replies are listed 'Best First'.
Re^2: renaming 1000's of FASTA files
by garyboyd (Acolyte) on Jul 11, 2011 at 15:00 UTC

    Thanks anonymous monk, would using index instead of the regex speed things up noticeably? I decided to split the input file and run multiple processes.

Log In?

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://913683]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others exploiting the Monastery: (2)
As of 2022-08-15 03:53 GMT
Find Nodes?
    Voting Booth?

    No recent polls found