in reply to Processing ~1 Trillion records
I've only glanced at the various answers quickly, so maybe I'm off the mark, but:
My immediate reaction to needing to process that many rows is to try to parallelize the process. It will put a higher load on the DB, but that's what the DB is really good at. Obviously your dataset needs to be partitionable, but I can't imagine a dataset of that size that can't be split in some way.
Michael
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^2: Processing ~1 Trillion records
by Anonymous Monk on Oct 26, 2012 at 12:47 UTC |
In Section
Seekers of Perl Wisdom