Beefy Boxes and Bandwidth Generously Provided by pair Networks
good chemistry is complicated,
and a little bit messy -LW

Re^10: Best way to store/access large dataset?

by stonecolddevin (Parson)
on Jun 29, 2018 at 16:51 UTC ( #1217618=note: print w/replies, xml ) Need Help??

in reply to Re^9: Best way to store/access large dataset?
in thread Best way to store/access large dataset?

ETL is extract/transform/load. So basically you'd be taking your raw data, extracting it out of the files and transforming it into a sensible format or data structure, and loading it up into a persistent data store.

A million rows per column is much more reasonable than a million columns. That's still a ton of data depending on how many parent rows the associated rows have. I have some ideas but honestly it's probably best to get your co-workers feedback since they know the data and ask whatever other specific questions you have.

Three thousand years of beautiful tradition, from Moses to Sandy Koufax, you're god damn right I'm living in the fucking past

  • Comment on Re^10: Best way to store/access large dataset?

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1217618]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others exploiting the Monastery: (4)
As of 2021-04-11 08:15 GMT
Find Nodes?
    Voting Booth?

    No recent polls found