This misses the point - sometimes is much easier to take 1Tb of text data and manipulate it using standard "big data" tools, than it is to figure out how to do it using a single machine and RAM. I don't care where it fits. I care about doing the job as quickly and efficiently (and reproducible) as possible.