Big data is dead (2023) | Hacker News

· algiegray's blog


Big Data is Dead (2023) #

The Problem with Trick Questions #

Simple Versus Complex Solutions #

The Importance of Understanding Data Scale #

The Value of Flexibility #

Top Quotes #

"The winner of course was the guy who understood that 6TiB is what 6 of us in the room could store on our smart phones, or a enterprise HDD (or three of them for redundancy), and it could be loaded (multiple times) to memory as CSV and simply run awk scripts on it."

"I'm prone to the same fallacy: when I learn how to use a hammer, everything looks like a nail."

"Consulting service: you bring your big data problems to me, I say "your data set fits in RAM", you pay me ,000 for saving you ,000."

"I think more like, how would you prepare and cook the best five course gala dinner for only . That requires true skill."

TL;DR #

The article argues that "big data" is not as big a problem as people think, and that many data sets can be easily managed with simple tools and approaches. It criticizes the tendency to overengineer solutions and emphasizes the importance of understanding the actual data scale and needs of the problem before jumping to complex and expensive "big data" solutions.

source