Hacker News
- Benchmarking Big Data SQL Platforms in the Cloud https://databricks.com/blog/2017/07/12/benchmarking-big-data-sql-platforms-in-the-cloud.html 2 comments
- I've benchmarked speed and cost of 3 major big data processing solutions on gcp: BigQuery, Dataflow and Spark. Dataflow turned out to be 30 times slower and more expensive than everything else https://mgaiduk.substack.com/p/big-data-on-gcp-dataflow-bigquery?sd=pf 2 comments googlecloud
- "While it is flattering that the Library of Congress is used as a benchmark which others measure their data capacity, we are far “bigger” than many of them might think"... Like "3 and 5 petabytes of content per year" for *decades* big. http://blogs.loc.gov/loc/2009/02/how-big-is-the-library-of-congress/ 4 comments reddit.com