Hacker News
- Cloudera taken private for $5.3B, acquires Datacoral and Cazena https://blog.cloudera.com/turning-the-page/ 109 comments
- Common Probability Distributions: The Data Scientist’s Crib Sheet http://blog.cloudera.com/blog/2015/12/common-probability-distributions-the-data-scientists-crib-sheet/ 7 comments
- BinaryPig: Malware analysis with Hadoop, Django and Elasticsearch http://blog.cloudera.com/blog/2013/11/binarypig-scalable-static-binary-analysis-over-hadoop/ 2 comments
- Algorithms Every Data Scientist Should Know: Reservoir Sampling http://blog.cloudera.com/blog/2013/04/hadoop-stratified-randosampling-algorithm/ 40 comments
- A Guide to Python Frameworks for Hadoop http://blog.cloudera.com/blog/2013/01/a-guide-to-python-frameworks-for-hadoop/ 15 comments
- A nice introduction to and cheatsheet on Probability Distributions https://blog.cloudera.com/blog/2015/12/common-probability-distributions-the-data-scientists-crib-sheet/ 5 comments datascience
- Common Probability Distributions: The Data Scientist's Crib Sheet http://blog.cloudera.com/blog/2015/12/common-probability-distributions-the-data-scientists-crib-sheet/?amp%3Butm_campaign=buffer&%3Butm_medium=social&%3Butm_source=facebook.com 10 comments statistics
- How Apache Spark, Scala, and Functional Programming Made Hard Problems Easy at Barclays https://blog.cloudera.com/blog/2015/08/how-apache-spark-scala-and-functional-programming-made-hard-problems-easy-at-barclays/ 7 comments scala
- How Impala Uses Runtime Code Generation (LLVM) to Maximize Query Performance http://blog.cloudera.com/blog/2013/02/inside-cloudera-impala-runtime-code-generation/ 6 comments programming
- Cloudera Impala: Real-Time Queries in Apache Hadoop Based on Google Dremel is Now in Public Beta http://blog.cloudera.com/blog/2012/10/cloudera-impala-real-time-queries-in-apache-hadoop-for-real/ 6 comments programming