Hacker News
- Apache iceberg the Hadoop of the modern-data-stack? https://blog.det.life/apache-iceberg-the-hadoop-of-the-modern-data-stack-c83f63a4ebb9 63 comments
- Poleposition: Open-Source Apache Bigtop Hadoop Installer for the Masses https://github.com/beartell/PolePosition 2 comments
- Introducing Apache Hadoop: The Modern Data Operating System http://www.infoq.com/presentations/Introducing-Apache-Hadoop 2 comments
- Apache Hadoop 2.0 (Alpha) Released http://hortonworks.com/blog/apache-hadoop-2-0-alpha-released/ 4 comments
- NextGen Apache Hadoop MapReduce http://www.slideshare.net/hortonworks/nextgen-apache-hadoop-mapreduce 2 comments
- Map-Reduce With Ruby Using Apache Hadoop http://www.cloudera.com/blog/2011/01/map-reduce-with-ruby-using-apache-hadoop/ 3 comments
- A New Analytics Toolbox with Apache Spark – Going Beyond Hadoop http://planetcassandra.org/blog/the-new-analytics-toolbox-with-apache-spark-going-beyond-hadoop/ 27 comments
- Apache Hadoop 2 goes GA http://hortonworks.com/blog/apache-hadoop-2-is-ga/ 4 comments
- Looking at the code behind our three uses of Apache Hadoop http://www.facebook.com/notes/facebook-engineering/looking-at-the-code-behind-our-three-uses-of-apache-hadoop/468211193919 3 comments
- Apache Hadoop Wins Terabyte Sort Benchmark (1 terabyte of data in 209 seconds) http://developer.yahoo.com/blogs/hadoop/2008/07/apache_hadoop_wins_terabyte_sort_benchmark.html 3 comments
- Apache Drill 1.0 – Schema-Free SQL Query Engine for Hadoop and NoSQL http://drill.apache.org/ 30 comments
- Apache Drill: Schema-free SQL Query Engine for Hadoop and NoSQL http://drill.apache.org/ 7 comments
- Users of Popular Open Source Software including jQuery, Apache Hadoop under Patent Attack https://www.jdsupra.com/legalnews/popular-open-source-software-under-68551/ 7 comments programming
- Apache Hadoop 2.0 (Alpha) released http://hortonworks.com/blog/apache-hadoop-2-0-alpha-released/ 7 comments programming
- NextGen MapReduce Hits Apache Hadoop Mainline http://www.hortonworks.com/nextgen-mapreduce-hits-apache-hadoop-mainline/ 10 comments programming
- MapReduce programming with Apache Hadoop http://www.javaworld.com/javaworld/jw-09-2008/jw-09-hadoop.html 4 comments programming
- Apache Hadoop Wins Terabyte Sort Benchmark http://developer.yahoo.com/blogs/hadoop/2008/07/apache_hadoop_wins_terabyte_sort_benchmark.html 31 comments programming
- Hadoop Becomes a Top Level Project in Apache Software http://www.jaxmag.com/itr/news/psecom,id,39870,nodeid,146.html 3 comments programming
- Hadoop: MapReduce implementation from Apache Foundation http://wiki.apache.org/lucene-hadoop/ 2 comments programming
- Apache Hadoop v3.0.0 General Availability http://mail-archives.apache.org/mod_mbox/www-announce/201712.mbox/%3C1513249258.1489113.1204802568.68bb3f6e%40webmail.messagingengine.com%3E 4 comments programming
- Apache Hadoop Explained: Kafka, ZooKeeper, HDFS and Cassandra. http://www.grokit.ca/cnt/apachehadoop/ 12 comments programming
- Using Apache Cassandra with Apache Hadoop http://www.orpiske.net/2014/07/using-apache-cassandra-with-apache-hadoop/ 4 comments programming
- Apache Avro (Hadoop) quick start guide. Avro is a data serialization system. http://github.com/phunt/avro-rpc-quickstart 3 comments programming
- Google grants Apache Hadoop a license for patent 7,650,331 ["System and method for efficient large- scale data processing"] http://markmail.org/message/iquq3vxe4opmk4lt 39 comments programming
- Apache Hadoop vs Apache Spark: Two Popular Big Data Frameworks Compared http://www.evontech.com/what-we-are-saying/entry/apache-hadoop-vs-apache-spark-two-popular-big-data-frameworks-compared.html 4 comments datascience
- Analyzing some ‘Big’ Data Using C#, Azure And Apache Hadoop – Analyzing Stack Overflow Data Dumps http://www.codeproject.com/articles/398563/analyzing-some-big-data-using-csharp-azure-and-apa 4 comments programming
- Cloudera Impala: Real-Time Queries in Apache Hadoop Based on Google Dremel is Now in Public Beta http://blog.cloudera.com/blog/2012/10/cloudera-impala-real-time-queries-in-apache-hadoop-for-real/ 6 comments programming
- Apache Kylin is an open source Distributed Analytics Engine designed to provide SQL interface and multi-dimensional analysis (OLAP) on Hadoop supporting extremely large datasets https://kylin.apache.org/ 11 comments programming