Hacker News
- Spark 2.0.0 Released https://spark.apache.org/news/spark-2-0-0-released.html 22 comments
- Apache Spark 1.0.0 http://spark.apache.org/releases/spark-release-1-0-0.html 39 comments
- Pyspark Pandas API https://spark.apache.org/docs/latest/api/python/user_guide/pandas_on_spark/index.html 3 comments aws
- Spark Release 3.0.0 https://spark.apache.org/releases/spark-release-3-0-0.html 3 comments scala
- Spark Release 3.0.0 https://spark.apache.org/releases/spark-release-3-0-0.html 5 comments java
- How do you host your Scala documentation for private projects? http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.Dataset 11 comments scala
- Preview release of Spark 3.0 https://spark.apache.org/news/spark-3.0.0-preview.html 5 comments scala
- Spark Release 2.4.0 (including Scala 2.12 support) https://spark.apache.org/releases/spark-release-2-4-0.html 10 comments scala
- Apache Spark 2.0.0 http://spark.apache.org/releases/spark-release-2-0-0.html 11 comments scala
- Apache Spark 2.0.0 has been released! http://spark.apache.org/releases/spark-release-2-0-0.html 4 comments programming
- Using Apache's Spark library https://spark.apache.org/docs/latest/quick-start.html 10 comments learnprogramming
- Apache Spark v1.0 released http://spark.apache.org/releases/spark-release-1-0-0.html 8 comments programming
- Cluster computing with Haskell? http://spark.apache.org/ 9 comments haskell
Linking pages
- Against SQL https://scattered-thoughts.net/writing/against-sql/ 708 comments
- GitHub - karanpratapsingh/system-design: Learn how to design systems at scale and prepare for system design interviews https://github.com/karanpratapsingh/system-design 331 comments
- Why isn't differential dataflow more popular? https://scattered-thoughts.net/writing/why-isnt-differential-dataflow-more-popular/ 137 comments
- GitHub - tobymao/sqlglot: Python SQL Parser and Transpiler https://github.com/tobymao/sqlglot 135 comments
- What every software engineer should know about search https://scribe.rip/p/what-every-software-engineer-should-know-about-search-27d1df99f80d 132 comments
- Advantages of Using R Notebooks For Data Analysis Instead of Jupyter Notebooks | Max Woolf's Blog http://minimaxir.com/2017/06/r-notebooks/ 122 comments
- GitHub - databricks/scala-style-guide: Databricks Scala Coding Style Guide https://github.com/databricks/scala-style-guide 112 comments
- GitHub - mikeroyal/Self-Hosting-Guide: Self-Hosting Guide. Learn all about locally hosting (on premises & private web servers) and managing software applications by yourself or your organization. Including Cloud, LLMs, WireGuard, Automation, Home Assistant, and Networking. https://github.com/mikeroyal/Self-Hosting-Guide 108 comments
- Open-sourcing Polynote: an IDE-inspired polyglot notebook | by Netflix Technology Blog | Netflix TechBlog https://medium.com/netflix-techblog/open-sourcing-polynote-an-ide-inspired-polyglot-notebook-7f929d3f447 100 comments
- Why Scala?. Following Martin Odersky’s keynote at… | by Adam Warski | SoftwareMill Tech Blog https://blog.softwaremill.com/why-scala-a6ac8c98c541 97 comments
- Leaving Apple Inc. | Max Woolf's Blog http://minimaxir.com/2017/05/leaving-apple/ 93 comments
- Databricks Is an RDBMS | Blog | Fivetran https://fivetran.com/blog/databricks-is-an-rdbms 89 comments
- Ten Years and Counting: My Affair with Microservices · allegro.tech https://blog.allegro.tech/2024/04/ten-years-microservices.html 73 comments
- The Most In-Demand Skills for Data Scientists | by Jeff Hale | Towards Data Science https://towardsdatascience.com/the-most-in-demand-skills-for-data-scientists-4a4a8db896db 64 comments
- Tracking down the Villains: Outlier Detection at Netflix | by Netflix Technology Blog | Netflix TechBlog http://techblog.netflix.com/2015/07/tracking-down-villains-outlier.html 63 comments
- How to Misunderstand Free Software | get GNU/Linux! http://getgnulinux.org/linux/misunderstanding_free_software/ 60 comments
- GitHub - robinhood/faust: Python Stream Processing https://github.com/robinhood/faust 59 comments
- Apache Hop 2.0 is available!! - Hop https://hop.apache.org/blog/2022/06/hop-2.0.0/ 53 comments
- There Is No Big Data | Daan Debie https://dandydev.net/blog/there-is-no-big-data 51 comments
- Super-structured Data | Brim Data https://www.brimdata.io/blog/super-structured-data/ 47 comments