Data Lakehouse Architecture and AI Company - Databricks - discu.eu

Hacker News

DBRX: A new open LLM https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm 343 comments 27/3/2024

Reddit

Linking pages

Membership | BSA | The Software Alliance http://www.bsa.org/about-bsa/bsa-members 171 comments
GitHub - tobymao/sqlglot: Python SQL Parser and Transpiler https://github.com/tobymao/sqlglot 135 comments
Databricks Is an RDBMS | Blog | Fivetran https://fivetran.com/blog/databricks-is-an-rdbms 89 comments
dolly/data at master · databrickslabs/dolly · GitHub https://github.com/databrickslabs/dolly/tree/master/data 89 comments
GitHub - ipyflow/ipyflow: A reactive Python kernel for Jupyter notebooks https://github.com/ipyflow/ipyflow 73 comments
Replit - How to train your own Large Language Models https://blog.replit.com/llm-training 60 comments
Databricks open-sources Delta Lake to make data lakes more reliable | TechCrunch https://techcrunch.com/2019/04/24/databricks-open-sources-delta-lake-to-make-data-lakes-more-reliable/ 54 comments
GitHub - devinpleuler/analytics-handbook: Getting started with soccer analytics https://github.com/devinpleuler/analytics-handbook 43 comments
GitHub - deepset-ai/haystack: :mag: Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex question answering, semantic search, text generation applications, and more. https://github.com/deepset-ai/haystack 35 comments
How companies make millions on Open Source – Palark | Blog https://blog.palark.com/open-source-business-models/ 33 comments
Building an open data pipeline in 2024 - by Dan Goldin https://blog.twingdata.com/p/building-an-open-data-pipeline-in 32 comments
GitHub - graphistry/pygraphistry: PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer https://github.com/graphistry/pygraphistry 27 comments
Databricks acquires Redash, a visualizations service for data scientists | TechCrunch https://techcrunch.com/2020/06/24/databricks-acquires-redash-a-visualizations-service-for-data-scientists/ 26 comments
How open-source software took over the world | TechCrunch https://techcrunch.com/2019/01/12/how-open-source-software-took-over-the-world/ 18 comments
Membership | BSA | The Software Alliance https://bsa.org/membership 12 comments
Startups That Will Be Huge in 2016 http://www.businessinsider.com/startups-that-will-be-huge-in-2016-2015-12 8 comments
High-performance Inferencing with Transformer Models on Spark | by Dannie Sim | Towards Data Science https://towardsdatascience.com/high-performance-inferencing-with-large-transformer-models-on-spark-beb82e71ecc9 8 comments
Polyaxon, Argo and Seldon for model training, package and deployment in Kubernetes https://danielfrg.com/blog/2018/10/model-management-polyaxon-argo-seldon/ 7 comments
Meet VC Jeremy Fiance, UC Berkeley's 24-year-old superconnector • TechCrunch http://techcrunch.com/2016/04/18/meet-vc-jeremy-fiance-uc-berkeleys-24-year-old-superconnector/ 6 comments
Dr Alex Ioannides – Building a Data Science Platform for R&D, Part 1 - Setting-Up AWS https://alexioannides.com/2016/08/16/building-a-data-science-platform-for-rd-part-1-setting-up-aws/ 5 comments