Hacker News
- Lean Data Automation: a principal components approach https://blog.dagworks.io/p/lean-data-automation-a-principal 0 comments
Linked pages
- Metaflow - a framework for real-life data science and ML https://metaflow.org/ 151 comments
- Cloud Data Warehouse – Amazon Redshift – Amazon Web Services http://aws.amazon.com/redshift/ 93 comments
- GitHub - kedro-org/kedro: A Python framework for creating reproducible, maintainable and modular data science code. https://github.com/kedro-org/kedro 40 comments
- Machine Learning – Amazon Web Services https://aws.amazon.com/sagemaker/ 29 comments
- GitHub - spotify/luigi: Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in. https://github.com/spotify/luigi 17 comments
- Modal: High-performance AI infrastructure https://modal.com/ 13 comments
- Apache Airflow https://airflow.apache.org/ 10 comments
- Presto: Free, Open-Source SQL Query Engine for any Data https://prestodb.io/ 8 comments
- Rent Cloud GPUs from $0.2/hour https://runpod.io 4 comments
- Data Lakehouse Architecture and AI Company - Databricks https://databricks.com/ 3 comments
- Dagster | Cloud-native orchestration of data pipelines https://dagster.io/ 3 comments
- GitHub - dbt-labs/dbt-core: dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications. https://github.com/dbt-labs/dbt-core 3 comments
- GitHub - TobikoData/sqlmesh: SQLMesh is a DataOps framework that brings the benefits of DevOps to data teams. It enables data scientists, analysts, and engineers to efficiently run and deploy data transformations written in SQL or Python. https://github.com/TobikoData/sqlmesh 2 comments
- Please separate orchestration and execution https://www.run.house/blog/orchestration-and-execution 2 comments
- https://www.snowflake.com/blog/introducing-polaris-catalog/ 2 comments
- Flyte https://flyte.org/ 0 comments
- Vertex AI | Google Cloud https://cloud.google.com/vertex-ai 0 comments
- GitHub - dlt-hub/dlt: data load tool (dlt) is an open source Python library that makes data loading easy 🛠️ https://github.com/dlt-hub/dlt 0 comments
- How well-structured should your data code be? https://blog.dagworks.io/p/how-well-structured-should-your-data 0 comments
- GitHub - run-house/runhouse: The fastest way to iterate and deploy AI workloads on your own infra. Unobtrusive, debuggable, PyTorch-like APIs. https://github.com/run-house/runhouse 0 comments
Related searches:
Search whole site: site:blog.dagworks.io
Search title: Lean Data Automation: A Principal Components Approach
See how to search.