- [D] Do you use data engineering pipelines for real life projects? https://github.com/great-expectations/great_expectations 9 comments machinelearning
Linking pages
- Ruff https://beta.ruff.rs/docs/ 67 comments
- GitHub - tensorchord/Awesome-LLMOps: An awesome & curated list of best LLMOps tools for developers https://github.com/tensorchord/Awesome-LLMOps 5 comments
- How to get your data scientists and data engineers rowing in the same direction | VentureBeat https://venturebeat.com/2020/08/09/how-to-get-your-data-scientists-and-data-engineers-rowing-in-the-same-direction/ 4 comments
- How Provectus Built a High-Load Data Quality Pipeline on AWS for Lane Health | AWS Partner Network (APN) Blog https://aws.amazon.com/blogs/apn/how-provectus-built-a-high-load-data-quality-pipeline-on-aws-for-lane-health/ 3 comments
- The Most Underrated Python Packages | by Eyal Trabelsi | Towards Data Science https://towardsdatascience.com/the-most-underrated-python-packages-e22bf6049b5e 2 comments
- GitLab-spinoff Meltano raises another $8.2M for its open source DataOps platform | TechCrunch https://techcrunch.com/2022/06/08/gitlab-spinoff-meltano-raises-another-8-2m-for-its-open-source-dataops-platform/ 2 comments
- Data Distribution Shifts and Monitoring https://huyenchip.com/2022/02/07/data-distribution-shifts-and-monitoring.html 1 comment
- Introducing Dagster. A open-source Python library for… | by Nick Schrock | Dagster | Medium https://medium.com/@schrockn/introducing-dagster-dbd28442b2b7 1 comment
- So you want Data Quality Control? | DoltHub Blog https://www.dolthub.com/blog/2022-11-23-data-quality-control/ 1 comment
- How to build TRUST in Machine Learning, the sane way | by Eyal Trabelsi | Bigabid’s Brain | Medium https://medium.com/bigabids-dataverse/how-to-build-trust-in-machine-learning-the-sane-way-39d879f22e69 0 comments
- How to Build a Production Grade Workflow with SQL Modelling — Data Science & Engineering https://shopify.engineering/build-production-grade-workflow-sql-modelling 0 comments
- GitHub - ml-tooling/best-of-python: 🏆 A ranked list of awesome Python open-source libraries and tools. Updated weekly. https://github.com/ml-tooling/best-of-python 0 comments
- Down with Pipeline debt / Introducing Great Expectations | by Great Expectations | Medium https://medium.com/@expectgreatdata/down-with-pipeline-debt-introducing-great-expectations-862ddc46782a 0 comments
- Open Source Patterns in Python | DoltHub Blog https://www.dolthub.com/blog/2021-08-16-doltpy-automation/ 0 comments
- Testing Pandas Code - MungingData https://mungingdata.com/pandas/unit-testing-pandas/ 0 comments
- GitHub - unionai-oss/pandera: A light-weight, flexible, and expressive statistical data testing library https://github.com/pandera-dev/pandera 0 comments
- GitHub - tensorchord/awesome-open-source-mlops: An awesome & curated list of best open source MLOps/LLMOps tools for data scientists. https://github.com/tensorchord/awesome-open-source-mlops 0 comments
- GitHub - vihar/awesome-oss-saas: A collection of open-source saas tools https://github.com/vihar/awesome-oss-saas 0 comments
Linked pages
- Data Version Control · DVC https://dvc.org/ 123 comments
- GitHub - charliermarsh/ruff: An extremely fast Python linter, written in Rust. https://github.com/charliermarsh/ruff 101 comments
- Apache Airflow https://airflow.apache.org/ 10 comments
- Apache Spark™ - Unified Engine for large-scale data analytics http://spark.apache.org/ 9 comments
- Workflow Orchestration Made Simple | Prefect https://prefect.io 7 comments
- dbt - Transform data in your warehouse https://www.getdbt.com/ 6 comments
- GitHub - dagster-io/dagster: An orchestration platform for the development, production, and observation of data assets. https://github.com/dagster-io/dagster 2 comments
- GitHub - quiltdata/quilt: Quilt is a data mesh for connecting people with actionable data https://github.com/quiltdata/quilt 2 comments
- GitHub - kedro-org/kedro: A Python framework for creating reproducible, maintainable and modular data science code. https://github.com/quantumblacklabs/kedro 0 comments
- Flyte https://flyte.org/ 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:github.com
Search title: GitHub - great-expectations/great_expectations: Always know what to expect from your data.
See how to search.