Linking pages
- Why data scientists shouldn’t need to know Kubernetes https://huyenchip.com/2021/09/13/data-science-infrastructure.html 121 comments
- GitHub - bayandin/awesome-awesomeness: A curated list of awesome awesomeness https://github.com/bayandin/awesome-awesomeness 66 comments
- GitHub - jnv/lists: The definitive list of lists (of lists) curated on GitHub and elsewhere https://github.com/jnv/lists 28 comments
- GitHub - grailbio/reflow: A language and runtime for distributed, incremental data processing in the cloud https://github.com/grailbio/reflow 15 comments
- GitHub - ropensci/drake: An R-focused pipeline toolkit for reproducibility and high-performance computing https://github.com/ropensci/drake 5 comments
- Viewflow 1.0. I’m glad to announce 1.0 release of… | by Django Viewflow | Medium https://medium.com/@viewflow/viewflow-1-0-835b72a65fea 4 comments
- GitHub - visenger/awesome-mlops: A curated list of references for MLOps https://github.com/visenger/awesome-mlops 2 comments
- GitHub - cuuupid/awesome-lists: A curated list of curated lists. https://github.com/cuuupid/awesome-lists 1 comment
- GitHub - r0f1/datascience: Curated list of Python resources for data science. https://github.com/r0f1/datascience 0 comments
- Workflow/BPM Systems, 2020 : Neuroning https://neuroning.com/post/workflow-systems-2020/ 0 comments
- Building a Data Engineering Project in 20 Minutes | https://www.sspaeti.com/blog/data-engineering-project-in-twenty-minutes/ 0 comments
- Data Science workflows at insitro: using redun on AWS Batch | AWS HPC Blog https://aws.amazon.com/blogs/hpc/data-science-workflows-at-insitro-using-redun-on-aws-batch/ 0 comments
- GitHub - asoplata/open-science-resources: A publicly-editable collection of open science resources, including tools, datasets, meta-resources, etc. https://github.com/asoplata/open-science-resources 0 comments
- Bioinformatics pipeline example from the bottom up https://ricomnl.com/blog/bottom-up-bioinformatics-pipeline/ 0 comments
- Building a Data Engineering Project in 20 Minutes | https://www.ssp.sh/blog/data-engineering-project-in-twenty-minutes/ 0 comments
- redun documentation https://insitro.github.io/redun/ 0 comments
Linked pages
- Bazel http://bazel.io/ 374 comments
- Project Jupyter | Home http://jupyter.org 240 comments
- Apache NiFi http://nifi.apache.org/index.html 223 comments
- GitHub - kahun/awesome-sysadmin: A curated list of amazingly awesome open source sysadmin resources inspired by Awesome PHP. https://github.com/kahun/awesome-sysadmin 194 comments
- Data Version Control · DVC https://dvc.org/ 175 comments
- Metaflow - a framework for real-life data science and ML https://metaflow.org/ 151 comments
- GitHub - uber/cadence: Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logic in a scalable and resilient way. https://github.com/uber/cadence 133 comments
- GitHub - mara/mara-pipelines: A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow https://github.com/mara/data-integration 125 comments
- GitHub - kestra-io/kestra: Orchestration and automation platform to execute millions of scheduled and event-driven workflows declaratively in code and from the UI https://github.com/kestra-io/kestra 108 comments
- GitHub - substantic/rain: Framework for large distributed pipelines https://github.com/substantic/rain 108 comments
- Open Source Durable Execution | Temporal Technologies https://temporal.io 95 comments
- SCons: A software construction tool - SCons http://scons.org/ 88 comments
- GitHub - ovh/cds: Enterprise-Grade Continuous Delivery & DevOps Automation Open Source Platform https://github.com/ovh/cds 58 comments
- GitHub - treeverse/lakeFS: lakeFS - Data version control for your data lake | Git for data https://github.com/treeverse/lakeFS 55 comments
- http://beakernotebook.com 53 comments
- GitHub - ploomber/ploomber: The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️ https://github.com/ploomber/ploomber 50 comments
- Guix Workflow Language | Guix Workflow Language https://www.guixwl.org/ 50 comments
- A DSL for parallel and scalable computational pipelines | Nextflow https://www.nextflow.io/ 40 comments
- Kiba ETL http://www.kiba-etl.org/ 32 comments
- Zeppelin http://zeppelin.apache.org/ 31 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - pditommaso/awesome-pipeline: A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
See how to search.