Hacker News
- Pull data from 100s of sources in 1 Python statement with in-memory ELT library https://airbyte.com/blog/announcing-pyairbyte 3 comments
- Navigating the Data Engineering Landscape in 2024 https://airbyte.com/blog/data-engineering-landscape-2024 3 comments
- A Technical Dive into PostgreSQL's replication mechanisms https://airbyte.com/blog/a-guide-to-logical-replication-and-cdc-in-postgresql 25 comments
- ELTP: Extending ELT for Modern AI and Analytics https://airbyte.com/blog/eltp-extending-elt-for-modern-ai-and-analytics 15 comments
- Show HN: Chat with your data using LangChain, Pinecone, and Airbyte https://airbyte.com/tutorials/chat-with-your-data-using-openai-pinecone-airbyte-and-langchain 59 comments
- Airbyte API & Terraform Provider – available in open source https://airbyte.com/blog/airbytes-official-api-and-terraform-provider-now-in-open-source 6 comments
- Using Adapt and Beam for Effective Data Modeling https://airbyte.com/blog/data-modeling-unsung-hero-data-engineering-architecture-pattern-tools 2 comments
- Airbyte makes 100 alpha / beta connectors free https://airbyte.com/blog/why-airbyte-made-alpha-and-beta-connectors-free 22 comments
- Data Warehouse vs. Operational Database What? How? Which One? https://airbyte.com/blog/data-warehouse-vs-operational-database 4 comments
- The evolution of the data engineer role https://airbyte.com/blog/data-engineering-past-present-and-future 117 comments
- Using EtLT to improve GDPR compliance https://airbyte.com/blog/etlt-gdpr-compliance 2 comments
- Data Integration Guide: Techniques, Technologies, and Tools https://airbyte.com/blog/data-integration 2 comments
- Airbyte acquires Grouparoo to accelerate Data Movement https://airbyte.com/blog/airbyte-acquires-grouparoo-to-accelerate-data-movement 2 comments
- Scaling Data Pipelines on Kubernetes https://airbyte.com/blog/scaling-data-pipelines-kubernetes 2 comments
- From NumPy to Arrow: How Pandas 2.0 is Changing Data Processing for the Better https://airbyte.com/blog/pandas-2-0-ecosystem-arrow-polars-duckdb/ 89 comments datascience
- From NumPy to Arrow: How Pandas 2.0 is Changing Data Processing for the Better https://airbyte.com/blog/pandas-2-0-ecosystem-arrow-polars-duckdb/ 2 comments python
- You have collected unstructured data! Now what? https://airbyte.com/blog/analyze-unstructured-data 2 comments database
- The fundamental architectural difference that makes data warehouses appropriate for analytics, and that makes operational databases appropriate for operational workloads. https://airbyte.com/blog/data-warehouse-vs-operational-database 3 comments database
- Redshift Turns 10: The Evolution of Amazon's Cloud Data Warehouse https://airbyte.com/blog/amazon-redshift-data-warehouse-evolution 5 comments aws
- How to optimize Redshift performance and reduce costs https://airbyte.com/blog/optimize-redshift-performance-and-reduce-costs 3 comments aws
- Will Rust Take over Data Engineering? https://airbyte.com/blog/rust-for-data-engineering 4 comments coding
- How to build a Data Lake with Python APIs on top of Table Formats (Apache Hudi, Iceberg, Delta Lake) https://airbyte.com/blog/data-lake-lakehouse-guide-powered-by-table-formats-delta-lake-iceberg-hudi 2 comments python
- Best practices for data modeling with SQL and dbt https://airbyte.com/blog/sql-data-modeling-with-dbt 2 comments sql
- Python vs SQL for Data Analysis: comparing performance, functionality and dev XP https://airbyte.com/blog/sql-vs-python-data-analysis 9 comments python
- How we run database migrations with Flyway, jOOQ, and testcontainers https://airbyte.com/blog/database-migrations-with-flyway-jooq-and-testcontainers 24 comments java
Linking pages
- GitHub - remoteintech/remote-jobs: A list of semi to fully remote-friendly companies (jobs) in tech. https://github.com/remoteintech/remote-jobs 158 comments
- GitHub - Qovery/Replibyte: Seed your development database with real data ⚡️ https://github.com/Qovery/replibyte 115 comments
- GitHub - airbytehq/airbyte: Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes. https://github.com/airbytehq/airbyte 39 comments
- Extract, load, and transform your data with PlanetScale Connect https://planetscale.com/blog/extract-load-and-transform-your-data-with-planetscale-connect 20 comments
- Self Hostable Open Source Alternatives to Commercial products - DEV Community 👩💻👨💻 https://dev.to/rupeshpadhye/self-hostable-open-source-alternatives-to-commercial-products-ga4 19 comments
- GitHub - CrowdDotDev/awesome-oss-investors: Awesome list of VCs investing in commercial open-source startups 💸 https://github.com/CrowdDotDev/awesome-oss-investors 8 comments
- Replicate RDS PostgreSQL to BigQuery using AirByte CDC | by RK Kuppala | The Cloudside View https://blog.thecloudside.com/replicate-rds-postgresql-to-bigquery-using-airbyte-cdc-9be9f294e71a 4 comments
- GitHub - sample-resume/awesome-easy-apply: 🚀 A curated list of 800+ software engineering companies that use easy-to-apply job platforms like Lever and Greenhouse https://github.com/sample-resume/awesome-easy-apply 4 comments
- Introducing Open-Source Indexes: Databases, Headless CMSs and Static Site Generators | by Bogdan Semenov | Runa Capital // Writings | Medium https://medium.com/runacapital/introducing-real-time-open-source-indexes-databases-headless-cmss-and-static-site-generators-5b53cbf87188 3 comments
- ETL vs. ELT https://glossary.airbyte.com/term/etl-vs-elt/ 3 comments
- GitHub - valmi-io/valmi-activation: valmi.io reverse-ETL (data activation) is the open-source data activation platform to load data from warehouses into SaaS platforms, Webhook Apis etc. https://github.com/valmi-io/valmi-activation 2 comments
- Benchmarking Postgres Replication: PeerDB vs Airbyte https://blog.peerdb.io/benchmarking-postgres-replication-peerdb-vs-airbyte 2 comments
- GitHub - faros-ai/faros-community-edition: BI, API and Automation layer for your Engineering Operations data https://github.com/faros-ai/faros-community-edition 1 comment
- Investing in Software Infrastructure in a Downmarket | by Cowboy Ventures | Cowboy Ventures | Medium https://medium.com/cowboy-ventures/investing-in-software-infrastructure-in-a-downmarket-893631ef841e 1 comment
- Philosophy Major to Shopify Staff Developer - PorchLab https://www.porchlab.com/philosophy-shopify-developer/ 1 comment
- Data50: The World’s Top 50 Data Startups | Future https://future.a16z.com/data50/ 1 comment
- Remote-friendly companies – Remote In Tech https://remoteintech.company 1 comment
- How to Automate Data Analytics Using CI/CD | by Patrik Braborec | GoodData Developers | Medium https://medium.com/gooddata-developers/how-to-automate-data-analytics-using-ci-cd-9f1475065d61 1 comment
- How we Created an in-Browser Kubernetes Experience https://www.plural.sh/blog/how-we-created-an-in-browser-kubernetes-experience/ 1 comment
- How to build a DAG based Task Scheduling tool for Multiprocessor systems using python | by Ramses Alexander Coraspe Valdez | ITNEXT https://coraspe-ramses.medium.com/how-to-build-a-dag-based-task-scheduling-tool-for-multiprocessor-systems-using-python-d11a093a835b?sk=cd97481b16fea0e941c32362eaded7c5 1 comment