- Splink 3: Fast, accurate and scalable fuzzy record linkage in Python with support for multiple backends (FOSS) https://github.com/moj-analytical-services/splink/ 7 comments datascience
Linking pages
- GitHub - tobymao/sqlglot: Python SQL Parser and Transpiler https://github.com/tobymao/sqlglot 135 comments
- sqlglot/python_sql_engine.md at main · tobymao/sqlglot · GitHub https://github.com/tobymao/sqlglot/blob/main/posts/python_sql_engine.md 21 comments
- GitHub - ropeladder/record-linkage-resources: Resources for tackling record linkage / deduplication / data matching problems https://github.com/ropeladder/record-linkage-resources 3 comments
- Fuzzy Matching and Deduplicating Hundreds of Millions of Records with Splink | by Robin Linacre | Towards Data Science https://towardsdatascience.com/fuzzy-matching-and-deduplicating-hundreds-of-millions-of-records-using-apache-spark-93d0f095001f 0 comments
- GitHub - davidgasquez/awesome-duckdb: 🦆 A curated list of awesome DuckDB resources https://github.com/davidgasquez/awesome-duckdb 0 comments
- Fuzzy regex matching in Python • Max Halford https://maxhalford.github.io/blog/fuzzy-regex-matching-in-python/ 0 comments
Would you like to stay up to date with Python? Checkout Python
Weekly.