Hacker News
Linked pages
- GitHub - dedupeio/dedupe: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution. https://github.com/dedupeio/dedupe 13 comments
- GitHub - moj-analytical-services/splink: Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends https://github.com/moj-analytical-services/splink 9 comments
- Thicket [beta] https://thicket.io 1 comment
- GitHub - vintasoftware/entity-embed: PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors. https://github.com/vintasoftware/entity-embed/ 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - ropeladder/record-linkage-resources: Resources for tackling record linkage / deduplication / data matching problems
See how to search.