Hacker News
- Finding near-duplicates with Jaccard similarity and MinHash https://blog.nelhage.com/post/fuzzy-dedup/ 36 comments
Lobsters
- Finding near-duplicates with Jaccard similarity and MinHash https://blog.nelhage.com/post/fuzzy-dedup/ 2 comments math , programming
Linked pages
- [2005.14165] Language Models are Few-Shot Learners https://arxiv.org/abs/2005.14165 201 comments
- Jaccard index - Wikipedia https://en.wikipedia.org/wiki/Jaccard_index 43 comments
- Locality-sensitive hashing - Wikipedia https://en.wikipedia.org/wiki/Locality-sensitive_hashing 40 comments
- HyperLogLog - Wikipedia https://en.wikipedia.org/wiki/HyperLogLog 3 comments
- UAX #15: Unicode Normalization Forms https://unicode.org/reports/tr15/ 0 comments
- [2101.00314] SetSketch: Filling the Gap between MinHash and HyperLogLog https://arxiv.org/abs/2101.00314 0 comments
- MinHash - Wikipedia http://en.wikipedia.org/wiki/MinHash 0 comments
Related searches:
Search whole site: site:blog.nelhage.com
Search title: Finding near-duplicates with Jaccard similarity and MinHash - Made of Bugs
See how to search.