Linking pages
- Chicken-story/README.md at main · eyal0/Chicken-story · GitHub https://github.com/eyal0/Chicken-story/blob/main/README.md 107 comments
- Archiving URLs - Gwern.net https://www.gwern.net/Archiving-URLs 80 comments
- Finding near-duplicates with Jaccard similarity and MinHash - Made of Bugs https://blog.nelhage.com/post/fuzzy-dedup/ 40 comments
- GitHub - openvenues/libpostal: A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data. https://github.com/openvenues/libpostal 32 comments
- GitHub - orlp/foldhash: A fast, non-cryptographic, minimally DoS-resistant hashing algorithm for Rust. https://github.com/orlp/foldhash 15 comments
- The sort --key Trick · Gwern.net https://www.gwern.net/Sort 0 comments
- How ProPublica's Message Machine Reverse Engineers Political Microtargeting — ProPublica http://www.propublica.org/nerds/item/how-propublicas-message-machine-reverse-engineers-political-microtargeting 0 comments
- Black-Box Auditing: Verifying End-to-End Replication Integrity between MySQL and Redshift https://engineeringblog.yelp.com/2018/04/black-box-auditing.html 0 comments
- Distributed Top-N Similarity Join with Hive and Perl | by Bosko Devetak | Booking.com Engineering | Medium http://blog.booking.com/top-N-similarity-join-with-hive-and-perl-part-I.html 0 comments
- GitHub - vortext/clj-similar: Experimental library for similar set lookup using MinHash and k-d trees https://github.com/vortext/clj-similar 0 comments
- GitHub - dynatrace-oss/hash4j: Dynatrace hash library for Java https://github.com/dynatrace-oss/hash4j 0 comments
- Pass The Salt 2023 Wrap-Up - /dev/random https://blog.rootshell.be/2023/07/05/pass-the-salt-2023-wrap-up/ 0 comments
- A pure python LSH nearest neighbors implementation https://softwaredoug.com/blog/2023/08/21/implementing-random-projections 0 comments
- Algebraic Locality-Sensitive Hashing | Hopping to Numbers https://drorspei.com/2024/08/01/algebraic-locality-sensitive-hashing/ 0 comments
Related searches:
Search whole site: site:en.wikipedia.org
Search title: MinHash - Wikipedia
See how to search.