Hacker News
- Textdistance: Compute distance between sequences with 30 algorithms https://github.com/life4/textdistance 2 comments
Linking pages
- GitHub - life4/textdistance.rs: 🦀📏 Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support. https://github.com/life4/textdistance.rs 27 comments
- Near duplicate image detection https://neuml.hashnode.dev/near-duplicate-image-detection 4 comments
- Diving into PyPI package name squatting | Gram Publishing v2 https://blog.orsinium.dev/posts/py/pypi-squatting/ 3 comments
- GitHub - dbousque/batch_jaro_winkler: Fast batch jaro winkler distance implementation in C99 with Ruby, OCaml and Python bindings. https://github.com/dbousque/batch_jaro_winkler 2 comments
- GitHub - r0f1/datascience: Curated list of Python resources for data science. https://github.com/r0f1/datascience 0 comments
- GitHub - ml-tooling/best-of-ml-python: 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly. https://github.com/ml-tooling/best-of-ml-python 0 comments
- Machine Learning Toolbox https://amitness.com/toolbox/ 0 comments
Linked pages
- Cosine similarity - Wikipedia https://en.wikipedia.org/wiki/Cosine_similarity 274 comments
- Levenshtein distance - Wikipedia https://en.wikipedia.org/wiki/Levenshtein_distance 173 comments
- Burrows–Wheeler transform - Wikipedia https://en.wikipedia.org/wiki/Burrows%E2%80%93Wheeler_transform 58 comments
- Jaccard index - Wikipedia https://en.wikipedia.org/wiki/Jaccard_index 43 comments
- Entropy (information theory) - Wikipedia https://en.wikipedia.org/wiki/Entropy_(information_theory)#Entropy_as_a_measure_of_diversity 16 comments
- Jaro–Winkler distance - Wikipedia https://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance 9 comments
- Home | Task https://taskfile.dev/ 6 comments
- Normalized compression distance - Wikipedia https://en.wikipedia.org/wiki/Normalized_compression_distance 2 comments
- Guide to Fuzzy Matching with Python - Open Source Automation http://theautomatic.net/2019/11/13/guide-to-fuzzy-matching-with-python/ 0 comments
- Run-length encoding - Wikipedia https://en.wikipedia.org/wiki/Run-length_encoding 0 comments
- Hamming distance - Wikipedia https://en.wikipedia.org/wiki/Hamming_distance 0 comments
- Longest common subsequence - Wikipedia https://en.wikipedia.org/wiki/Longest_common_subsequence_problem 0 comments
- Arithmetic coding - Wikipedia http://en.wikipedia.org/wiki/Arithmetic_coding 0 comments
- Damerau–Levenshtein distance - Wikipedia https://en.wikipedia.org/wiki/Damerau%E2%80%93Levenshtein_distance 0 comments
- Needleman–Wunsch algorithm - Wikipedia https://en.wikipedia.org/wiki/Needleman%E2%80%93Wunsch_algorithm 0 comments
- GitHub - jamesturk/jellyfish: 🪼 a python library for doing approximate and phonetic matching of strings. https://github.com/jamesturk/jellyfish 0 comments