- LZ77 Is All You Need? Why Gzip + KNN Works for Text Classification https://codeconfessions.substack.com/p/lz77-is-all-you-need 3 comments datascience
- LZ77 Is All You Need? Why Gzip + KNN Works for Text Classification https://codeconfessions.substack.com/p/lz77-is-all-you-need 2 comments coding
Linked pages
- https://kenschutte.com/gzip-knn-paper/ 130 comments
- https://aclanthology.org/2023.findings-acl.426/ 17 comments
- Asymmetric numeral systems - Wikipedia https://en.wikipedia.org/wiki/Asymmetric_numeral_systems 15 comments
- Huffman coding - Wikipedia https://en.wikipedia.org/wiki/Huffman_coding 12 comments
- ACL Paper Decoded: Gzip + KNN Rival BERT in Text Classification https://codeconfessions.substack.com/p/decoding-the-acl-paper-gzip-and-knn 11 comments
- Large Language Models and Nearest Neighbors https://magazine.sebastianraschka.com/p/large-language-models-and-nearest 4 comments
- Normalized compression distance - Wikipedia https://en.wikipedia.org/wiki/Normalized_compression_distance 2 comments
- Arithmetic coding - Wikipedia http://en.wikipedia.org/wiki/Arithmetic_coding 0 comments
- Lempel–Ziv–Markov chain algorithm - Wikipedia https://en.wikipedia.org/wiki/Lempel%E2%80%93Ziv%E2%80%93Markov_chain_algorithm 0 comments
- zstd - Wikipedia https://en.wikipedia.org/wiki/Zstd 0 comments
- GitHub - cyrilou242/ftcc: Fast Text Classification with Compressors dictionary https://github.com/cyrilou242/ftcc 0 comments
- LZ77 and LZ78 - Wikipedia https://en.wikipedia.org/wiki/LZ77_and_LZ78 0 comments
Related searches:
Search whole site: site:codeconfessions.substack.com
Search title: Why Gzip-KNN Works: The LZ77 Factor in Text Classification
See how to search.