GitHub - karpathy/minbpe: Minimal, clean, educational code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization. - discu.eu

Hacker News

Code for the Byte Pair Encoding algorithm, commonly used in LLM tokenization https://github.com/karpathy/minbpe 31 comments 17/2/2024

Linking pages

GitHub - naklecha/llama3-from-scratch: llama3 implementation one matrix multiplication at a time https://github.com/naklecha/llama3-from-scratch 269 comments
GitHub - therealoliver/Deepdive-llama3-from-scratch: Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code. https://github.com/therealoliver/Deepdive-llama3-from-scratch 14 comments
GitHub - kuprel/minbpe-pytorch: Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA https://github.com/kuprel/minbpe-pytorch 9 comments
GitHub - SalvatoreRa/ML-news-of-the-week: A collection of the the best ML and AI news every week (research, news, resources) https://github.com/SalvatoreRa/ML-news-of-the-week 8 comments
GitHub - Camb-ai/MARS5-TTS: MARS5 speech model (TTS) from CAMB.AI https://github.com/Camb-ai/MARS5-TTS 6 comments
GitHub - mukel/llama3.java: Practical Llama 3 inference in Java https://github.com/mukel/llama3.java 4 comments
Direct Preference Optimization Explained In-depth https://www.tylerromero.com/posts/2024-04-dpo/ 0 comments
GitHub - BobMcDear/minbpe-hs: Byte-level byte pair encoding (BPE) in Haskell https://github.com/BobMcDear/minbpe-hs 0 comments
(Opinionated) Guide to ML Engineer Job Hunting | Yuan Meng https://www.yuan-meng.com/posts/mle_interviews/ 0 comments
Implementing A Byte Pair Encoding (BPE) Tokenizer From Scratch https://sebastianraschka.com/blog/2025/bpe-from-scratch.html 0 comments
Deepdive-llama3-from-scratch/README.md at main · therealoliver/Deepdive-llama3-from-scratch · GitHub https://github.com/therealoliver/Deepdive-llama3-from-scratch/blob/main/README.md 0 comments

Linked pages

Related searches:

Search whole site: site:github.com

Search title: GitHub - karpathy/minbpe: Minimal, clean, educational code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

See how to search.

Submit link to: