Hacker News
- Code for the Byte Pair Encoding algorithm, commonly used in LLM tokenization https://github.com/karpathy/minbpe 31 comments
Linking pages
- GitHub - naklecha/llama3-from-scratch: llama3 implementation one matrix multiplication at a time https://github.com/naklecha/llama3-from-scratch 269 comments
- GitHub - kuprel/minbpe-pytorch: Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA https://github.com/kuprel/minbpe-pytorch 9 comments
- GitHub - SalvatoreRa/ML-news-of-the-week: A collection of the the best ML and AI news every week (research, news, resources) https://github.com/SalvatoreRa/ML-news-of-the-week 8 comments
- GitHub - Camb-ai/MARS5-TTS: MARS5 speech model (TTS) from CAMB.AI https://github.com/Camb-ai/MARS5-TTS 6 comments
- GitHub - mukel/llama3.java: Practical Llama 3 inference in Java https://github.com/mukel/llama3.java 4 comments
- Direct Preference Optimization Explained In-depth https://www.tylerromero.com/posts/2024-04-dpo/ 0 comments
- GitHub - BobMcDear/minbpe-hs: Byte-level byte pair encoding (BPE) in Haskell https://github.com/BobMcDear/minbpe-hs 0 comments
- (Opinionated) Guide to ML Engineer Job Hunting | Yuan Meng https://www.yuan-meng.com/posts/mle_interviews/ 0 comments
Linked pages
- GitHub - openai/tiktoken https://github.com/openai/tiktoken 74 comments
- GitHub - openai/gpt-2: Code for the paper "Language Models are Unsupervised Multitask Learners" https://github.com/openai/gpt-2 2 comments
- https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf 1 comment