Hacker News
- NanoGPT: The simplest, fastest repository for training medium-sized GPTs https://github.com/karpathy/nanoGPT 21 comments
- NanoGPT https://github.com/karpathy/nanoGPT 320 comments
- I Created a Neural Network which Beats the Transformer in a Metric by Quite a Bit [Project][Discussion] https://github.com/karpathy/nanoGPT 22 comments machinelearning
- [P] Nano GPT https://github.com/karpathy/nanoGPT 2 comments machinelearning
Linking pages
- The AI research job market shit show (and my experience) https://www.interconnects.ai/p/ai-research-job-market 183 comments
- GitHub - karpathy/minGPT: A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training https://github.com/karpathy/minGPT 133 comments
- PyTorch 2.0: Our next generation release that is faster, more Pythonic and Dynamic as ever | PyTorch https://pytorch.org/blog/pytorch-2.0-release/ 122 comments
- Chess-GPT’s Internal World Model | Adam Karvonen https://adamkarvonen.github.io/machine_learning/2024/01/03/chess-world-models.html 112 comments
- GitHub - suno-ai/bark: 🔊 Text-Prompted Generative Audio Model https://github.com/suno-ai/bark 93 comments
- It's not just statistics: GPT-4 does reason. https://jbconsulting.substack.com/p/its-not-just-statistics-gpt-4-does 93 comments
- GitHub - Lightning-AI/lit-llama: Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed. https://github.com/Lightning-AI/lit-llama 69 comments
- bark/README.md at main · suno-ai/bark · GitHub https://github.com/suno-ai/bark/blob/main/README.md 60 comments
- Understanding Large Language Models - by Sebastian Raschka https://magazine.sebastianraschka.com/p/understanding-large-language-models 53 comments
- GitHub - pytorch/torchchat: Run PyTorch LLMs locally on servers, desktop and mobile https://github.com/pytorch/torchchat 41 comments
- Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond | PyTorch https://pytorch.org/blog/inside-the-matrix/ 34 comments
- Monosemanticity at Home: My Attempt at Replicating Anthropic's Interpretability Research from Scratch https://jakeward.substack.com/p/monosemanticity-at-home-my-attempt 31 comments
- Understanding Large Language Models -- A Transformative Reading List https://sebastianraschka.com/blog/2023/llm-reading-list.html 26 comments
- GitHub - certik/fastGPT: Fast GPT-2 inference written in Fortran https://github.com/certik/fastGPT 13 comments
- GitHub - dabochen/spreadsheet-is-all-you-need: A nanoGPT pipeline packed in a spreadsheet https://github.com/dabochen/spreadsheet-is-all-you-need 13 comments
- GitHub - sedthh/BeatLearning: Open Source Generative AI Models for Automatic Rhythm Game Beatmap Generation (for acoustic people) https://github.com/sedthh/BeatLearning 11 comments
- GitHub - google/maxtext: A simple, performant and scalable Jax LLM! https://github.com/google/maxtext 9 comments
- GitHub - max-ng/recurser: Reduce VRAM usage on transformer models https://github.com/max-ng/recurser 7 comments
- GitHub - arpytanshu1/ts-tok: Time Series Tokenizer for Transformers https://github.com/arpytanshu1/ts-tok 6 comments
- GitHub - vithursant/nanoGPT_mlx https://github.com/vithursant/nanoGPT_mlx 6 comments
Linked pages
- PyTorch 2.0 | PyTorch https://pytorch.org/get-started/pytorch-2.0/ 153 comments
- GitHub - karpathy/minGPT: A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training https://github.com/karpathy/minGPT 133 comments
- Let's build GPT: from scratch, in code, spelled out. - YouTube https://www.youtube.com/watch?v=kCc8FmEb1nY 105 comments
- PyTorch http://pytorch.org/ 100 comments
- Neural Networks: Zero To Hero https://karpathy.ai/zero-to-hero.html 69 comments
- Start Locally | PyTorch https://pytorch.org/get-started/locally/ 3 comments
- GPU Cloud, Workstations, Servers, Laptops for Deep Learning | Lambda https://lambdalabs.com/ 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:github.com
Search title: GitHub - karpathy/nanoGPT: The simplest, fastest repository for training/finetuning medium-sized GPTs.
See how to search.