Linked pages
- GitHub - karpathy/nanoGPT: The simplest, fastest repository for training/finetuning medium-sized GPTs. https://github.com/karpathy/nanoGPT 366 comments
- GitHub - jzhang38/TinyLlama https://github.com/jzhang38/TinyLlama 60 comments
- Let's build the GPT Tokenizer - YouTube https://www.youtube.com/watch?v=zduSFxRajkE 51 comments
- The Illustrated Transformer – Jay Alammar – Visualizing machine learning one concept at a time. https://jalammar.github.io/illustrated-transformer/ 36 comments
- The spelled-out intro to neural networks and backpropagation: building micrograd - YouTube https://www.youtube.com/watch?v=VMj-3S1tku0 17 comments
- [2104.09864] RoFormer: Enhanced Transformer with Rotary Position Embedding https://arxiv.org/abs/2104.09864 8 comments
- TensorDock — Easy & Affordable Cloud GPUs https://tensordock.com/ 2 comments
- [2203.15556] Training Compute-Optimal Large Language Models https://arxiv.org/abs/2203.15556 0 comments
- [2307.09288] Llama 2: Open Foundation and Fine-Tuned Chat Models https://arxiv.org/abs/2307.09288 0 comments
- HuggingFaceTB/smoltalk · Datasets at Hugging Face https://huggingface.co/datasets/HuggingFaceTB/smoltalk 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:github.com
Search title: GitHub - CohleM/lilLM: A little(lil) Language Model (LM)
See how to search.