Hacker News
- TinyLlama project aims to pretrain a 1.1B Llama model on 3T tokens https://github.com/jzhang38/TinyLlama 60 comments
Linking pages
- GitHub - mlabonne/llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks. https://github.com/mlabonne/llm-course 10 comments
- GitHub - tairov/llama2.mojo: Inference Llama 2 in one file of pure 🔥 https://github.com/tairov/llama2.mojo 1 comment
- GitHub - Lightning-AI/litdata: Transform datasets at scale. Optimize datasets for fast AI model training. https://github.com/Lightning-AI/litdata 1 comment
- Running LLaVA on iOS With llama.cpp and TinyLlama – Prashanth Sadasivan – The Chief Questions Officer 💻🍳☕️ https://prashanth.world/llava-on-ios/ 1 comment
- GitHub - microsoft/Samba: Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling" https://github.com/microsoft/Samba 1 comment
- The AI OS (Sept 2023 Recap) - by swyx - Latent Space https://www.latent.space/p/sep-2023 0 comments
- Research Papers in January 2024 - by Sebastian Raschka, PhD https://magazine.sebastianraschka.com/p/research-papers-in-january-2024 0 comments
- GitHub - NexaAI/Awesome-LLMs-on-device: Awesome LLMs on Device: A Comprehensive Survey https://github.com/NexaAI/Awesome-LLMs-on-device 0 comments
Linked pages
- GitHub - ggerganov/llama.cpp: Port of Facebook's LLaMA model in C/C++ https://github.com/ggerganov/llama.cpp 286 comments
- GitHub Star History https://star-history.com/#microsoft/playwright&cypress-io/cypress&Date 78 comments
- [2304.01373] Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling https://arxiv.org/abs/2304.01373 7 comments
- GitHub - vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs https://github.com/vllm-project/vllm 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - jzhang38/TinyLlama
See how to search.