GitHub - microsoft/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Linking pages

What We Know About LLMs (Primer) https://willthompson.name/what-we-know-about-llms-primer 164 comments
GitHub - kingoflolz/mesh-transformer-jax: Model parallel transformers in JAX and Haiku https://github.com/kingoflolz/mesh-transformer-jax 146 comments
GitHub - EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library. https://github.com/EleutherAI/gpt-neox 67 comments
DeepSpeed/README.md at master · microsoft/DeepSpeed · GitHub https://github.com/microsoft/DeepSpeed/blob/master/blogs/deepspeed-chat/README.md 55 comments
GitHub - QwenLM/Qwen: The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. https://github.com/QwenLM/Qwen 51 comments
GitHub - punica-ai/punica: Serving multiple LoRA finetuned LLM as one https://github.com/punica-ai/punica 26 comments
GitHub - sail-sg/understand-r1-zero: Understanding R1-Zero-Like Training: A Critical Perspective https://github.com/sail-sg/understand-r1-zero 21 comments
GitHub - linkedin/Liger-Kernel: Efficient Triton Kernels for LLM Training https://github.com/linkedin/Liger-Kernel 19 comments
Snowflake Arctic - LLM for Enterprise AI https://www.snowflake.com/blog/arctic-open-efficient-foundation-language-models-snowflake/ 6 comments
GitHub - tensorchord/Awesome-LLMOps: An awesome & curated list of best LLMOps tools for developers https://github.com/tensorchord/Awesome-LLMOps 5 comments
GitHub - Alpha-VLLM/LLaMA2-Accessory: An Open-source Toolkit for LLM Development https://github.com/Alpha-VLLM/LLaMA2-Accessory 3 comments
GitHub - janhq/awesome-local-ai: An awesome repository of local AI tools https://github.com/janhq/awesome-local-ai 3 comments
PyTorch Lightning vs DeepSpeed vs FSDP vs FFCV vs … | by William Falcon | Towards Data Science https://william-falcon.medium.com/pytorch-lightning-vs-deepspeed-vs-fsdp-vs-ffcv-vs-e0d6b2a95719 2 comments
GitHub - OpenGVLab/InternImage: [CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions https://github.com/OpenGVLab/InternImage 2 comments
GitHub - OpenLLMAI/OpenRLHF: A Ray-based High-performance RLHF framework (for 7B on RTX4090 and 34B on A100) https://github.com/OpenLLMAI/OpenRLHF 2 comments
OneFlow Made Training GPT-3 Easier（Part 1） | by OneFlow | Medium https://oneflow2020.medium.com/oneflow-made-training-gpt-3-easier-part-1-5b6b65d70d3c 1 comment
GitHub - ai-forever/ru-gpts: Russian GPT3 models. https://github.com/sberbank-ai/ru-gpts 1 comment
GitHub - enhuiz/vall-e: An unofficial PyTorch implementation of the audio LM VALL-E, WIP https://github.com/enhuiz/vall-e 1 comment
DeepSpeedExamples/applications/DeepSpeed-Chat at master · microsoft/DeepSpeedExamples · GitHub https://github.com/microsoft/DeepSpeedExamples/tree/master/applications/DeepSpeed-Chat 1 comment
GitHub - deepseek-ai/DeepSeek-Coder: DeepSeek Coder: Let the Code Write Itself https://github.com/deepseek-ai/DeepSeek-Coder 1 comment

Linking pages

Linked pages