Linking pages
- DeepSeek-V3 Technical Report https://arxiv.org/html/2412.19437v1 42 comments
- GitHub - NVIDIA/NeMo: NeMo: a toolkit for conversational AI https://github.com/NVIDIA/NeMo 8 comments
- Benchmarking and Dissecting the Nvidia Hopper GPU Architecture https://arxiv.org/html/2402.13499v1 4 comments
- GitHub - ArcInstitute/evo2: Genome modeling and design across all domains of life https://github.com/ArcInstitute/evo2 0 comments
Linked pages
- GitHub - EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library. https://github.com/EleutherAI/gpt-neox 67 comments
- Inflection-2: The Next Step Up https://inflection.ai/inflection-2 30 comments
- Benchmarking Large Language Models on NVIDIA H100 GPUs with CoreWeave (Part 1) https://www.mosaicml.com/blog/coreweave-nvidia-h100-part-1 15 comments
- GitHub - hpcaitech/ColossalAI: Making large AI models cheaper, faster and more accessible https://github.com/hpcaitech/ColossalAI 9 comments
- Apache License, Version 2.0 – Open Source Initiative https://opensource.org/licenses/Apache-2.0 6 comments
- https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf 0 comments
- GitHub - stanford-crfm/levanter: Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax https://github.com/stanford-crfm/levanter 0 comments