Hacker News
- Opentensor and Cerebras announce BTLM-3B-8K, a leading 3B param. language model https://huggingface.co/cerebras/btlm-3b-8k-base 2 comments
Linking pages
- Everything about Distributed Training and Efficient Finetuning | Sumanth's Personal Website https://sumanthrh.com/post/distributed-and-efficient-finetuning/ 1 comment
- ALiBi FlashAttention - Speeding up ALiBi by 3-5x with a hardware-efficient implementation | Princeton Language and Intelligence https://pli.princeton.edu/blog/2024/alibi-flashattention-speeding-alibi-3-5x-hardware-efficient-implementation 1 comment
- BTLM-3B-8K: 7B Performance in a 3 Billion Parameter Model - Cerebras https://www.cerebras.net/machine-learning/btlm-3b-8k-7b-performance-in-a-3-billion-parameter-model/ 0 comments
Related searches:
Search whole site: site:huggingface.co
Search title: cerebras/btlm-3b-8k-base · Hugging Face
See how to search.