Linking pages
Linked pages
- Aya | Cohere For AI https://cohere.com/research/aya 59 comments
- [2405.09818] Chameleon: Mixed-Modal Early-Fusion Foundation Models https://arxiv.org/abs/2405.09818 40 comments
- GitHub - deepseek-ai/DeepSeek-Math https://github.com/deepseek-ai/DeepSeek-Math 39 comments
- Open Language Models (OLMos) and the LLM landscape https://www.interconnects.ai/p/olmo 0 comments
- [2406.11794] DataComp-LM: In search of the next generation of training sets for language models https://arxiv.org/abs/2406.11794 0 comments
- [2407.21783] The Llama 3 Herd of Models https://arxiv.org/abs/2407.21783 0 comments
Related searches:
Search whole site: site:interconnects.ai
Search title: OLMoE and the hidden simplicity in training better foundation models
See how to search.