Hacker News
- Oumi-AI/oumi: Everything you need to build foundation models, end-to-end https://github.com/oumi-ai/oumi 0 comments
Linked pages
- QwQ: Reflect Deeply on the Boundaries of the Unknown | Qwen https://qwenlm.github.io/blog/qwq-32b-preview/ 421 comments
- Introducing DBRX: A New State-of-the-Art Open LLM | Databricks https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm 343 comments
- Mixtral of experts | Mistral AI | Open source models https://mistral.ai/news/mixtral-of-experts/ 300 comments
- meta-llama/Llama-3.3-70B-Instruct · Hugging Face https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct 224 comments
- [2005.14165] Language Models are Few-Shot Learners https://arxiv.org/abs/2005.14165 201 comments
- [2404.14219] Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone https://arxiv.org/abs/2404.14219 130 comments
- [2310.06825] Mistral 7B https://arxiv.org/abs/2310.06825 124 comments
- Qwen2.5: A Party of Foundation Models! | Qwen https://qwenlm.github.io/blog/qwen2.5/ 38 comments
- GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. https://github.com/huggingface/transformers 26 comments
- Introducing MPT-7B: A New Standard for Open-Source, Commercially Usable LLMs https://www.mosaicml.com/blog/mpt-7b 11 comments
- GitHub - psf/black: The uncompromising Python code formatter https://github.com/psf/black 8 comments
- Apache License, Version 2.0 – Open Source Initiative https://opensource.org/licenses/Apache-2.0 6 comments
- deepseek-ai/DeepSeek-R1 · Hugging Face https://huggingface.co/deepseek-ai/DeepSeek-R1 6 comments
- EleutherAI/gpt-j-6B · Hugging Face https://huggingface.co/EleutherAI/gpt-j-6B 2 comments
- SmolLM - blazingly fast and remarkably powerful https://huggingface.co/blog/smollm 2 comments
- [2309.16609] Qwen Technical Report https://arxiv.org/abs/2309.16609 1 comment
- bigcode/starcoder2-15b · Hugging Face https://huggingface.co/bigcode/starcoder2-15b 1 comment
- google/gemma-2-2b-it · Hugging Face https://huggingface.co/google/gemma-2-2b-it 1 comment
- Qwen2-VL: To See the World More Clearly | Qwen https://qwenlm.github.io/blog/qwen2-vl/ 1 comment
- gpt2 · Hugging Face https://huggingface.co/gpt2 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:github.com
Search title: GitHub - oumi-ai/oumi: Everything you need to build state-of-the-art foundation models, end-to-end.
See how to search.