GitHub - hiyouga/LLaMA-Factory: A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Linking pages

Hello Qwen2 | Qwen https://qwenlm.github.io/blog/qwen2/ 130 comments
GitHub - 01-ai/Yi-1.5: Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability. https://github.com/01-ai/Yi-1.5 67 comments
Qwen2.5: A Party of Foundation Models! | Qwen https://qwenlm.github.io/blog/qwen2.5/ 38 comments
Sky-T1: Train your own O1 preview model within $450 https://novasky-ai.github.io/posts/sky-t1/ 6 comments
Introducing Qwen1.5 | Qwen https://qwenlm.github.io/blog/qwen1.5/ 3 comments
Aman's AI Journal • Primers • Overview of Large Language Models https://aman.ai/primers/ai/LLM/ 1 comment
Qwen2-VL: To See the World More Clearly | Qwen https://qwenlm.github.io/blog/qwen2-vl/ 1 comment
GitHub - open-thoughts/open-thoughts: Fully open data curation for reasoning models https://github.com/open-thoughts/open-thoughts 1 comment
Scaling up Open Reasoning with OpenThinker-32B | Open Thoughts https://www.open-thoughts.ai/blog/scale 1 comment
GitHub - google-gemini/gemma-cookbook: A collection of guides and examples for the Gemma open models from Google. https://github.com/google-gemini/gemma-cookbook 0 comments
Installing and Developing vLLM with Ease | vLLM Blog https://blog.vllm.ai/2025/01/10/dev-experience.html 0 comments
GitHub - NovaSky-AI/SkyThought: Sky-T1: Train your own O1 preview model within $450 https://github.com/NovaSky-AI/SkyThought 0 comments
GitHub - QwenLM/Qwen3: Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud. https://github.com/QwenLM/Qwen3 0 comments

Linking pages

Linked pages