Hacker News
- Qwen: chat and pretrained large language model by Alibaba Cloud https://github.com/QwenLM/Qwen 51 comments
Linking pages
- What I learned from looking at 900 most popular open source AI tools https://huyenchip.com/2024/03/14/ai-oss.html 40 comments
- 10 Noteworthy AI Research Papers of 2023 https://magazine.sebastianraschka.com/p/10-ai-research-papers-2023 24 comments
- GitHub - SalvatoreRa/ML-news-of-the-week: A collection of the the best ML and AI news every week (research, news, resources) https://github.com/SalvatoreRa/ML-news-of-the-week 8 comments
- GitHub - eosphoros-ai/Awesome-Text2SQL: Curated tutorials and resources for Large Language Models, Text2SQL, and more. https://github.com/eosphoros-ai/Awesome-Text2SQL 1 comment
- Getting Started with Qwen1.5-72B-Chat https://www.secondstate.io/articles/qwen1.5-72b-chat/ 1 comment
- GitHub - nlpfromscratch/nlp-llms-resources: Master list of curated resources on NLP and LLMs https://github.com/nlpfromscratch/nlp-llms-resources 0 comments
- China is overtaking the West in the open AI race https://tobiaschen.substack.com/p/china-is-overtaking-the-west-in-the 0 comments
- Introducing Qwen | Qwen https://qwenlm.github.io/blog/qwen/ 0 comments
- FLiPStackWeekly/141-10June2024.md at main · tspannhw/FLiPStackWeekly · GitHub https://github.com/tspannhw/FLiPStackWeekly/blob/main/141-10June2024.md 0 comments
- GitHub - Kedreamix/Linly-Dubbing: 智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界” https://github.com/Kedreamix/Linly-Dubbing 0 comments
Linked pages
- [2305.14314] QLoRA: Efficient Finetuning of Quantized LLMs https://arxiv.org/abs/2305.14314 129 comments
- [2106.09685] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 8 comments
- Fully Sharded Data Parallel: faster AI training with fewer GPUs Engineering at Meta - https://engineering.fb.com/2021/07/15/open-source/fsdp/ 2 comments
- GitHub - microsoft/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. https://github.com/microsoft/DeepSpeed 1 comment
- GitHub - PanQiWei/AutoGPTQ: An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm. https://github.com/PanQiWei/AutoGPTQ 0 comments
- Qwen/Qwen-14B · Hugging Face https://huggingface.co/Qwen/Qwen-14B 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - QwenLM/Qwen: The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
See how to search.