Hacker News
- Qwen: chat and pretrained large language model by Alibaba Cloud https://github.com/QwenLM/Qwen 51 comments
Linking pages
- What I learned from looking at 900 most popular open source AI tools https://huyenchip.com/2024/03/14/ai-oss.html 41 comments
- 10 Noteworthy AI Research Papers of 2023 https://magazine.sebastianraschka.com/p/10-ai-research-papers-2023 24 comments
- GitHub - langmanus/langmanus: A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search, crawling, and Python code execution, while giving back to the community that made this possible. https://github.com/langmanus/langmanus 13 comments
- GitHub - SalvatoreRa/ML-news-of-the-week: A collection of the the best ML and AI news every week (research, news, resources) https://github.com/SalvatoreRa/ML-news-of-the-week 8 comments
- GitHub - eosphoros-ai/Awesome-Text2SQL: Curated tutorials and resources for Large Language Models, Text2SQL, and more. https://github.com/eosphoros-ai/Awesome-Text2SQL 1 comment
- Getting Started with Qwen1.5-72B-Chat https://www.secondstate.io/articles/qwen1.5-72b-chat/ 1 comment
- GitHub - nlpfromscratch/nlp-llms-resources: Master list of curated resources on NLP and LLMs https://github.com/nlpfromscratch/nlp-llms-resources 0 comments
- China is overtaking the West in the open AI race https://tobiaschen.substack.com/p/china-is-overtaking-the-west-in-the 0 comments
- Introducing Qwen | Qwen https://qwenlm.github.io/blog/qwen/ 0 comments
- FLiPStackWeekly/141-10June2024.md at main · tspannhw/FLiPStackWeekly · GitHub https://github.com/tspannhw/FLiPStackWeekly/blob/main/141-10June2024.md 0 comments
- GitHub - Kedreamix/Linly-Dubbing: 智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界” https://github.com/Kedreamix/Linly-Dubbing 0 comments
Linked pages
- [2305.14314] QLoRA: Efficient Finetuning of Quantized LLMs https://arxiv.org/abs/2305.14314 129 comments
- [2106.09685] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 8 comments
- Fully Sharded Data Parallel: faster AI training with fewer GPUs Engineering at Meta - https://engineering.fb.com/2021/07/15/open-source/fsdp/ 2 comments
- GitHub - microsoft/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. https://github.com/microsoft/DeepSpeed 1 comment
- GitHub - PanQiWei/AutoGPTQ: An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm. https://github.com/PanQiWei/AutoGPTQ 0 comments
- Qwen/Qwen-14B · Hugging Face https://huggingface.co/Qwen/Qwen-14B 0 comments
- GitHub - Dao-AILab/flash-attention: Fast and memory-efficient exact attention https://github.com/Dao-AILab/flash-attention 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - QwenLM/Qwen: The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
See how to search.