Hacker News
- What I learned from looking at 900 most popular open source AI tools https://huyenchip.com/2024/03/14/ai-oss.html 39 comments
Linking pages
- GitHub - SalvatoreRa/ML-news-of-the-week: A collection of the the best ML and AI news every week (research, news, resources) https://github.com/SalvatoreRa/ML-news-of-the-week 8 comments
- Stability's fast-growing list of models https://aisupremacy.substack.com/p/stabilitys-fast-growing-list-of-models 0 comments
Linked pages
- [2402.17764] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits https://arxiv.org/abs/2402.17764 575 comments
- GitHub - BlinkDL/RWKV-LM: RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding. https://github.com/BlinkDL/RWKV-LM 179 comments
- GitHub - dair-ai/Prompt-Engineering-Guide: 🐙 Guides, papers, lecture, notebooks and resources for prompt engineering https://github.com/dair-ai/Prompt-Engineering-Guide 149 comments
- Vector database - Milvus https://milvus.io/ 121 comments
- What I learned from looking at 200 machine learning tools https://huyenchip.com/2020/06/22/mlops.html 106 comments
- GitHub - facebookresearch/faiss: A library for efficient similarity search and clustering of dense vectors. https://github.com/facebookresearch/faiss 100 comments
- GitHub - f/awesome-chatgpt-prompts: This repo includes ChatGPT prompt curation to use ChatGPT better. https://github.com/f/awesome-chatgpt-prompts 57 comments
- GitHub - QwenLM/Qwen: The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. https://github.com/QwenLM/Qwen 51 comments
- GitHub - FMInference/FlexGen: Running large language models on a single GPU for throughput-oriented scenarios. https://github.com/FMInference/FlexGen 45 comments
- GitHub - qdrant/qdrant: Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/ https://github.com/qdrant/qdrant 42 comments
- GitHub - guidance-ai/guidance: A guidance language for controlling large language models. https://github.com/guidance-ai/guidance 41 comments
- GitHub - skypilot-org/skypilot: SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface. https://github.com/skypilot-org/skypilot 10 comments
- https://github.com/QwenLM/ 4 comments
- Sam Altman wants AI to create a one-person unicorn with a billion-dollar valuation | Fortune https://fortune.com/2024/02/04/sam-altman-one-person-unicorn-silicon-valley-founder-myth/ 3 comments
- [2212.09720] The case for 4-bit precision: k-bit Inference Scaling Laws https://arxiv.org/abs/2212.09720 2 comments
- GitHub - huggingface/safetensors: Simple, safe way to store and distribute tensors https://github.com/huggingface/safetensors 1 comment
- GitHub - lancedb/lancedb: Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps! https://github.com/lancedb/lancedb 1 comment
- GitHub - triton-inference-server/server: The Triton Inference Server provides an optimized cloud and edge inferencing solution. https://github.com/triton-inference-server/server 1 comment
- Machine learning is going real-time https://huyenchip.com/2020/12/27/real-time-machine-learning.html 0 comments
- GitHub - arogozhnikov/einops: Deep learning operations reinvented (for pytorch, tensorflow, jax and others) https://github.com/arogozhnikov/einops 0 comments
Related searches:
Search whole site: site:huyenchip.com
Search title: What I learned from looking at 900 most popular open source AI tools
See how to search.