Hacker News
Linked pages
- GitHub - deepset-ai/haystack: :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots. https://github.com/deepset-ai/haystack 35 comments
- Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA https://huggingface.co/blog/4bit-transformers-bitsandbytes 15 comments
- [2210.03629] ReAct: Synergizing Reasoning and Acting in Language Models https://arxiv.org/abs/2210.03629#google 3 comments
- GitHub - jerryjliu/llama_index: LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. https://github.com/jerryjliu/llama_index 0 comments
- [2210.17323] GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers https://arxiv.org/abs/2210.17323 0 comments
Related searches:
Search whole site: site:pinecone.io
Search title: LLM Ecosystem: Quantization, RAG, Agents, and More | Pinecone
See how to search.