Hacker News
- Making my local LLM voice assistant faster and more scalable with RAG https://johnthenerd.com/blog/faster-local-llm-assistant/ 16 comments
Linked pages
- Building a fully local AI smart home assistant | John's Website https://johnthenerd.com/blog/local-llm-assistant/ 186 comments
- [2312.10997] Retrieval-Augmented Generation for Large Language Models: A Survey https://arxiv.org/abs/2312.10997 2 comments
- Mastering LLM Techniques: Inference Optimization | NVIDIA Technical Blog https://developer.nvidia.com/blog/mastering-llm-techniques-inference-optimization/ 0 comments
- Ollama https://ollama.com/ 0 comments
Related searches:
Search whole site: site:johnthenerd.com
Search title: Making my local LLM voice assistant faster and more scalable with RAG | John's Website
See how to search.