Hacker News
Lobsters
- Edge AI Just Got Faster https://justine.lol/mmap/ 5 comments ai , performance
- "We modified llama.cpp to load weights using mmap() instead of C++ standard I/O. That enabled us to load LLaMA 100x faster using half as much memory." https://justine.lol/mmap/ 80 comments programming
Linking pages
Related searches:
Search whole site: site:justine.lol
Search title: Edge AI Just Got Faster
See how to search.