Hacker News
Linked pages
- [2408.07055] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs https://arxiv.org/abs/2408.07055 1 comment
- GitHub - vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs https://github.com/vllm-project/vllm 0 comments
- GitHub - Dao-AILab/flash-attention: Fast and memory-efficient exact attention https://github.com/Dao-AILab/flash-attention 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - THUDM/LongWriter: LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
See how to search.