Linking pages
- Consistency Large Language Models: A Family of Efficient Parallel Decoders | Hao AI Lab @ UCSD https://hao-ai-lab.github.io/blogs/cllm/ 98 comments
- GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
- Speculative Decoding - philkrav https://philkrav.com/posts/speculative/ 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2306.13649] GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models
See how to search.