[2306.13649] GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models - discu.eu

Linking pages

Consistency Large Language Models: A Family of Efficient Parallel Decoders | Hao AI Lab @ UCSD https://hao-ai-lab.github.io/blogs/cllm/ 98 comments
GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
Speculative Decoding - philkrav https://philkrav.com/posts/speculative/ 0 comments

Related searches:

Search whole site: site:arxiv.org

Search title: [2306.13649] GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models

See how to search.

Submit link to: