Linking pages
- Large Transformer Model Inference Optimization | Lil'Log https://lilianweng.github.io/posts/2023-01-10-inference-optimization/ 20 comments
- Knowing Enough About MoE to Explain Dropped Tokens in GPT-4 - 152334H https://152334h.github.io/blog/knowing-enough-about-moe/ 1 comment
- The Next Generation Of Large Language Models https://www.forbes.com/sites/robtoews/2023/02/07/the-next-generation-of-large-language-models/ 0 comments
- GitHub - koayon/awesome-adaptive-computation: A curated reading list of research in Adaptive Computation (AC). https://github.com/koayon/awesome-adaptive-computation 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2209.01667] A Review of Sparse Expert Models in Deep Learning
See how to search.