- when is somebody going to use tokenformer in prompt to video, in chatbots and robots https://github.com/Haiyang-W/TokenFormer 2 comments deeplearning
Linked pages
- GitHub - EleutherAI/gpt-neox: An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library. https://github.com/EleutherAI/gpt-neox 67 comments
- [2312.00752] Mamba: Linear-Time Sequence Modeling with Selective State Spaces https://arxiv.org/abs/2312.00752 42 comments
- [2410.23168] TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters https://arxiv.org/abs/2410.23168 38 comments
- GitHub - EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of autoregressive language models. https://github.com/EleutherAI/lm-evaluation-harness 0 comments
- [2407.04620] Learning to (Learn at Test Time): RNNs with Expressive Hidden States https://arxiv.org/abs/2407.04620 0 comments