Linking pages
- How Good Are the Latest Open LLMs? And Is DPO Better Than PPO? https://magazine.sebastianraschka.com/p/how-good-are-the-latest-open-llms 1 comment
- 🦅 EagleX v2 : Soaring past LLaMA2 7B in both English and Multi-lang evals (RWKV-v5) https://blog.rwkv.com/p/eaglex-v2-soaring-past-llama2-7b 0 comments
- GitHub - sustcsonglin/flash-linear-attention: Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton https://github.com/sustcsonglin/flash-linear-attention 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2404.05892] Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence
See how to search.