Hacker News
- Distributed Inference and Fine-Tuning of Large Language Models over the Internet https://browse.arxiv.org/html/2312.08361v1 22 comments
- [D] Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces https://browse.arxiv.org/pdf/2402.00789.pdf 6 comments machinelearning
- [2402.00795] LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law https://browse.arxiv.org/abs/2402.00795 4 comments machinelearning