site:browse.arxiv.org - discu.eu

Hacker News

Distributed Inference and Fine-Tuning of Large Language Models over the Internet https://browse.arxiv.org/html/2312.08361v1 22 comments 2/1/2024

Reddit

[D] Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces https://browse.arxiv.org/pdf/2402.00789.pdf 6 comments 2/2/2024 machinelearning
[2402.00795] LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law https://browse.arxiv.org/abs/2402.00795 4 comments 2/2/2024 machinelearning