Hacker News
- Generalized Language Models https://lilianweng.github.io/lil-log/2019/01/31/generalized-language-models.html 3 comments
- Transformer language models are doing something more general – LessWrong https://www.lesswrong.com/posts/YwqSijHybF9GFkDab/transformer-language-models-are-doing-something-more-general 9 comments
- Large Language Models As General Pattern Machines https://arxiv.org/abs/2307.04721 35 comments
- Natural language benchmarks don’t measure AI models’ general knowledge well https://venturebeat.com/2020/08/12/natural-language-benchmarks-dont-measure-ai-models-general-knowledge-well-research-shows/ 69 comments
- InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning https://arxiv.org/abs/2305.06500 5 comments machinelearning
- AndroMeta is a software platform for technical and scientific computing - machine learning and artificial intelligence in general, distributed and concurrent computing, language design, modeling and simulation http://dextk.org/andrometa/home.html 4 comments programming
- [R] LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models - University of Illinois 2023 https://arxiv.org/abs/2308.16137 2 comments machinelearning
- [R] Microsoft introduce Kosmos-1, a Multimodal Large Language Model (MLLM) that can perceive general modalities, learn in context (i.e., few-shot), and follow instructions (i.e., zero-shot) https://arxiv.org/abs/2302.14045 82 comments machinelearning