Generalized Language Models - discu.eu

Hacker News

Generalized Language Models https://lilianweng.github.io/lil-log/2019/01/31/generalized-language-models.html 3 comments 3/2/2019

Reddit

InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning https://arxiv.org/abs/2305.06500 5 comments 14/5/2023 machinelearning
Large language models (AI) surpass human experts in predicting neuroscience results, according to a new paper in Nature Human Behavior. When asked to predict scientific results based on past findings, general AIs did better than experts, with a neuroscience trained AI doing better than both. https://www.nature.com/articles/s41562-024-02046-9 9 comments 17/1/2025 science
AndroMeta is a software platform for technical and scientific computing - machine learning and artificial intelligence in general, distributed and concurrent computing, language design, modeling and simulation http://dextk.org/andrometa/home.html 4 comments 17/9/2009 programming
[R] LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models - University of Illinois 2023 https://arxiv.org/abs/2308.16137 2 comments 31/8/2023 machinelearning
Testing theory of mind in large language models and humans - GPT4 generally performed as well as and sometimes exceeded humans, but it struggled with detecting faux pax. However, detection of faux pax was the only domain LLaMA2 scored better than humans. https://www.nature.com/articles/s41562-024-01882-z 99 comments 25/5/2024 science
How close is AI to human-level intelligence? - Large language models such as OpenAI’s o1 have electrified the debate over achieving artificial general intelligence, or AGI. But they are unlikely to reach this milestone on their own. https://www.nature.com/articles/d41586-024-03905-1 110 comments 7/12/2024 futurology
[R] Microsoft introduce Kosmos-1, a Multimodal Large Language Model (MLLM) that can perceive general modalities, learn in context (i.e., few-shot), and follow instructions (i.e., zero-shot) https://arxiv.org/abs/2302.14045 82 comments 28/2/2023 machinelearning