- NVIDIA AI Researchers Introduce FFN Fusion: A Novel Optimization Technique that Demonstrates How Sequential Computation in Large Language Models LLMs can be Effectively Parallelized https://www.marktechpost.com/2025/03/29/nvidia-ai-researchers-introduce-ffn-fusion-a-novel-optimization-technique-that-demonstrates-how-sequential-computation-in-large-language-models-llms-can-be-effectively-parallelized/ 1 comment machinelearningnews
- New study finds large language models are prone to social identity biases similar to the way humans are—but LLMs can be trained to stem these outputs https://www.nyu.edu/about/news-publications/news/2024/december/-us--vs---them--biases-plague-ai--too.html 31 comments science
- 2024's Biggest Breakthroughs in Computer Science (2024) The year's biggest breakthroughs in computer science included a new understanding of what’s going on in large language models (LLMs) and a breakthrough in computing Hamiltonians — models that represent complex quantum systems. [10:46] https://www.youtube.com/watch?v=fTMMsreAqX0 4 comments documentaries
- New study finds large language models are prone to social identity biases similar to the way humans are—but LLMs can be trained to stem these outputs https://www.nyu.edu/about/news-publications/news/2024/december/-us--vs---them--biases-plague-ai--too.html 12 comments science
- [R] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits https://arxiv.org/abs/2402.17764 121 comments machinelearning
- OpenLLM: An open platform for operating large language models (LLMs) in production. https://github.com/bentoml/OpenLLM 3 comments opensource
- A Comparison of Large Language Models (LLMs) in Biomedical Domain https://provectus.com/blog/comparison-large-language-models-biomedical-domain/ 2 comments artificial
- [R] The Invention of Large Language Models (LLMs) - A brief history 🧵 https://twitter.com/MJayatilake 3 comments machinelearning
- Researchers at Stanford Introduce Parsel: An Artificial Intelligence AI Framework That Enables Automatic Implementation And Validation of Complex Algorithms With Code Large Language Models LLMs https://www.marktechpost.com/2023/01/29/researchers-at-stanford-introduce-parsel-an-artificial-intelligence-ai-framework-that-enables-automatic-implementation-and-validation-of-complex-algorithms-with-code-large-language-models-llms/ 2 comments programming
- A new study explores how human confidence in large language models (LLMs) often surpasses their actual accuracy. It highlights the 'calibration gap' - the difference between what LLMs know and what users think they know. https://doi.org/10.1038/s42256-024-00976-7 22 comments science
- Deception abilities emerged in large language models | State-of-the-art LLMs are able to understand and induce false beliefs in other agents. These abilities were nonexistent in earlier LLMs. https://www.pnas.org/doi/full/10.1073/pnas.2317967121 2 comments artificial
- 🚀 AI Unlocked: A Practical Guide to Mastering Large Language Models (LLMs) 🧠🔓 https://medium.com/@yusufsevinir/building-llms-from-poc-to-production-an-overview-ea7ceb9aa8d8 3 comments learnmachinelearning
- Deception abilities emerged in large language models: Experiments show state-of-the-art LLMs are able to understand and induce false beliefs in other agents. Such strategies emerged in state-of-the-art LLMs, but were nonexistent in earlier LLMs. https://www.pnas.org/doi/full/10.1073/pnas.2317967121 24 comments science
- GitHub - mrphrazer/reverser_ai: Provides automated reverse engineering assistance through the use of local large language models (LLMs) on consumer hardware. https://github.com/mrphrazer/reverser_ai 2 comments reverseengineering
- US Intelligence Advanced Research Projects Activity (IARPA) issues request to identify potential threats that large language models (LLMs) may pose https://sociable.co/government-and-policy/spy-community-threats-tech-chatgpt-language-models/ 5 comments futurology
- AI can predict study results better than human experts | The study demonstrates that large language models (LLMs) trained on vast datasets of text can distil patterns from scientific literature, enabling them to forecast scientific outcomes with superhuman accuracy. https://www.ucl.ac.uk/news/2024/nov/ai-can-predict-study-results-better-human-experts 4 comments science
- Best YouTube Channels for Learning Large Language Models (LLMs) https://www.linkedin.com/posts/neeraj-125601238_llms-artificialintelligence-machinelearning-activity-7243092362841743361-mYvS 4 comments learnmachinelearning
- Large language models (LLMs) are more likely to criminalise users that use African American English, the results of a new Cornell University study show https://www.euronews.com/next/2024/03/09/ai-models-found-to-show-language-bias-by-recommending-black-defendents-be-sentenced-to-dea 62 comments technology
- The Rise of Generative AI Large Language Models (LLMs) like ChatGPT — Information is Beautiful https://informationisbeautiful.net/visualizations/the-rise-of-generative-ai-large-language-models-llms-like-chatgpt 3 comments dataisbeautiful
- Large Language Models and the Socratic Method: Exploring how LLMs can simulate Socratic dialogues to stimulate critical thinking. Introducing the Tree of Thoughts method to improve AI’s performance on complex reasoning tasks and emphasizing the importance of critical thinking in the era of AI. https://www.cbrincoveanu.com/posts/large-language-models-and-the-socratic-method/ 2 comments learnmachinelearning
- ChatGPT and other large language models (LLMs) cannot learn independently or acquire new skills, meaning they pose no existential threat to humanity, according to new research. They have no potential to master new skills without explicit instruction. https://www.bath.ac.uk/announcements/ai-poses-no-existential-threat-to-humanity-new-study-finds/ 1423 comments science
- [P] Bringing Open Large Language Models to Consumer Devices. The project enables 'small' LLMs like Vicuna 7B or Red Pajama INCITE 3B to run locally on mobile phones, with hardware acceleration, using WebAssembly and WebGPU. https://mlc.ai/blog/2023/05/22/bringing-open-large-language-models-to-consumer-devices 7 comments machinelearning
- Generating Mathematical Derivations with Large Language Models. "In this paper, we leverage a symbolic engine to generate derivations of equations at scale, and investigate the capabilities of LLMs when deriving goal equations from premises." [abstract + link to PDF, 95pp] https://arxiv.org/abs/2307.09998 2 comments math
- Large Language Models appear to be more liberal: A new study of 24 state-of-the-art conversational LLMs, including ChatGPT, shows that today's AI models lean left of center. LLMs show an average score of -30 on a political spectrum, indicating a left-leaning bias. https://www.psychologytoday.com/au/blog/the-digital-self/202408/are-large-language-models-more-liberal 653 comments science