Linking pages
- What can LLMs never do? - by Rohit Krishnan https://www.strangeloopcanon.com/p/what-can-llms-never-do 376 comments
- ☞ Living with the Veil of Progress - by Samuel Arbesman https://arbesman.substack.com/p/living-with-the-veil-of-progress 0 comments
- No, LLMs are not "scheming" - by Rohit Krishnan https://www.strangeloopcanon.com/p/no-llms-are-not-scheming 0 comments
Linked pages
- Centaurs and Cyborgs on the Jagged Frontier https://www.oneusefulthing.org/p/centaurs-and-cyborgs-on-the-jagged 208 comments
- How Rogue AIs may Arise - Yoshua Bengio https://yoshuabengio.org/2023/05/22/how-rogue-ais-may-arise/ 142 comments
- Consensus: AI-powered Academic Search Engine https://consensus.app/ 56 comments
- AMIE: A research AI system for diagnostic medical reasoning and conversations – Google Research Blog https://blog.research.google/2024/01/amie-research-ai-system-for-diagnostic_12.html 32 comments
- [2104.02145] What Will it Take to Fix Benchmarking in Natural Language Understanding? https://arxiv.org/abs/2104.02145 12 comments
- Geoffrey Hinton tells us why he’s now scared of the tech he helped build | MIT Technology Review https://www.technologyreview.com/2023/05/02/1072528/geoffrey-hinton-google-why-scared-ai/ 2 comments
- [2305.14763] Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models https://arxiv.org/abs/2305.14763 1 comment
- [1907.07355] Probing Neural Network Comprehension of Natural Language Arguments https://arxiv.org/abs/1907.07355 0 comments
- https://www.science.org/doi/10.1126/science.adj5957 0 comments
- [2312.11671] Evaluating Language-Model Agents on Realistic Autonomous Tasks https://arxiv.org/abs/2312.11671#arc 0 comments
Related searches:
Search whole site: site:strangeloopcanon.com
Search title: Evaluations are all we need - by Rohit Krishnan
See how to search.