Linking pages
- AI leaderboards are no longer useful. It's time to switch to Pareto curves. https://www.aisnakeoil.com/p/ai-leaderboards-are-no-longer-useful 14 comments
- New paper: AI agents that matter https://www.aisnakeoil.com/p/new-paper-ai-agents-that-matter 10 comments
- GitHub - JShollaj/awesome-llm-interpretability: A curated list of Large Language Model (LLM) Interpretability resources. https://github.com/JShollaj/awesome-llm-interpretability 1 comment
- Evaluating LLMs is a minefield https://www.aisnakeoil.com/p/evaluating-llms-is-a-minefield 0 comments
- Will AI transform law? https://www.aisnakeoil.com/p/will-ai-transform-law 0 comments
Related searches:
Search whole site: site:cs.princeton.edu
Search title: Evaluating LLMs is a minefield
See how to search.