Linking pages
Linked pages
- Elo rating system - Wikipedia https://en.wikipedia.org/wiki/Elo_rating_system 386 comments
- Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4 https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard 51 comments
- [2009.03300] Measuring Massive Multitask Language Understanding https://arxiv.org/abs/2009.03300 0 comments
- Evaluating and uncovering open LLMs - by Nathan Lambert https://www.interconnects.ai/p/evaluating-open-llms 0 comments
- How the open-source LLM ecosystem & leaderboards work https://www.interconnects.ai/p/how-the-open-source-llm-ecosystem 0 comments
- https://arena.lmsys.org/ 0 comments
- [2306.05685] Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena https://arxiv.org/abs/2306.05685 0 comments
Related searches:
Search whole site: site:generatingconversation.substack.com
Search title: An introduction to evaluating LLMs
See how to search.