- Release of Awesome Foundation Model Leaderboard List https://github.com/SAILResearch/awesome-foundation-model-leaderboards 3 comments computervision
Linked pages
- The Pile http://pile.eleuther.ai/ 294 comments
- SuperGLUE Benchmark https://super.gluebenchmark.com/leaderboard 238 comments
- Alpaca Eval Leaderboard https://tatsu-lab.github.io/alpaca_eval/ 132 comments
- The latest in Machine Learning | Papers With Code https://paperswithcode.com/ 118 comments
- Model & API Provider Analysis | Artificial Analysis https://artificialanalysis.ai 70 comments
- Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4 https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard 51 comments
- GitHub - FranxYao/chain-of-thought-hub: Benchmarking large language models' complex reasoning ability with chain-of-thought prompting https://github.com/FranxYao/chain-of-thought-hub 26 comments
- https://www.nuscenes.org/ 7 comments
- SWE-bench http://www.swebench.com/ 6 comments
- LMSys Chatbot Arena Leaderboard - a Hugging Face Space by lmsys https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard 3 comments
- ML.ENERGY Leaderboard https://ml.energy/leaderboard/ 2 comments
- MTEB Leaderboard - a Hugging Face Space by mteb https://huggingface.co/spaces/mteb/leaderboard 1 comment
- Bird Homepage https://bird-bench.github.io/ 1 comment
- Hallucinations Leaderboard - a Hugging Face Space by hallucinations-leaderboard https://huggingface.co/spaces/hallucinations-leaderboard/leaderboard 1 comment
- GitHub - OpenGenerativeAI/llm-colosseum: Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM https://github.com/OpenGenerativeAI/llm-colosseum 1 comment
- 3D Arena - a Hugging Face Space by dylanebert https://huggingface.co/spaces/dylanebert/3d-arena 1 comment
- AI2 Leaderboard https://leaderboard.allenai.org/ 0 comments
- Can Ai Code Results - a Hugging Face Space by mike-ravkine https://huggingface.co/spaces/mike-ravkine/can-ai-code-results 0 comments
- Leaderboard | C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models https://cevalbenchmark.com/static/leaderboard.html 0 comments
- Open ASR Leaderboard - a Hugging Face Space by hf-audio https://huggingface.co/spaces/hf-audio/open_asr_leaderboard 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - SAILResearch/awesome-foundation-model-leaderboards: A curated list of awesome leaderboards for foundation models
See how to search.