GitHub - SAILResearch/awesome-foundation-model-leaderboards: A curated list of awesome leaderboards for foundation models - discu.eu

Reddit

Release of Awesome Foundation Model Leaderboard List https://github.com/SAILResearch/awesome-foundation-model-leaderboards 3 comments 8/7/2024 computervision

Linked pages

The Pile http://pile.eleuther.ai/ 294 comments
SuperGLUE Benchmark https://super.gluebenchmark.com/leaderboard 238 comments
Alpaca Eval Leaderboard https://tatsu-lab.github.io/alpaca_eval/ 132 comments
The latest in Machine Learning | Papers With Code https://paperswithcode.com/ 118 comments
Model & API Provider Analysis | Artificial Analysis https://artificialanalysis.ai 70 comments
Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4 https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard 51 comments
GitHub - FranxYao/chain-of-thought-hub: Benchmarking large language models' complex reasoning ability with chain-of-thought prompting https://github.com/FranxYao/chain-of-thought-hub 26 comments
https://www.nuscenes.org/ 7 comments
SWE-bench Leaderboard http://www.swebench.com/ 6 comments
LMSys Chatbot Arena Leaderboard - a Hugging Face Space by lmsys https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard 3 comments
ML.ENERGY Leaderboard https://ml.energy/leaderboard/ 2 comments
MTEB Leaderboard - a Hugging Face Space by mteb https://huggingface.co/spaces/mteb/leaderboard 1 comment
Bird Homepage https://bird-bench.github.io/ 1 comment
Hallucinations Leaderboard - a Hugging Face Space by hallucinations-leaderboard https://huggingface.co/spaces/hallucinations-leaderboard/leaderboard 1 comment
GitHub - OpenGenerativeAI/llm-colosseum: Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM https://github.com/OpenGenerativeAI/llm-colosseum 1 comment
3D Arena - a Hugging Face Space by dylanebert https://huggingface.co/spaces/dylanebert/3d-arena 1 comment
AI2 Leaderboard https://leaderboard.allenai.org/ 0 comments
Can Ai Code Results - a Hugging Face Space by mike-ravkine https://huggingface.co/spaces/mike-ravkine/can-ai-code-results 0 comments
Leaderboard | C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models https://cevalbenchmark.com/static/leaderboard.html 0 comments
Open ASR Leaderboard - a Hugging Face Space by hf-audio https://huggingface.co/spaces/hf-audio/open_asr_leaderboard 0 comments

Related searches:

Search whole site: site:github.com

Search title: GitHub - SAILResearch/awesome-foundation-model-leaderboards: A curated list of awesome leaderboards for foundation models

See how to search.

Submit link to: