- [R] All about evaluating Large language models https://explodinggradients.com/all-about-evaluating-large-language-models 8 comments machinelearning
Linked pages
- Beautiful Free Images & Pictures | Unsplash https://unsplash.com/ 274 comments
- Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4 https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard 51 comments
- GitHub - explodinggradients/ragas: Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines https://github.com/explodinggradients/ragas 35 comments
- LMSys Chatbot Arena Leaderboard - a Hugging Face Space by lmsys https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard 3 comments
- GitHub - inverse-scaling/prize: A prize for finding tasks that cause large language models to show inverse scaling https://github.com/inverse-scaling/prize 1 comment
- Holistic Evaluation of Language Models (HELM) https://crfm.stanford.edu/helm/latest/ 1 comment
- ARC/README.md at master · fchollet/ARC · GitHub https://github.com/fchollet/ARC/blob/master/README.md 0 comments
- GitHub - EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of autoregressive language models. https://github.com/EleutherAI/lm-evaluation-harness 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:explodinggradients.com
Search title: All about evaluating Large language models
See how to search.