All about evaluating Large language models - discu.eu

Reddit

[R] All about evaluating Large language models https://explodinggradients.com/all-about-evaluating-large-language-models 8 comments 10/7/2023 machinelearning

Linked pages

Beautiful Free Images & Pictures | Unsplash https://unsplash.com/ 274 comments
Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4 https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard 51 comments
GitHub - explodinggradients/ragas: Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines https://github.com/explodinggradients/ragas 35 comments
LMSys Chatbot Arena Leaderboard - a Hugging Face Space by lmsys https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard 3 comments
GitHub - inverse-scaling/prize: A prize for finding tasks that cause large language models to show inverse scaling https://github.com/inverse-scaling/prize 1 comment
Holistic Evaluation of Language Models (HELM) https://crfm.stanford.edu/helm/latest/ 1 comment
ARC/README.md at master · fchollet/ARC · GitHub https://github.com/fchollet/ARC/blob/master/README.md 0 comments
GitHub - EleutherAI/lm-evaluation-harness: A framework for few-shot evaluation of autoregressive language models. https://github.com/EleutherAI/lm-evaluation-harness 0 comments

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:explodinggradients.com

Search title: All about evaluating Large language models

See how to search.

Submit link to: