Linking pages
- GitHub - alopatenko/LLMEvaluation: A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use cases, promote the adoption of best practices in LLM assessment, and critically assess the effectiveness of these evaluation methods. https://github.com/alopatenko/LLMEvaluation 0 comments
- GitHub - mlfoundations/evalchemy: Automatic evals for LLMs https://github.com/mlfoundations/evalchemy 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2401.03065] CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution
See how to search.