Search whole site: site:github.com
Search title: GitHub - carlini/yet-another-applied-llm-benchmark: A benchmark to evaluate language models on questions I've previously asked them to solve.
See how to search.