discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Linking pages
GitHub - THUDM/AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents
https://github.com/THUDM/AgentBench
1 comment
Related searches:
Search whole site:
site:llmbench.ai
Search title:
AgentBench
See
how to search
.
Submit link to:
Hacker News
Reddit
Lobsters
Twitter
Mastodon