Skip to main content

Agents

Executes the mat-agent-bench benchmark against a local server, fetching questions, executing tasks, and submitting results for grading.

Install this skill

or
Agents341 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!
Installation guide →
GitHub Stars 1
Rate this skill
Categoryproductivity
UpdatedMay 31, 2026
ruoyuwang1995nya/mat_agent_bench