Agents
Executes the mat-agent-bench benchmark against a local server, fetching questions, executing tasks, and submitting results for grading.
Install this skill
or
Agents341 files
Comments
Sign in to leave a comment.
No comments yet. Be the first to comment!
GitHub Stars 1
Rate this skill
Categoryproductivity
UpdatedMay 31, 2026
ruoyuwang1995nya/mat_agent_bench