Skip to main content

run-benchmark

Facilitates MCP evaluations using mcpbr on SWE-bench datasets, ensuring valid and reproducible benchmarking for AI agents.

Install this skill

or
run-benchmark2 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!
Installation guide →