Skip to main content

agent-eval

Compares coding agents like Claude Code and Aider on custom tasks, measuring pass rates, costs, time, and consistency.

Install this skill

or
100/100

Security score

The agent-eval skill was audited on May 15, 2026. Our scanner tested it across 12 threat categories and found no security issues.

Categories Tested

Security Issues

No security issues detected

This skill passed all security checks.

Scanned on May 15, 2026
View Security Dashboard
Installation guide →