Evals
Provides a comprehensive framework for evaluating AI agents, focusing on workflows and multiple grading methods for robust performance assessment.
Install this skill
or
Evals1 files
Comments
Sign in to leave a comment.
No comments yet. Be the first to comment!
GitHub Stars 106
Rate this skill
Categorydevelopment
UpdatedMay 21, 2026
claude-codeapiml-ai-engineerdata-scientistqa-engineerproduct-managertechnical-pmdevelopmentdata analyticsproduct
majiayu000/claude-skill-registry