evaluation
Evaluates agent systems using quality gates and LLM-as-judge for nuanced performance measurement and continuous improvement.
Install this skill
or
evaluation1 files
Comments
Sign in to leave a comment.
No comments yet. Be the first to comment!
GitHub Stars 106
Rate this skill
Categorydata analytics
UpdatedMay 21, 2026
majiayu000/claude-skill-registry