evaluation

Evaluates agent systems using quality gates and LLM-as-judge for nuanced performance measurement and continuous improvement.

Install this skill

or

evaluation1 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!

Installation guide →

GitHub Stars 106

Rate this skill

Categorydata analytics

UpdatedMay 21, 2026

openclaw api data-analyst ml-ai-engineer product-manager data analytics development product

majiayu000/claude-skill-registry