Skip to main content

advanced-evaluation

This skill should be used for advanced LLM evaluation: LLM-as-judge systems, direct scoring, pairwise comparison, rubric calibration, evaluator bias mitigation, confidence scoring, and automated quality assessment.

Install this skill

or
advanced-evaluation6 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!
Installation guide →
GitHub Stars 16.7K
Rate this skill
Categorydevops
UpdatedJune 24, 2026
muratcankoylan/Agent-Skills-for-Context-Engineering