Skip to main content

ai-research-11-evaluation-bigcode-evaluation-harness

Evaluates code generation models using multiple benchmarks to assess coding abilities and quality across various programming languages.

Install this skill

or
ai-research-11-evaluation-bigcode-evaluation-harness4 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!
Installation guide →