ai-research-11-evaluation-bigcode-evaluation-harness
Evaluates code generation models using multiple benchmarks to assess coding abilities and quality across various programming languages.
Install this skill
or
ai-research-11-evaluation-bigcode-evaluation-harness4 files
Comments
Sign in to leave a comment.
No comments yet. Be the first to comment!
Rate this skill
Categorydevelopment
UpdatedJune 15, 2026
openclawapitestingml-ai-engineerdata-scientistdata-analystproduct-managertechnical-pmgithubdockergcpawsdevelopmentdata analyticsproduct
zxmengde/Playgroud