Skip to main content
hamelsmu

hamelsmu

GitHub profile for hamelsmu34 skills

Are you hamelsmu? Claim your skills.

Verified badgeInstall analyticsFork attributionSecurity scoreGitHub sync

hamelsmu hamelsmu / evaluate-rag

1.4K

Facilitates the evaluation of retrieval-augmented generation systems by measuring retrieval quality and optimizing generation strategies.

openclaw
100
100

hamelsmu hamelsmu / error-analysis

1.4K

Assists users in identifying and categorizing failure modes in LLM pipelines by analyzing traces for improved evaluation and debugging.

openclaw
100
100

hamelsmu hamelsmu / eval-audit

1.4K

Audits LLM evaluation pipelines to identify issues and provide actionable insights for improving evaluation trustworthiness.

openclaw
100
93

hamelsmu hamelsmu / write-judge-prompt

1.4K

Designs evaluators for subjective criteria in AI, enhancing assessment of tone, relevance, and completeness beyond code-based checks.

openclaw
100
100

hamelsmu hamelsmu / build-review-interface

1.4K

Creates a custom browser-based annotation interface for reviewing LLM traces and collecting structured feedback efficiently.

opencodecodex
83
100

hamelsmu hamelsmu / generate-synthetic-data

1.4K

Generates diverse synthetic test inputs for LLM evaluation, aiding in dataset bootstrapping and stress-testing failure hypotheses.

openclaw
83
100

hamelsmu hamelsmu / validate-evaluator

1.4K

Calibrates LLM judges against human labels using data splits and bias correction to ensure reliable outputs.

75
100

hamelsmu hamelsmu / How to Write Good Skills

1.4K

Provides guidelines for creating and maintaining skills for AI coding agents, focusing on clarity and domain-specific directives.

claude-codecursor
0
100