dataset-engineering
Create, clean, and optimize datasets for LLM fine-tuning. Covers formats (Alpaca, ShareGPT, ChatML), synthetic data generation, quality assessment, and augmentation. Use when preparing data for training.
Install this skill
dataset-engineering1 files
Comments
Sign in to leave a comment.
No comments yet. Be the first to comment!
Install this skill with one command
/learn @majiayu000/dataset-engineeringGitHub Stars 80
Rate this skill
Categorydevelopment
UpdatedFebruary 16, 2026
majiayu000/claude-skill-registry