simpo-training
SimPO offers a reference-free method for LLM preference optimization, enhancing training efficiency and performance without a reference model.
Install this skill
or
simpo-training4 files
Comments
Sign in to leave a comment.
No comments yet. Be the first to comment!