Skip to main content

model-pruning

Enables efficient model compression and faster inference for LLMs using advanced pruning techniques like Wanda and SparseGPT.

Install this skill

or
model-pruning2 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!
Installation guide →