speculative-decoding
Enhances LLM inference speed and reduces latency using advanced decoding techniques for real-time applications.
Install this skill
or
speculative-decoding3 files
Comments
Sign in to leave a comment.
No comments yet. Be the first to comment!
GitHub Stars 22.3K
Rate this skill
Categorydevelopment
UpdatedApril 30, 2026
davila7/claude-code-templates