hypura
Hypura is a scheduler for LLM inference, optimizing tensor placement and benchmarking for efficient model serving on Apple Silicon and CUDA.
Install this skill
or
hypura1 files
Comments
Sign in to leave a comment.
No comments yet. Be the first to comment!