Skip to main content

hypura

Hypura is a scheduler for LLM inference, optimizing tensor placement and benchmarking for efficient model serving on Apple Silicon and CUDA.

Install this skill

or
hypura1 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!
Installation guide →