Hypura – A storage-tier-aware LLM inference scheduler for Apple Silicon

136 points | by tatef 3 hours ago

47 comments