Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
audio eagle quantization diffusion vlm llm qwen speculative-decoding llm-compression hunyuan deepseek fp4 dflash
-
Updated
Apr 2, 2026 - Python