原文整理页

mlx-vlm 发布 v0.4.3 版本,首发支持 Gemma 4 等多模态模型,并引入 TurboQuant 压缩技术

来源作者:Garry Tan (@garrytan)原始来源:https://x.com/garrytan/status/2039957916263002343

中文导读

mlx-vlm 发布 v0.4.3 版本,首发支持 Gemma 4、Falcon-OCR 等多模态模型,并引入 TurboQuant 压缩技术。

正文 Markdown

mlx-vlm v0.4.3 is here 🚀 Day-0 support: 🔥 Gemma 4 (vision, audio, MoE) by @GoogleDeepMind 🦅 Falcon-OCR + Falcon Perception by @TIIuae 🪨 Granite Vision 4.0 by @IBMResearch New models: 🎯 SAM 3.1 with Object Multiplex by @facebook 🔍 RF-DETR detection & segmentation by @roboflow Infra: ⚡ TurboQuant (KV cache compression) 🖥️ CUDA support for vision models (Sam and RF-DETR) Get started today: > uv pip install -U mlx-vlm Leave us a star ⭐️ https://t.co/7BvnEuzKvj