MiniMax Releases M3 Multimodal Model Series with Base and Quantized Versions

MiniMax has officially released the M3 multimodal model series on Hugging Face, featuring both the base M3 model and the quantized MiniMax-M3-MXFP8. The models adopt a Mixture of Experts (MoE) architecture, supporting image-text-to-text multimodal tasks and offering agent and coding capabilities.

According to the Hugging Face page, the M3 series quickly gained traction, with over 572,000 downloads and 43 likes. This reflects strong community demand for open-source multimodal models.

MiniMax 正式发布 M3 多模态模型系列，含基础版与量化版 — Image source: huggingface.co

The key highlight of the MiniMax M3 series is its multimodal fusion capability, processing both image and text inputs to generate text outputs. This makes it suitable for visual question answering, image-text generation, intelligent customer service, and more.

The quantized MXFP8 version uses 8-bit floating-point quantization to reduce deployment cost, enabling large model inference on consumer-grade GPUs and lowering the barrier for developers.

MiniMax has been committed to large model R&D, and the M3 series represents a significant step in multimodal direction. Compared to similar models, M3 strikes a good balance between parameter count and performance.

Looking ahead, MiniMax may release larger models or optimize for specific scenarios. Developers can expect richer toolchains and community support.

The release also underscores the growing activity of Chinese AI companies in the open-source space, offering more choices to global developers.

Sources

Source 1: https://huggingface.co/MiniMaxAI/MiniMax-M3-MXFP8

Why it matters

MiniMax's M3 multimodal series is a notable open-source contribution, advancing multimodal model accessibility and cost-effective deployment.

微博 X LinkedIn Facebook Telegram 邮件

MiniMaxMulti-modalModelHugging Face

MiniMax Releases M3 Multimodal Model Series with Base and Quantized Versions

Nearby Updates

MiniMax Officially Releases M3 Multimodal MoE Model on Hugging Face with Image-Text-to-Text Pipeline

First AI Agent Payment Completed in France, Marking a FinTech Milestone

GSMA Intelligence Releases Agentic Core White Paper, Defining New Paradigm for Intelligent Core Network Evolution

Om AI Lianhui Releases VLX: World's First Edge Streaming Multimodal Model for the Physical World