
CHAT
MiMo-V2-Flash API
All You Need To Know About MiMo-V2-Flash API
Overview
Model Provider:XiaomiMiMo
Model Type:Chat
State:Ready
Key Specs
Quantization:BF16
Parameters:310B
Context:256K
Pricing:$0.08 input / $0.24 output
Try Model API
Quick Start
Reserve Dedicated Endpoint
Introduction
MiMo-V2-Flash is a Mixture-of-Experts (MoE) language model with 309B total parameters and 15B active parameters. Designed for high-speed reasoning and agentic workflows, it utilizes a novel hybrid attention architecture and Multi-Token Prediction (MTP) to achieve state-of-the-art performance while significantly reducing inference costs.
MiMo-V2-Flash API Usage
Endpoint
mimo/mimo-v2-flash

