Qwen3.5-397B-A17B & Qwen3.5-397B-A17B & MiniMax M2.5 is Live on Canopy Wave. Try it Now!DeepSeek V3.1
MiMo-V2-Flash API
CHAT

MiMo-V2-Flash API

All You Need To Know About MiMo-V2-Flash API

Overview

Model Provider:XiaomiMiMo
Model Type:Chat
State:Ready

Key Specs

Quantization:BF16
Parameters:310B
Context:256K
Pricing:$0.08 input / $0.24 output
Try Model API
Quick Start
Reserve Dedicated Endpoint

Introduction

MiMo-V2-Flash is a Mixture-of-Experts (MoE) language model with 309B total parameters and 15B active parameters. Designed for high-speed reasoning and agentic workflows, it utilizes a novel hybrid attention architecture and Multi-Token Prediction (MTP) to achieve state-of-the-art performance while significantly reducing inference costs.

MiMo-V2-Flash API Usage

Model

Endpoint

mimo/mimo-v2-flash


        1
        curl -X POST https://api.canopywave.io/v1/chat/completions \
      
        2
          -H "Content-Type: application/json" \
      
        3
          -H "Authorization: Bearer $CANOPYWAVE_API_KEY" \
      
        4
          -d '{
      
        5
            "model": "xiaomimimo/mimo-v2-flash",
      
        6
            "messages": [
      
        7
              {"role": "user", "content": "tell me a story"}
      
        8
            ],
      
        9
            "max_tokens": 1000,
      
        10
            "temperature": 0.7
      
        11
          }'
      
PromotionContact us