
CHATCODELLM
Kimi-K2-Thinking API
All You Need To Know About Kimi-K2-Thinking API
Overview
Model Provider:Moonshot AI
Model Type:Chat
State:Ready
Key Specs
Quantization:bf16
Parameters:1T
Context:256K
Pricing:$0.48 input / $2.00 output
Try Model API
Quick Start
Reserve Dedicated Endpoint
Introduction
The Kimi-K2-Thinking API provides access to Moonshot AI’s latest open-source thinking agent, designed for exceptional long-horizon reasoning. It combines step-by-step thought processes with dynamic tool use, maintaining stable performance across 200–300 sequential calls and achieving state-of-the-art results on benchmarks such as HLE and BrowseComp.
Built on a trillion-parameter MoE architecture with a 256k context window, the Kimi-K2-Thinking API leverages native INT4 quantization for a 2× inference speedup—delivering powerful deep-reasoning capabilities with high efficiency.
Kimi-K2-Thinking API Usage
Endpoint
moonshotai/kimi-k2-thinking

