
Unlimited Token Plan
The World’s First-Ever
Unlimited Token Plan
The World’s First-Ever Unlimited Token Plan
90% Cost Saving | Real unlimited tokens | Best Open Model
Unlimited Tokens, 90% Lower Cost

Up to 90% lower cost than closed models such as Claude, with a 65:1 input-to-output ratio.

Up to 90% lower cost than closed models such as Claude, with a 65:1 input-to-output ratio.
Deeply optimized for Best-in-class Performance
We offer professionally optimized Kimi K2.5, delivering significant performance advantages over other providers.
Up to 21x faster than other providers, delivering both high speed and stability under heavy load and complex requests.


Up to 21x faster than other providers, delivering both high speed and stability under heavy load and complex requests.

Up to 5x higher Total TPS than other providers, delivering superior throughput across the entire inference pipeline (prefill + decode).

Up to 5x higher Total TPS than other providers, delivering superiorthroughput across the entire inference pipeline (prefill + decode).
100% Privacy Protected
We guarantee that all inference runs with Canopy Wave's assurance of data privacy and compliance for international users. Your inference data is retained at zero by Canopy Wave, eliminating data leakage risks.
Choose Your Plan
Unlimited* 50M
per month, billed monthly
Model : Kimi K2.5
Threshold : 50M
- Unlimited token calls
- Standard priority queue
- Community support channel

Unlimited* 200M
per month, billed monthly
Model : Kimi K2.5
Threshold : 200M
- Unlimited token calls
- High-priority inference
- Priority technical support channel
Unlimited* 500M
per month, billed monthly
Model : Kimi K2.5
Threshold : 500M
- Unlimited token calls
- Highest inference priority
- Dedicated support channel
Enterprise
- Fully customized solution
- Private deployment options
- SLA guarantee
- Dedicated technical account manager
- Compliance & security customization
*Unlimited API call claims
Premium high-speed inference up to your plan's limit, then unlimited at reduced throughput
If you consume more than your plan's premium token allocation (50M / 200M / 500M), your API requests will continue to work without interruption but may be processed at low priority with limited concurrent request until your next billing cycle or plan upgrade.
No overage fees. No hard stops. Your workflows keep running.
