
Unlimited Token Plan
The World’s First-Ever
Unlimited Token Plan
The World’s First-Ever Unlimited Token Plan
90% Cost Saving | Real Unlimited Tokens | Best Open Model
Unlimited Tokens, 90% Cost Saving
| Light User-50M | $159 | $15.99 Unlimited 50M |
| Medium User-200M | $636 | $59.99 Unlimited 200M |
| Heavy User-500M | $1,590 | $159.99 Unlimited 500M |
Up to 90% lower cost than closed models such as Claude, with a 65:1 input-to-output ratio.

Up to 90% lower cost than closed models such as Claude, with a 65:1 input-to-output ratio.
See How We Stack Up Against Closed Models
| Unlimited token & unlimited request | Single payment, unlimited token | Pay per token used | Pay per token used |
| No surprise bills | Use with confidence, no cost worries. | Worried about surprise bills | Worried about surprise bills |
| Priority access at high traffic times | |||
| 100% Privacy Protected | |||
| Open Source |
Deeply optimized for Best-in-class Performance
We offer professionally optimized Kimi K2.5 and MiniMax M2.5, delivering significant performance advantages over other providers.
Canopy Wave keeps TTFT (P95) stably below 6.0s, ensuring consistent speed and reliability even under high load and complex requests — up to 21x faster than other providers.


Up to 21x faster than other providers, delivering both high speed and stability under heavy load and complex requests.

Canopy Wave delivers 1078.13 tokens/s Total TPS. With superior end-to-end inference throughput (prefill + decode), it outperforms other providers by up to 5x.

Up to 5x higher Total TPS than other providers, delivering superiorthroughput across the entire inference pipeline (prefill + decode).
100% Privacy Protected
We guarantee that all inference runs with Canopy Wave's assurance of data privacy and compliance for international users. Your inference data is retained at zero by Canopy Wave, eliminating data leakage risks.
Choose Your Plan
Unlimited* 50M
per month, billed monthly
Unlimited token
Unlimited request
50 million high-speed tokens(After 50M, your API requests may be processed at low priority)
Access to Kimi K2.5 & MiniMax M2.5
Standard priority queue
Community support channel


Unlimited* 200M
per month, billed monthly
Unlimited token
Unlimited request
200 million high-speed tokens(After 200M, your API requests may be processed at low priority)
Access to Kimi K2.5 & MiniMax M2.5
High-priority inference
Priority technical support channel
Unlimited* 500M
per month, billed monthly
Unlimited token
Unlimited request
500 million high-speed tokens(After 500M, your API requests may be processed at low priority)
Access to Kimi K2.5 & MiniMax M2.5
Highest inference priority
Dedicated support channel
Enterprise
Fully customized solution
Private deployment options
SLA guarantee
Dedicated technical account manager
Compliance & security customization
*Unlimited API call claims
Premium high-speed inference, followed by unlimited usage at reduced throughput.
If you consume more than your plan's high-speed token allocation (50/200/500 Million Tokens), your API requests will continue to work without interruption but may be processed at low priority with limited concurrent request until your next billing cycle or plan upgrade.
No overage fees. No hard stops. Your workflows keep running.
