Claude Code Proxy Costs
Documentation and guides for OutcomeDev.
By enabling OutcomeDev's ability to proxy the Claude Code CLI through Anthropic-compatible providers like MiniMax, we unlock significant cost savings without sacrificing the robust terminal-native capabilities of the Claude Code harness.
This document breaks down the cost implications of using native Anthropic models (like the latest 4.x series) versus proxying to high-performance open-weight or alternative models like MiniMax M2.5 and M2.7.
Current Pricing (per 1 Million Tokens)
Prices as of April 2026
| Provider / Model | Input Cost | Output Cost | Context Window |
|---|---|---|---|
| Anthropic Claude Opus 4.7 | $5.00 | $25.00 | 1M |
| Anthropic Claude Sonnet 4.6 | $3.00 | $15.00 | 1M |
| Anthropic Claude Haiku 4.5 | $1.00 | $5.00 | 1M |
| MiniMax M2.7 | ~$0.30 | ~$1.20 | 256K |
| MiniMax M2.5 | ~$0.15 | ~$1.15 | 256K |
Scenario: Complex Agentic Task
A typical complex software engineering task using the Claude Code CLI. Because the CLI is highly agentic, it makes many API calls and heavily utilizes the context window to "read" the workspace.
Assumptions:
- Turns/Steps: 50 API calls
- Average Input Size: 40,000 tokens per turn (Context builds up as the CLI reads files)
- Average Output Size: 2,000 tokens per turn
- Total Input Tokens: 2,000,000 (2M)
- Total Output Tokens: 100,000 (0.1M)
Cost Breakdown per Session
1. Using Claude Opus 4.7 (Native)
- Input Cost: 2M * $5.00 = $10.00
- Output Cost: 0.1M * $25.00 = $2.50
- Total Session Cost: $12.50
2. Using Claude Sonnet 4.6 (Native)
- Input Cost: 2M * $3.00 = $6.00
- Output Cost: 0.1M * $15.00 = $1.50
- Total Session Cost: $7.50
3. Using Claude Haiku 4.5 (Native)
- Input Cost: 2M * $1.00 = $2.00
- Output Cost: 0.1M * $5.00 = $0.50
- Total Session Cost: $2.50
4. Proxying via MiniMax M2.7
- Input Cost: 2M * $0.30 = $0.60
- Output Cost: 0.1M * $1.20 = $0.12
- Total Session Cost: $0.72
5. Proxying via MiniMax M2.5
- Input Cost: 2M * $0.15 = $0.30
- Output Cost: 0.1M * $1.15 = $0.12
- Total Session Cost: $0.42
Conclusion: The "Proxy" Advantage
Using the MiniMax M2.7 proxy instead of Claude Sonnet 4.6 yields a ~10.4x reduction in cost (saving nearly $6.80 per complex task). Against the top-tier Claude Opus 4.7, the savings are even more staggering: ~17.3x cheaper (saving almost $12.00 per task).
Even against Anthropic's fastest/cheapest current model (Haiku 4.5), MiniMax M2.7 is nearly 3.5x cheaper.
By utilizing OutcomeDev's proxying capability (ANTHROPIC_BASE_URL and ANTHROPIC_CUSTOM_MODEL_OPTION injection), teams can deploy the highly-polished Claude Code harness at scale across hundreds of automated tasks daily, turning what would be a massive infrastructure expense into a trivial operational cost.
A Note on Interoperability and Ecosystem Health
Proxying the Claude Code CLI through Anthropic-compatible backends is a standard exercise of developer interoperability.
Why this benefits the ecosystem:
- Standardized Protocols: The CLI's support for
ANTHROPIC_BASE_URLis a deliberate architectural feature designed to support diverse networking environments and compatible endpoints. - Mindshare Over Marketshare: Even when proxying, developers remain within the Anthropic toolchain. This cements the Claude Code CLI as the industry standard for agentic coding.
- Low-Friction Upsell: By using the Claude harness as their daily driver, developers can effortlessly switch back to native Anthropic models (like Claude Opus) for critical, high-reasoning tasks with a single command.
OutcomeDev promotes a transparent, modular AI stack where developers are free to choose the best "brain" for their current budget and task requirements.