MiniMax is Cheaper than Opus. The Catch is Routing

MiniMax's models look like a cost-effective Opus rival for AI automation. They excel at code, tool calling, and agent tasks. However, production performance hinges on their MoE routing. This trade-off is often beneficial for structured automations but can be a drawback for open-ended, conversational AI applications.

Technical Context

I looked into MiniMax's recent releases not out of curiosity, but because you immediately want to see how such models fit into AI automation and high-throughput tasks. On paper, the picture looks very appealing: an MoE architecture, low cost due to a small number of active parameters, and impressive results in coding, tool calling, and agent-based scenarios.

Specifically, MiniMax’s notable 2026 models include the M1 and the more recent M2.5 and M2.7 lines. They have large total parameter counts but significantly smaller active sets: for example, the M2.5 has about 229B total and ~10B active parameters, while the M1 has ~456B total and ~45.9B active. This is where the cost savings come from: a model can perform almost like a top-tier one on benchmarks but cost several times less than dense flagships like Opus.

And this is usually where I temper my enthusiasm. The quality of MoE models almost always depends on their routing: if a query hits the right experts, the response is excellent; if the router misses, the same model can suddenly falter on a very similar case. That's why statements like "it's almost as good as Opus on SWE-Bench, so we can replace it seamlessly" seem too bold to me.

On benchmarks, MiniMax does look strong, especially on tasks involving tools, long contexts, and repetitive patterns. But in live production, what matters isn't the best attempt but the consistency of quality. And that's where dense models typically perform more reliably.

Impact on Business and Automation

For automations, this isn't necessarily a drawback; in fact, it's often a workable compromise. If I have a narrow pipeline, a solid system prompt, format control, output validation, and a clear set of tools, MiniMax can provide a very cost-effective artificial intelligence integration.

Who wins? Teams that need massive throughput: support triage, data extraction, draft generation, code agents, internal copilots. Who loses? Products where users can speak freely, jump between topics, and expect a consistently intelligent dialogue without guardrails.

I would put it simply: MiniMax hasn't "killed Opus" but has significantly tightened the economics in areas where process architecture matters more than the model's charisma. At Nahornyi AI Lab, this is precisely what we help clients figure out: sometimes a cheap MoE is sufficient, while other times, trying to cut costs breaks the entire UX. If you're planning an AI implementation and are unsure what to deploy, we can quickly analyze your scenario and build an AI solution without risky experiments on live users.

Share this article

Twitter/X LinkedIn Telegram

MiniMax is Cheaper than Opus. The Catch is Routing

Technical Context

Impact on Business and Automation

More News

Gemma 4 Becomes Significantly More Practical on Edge

364M parameters and a new chance for on-device AI