X (Twitter) Facebook Pinterest LinkedIn E-mail

OpenAI is reportedly considering a significant price cut for its tokens to defend its market position against Anthropic’s growing presence in the professional AI sector. This information, initially reported by The Wall Street Journal and picked up by Reuters, has not been officially confirmed by the company but aligns with a trend already noticeable among businesses and developers: the cost of using advanced models has become as critical as the quality of responses.

The potential reduction comes at a time when AI is shifting from experimentation to more continuous, widespread use. Many companies are no longer conducting isolated tests with ChatGPT, Claude, or integrated internal models. Instead, they are deploying agents, coding assistants, support systems, document analysis tools, process automation, and internal tools generating thousands or millions of requests. When that happens, the token price stops being merely a technical detail in an API documentation and becomes a line item in the budget.

The token becomes the new economic unit of AI

In the language model market, the token functions as the basic billing unit. Companies pay for the information they send to the model and for the responses they receive. Costs vary depending on the model, context size, processing speed, usage type, and whether discounts are applied for batching, caching, or low-priority tasks.

Currently, OpenAI maintains a tiered structure. GPT-5.5 is priced at $5 per million input tokens and $30 per million output tokens. GPT-5.4 drops to $2.50 and $15, respectively, while GPT-5.4 Mini is at $0.75 and $4.50. There are also more economical options, such as GPT-5.4 Nano, and significant discounts through Batch API or Flex for asynchronous or less urgent workloads.

Anthropic, on the other hand, offers a range where Claude Fable 5 and Mythos 5 are priced at $10 per million input tokens and $50 per million output tokens. Opus 4.8 is at $5 and $25, Sonnet 4.6 at $3 and $15, and Haiku 4.5 at $1 and $5.

Provider and Model	Input per 1M tokens	Output per 1M tokens	Positioning
OpenAI GPT-5.5	$5	$30	Advanced model for demanding tasks
OpenAI GPT-5.4	$2.50	$15	Mid-range tier
OpenAI GPT-5.4 mini	$0.75	$4.50	Mass deployment and lower-cost tasks
OpenAI GPT-5.4 nano	$0.20	$1.25	Simple automation, high volume
Anthropic Claude Fable 5	$10	$50	Premium model for agents and complex tasks
Anthropic Claude Opus 4.8	$5	$25	Advanced range
Anthropic Claude Sonnet 4.6	$3	$15	Balance between capacity and cost
Anthropic Claude Haiku 4.5	$1	$5	Economical option within Claude lineup

The table shows that competition is not simply “OpenAI expensive versus Anthropic cheap.” Both catalogs feature models at various levels. The real battleground is the effective cost per task completed. A model might charge more per token but require fewer steps. Another might be cheaper but need more context, retries, or human supervision.

Anthropic can respond, but not necessarily by cutting everything

The key question is whether Anthropic will engage in an open price war. The most likely answer is that they will respond, but cautiously. It doesn’t seem reasonable to immediately cut prices across their entire premium range, especially after introducing Claude Fable 5 and Mythos 5 as high-value models for professional work, agents, and coding. A blanket discount could defend market share but would also reduce margins on a costly infrastructure to operate.

Anthropic has other strategies to compete. They can offer volume discounts to large clients, expand enterprise credits, improve actual costs through prompt caching, push workloads to Batch Processing with 50% discounts, adjust intermediate models like Sonnet or Haiku, or create specific pricing for high-volume use cases like coding agents.

Potential moves by Anthropic	Probability	Likely effect
General discount on Fable 5	Low to medium	Defend market share aggressively but impact margins
Enterprise volume discounts	High	Competing for large accounts without reducing public prices
More credits in subscriptions	High	Retain professional users
Boost in Batch Processing	High	Lower costs for less urgent workloads
Prompt caching improvements	High	Reduce actual costs for long-running tasks
Adjustments to Sonnet or Haiku	Medium-high	Defend volume in common tasks
Specific rates for coding agents	Medium	Target developer market directly

This selective response makes sense because Anthropic has gained a reputation in professional applications where quality is paramount. Claude Code has gained visibility among developers, and Claude models are used for programming, long analysis, and document work. The company can try to defend this space not just through price but also through performance, reliability, and user experience.

Coding and agents: where the billing skyrockets

Pricing pressure is especially evident in coding agents. A code assistant doesn’t just answer questions; it reads multiple files, understands dependencies, suggests changes, generates tests, runs tools, interprets errors, and retries. Each step consumes tokens.

For individual users, costs may seem manageable. But for engineering teams with dozens or hundreds of developers working daily with agents, the costs scale up quickly. A difference of just a few dollars per million tokens can turn into thousands or millions annually, depending on volume.

The same applies to customer support, document review, contract analysis, back-office automation, or internal agents executing chained tasks. The more autonomous the system, the more context it needs and the more iterations it performs. Model efficiency matters, but so does price.

Use case	Why it consumes many tokens	Price sensitivity
Coding agents	Read code, generate changes, and test repeatedly	Very high
Automated support	Process histories, policies, and long responses	High
Document analysis	Work with extensive documents and accumulated context	High
Internal agents	Chain steps, tools, and validations	Very high
Content generation	Create multiple variants and revisions	Medium
Simple classification	Short, repetitive tasks	Very high if volume is massive

For this reason, OpenAI is incentivized to lower prices. If they can reduce the cost of GPT-5.5 or its intermediate models without degrading performance, they can capture more enterprise workloads before Anthropic secures a strong position in professional flows.

Lowering prices doesn’t eliminate infrastructure costs

The issue is that generative AI doesn’t have the typical profit margins of traditional software. Every model call consumes computation, memory, network, energy, and data center capacity. Advanced inference, especially in large models and long contexts, is not free even with optimized software.

Both OpenAI and Anthropic have raised capital and signed large infrastructure deals precisely because serving AI at scale requires huge investment. Reducing prices might boost adoption but also increase demand for GPUs and strain availability. If the additional volume doesn’t compensate for the lower revenue per token, margins will tighten.

So, the strategy isn’t straightforward. OpenAI could lower prices to gain market share but will need to rely on more efficient models, caching, batching, proprietary hardware, or more favorable cloud agreements. Anthropic faces the same dilemma. Ultimately, both are competing to demonstrate that they can sell AI at scale without serving costs eating into profitability.

The real battleground will be the cost per task, not per token

In the medium term, companies won’t just compare the price per million tokens but will look at the total cost to complete an entire task. How much does it cost to close a support ticket? To review a code repository? To analyze a thousand contracts? To keep an agent running for an hour? How much human work is saved, and how much supervision is needed?

This could favor multi-model strategies—using cheap models for simple tasks, premium models for complex decisions, self-hosted models for sensitive data, or different providers based on latency, cost, availability, or compliance. Price wars may not produce a single winner; they might accelerate market fragmentation.

Business strategy	Advantage	Risk
Always use the most powerful model	Better overall quality	High cost
Always use the cheapest model	Immediate savings	Lower reliability in complex tasks
Multi-model architecture	Cost-quality balance	Greater technical complexity
Partial self-hosting	Data and cost control at scale	More demanding operation
Batched, non-urgent tasks	Significant discounts	Less immediacy
Prompt caching	Lower costs for repeated contexts	Requires good prompt and workflow design

Good news for users, pressure for providers

For developers, startups, and companies already using AI, a price decrease would be positive. It would enable more experimentation, greater automation, and the launch of products at lower entry costs. It could also help make AI tools accessible to users who were previously limited by budgets.

For model providers, the outlook is more challenging. Competition might turn much of the market into a volume game, where only those with the best combination of models, infrastructure, distribution, and capital survive. Smaller, more efficient models will gain importance, as not all tasks justify using the most expensive models.

Regarding employment, it’s not only about token prices. Cheaper AI can accelerate automation, but replacing human work still depends on reliability, integration, legal responsibility, data quality, and oversight. Many companies have learned that poorly implemented automation can be more costly than maintaining human teams. Lower prices will make more cases viable but won’t solve organizational complexity on their own.

OpenAI’s potential price cut hints at a more mature and less idealistic phase of AI. It’s no longer enough to present an impressive model; convincing the market that daily use is economically sensible is crucial. Anthropic may enter the price war but will likely choose where to cut margins and where to defend pricing carefully.

Enterprise AI is beginning to resemble any infrastructure market: performance, cost, availability, and trustworthiness. The winner won’t necessarily be the one with the most brilliant model but the one who can make each task cheaper, faster, and with less risk.

Frequently Asked Questions

Has OpenAI confirmed the price reduction?

No. The potential reduction comes from a report by The Wall Street Journal, confirmed by Reuters, which stated they couldn’t verify it independently.

Why does OpenAI want to lower prices?

The assumption is that they aim to better compete with Anthropic in professional clients, coding, AI agents, and high-volume enterprise workloads.

Could Anthropic engage in a price war?

Yes, though probably selectively, through volume discounts, credits, batch processing, prompt caching, or adjustments to specific models.

What metric should companies focus on?

Beyond price per million tokens, it’s more relevant to consider the cost per completed task: how much it costs to close a support ticket, review code, analyze contracts, or run an agent for an hour.