OpenAI GPT-5.6: Sol, Terra, and Luna Pricing, Benchmarks July 2026
Quick summary
OpenAI launched GPT-5.6 Sol, Terra, and Luna June 26 to 20 vetted partners. Sol is $5/$30, Terra $2.50/$15, Luna $1/$6 per 1M tokens. General access expected mid-July.
Read next
- ChatGPT Had 90% of the US Enterprise AI Market in 2025. Claude Now Has 70%. What Happened in 12 Months.In February 2025, ChatGPT held 90% of the US business AI market. By February 2026, Claude enterprise share surged to nearly 70%. Here is what drove the shift and what it means for developers choosing AI platforms.
- OpenAI Just Launched GPT-5.4. Native Computer Use, 1 Million Token Context, 33% Fewer Errors. Here Is What Changes for Developers.OpenAI released GPT-5.4 on March 5, 2026 with native computer use — AI agents that operate desktop and web apps without wrapper code. 1 million token context, 33% fewer errors. Here is what this means for every developer building AI agents.
OpenAI launched GPT-5.6 on June 26, 2026 as three distinct capability tiers: Sol, Terra, and Luna. Sol, the flagship, tops Terminal-Bench 2.1 at 91.9% and costs $5 per million input tokens. As of July 5, all three models remain restricted to roughly 20 vetted partner organizations, with general availability expected in mid-July. Claude Sonnet 5 shipped to all users the same day GPT-5.6 launched and is already available through every Claude plan.
Here is what you need to know before general access opens.
What Is the GPT-5.6 Model Family?
GPT-5.6 introduces a new naming system built around durable capability tiers rather than version suffixes. Sol, Terra, and Luna are not just cost tiers. OpenAI describes them as independent capability levels that can advance on their own release cadence. Sol is the highest capability tier, comparable to the old "Opus-class" framing. Terra is the balanced everyday model. Luna is the fast, cost-optimised option.
This naming change is significant for developers building on the API. GPT-5.6 Sol will stay Sol as the model improves, rather than being renamed GPT-5.7. The tier name becomes a stable reference point for capability expectations.
Sol, Terra, and Luna: What Each Tier Is For
Sol is the frontier model. It is what you reach for when quality is the only constraint, not latency or cost. The 91.9% score on Terminal-Bench 2.1 under the "Sol Ultra" configuration is the highest recorded for any model on that benchmark. Standard Sol runs at 88.8%. The gap between Sol Ultra and standard Sol suggests the Ultra configuration uses extended thinking or multi-pass reasoning, similar to how Claude's extended thinking mode changes output quality on hard tasks.
Terra is the balanced model. At $2.50 input and $15 output per million tokens, it sits in the same pricing neighbourhood as Claude Sonnet 5 and Gemini 2.5 Pro. For most business automation, coding assistance, and document processing, Terra is where the cost-quality tradeoff makes sense. OpenAI describes it as suitable for "everyday work."
Luna is the speed and cost tier. At $1 input and $6 output per million tokens, it competes directly with Claude Haiku 4.5 and Gemini 2.5 Flash. High-volume pipelines where latency and cost matter more than frontier reasoning belong on Luna.
GPT-5.6 Pricing vs Current Alternatives
| Model | Input (per 1M tokens) | Output (per 1M tokens) | Status |
|---|---|---|---|
| GPT-5.6 Sol | $5.00 | $30.00 | Restricted |
| GPT-5.6 Terra | $2.50 | $15.00 | Restricted |
| GPT-5.6 Luna | $1.00 | $6.00 | Restricted |
| Claude Sonnet 5 | $3.00 ($2.00 intro) | $15.00 ($10.00 intro) | Live |
| Claude Opus 4.8 | $15.00 | $75.00 | Live |
| Claude Haiku 4.5 | $0.80 | $4.00 | Live |
| Gemini 2.5 Pro | $1.25 | $10.00 | Live |
Sonnet 5's introductory pricing at $2/$10 undercuts GPT-5.6 Terra at $2.50/$15 on both input and output. Until GPT-5.6 reaches general availability, Sonnet 5 is the practical choice for developers who need a capable model today at competitive pricing.
Benchmark Comparison: GPT-5.6 vs Claude and Gemini
| Benchmark | GPT-5.6 Sol Ultra | GPT-5.6 Sol | Claude Opus 4.8 | Claude Sonnet 5 |
|---|---|---|---|---|
| Terminal-Bench 2.1 | 91.9% | 88.8% | 78.9% | — |
| Claude Opus 4.8 overall | — | — | 67.9 | — |
| GPT-5.5 (prior) | — | 88.0% | — | — |
GPT-5.6 Sol represents a meaningful improvement over GPT-5.5 on agentic coding tasks. The jump from 88.0% (GPT-5.5) to 88.8% (Sol) and 91.9% (Sol Ultra) on Terminal-Bench is the clearest signal that the model materially improves on the kinds of tasks developers care about most: autonomous code execution, tool use, and multi-step agent workflows.
The missing Claude Sonnet 5 data on Terminal-Bench is because the benchmark results were published before Sonnet 5 could be fully evaluated. That data will arrive as independent researchers run evaluations over the next few weeks.
Why GPT-5.6 Is Government-Gated
OpenAI shared GPT-5.6 Sol with the US government before announcing it publicly. General availability was delayed while OpenAI worked through security review with federal stakeholders. The pattern mirrors what happened with Anthropic's Fable 5 and Mythos 5, where the government imposed export controls after researchers found a jailbreak post-launch.
OpenAI avoided that outcome by coordinating with government before release. The tradeoff is a delayed general rollout for roughly 20 vetted partners to date, with a promised mid-July general availability window.
GPT-5.6 Sol will also run on Cerebras hardware at up to 750 tokens per second, which would make it the fastest frontier model available by token throughput. Cerebras access is launching in July for select customers as capacity expands.
What the Sol/Terra/Luna System Means for Developer Tooling
The new naming system creates a stable routing surface for agent frameworks and middleware. Instead of encoding specific version numbers into routing logic, teams can route by capability tier.
That said, OpenAI has changed its naming conventions multiple times. GPT-4 became GPT-4o became GPT-5 became GPT-5.5 became GPT-5.6, with capability suffixes and mini variants appearing and disappearing. The Sol/Terra/Luna framework is a meaningful step toward stability, but developers who have been burned by deprecation cycles should treat it as a commitment to watch rather than a guarantee.
Practically, this means: do not hardcode "Sol" as a permanent alias in critical production systems until the tier has been stable for at least two release cycles.
What to Do Before General Access Opens
If you are on the waitlist or have API access today, the priority tests are the ones that matter for your specific use case. Benchmarks like Terminal-Bench are useful signals but they measure a particular class of agentic coding tasks. Your pipeline may have different characteristics.
For teams currently on Claude Sonnet 5: the introductory pricing window is an opportunity to build cost baselines before GPT-5.6 Terra is available for comparison. When Terra ships, you will want real cost-per-task data, not benchmark extrapolations.
For teams on GPT-5.5 or GPT-4o: Sol's improvement on agentic tasks suggests it is worth evaluating for coding agents and tool-use workflows. The pricing is higher than 4o but the capability gap is significant.
Key Takeaways
- GPT-5.6 launched June 26, 2026 to approximately 20 vetted partners; general availability expected mid-July 2026
- Sol: $5/$30, Terra: $2.50/$15, Luna: $1/$6 per million input/output tokens
- Sol Ultra scores 91.9% on Terminal-Bench 2.1, the highest recorded for any model on that benchmark
- Claude Sonnet 5 at $2/$10 introductory pricing undercuts GPT-5.6 Terra and is available today across all Claude plans
- Sol/Terra/Luna are durable capability tiers, not version-specific model names, representing a shift toward stable API references
- For developers: build cost-per-task baselines on currently available models now; use them to evaluate GPT-5.6 Terra objectively when it ships rather than benchmarking from scratch
- What to watch: GPT-5.6 general availability date and Cerebras rollout for 750 tokens/second Sol access
FAQ
Frequently Asked Questions
What is OpenAI GPT-5.6 and when is it available?
GPT-5.6 is a three-tier model family launched by OpenAI on June 26, 2026, comprising Sol (flagship), Terra (balanced), and Luna (fast/affordable). As of July 5, the models are only accessible to approximately 20 vetted partner organizations. OpenAI has indicated general availability is expected in mid-July 2026. Claude Sonnet 5, which launched on the same day, is already available to all users.
What is the pricing for GPT-5.6 Sol, Terra, and Luna?
GPT-5.6 Sol costs $5 per million input tokens and $30 per million output tokens. Terra is $2.50 input and $15 output per million tokens. Luna is $1 input and $6 output per million tokens. Claude Sonnet 5 at its introductory rate of $2/$10 per million tokens undercuts GPT-5.6 Terra on both input and output pricing and is currently available.
How does GPT-5.6 Sol benchmark against Claude and other models?
GPT-5.6 Sol Ultra scores 91.9% on Terminal-Bench 2.1, the highest recorded for any model on that agentic coding benchmark. Standard Sol scores 88.8%, ahead of GPT-5.5 at 88.0%. Claude Opus 4.8 scores 78.9% on the same benchmark. Claude Sonnet 5 had not been fully evaluated on Terminal-Bench as of this writing. Overall model quality scores from LMSys as of June 2026 put Claude Opus 4.8 at 67.9 ahead of GPT-5.5 at 62.9.
Why is GPT-5.6 only available to 20 partners and not the public?
OpenAI coordinated GPT-5.6 Sol with the US government before the public announcement and provided access to vetted partners first, in part to avoid the situation Anthropic faced with Fable 5, which was hit with an export control order three days after launch after Amazon researchers found a jailbreak. By coordinating with government beforehand, OpenAI avoided a surprise restriction. The tradeoff is a staged rollout with general availability expected mid-July.
What is the difference between Sol, Terra, and Luna in the GPT-5.6 family?
Sol is the frontier capability tier for tasks where quality is the only constraint. Terra is the balanced model for everyday business automation, coding assistance, and document processing. Luna is the speed and cost tier for high-volume pipelines where latency and economics matter more than frontier reasoning. Unlike previous OpenAI model naming that used version suffixes like mini or nano, Sol, Terra, and Luna are intended as stable tier identifiers that persist across model improvements within each tier.
Free Weekly Briefing
The AI & Dev Briefing
One honest email a week — what actually matters in AI and software engineering. No noise, no sponsored content. Read by developers across 30+ countries.
No spam. Unsubscribe anytime.
More on AI
All posts →ChatGPT Had 90% of the US Enterprise AI Market in 2025. Claude Now Has 70%. What Happened in 12 Months.
In February 2025, ChatGPT held 90% of the US business AI market. By February 2026, Claude enterprise share surged to nearly 70%. Here is what drove the shift and what it means for developers choosing AI platforms.
OpenAI Just Launched GPT-5.4. Native Computer Use, 1 Million Token Context, 33% Fewer Errors. Here Is What Changes for Developers.
OpenAI released GPT-5.4 on March 5, 2026 with native computer use — AI agents that operate desktop and web apps without wrapper code. 1 million token context, 33% fewer errors. Here is what this means for every developer building AI agents.
RAG in Production 2026: Chunking Strategies, Embedding Costs, and What Actually Works at Scale
Most RAG tutorials show you how to build a demo. This post covers what breaks in production: chunking at 512 tokens beats semantic splitting, embedding costs range from $0.02 to $0.18 per million tokens, re-ranking boosts precision by 18–42%, and agentic RAG is now the 2026 standard. A practical guide for developers shipping RAG to real users.
Microsoft Launches 7 In-House AI Models at Build 2026 to Cut OpenAI Reliance
Microsoft unveiled 7 in-house MAI models at Build 2026: MAI-Thinking-1 (35B active params, no OpenAI data) and MAI-Code-1-Flash now live in GitHub Copilot and VS Code.
Free Tool
Will AI replace your job?
4 questions. Get a personalised developer risk score based on your stack, role, and what you actually build day to day.
Check Your AI Risk Score →Written by
Software Engineer based in Delhi, India. Writes about AI models, semiconductor supply chains, and tech geopolitics — covering the intersection of infrastructure and global events. 996+ posts cited by ChatGPT, Perplexity, and Gemini. Read in 167 countries.
