The Model Is Only 10%: The Real Lesson of the New SDLC

📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

The latest research emphasizes that AI system success depends more on configuration, harness, and context engineering than on the model itself. This shift impacts how organizations should approach AI development and investment.

A new Google whitepaper by Addy Osmani, Shubham Saboo, and Sokratis Kartakis states that the AI model constitutes only about 10% of what determines system behavior. The paper argues that harness, configuration, and verification are far more influential, shifting the focus from model development to system design and context engineering. This insight challenges common assumptions in AI development and suggests a strategic reorientation for organizations investing in AI tools.

The whitepaper, titled The New SDLC With Vibe Coding, emphasizes that the biggest shift in software engineering is moving from simply writing code to expressing intent and trusting machines to interpret that intent. According to the authors, 85% of developers use AI coding agents regularly, with 41% generating most new code via AI. However, the core message is that the model itself is only a small part of the system. The real value lies in the harness—the prompts, tools, rules, and context policies surrounding the model—which collectively shape behavior and performance.

Concrete evidence cited includes experiments where changing only the harness or prompts significantly improved agent performance, despite using the same underlying model. The whitepaper also highlights that failures are often due to configuration errors, missing tools, or vague rules, not the model’s capabilities. This reframes AI development as a cost-effective system design problem, where strategic investments in harness and context yield better results than chasing the latest models.

At a glance
reportWhen: published early 2026
The developmentA Google whitepaper introduces a new SDLC framework, highlighting that the AI model is only 10% of the system, with harness and verification comprising the rest.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Why AI System Design Focuses on Harness and Context

This shift matters because it redefines where organizations should allocate resources in AI development. Instead of prioritizing model upgrades, companies should invest in building robust harnesses, precise context management, and verification processes. This approach can lead to significant cost savings and more reliable, secure AI systems. It also underscores the importance of system architecture and configuration as critical factors in AI success, potentially democratizing AI deployment by reducing dependence on cutting-edge models.

MUCAR 892BT AI-Assisted Bidirectional Scan Tool, Full System OBD2 Scanner, Bi-Directional OBD2 Scanner Diagnostic Tool,ECU Coding, 35 Services, FCA Autoauth, CANFD and DOIP, Free Lifetime Upgrade

MUCAR 892BT AI-Assisted Bidirectional Scan Tool, Full System OBD2 Scanner, Bi-Directional OBD2 Scanner Diagnostic Tool,ECU Coding, 35 Services, FCA Autoauth, CANFD and DOIP, Free Lifetime Upgrade

【Powerful Performance】: OBD2 scanner, featuring an 8-inch ultra-large display, the MUCAR 892BT runs on Android 10 with a…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background of the Model-Centric Paradigm in AI Development

Historically, AI development has centered on acquiring and fine-tuning larger, more capable models. As of early 2026, the industry has seen widespread adoption of AI coding agents, with 85% of developers using them regularly. The prevailing belief has been that better models lead to better outcomes. However, recent experiments and the new whitepaper challenge this notion, demonstrating that system configuration and context engineering are more impactful than the model itself. This represents a paradigm shift from model-centric to system-centric AI development, emphasizing the importance of harness design and verification.

“The AI model constitutes only about 10% of what determines system behavior; the rest is harness, configuration, and verification.”

— Addy Osmani

AI Engineering: Building Applications with Foundation Models

AI Engineering: Building Applications with Foundation Models

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Unclear Aspects of the New SDLC Framework

While the whitepaper presents compelling evidence and experiments, the long-term implications of shifting focus from models to harness and context are still being evaluated. It remains unclear how this approach scales across different AI applications and industries, and whether it will lead to sustained improvements in AI reliability and cost-efficiency. Additionally, the specific strategies for optimizing harness design and context management are still emerging and not yet standardized.

AI-Native Software Delivery: Proven Practices to Produce High-Quality Software Faster

AI-Native Software Delivery: Proven Practices to Produce High-Quality Software Faster

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for AI Development Strategies

Organizations are expected to reevaluate their AI investment strategies, prioritizing system architecture, harness, and context engineering. Further research and case studies will likely emerge to quantify the benefits of this approach across various domains. Industry leaders may develop new tools and frameworks to facilitate better harness design and verification processes, making AI deployment more reliable and cost-effective. Monitoring how these practices evolve will be crucial in understanding the future of AI development.

The Secure Harness: Shipping Production Code with AI Coding Agents

The Secure Harness: Shipping Production Code with AI Coding Agents

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system’s behavior?

According to the whitepaper, most AI behavior is shaped by how the model is configured, prompted, and integrated with tools and rules—collectively called the harness—which accounts for about 90% of the system’s outcome.

How does this shift affect AI development costs?

Focusing on harness and context engineering can reduce costs by minimizing token usage, improving system reliability, and decreasing maintenance and security expenses, making AI deployment more economical in the long run.

What practical steps should organizations take based on this insight?

Organizations should invest in designing robust harnesses, developing verification and evaluation processes, and managing context effectively, rather than solely chasing newer or larger models.

Does this mean models are no longer important?

Models remain foundational, but their role is now viewed as part of a larger system. The focus shifts toward how models are integrated, configured, and controlled within the system architecture.

Will this approach work for all AI applications?

The whitepaper suggests promising results, but further research is needed to confirm how universally applicable this system-centric approach is across different industries and use cases.

Source: ThorstenMeyerAI.com

You May Also Like

Here’s How iOS 27’s New Liquid Glass Slider Works

Apple’s iOS 27 update adds a slider to adjust Liquid Glass transparency, enhancing customization after community feedback on design clarity.

OpenEuroLLM. The third path.

OpenEuroLLM, Europe’s pan-European LLM project, faces significant compute resource challenges as it aims to develop multilingual models across 20 organizations.

Data: The One Thing You Can’t Rent

As data becomes scarce and expensive, AI companies face new barriers, shifting focus from compute to proprietary, verified data sources.

Forezai · TradingAgents: A Trading Firm Made of Agents

Forezai introduces TradingAgents, a novel multi-agent research framework mimicking a trading desk with specialized AI agents and oversight, emphasizing structured disagreement.