The Model Is Only 10%: The Real Lesson of the New SDLC

📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent whitepaper emphasizes that in AI-assisted software development, the core value lies in configuring and controlling the environment around the model, not the model itself. This shifts focus from model innovation to harness design and context engineering.

A new Google whitepaper, “The New SDLC With Vibe Coding”, asserts that the AI model accounts for only about 10% of system behavior in AI-driven development. The paper emphasizes that the real control lies in the harness and context engineering, which constitute the remaining 90%. This insight challenges the common focus on the latest models and suggests a strategic shift for developers and organizations.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that the biggest shift in software engineering is moving from writing code to expressing intent and trusting machines to interpret that intent. It highlights that 85% of professional developers now use AI coding agents, with 51% doing so daily and about 41% of all new code being AI-generated.

The core message: the model itself is only a small part of the system. The paper provides evidence that changing the harness—prompts, tools, rules—can significantly improve agent performance. For example, a team improved a coding agent’s ranking from outside the Top 30 to Top 5 by tweaking only the harness, not the model. Similarly, a LangChain experiment increased performance by 13.7 points through prompt and middleware adjustments.

It underscores that most failures are configuration issues, such as missing tools or vague rules, rather than model limitations. Consequently, the surface area for strategic advantage is in harness design and context management, not in acquiring the latest models.

At a glance
reportWhen: published early 2026
The developmentThe new Google whitepaper highlights that only 10% of system behavior depends on the AI model, with the remaining 90% determined by harness and context configuration.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Why Focus on Harness and Context Matters

This shift redefines how organizations should invest in AI development. Instead of chasing the newest, most powerful models, companies should prioritize building robust harnesses and mastering context engineering. This approach offers a more cost-effective and controllable path to reliable AI systems, reducing token waste, maintenance costs, and security vulnerabilities. It also democratizes AI development, allowing teams to optimize existing models rather than depend solely on model improvements, which are often expensive and incremental.

Amazon

AI development environment configuration tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background on AI Development and the SDLC Shift

Until early 2026, the focus in AI development was largely on acquiring and deploying advanced models, with many believing that model size and sophistication dictated performance. The whitepaper marks a paradigm shift, emphasizing that the true value lies in how models are integrated and controlled. Prior to this, the industry often overlooked the importance of configuration, prompts, and environment design, which now emerge as critical factors. The paper builds on recent trends of widespread AI adoption, with over half of developers using AI daily, and reflects a maturation in understanding AI’s role in software engineering.

“The model is only 10% of what determines behavior; the harness is 90%. Focus on configuration and context.”

— Addy Osmani

Amazon

AI prompt engineering software

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Unclear Aspects of Implementation and Industry Adoption

While the whitepaper provides compelling evidence that harness and context are key, it remains unclear how quickly organizations will adopt this perspective at scale. Specific best practices for building effective harnesses and managing context dynamically are still emerging, and the long-term impact on AI development costs and security is yet to be fully understood. Additionally, how this paradigm shift influences the competitive landscape across industries is still developing.

Amazon

AI model harness design tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for Developers and Organizations

Organizations should evaluate their current AI workflows to identify opportunities for better harness and context engineering. Investing in tools and training focused on configuration, prompt engineering, and environment management will be crucial. Industry standards and best practices are expected to evolve quickly, and further research will clarify effective strategies. Monitoring how early adopters implement these principles will inform broader adoption and optimize AI development costs and reliability.

Amazon

AI system context management software

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of system behavior?

The whitepaper shows that most of an AI agent’s performance depends on how it is configured, including prompts, tools, rules, and environment setup, rather than the underlying model itself.

How can organizations improve their AI systems based on this insight?

By focusing on harness design—such as prompts, context management, and configuration—organizations can significantly enhance performance and reliability without needing the latest models.

Does this mean model development is less important?

Model development remains valuable, but the whitepaper emphasizes that harness and context engineering are more impactful for system behavior and cost-efficiency.

What are the risks of focusing on harness and context?

Potential risks include underestimating the importance of model improvements and misconfiguring environments, which could lead to security vulnerabilities or inconsistent results if not managed carefully.

Will this approach reduce AI development costs?

Yes, focusing on harness and context can lower token consumption, maintenance, and security remediation costs, making AI development more economical over time.

Source: ThorstenMeyerAI.com

You May Also Like

Delvasta: Forms That Build Themselves

Delvasta introduces a platform where forms build themselves using AI and branching logic to improve lead quality and user experience. Early access now available.

SpaceX launches 7.5-ton SiriusXM satellite as part of constellation refresh

SpaceX successfully launched a 7.5-ton SiriusXM satellite to enhance the satellite radio provider’s network, marking a key step in its constellation refresh.

Data: The One Thing You Can’t Rent

The industry faces a shift as data, the last un-rentable asset, becomes scarce and fenced, changing AI development dynamics and market power.

Forezai · TradingAgents: A Trading Firm Made of Agents

Forezai unveils TradingAgents, a multi-agent research framework mimicking a trading desk, emphasizing structured disagreement and oversight in AI-driven trading.