📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent Google whitepaper reveals that in AI-assisted software engineering, the AI model itself is only a small part of the system. The key to success lies in the harness and context engineering, which dominate system behavior and costs.

A new Google whitepaper titled The New SDLC With Vibe Coding emphasizes that the AI model accounts for only about 10% of system behavior in AI-driven software development. The paper argues that the real expertise lies in designing the harness and managing context, which together determine 90% of the system’s performance and costs. This shift has significant implications for how organizations approach AI integration.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, highlights that the dominant factor in AI system behavior is not the model itself but the surrounding harness — including prompts, tools, rules, and observability. Evidence from experiments shows that changing only the harness can dramatically improve performance, often more than switching models.

Furthermore, the paper introduces the concept of context engineering as a critical skill, involving the careful selection and structuring of instructions, knowledge, examples, and guardrails. The authors stress that effective context management can scale AI capabilities without increasing token costs exponentially.

Economically, the whitepaper warns that vibe coding — minimal prompts and loose oversight — appears cheap initially but incurs high long-term costs due to token inefficiency, maintenance, and security risks. In contrast, disciplined agentic engineering involves upfront investment in design and testing, resulting in lower marginal costs over time.

At a glance

reportWhen: published March 2026

The developmentThe whitepaper argues that the core of modern SDLC shifts from model development to harness and context engineering, with the model representing only 10% of system influence.

The Model Is Only 10% — The New SDLC With Vibe Coding

AI Dispatch · Field Notes

Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified

Vibe Coding

Casual prompts · “does it seem to work?” · disposable code · high risk

Structured AI-Assisted

Detailed prompts + constraints · manual testing · features in real codebases

Agentic Engineering

Formal specs · automated tests + evals + CI gates · production scale · low risk

Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.

The idea worth building your strategy around

Agent = Model + Harness

~10%

HARNESS — prompts · tools · context · hooks · sandboxes · observability

MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S

Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.

“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.

The economics: it’s a token-cost problem (CapEx vs OpEx)

Vibe Coding

Low CapEx · High OpEx

Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.

Agentic Engineering

High CapEx · Low OpEx

Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.

85%

of devs use AI coding agents (51% daily)

41%

of all new code is AI-generated

~90%

of agent behavior is the harness, not the model

+19%

longer on some tasks (METR) — verification is the cost

The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.

thorstenmeyerai.com

Why Focus on Harness and Context Matters

This shift redefines AI development priorities. Instead of obsessing over the latest model, organizations should invest in building robust harnesses and mastering context engineering. These elements are where durable competitive advantages can be created, as they are more controllable and customizable than the underlying models. The insight challenges the common perception that model quality alone drives AI success, emphasizing instead the importance of configuration and design.

For businesses, this means that effective AI deployment hinges on system architecture, tooling, and process design. It also impacts costs, security, and maintainability, making a disciplined approach more economically viable in the long run.

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

As an affiliate, we earn on qualifying purchases.

Background on AI in Software Development

As of early 2026, AI-assisted coding has become mainstream, with 85% of professional developers using AI tools regularly, and 41% generating most of their code via AI. The industry has seen a rapid evolution from simple prompt-based vibe coding to more structured, agentic engineering practices, driven by the recognition that system behavior is shaped more by configuration than by model improvements.

The whitepaper builds on prior trends of integrating AI into the SDLC, emphasizing that the focus has shifted from developing new models to optimizing how they are used and controlled. This aligns with broader industry observations that AI system performance depends heavily on how well it is configured and monitored.

“The biggest shift in software engineering isn’t a new language or framework — it’s moving from writing code to expressing intent and trusting machines to turn that into working software.”
— Addy Osmani

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

As an affiliate, we earn on qualifying purchases.

Unclear Aspects of Model-Harness Dynamics

While the whitepaper presents compelling evidence that harness and context dominate system behavior, it does not specify precise thresholds for different types of AI applications or how these principles scale across varied industries. The long-term impact on AI model development priorities remains to be fully validated in real-world deployments.

Additionally, the optimal balance between upfront investment in design versus iterative tuning is still under discussion, with some experts questioning whether the 90/10 split applies universally.

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

As an affiliate, we earn on qualifying purchases.

Next Steps for AI Development and Adoption

Organizations should prioritize developing robust harnesses and mastering context engineering, investing in tooling, testing, and training. Industry leaders are likely to focus on building standards and best practices for configuration management and system architecture.

Further research and case studies will clarify how these principles perform across different domains and whether the 90/10 split holds in practice. Expect ongoing innovation in tools that facilitate better harness and context management, along with evolving cost models.

Agentic Development: The Complete Guide to AI-Assisted Coding with Claude, Cursor, and Beyond

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of system behavior?

According to the whitepaper, the AI model itself provides the core capabilities, but the surrounding harness — prompts, tools, rules, and observability — determines how effectively those capabilities are applied, making it the dominant factor.

How does this shift affect AI development costs?

While initial investments in designing harnesses and contexts might be higher, this approach reduces long-term operational costs by improving efficiency, security, and maintainability, often lowering marginal costs compared to vibe coding.

Can organizations just focus on the model instead?

No. The whitepaper emphasizes that focusing solely on the model ignores the majority of the system’s behavior. Effective configuration and system design are more impactful for performance and cost management.

What skills are most important for future AI teams?

Skills in system architecture, context engineering, configuration management, and testing are increasingly vital, as they determine how well AI systems perform in real-world settings.

Will this change how AI tools are built?

Yes. Developers will prioritize building flexible, configurable harnesses and tools that enable precise control over AI behavior, rather than focusing solely on model improvements.

Source: ThorstenMeyerAI.com

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Cutrova: Edit the Words, Not the Timeline

Author

PepperEyes Team

Share article

The model is only 10%

Why Focus on Harness and Context Matters

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

Background on AI in Software Development

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

Unclear Aspects of Model-Harness Dynamics

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

Next Steps for AI Development and Adoption

Agentic Development: The Complete Guide to AI-Assisted Coding with Claude, Cursor, and Beyond

Key Questions

Why is the model only 10% of system behavior?

How does this shift affect AI development costs?

Can organizations just focus on the model instead?

What skills are most important for future AI teams?

Will this change how AI tools are built?

The High-End PC And Workstation Tax

The Menu: What Ten Answers Reveal

One Model, a Whole Portfolio: What Ten Days on Fable Mean for a Business Building on Frontier AI

My USB Drive Has A Hidden Encrypted Vault

14 Best Student Laptop Backpacks in 2026

Show HN: Reverse Minesweeper

15 Best Humidifiers for Dry Air Winter Comfort in 2026

How Do You Feel About David Jonsson As The New Black Panther? #Sdcc

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Author

PepperEyes Team

Share article

The model is only 10%

Why Focus on Harness and Context Matters

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

Background on AI in Software Development

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

Unclear Aspects of Model-Harness Dynamics

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

Next Steps for AI Development and Adoption

Agentic Development: The Complete Guide to AI-Assisted Coding with Claude, Cursor, and Beyond

Key Questions

Why is the model only 10% of system behavior?

How does this shift affect AI development costs?

Can organizations just focus on the model instead?

What skills are most important for future AI teams?

Will this change how AI tools are built?

You May Also Like