Sovereignty beats price: the new procurement signal
Angelika Beierlein
8 min read The German federal government has commissioned SAP and Deutsche Telekom to build its central ...
6 min Reading Time
Google has released an AI model that leads on 13 of 16 benchmarks – and sells it for a fraction of the market price. Anyone mistaking this for a discount promotion has missed Google’s strategy entirely. Gemini 3.1 Pro is not a mass-market product. It’s a signal: to the industry, to competitors, and to everyone trying to understand where the AI race is really headed.
This matters for IT leaders – not because they need to deploy Gemini 3.1 Pro tomorrow, but because its model architecture and pricing policy raise fundamental questions: Which AI model belongs in which pipeline – and when? And who holds the competence to decide? That answer is already strategic, not technical.
Benchmarks are seductive. They suggest the model with the highest scores is automatically the best choice for every use case. That’s false. Gemini 3.1 Pro is explicitly engineered for deep reasoning – solving logical problems it has never encountered before. That’s a fundamentally different discipline than what Anthropic optimized Claude Opus for: agent-based workflows.
Concretely: If you need a model to structure documentation, scan code repositories, and coordinate multiple external tools, Gemini 3.1 Pro alone won’t deliver optimal results. For those tasks, specialized competitors may hold situational advantages. The benchmark table doesn’t lie – it just answers the wrong question.
Why can Google offer a top-tier model at low prices? The answer lies not solely in model architecture – but in its vertical stack of TPU chips, data centers, and billion-user base. Google distributes AI capabilities across Search, Android, and Google Workspace – a level of cost control no pure-play AI lab can replicate.
DeepMind adds further leverage. The research division that revolutionized protein-structure prediction with AlphaFold drives foundational research. Google isn’t solving isolated applications. Google is attempting to solve intelligence itself – then waiting for that intelligence to resolve all other problems.
For CIOs, this means Google isn’t playing the same game as Anthropic or OpenAI. Anyone planning an AI strategy, should take this distinction seriously – because it reshapes the dependencies and lock-in risks tied to any model decision.
The central question is no longer: Which AI model is the best? It’s: What kind of problem do you need to solve? These categories differ fundamentally – and should guide every model decision.
Pure reasoning tasks – scientific analysis, legal case review, complex root-cause investigation – are Gemini 3.1 Pro’s sweet spot. High-volume, low-complexity tasks benefit from other approaches: fast, cost-efficient models with strong instruction-following.
Coordination problems – workflows requiring orchestration across multiple tools, APIs, and data sources – demand models with robust tool interaction. And then there are problems no current model handles reliably: ambiguity resolution, emotional intelligence, genuine judgment in unstructured situations. Honest AI architecture must account for those too.
Be honest: How many IT departments have systematically embedded these distinctions into their AI architecture? AI adoption in the German economy is growing – but the strategic depth of model selection lags behind.
The increasing fragmentation of the AI landscape creates a new requirement: model routing. Selecting the right model for the right problem at the right time is becoming the decisive operational capability – comparable to load balancing in network architecture. A router that sends all traffic to a single resource isn’t a router. It’s a bottleneck.
In practice, many enterprises still operate with one preferred model – out of habit, procurement decisions, or because evaluation is labor-intensive. That was acceptable when models were relatively homogeneous. With Gemini 3.1 Pro, Claude Opus, and a growing number of specialized models, this simplification is now a competitive disadvantage.
Three things commonly go wrong: First, models are selected by brand recognition – not task profile. Second, internal frameworks defining which problem categories matter most to the business are missing. Third, quality control of model outputs is underestimated – especially when models generate answers that sound highly plausible but are factually incorrect.
“Solve intelligence, then use that to solve everything else.”
– Demis Hassabis, CEO Google DeepMind, Nobel Laureate 2024
The more powerful AI models become, the more critical it is to critically assess their outputs. This isn’t a truism. It’s a structural shift in the competencies required of IT leaders and their teams – and it changes which skills are in demand within AI-integrated organizations.
Gemini 3.1 Pro can produce reasoning chains that appear convincing at first glance. Whether the conclusion holds up, whether premises are correctly established, whether the problem was even framed accurately – no model evaluates that itself. That remains a human responsibility.
Domain expertise, error-assessment ability, and awareness of a model’s limitations thus become key competencies – not as substitutes for technical understanding, but as essential complements. Similar dynamics emerge where companies roll back AI automation after underestimating the need for human judgment.
Gemini 3.1 Pro isn’t an assault on the mass market. It’s a long-term statement: Foundational research beats feature competition. Whoever solves intelligence first wins everything else along with it. For Google, this isn’t hope – it’s a blueprint already proven by DeepMind’s AlphaFold.
For IT decision-makers, this means auditing your own AI architecture now. Which problem types dominate your workflows? Where do you need deep reasoning, where rapid execution, where tool coordination? Answering those questions concretely leads to better model decisions – regardless of which benchmark ranking dominates headlines today.
Sounds good – but it only works if companies stop treating AI as a plug-and-play solution, and start evaluating it like any other complex IT tool: with specifications, test protocols, and clearly defined operating conditions.
“It’s a long-term statement: Foundational research beats feature competition.”
Gemini 3.1 Pro is optimized for deep reasoning – solving complex, novel logical problems. Claude Opus leans toward agent-based workflows; OpenAI models show strength in coding pipelines. No model is universally superior; suitability depends entirely on the specific task profile.
Google designs its own TPU chips, operates proprietary data centers, and distributes AI across its billion-user base in Search, Android, and Workspace. This vertical integration enables cost control no pure-play AI labs like Anthropic or OpenAI can match.
Model routing refers to the ability to select and deploy the optimal AI model for a given task. As models grow increasingly differentiated, this capability becomes an operational core competency – akin to load balancing in network architecture. Relying on a single model forfeits performance and cost efficiency.
Its clear sweet spot is deep reasoning tasks: scientific analysis, legal case review, complex root-cause investigation, and logical problem-solving in unfamiliar contexts. For high-volume, simple tasks – or multi-tool orchestration – specialized models often perform better.
The three most frequent errors: choosing models by brand recognition rather than task fit; lacking an internal framework to classify relevant problem types; and underestimating quality control for outputs that sound plausible but are factually wrong.
Domain expertise, error-assessment ability, and awareness of model limitations are becoming key competencies. As models grow more capable, the human capacity to critically interpret their outputs grows more vital – not as a replacement for technical fluency, but as an indispensable complement.
Google bets on foundational research over feature competition. The thesis: Whoever solves the problem of intelligence first wins all adjacent markets. For the AI market, this implies a structural advantage for vertically integrated players – and mounting pressure on pure-model vendors without proprietary infrastructure.
“The thesis: Whoever solves the problem of intelligence first wins all other markets along with it.”
Header Image Source: Pexels / Google DeepMind