LIVE
South Korea rally to beat Czechia 2-1 on World Cup opening dayCheaper, faster, and culturally aware, Avataar's video AI is built for India's scaleA New Vaccine Was Designed by AI and Safey Tested on HumansSpaceX raising $75 billion in record-setting IPO as Nasdaq debut awaits'Massive body blow' as PM loses his defence secretary - and another resignation followsUntil Dawn Characters Will Never Not Look Cursed, I GuessShinyHunters Exploits Oracle PeopleSoft Zero-Day (CVE-2026-35273) to Breach UniversitiesElon Musk's SpaceX prices shares at $135, raising $75 billion in largest-ever IPOBluesky launches group chats, as company shifts focus to community featuresTed Cruz and Ron Wyden try to fight censorship with bipartisan JAWBONE ActScientists Measure Earth’s Vast Underground Fungal Webs'The Love Hypothesis' Sets September Streaming Date On Prime VideoWhy this will be a World Cup like no otherNOAA Issues El Nino AdvisoryHome Sales Just Dropped in New York and 2 Other Major Cities. Here’s What’s Driving the Surprising SlumpSouth Korea rally to beat Czechia 2-1 on World Cup opening dayCheaper, faster, and culturally aware, Avataar's video AI is built for India's scaleA New Vaccine Was Designed by AI and Safey Tested on HumansSpaceX raising $75 billion in record-setting IPO as Nasdaq debut awaits'Massive body blow' as PM loses his defence secretary - and another resignation followsUntil Dawn Characters Will Never Not Look Cursed, I GuessShinyHunters Exploits Oracle PeopleSoft Zero-Day (CVE-2026-35273) to Breach UniversitiesElon Musk's SpaceX prices shares at $135, raising $75 billion in largest-ever IPOBluesky launches group chats, as company shifts focus to community featuresTed Cruz and Ron Wyden try to fight censorship with bipartisan JAWBONE ActScientists Measure Earth’s Vast Underground Fungal Webs'The Love Hypothesis' Sets September Streaming Date On Prime VideoWhy this will be a World Cup like no otherNOAA Issues El Nino AdvisoryHome Sales Just Dropped in New York and 2 Other Major Cities. Here’s What’s Driving the Surprising Slump
AI

AI agents keep giving confident wrong answers. The context layer is enterprise AI's next production problem.

Photo by Immo Wegmann on Unsplash

Enterprise artificial intelligence deployments are entering a critical new phase of operational failure, one that has little to do with the sophistication of the underlying language models themselves. The problem emerged with stark clarity at Snowflake Summit 26 in San Francisco, where the data cloud company unveiled a comprehensive response to what has become a silent production crisis: the same business data, when queried through different agents, tools or systems, returns fundamentally different answers. The fundamental issue centres on semantic fragmentation. When a single metric such as revenue exists simultaneously within a business intelligence dashboard, a SQL database table, and an agent's instruction set, each system interprets it according to its own embedded logic. This architectural inconsistency means enterprises deploying multiple AI agents across their infrastructure face an untenable situation where automated systems confidently deliver contradictory answers drawn from identical underlying data. The problem accelerated as organisations invested heavily in retrieval infrastructure over the past two years, optimising for speed and cost reduction in vector search without establishing shared definitions of what the data actually represents. Snowflake's response—a layered context system called Horizon Context and Cortex Sense—represents the first major vendor acknowledgment that data retrieval infrastructure alone cannot solve semantic coherence at enterprise scale.

The semantic layer concept is not new to technology infrastructure, yet the emergence of agentic AI has fundamentally changed the cost of failure. For years, semantic layers existed as optional architectural components, valuable but not critical to core business operations. Enterprises could tolerate inconsistency between their data warehouse and their analytics tools because humans performed the ultimate verification step, catching errors before they triggered automated actions. Agentic systems eliminate this human review checkpoint. When an AI agent makes a decision—allocating resources, processing transactions, prioritising customer communications—based on incorrect semantic interpretation, the consequences cascade immediately and at scale. The timing of Snowflake's announcement reflects industry recognition that hybrid retrieval architectures, which combine multiple search methods to optimise for different data types and query patterns, have become standard enterprise practice. VentureBeat's Q1 2026 survey data of organisations with more than 100 employees reveals the acceleration of this shift: hybrid retrieval intent nearly tripled in just two months, rising from 10.3% in January to 33.3% in March, representing the fastest-growing strategic position in the entire dataset. This rapid adoption explosion means enterprises are building production systems that compound the semantic fragmentation problem precisely as stakes for consistency rise. Christian Kleinerman, Executive Vice President of Product at Snowflake, distilled the core challenge in a single observation: organisations can ask AI agents questions and receive answers delivered with complete confidence, but correctness remains an entirely separate matter.

Snowflake's architectural response bifurcates context management into two distinct layers, each serving a different function within enterprise data governance. Horizon Context functions as the customer-managed layer, built on Snowflake's prior acquisition of Select Star, the data catalogue specialist. This layer aggregates metadata from diverse enterprise systems including Postgres databases, SQL Server, Tableau visualisation platforms and Power BI dashboards, consolidating them into a unified Horizon Catalog. The critical advantage is that every agent, business intelligence tool and external system queries this shared governed definition rather than reasoning independently over raw physical schemas. Semantic View Autopilot extends this capability by automatically creating and refining semantic views over time, allowing curated business logic to evolve without constant manual intervention. Cortex Sense operates as the platform-derived layer, automatically building and enriching context from customer data and usage patterns on an ongoing basis. Unlike Horizon Context, which requires explicit customer declaration and curation, Cortex Sense functions implicitly, improving the baseline user experience before any team has invested effort in semantic authoring. Kleinerman distinguished between the approaches with precision: Horizon Context encompasses everything that customers explicitly declare, while Cortex Sense represents anything the platform derives implicitly. The two layers feed into Snowflake's existing retrieval infrastructure, with Cortex Search serving as the retrieval-augmented generation implementation that pipes context-enriched information into both CoCo and Cowork agent platforms. Notably, Snowflake is committing to interoperability through the Open Semantic Interchange framework, making customer-declared definitions portable across third-party catalogues and tools rather than locked within Snowflake's ecosystem.

The practical implications for enterprises deploying agent systems today centre on a fundamental operational reality: agent reliability depends entirely on data semantic accuracy, not model capability. An enterprise deploying a state-of-the-art language model atop inconsistent business definitions will consistently receive confident wrong answers, making the deployed system actively unreliable rather than merely imperfect. This creates immediate problems for common enterprise use cases. A customer service agent trained to resolve inquiries might calculate customer lifetime value using definitions that diverge from the finance system's calculations, leading to systematically incorrect eligibility determinations across thousands of daily interactions. A supply chain optimisation agent might interpret inventory thresholds differently than the warehouse management system, creating phantom shortages or safety stock violations. A sales intelligence agent might parse commission structures inconsistently, leading to disputes with the sales organisation or incorrect forecasting. These are not hypothetical edge cases but predictable failure modes that emerge as soon as agents touch real enterprise data. The context layer problem becomes particularly acute because these failures are invisible until they scale. A single agent might behave reasonably when manual oversight exists, but deploying identical agent logic across multiple business units reveals inconsistencies that expose the underlying semantic fragmentation. Mike Leone, Vice President and Principal Analyst at Moor Insights and Strategy, identifies a critical danger: most vendors currently marketing context layer solutions as drop-in fixes are systematically overpromising their capabilities. When enterprises actually integrate these solutions into production systems, Leone warns, they typically discover how profoundly messy their existing data definitions and semantic foundations already are, a reckoning many organisations are about to experience simultaneously as agent adoption accelerates.

The competitive landscape around context layers reveals a market-wide recognition that the problem Snowflake is targeting represents the actual battleground for agentic AI viability. Microsoft has opened its Fabric IQ business ontology through model context protocol, enabling any vendor's agents to draw from a shared semantic layer rather than building proprietary definitions. Redis launched Iris, positioning itself as a context and memory platform sitting between agents and enterprise data, redesigned specifically for agent-scale retrieval volumes. Pinecone has repositioned itself from vector database company to knowledge engine, introducing Nexus, which precompiles enterprise data into task-specific artifacts before agents ever query it. Devin Pratt, Research Director at IDC, confirms that Snowflake's direction aligns with where the entire market is heading, with the context layer becoming the actual focus rather than model sophistication. Pratt emphasises that the real architectural advantage of Snowflake's approach is the deliberate split between explicit and implicit context. Horizon Context captures what teams actively declare and curate, while Cortex Sense captures what the platform discovers automatically. Because these two categories carry different confidence levels and governance requirements, treating them separately represents a more honest architectural acknowledgment than pretending all context is equivalent. Leone reinforces this assessment, arguing that the inability to trust explicit and implicit context equally would make a unified layer approach fundamentally flawed. The real test of viability, in Leone's estimation, becomes whether Snowflake can demonstrate that these two layers reconcile cleanly and provide complete lineage showing where every answer originates, moving beyond theoretical promise to demonstrable trustworthiness.

Enterprises evaluating context layer solutions must now assess along three specific technical dimensions that separate viable solutions from those likely to stall. Governance and lineage capabilities must be built into the core architecture rather than bolted on afterward, allowing teams to audit exactly why an agent produced its specific answer. Portability prevents vendors from capturing enterprise semantics, ensuring context and policy are not locked to a single provider, creating exit pathways and supporting multi-vendor agent ecosystems. Measurable accuracy requires context layers to establish quantifiable performance benchmarks that can be reused across different agents and tools, avoiding the creation of new semantic silos. Pratt articulates the fundamental requirement sharply: enterprises do not need another silo of semantics but rather a context layer that is governed, portable and trustworthy enough to audit completely. Looking forward, organisations should monitor three