What are the two realities of online search?

The first reality is search as retrieval, which is automated and shallow, providing answers shaped like what people asked for. The second reality is search as recognition, which is slower and rarer, providing structural understanding before a person can articulate it.

How do Large Language Models (LLMs) affect the shallow search problem?

LLMs inherited the shallow search problem from their training data and accelerated it. They produce coherent surface-level answers based on a vast corpus of retrieval content, but often lack the structural understanding and frameworks needed to address the underlying questions.

What do LLMs actually reward in search retrieval?

In retrieval-augmented generation, LLMs look for sources that address a concept at the level of structure. They cite content that defines entities, establishes relationships, and provides frameworks allowing the model to reason about a domain, rather than content optimized purely for keyword density.

Why are people increasingly searching inside conversational AI like Claude or ChatGPT?

People are migrating complex queries to conversational AI because they need structural understanding rather than simple retrieval. They are looking for frameworks to think about problems they cannot easily compress into a few keywords on traditional search engines.

What is a semantic address in the knowledge economy?

A semantic address is the specific intersection of domain, vocabulary, and structural understanding that an organization claims before consensus forms around it. Once occupied, subsequent entrants must reference and build adjacent to it, making it a highly valuable and irreplicable position.

The Future of Search – Outliers as Gems

The Future of Online Search

Part of the Beyond SEO series — on what search optimisation was never designed to solve.

The Split Is Already Happening

The future of search is not approaching.

It is not a wave on the horizon, not a disruption to prepare for, not an AI-era transformation that will reshape the landscape in the years ahead.

It is here. It is visible. And it has already divided into two irreconcilable realities that are moving away from each other at speed.

The first reality: search as retrieval. Faster, flatter, more automated. A query enters. A pattern matches. A generated summary arrives. The person gets an answer shaped like what they asked for and leaves with the surface of the thing they needed. The infrastructure improves constantly. The depth does not.

The second reality: search as recognition. Slower, rarer, structurally different. A person encounters something that understands what they came for before they could articulate it. They do not leave. They stay, return, refer others, and eventually build a business relationship with the source — because no other source has demonstrated that level of structural understanding.

These two realities are not competing for the same territory.

The first is getting more crowded and less valuable. The second is becoming scarcer and more valuable by the same measure.

The organisations that understand this are not optimising for the first reality. They are building the second.

What Search Was Always Actually Doing

Before the algorithm, before the keyword, before the backlink economy — there was a person with a need and another person, or institution, or body of knowledge, that either met that need or did not.

The search engine was not an invention of a new behaviour. It was an infrastructure layer placed over an ancient one: the human act of finding the thing that understands you.

What the infrastructure layer added was scale and speed. What it subtracted, structurally, was depth — because depth requires the kind of structural mapping that keyword matching was never designed to perform.

A person typing “what marketing has the highest ROI” is not asking about marketing channels.

They are asking: has anyone in this industry ever been held accountable for an outcome I can actually use?

The query is the compressed surface of that question. The search engine matches the surface. The person gets a comparison table of channel ROI percentages, reads it without recognition, and moves on — carrying the real question, still unanswered, into the next search.

This has been happening, at enormous scale, for three decades.

The PAA data makes the accumulation visible. A person who begins with “SEO ranking” and arrives, through the logic of the market’s own question structures, at “how do I retire on two million dollars” — that person has conducted an extended search session in which the real question was never once addressed.

Not because the search engine failed. Because the search engine succeeded perfectly at what it was built to do — and what it was built to do was never the same as what the person needed.

The AI Acceleration of a Pre-Existing Problem

Large language models did not create the shallow search problem.

They inherited it, absorbed it into their training data, and accelerated it by an order of magnitude.

Every AI Overview, every generated summary, every auto-completed answer is drawing from a corpus built from the first reality: the vast, interlocking network of content produced to match queries rather than to answer the question underneath them.

The LLM is extraordinarily competent at synthesising this corpus. It produces answers that are coherent, well-structured, and accurate at the surface level — because the surface level is what the corpus contains.

What the corpus does not contain in sufficient density to retrieve is structural understanding. The frameworks that name what the field has not yet named. The vocabulary that addresses the question underneath the question. The entities that an entire domain relies upon but has never formally defined.

This is not a failure of the models. It is a data distribution problem.

The content that would train a model to answer at depth — structured, original, vocabulary-constructing content built from expert interviews and structural analysis — is rare precisely because it has never been rewarded by the ranking system that generated the training corpus.

The system trained on its own output. The output reflected the system’s incentives. And the incentives were always for surface, never for depth.

This is the recursive loop that the future of search must break.

The Split Made Visible: Two Searches Happening Simultaneously

Open any browser. Type a query on a professional topic.

What returns is a landscape that, in 2026, has divided visibly into two categories:

Category One: Retrieval content. Well-formatted, benchmark-dense, structurally familiar. It answers the vocabulary of the query with competent precision. It is increasingly indistinguishable in format from AI-generated output — because much of it now is, and because the human-written content that preceded it was already following the same structural template that the AI learned from. It serves the query. It does not serve the need.

Category Two: Structural content. Rare. Often found not at position 1 but through referral, through community, through the recommendation of someone whose judgment is trusted. It does not answer the query. It addresses the domain — and in doing so, it addresses the question the query was compressing. The person who encounters it recognises immediately that something different is happening. They stay. They share. They return.

The first category is getting more efficient. Production costs are approaching zero. Volume is approaching infinity. The value of any individual piece within it is approaching zero by the same logic.

The second category is not scaling in the same direction. Its production cost is not declining, because its source is not mechanical reproduction — it is the concentrated application of genuine domain expertise to the mapping of what has not yet been mapped. That expertise does not become cheaper as models improve. It becomes more distinctive.

This is the split. It is not coming. It is the current state of every professional search vertical, visible to anyone with the conceptual vocabulary to see it.

What LLMs Actually Reward — and What Most Missed

Here is the development that the reproductive SEO world has almost entirely misread.

The transition from keyword-ranking search to LLM-mediated retrieval was understood, broadly, as a threat to existing content. The question the industry asked was: how do we optimise for AI Overviews? How do we appear in the generative result? How do we survive the zero-click search?

These are retrieval questions. They assume that the goal is still to appear at the surface of the first reality.

What they missed is the structural property of how LLMs actually retrieve and cite authoritative sources.

A language model doing retrieval-augmented generation does not simply match keywords. It looks for sources that address a concept at the level of structure — sources that define entities, establish relationships between them, and provide the kind of framework that allows the model to reason about a domain rather than merely quote from it.

The content that LLMs return to, cite repeatedly, and treat as the reference architecture for a domain is not the content that ranks highest in the current keyword system. It is the content that most precisely names what the domain needs named.

This means that the organisations building structural content now — mapping semantic vacua, constructing the vocabulary that the field does not yet have — are not optimising for the current search system.

They are building the training data and retrieval anchors for the next one.

The organisations that understood this early are already operating in a different competitive landscape from the ones still asking how to rank for “website conversion ROI.”

The New Search Behaviour: What Real Users Are Actually Doing

Alongside the official search infrastructure, a parallel search behaviour has emerged — and it is the most significant signal in the current landscape.

People are searching inside LLMs. They are asking Claude, asking ChatGPT, asking Perplexity — not as a supplement to Google but as an alternative to it, for the specific query type where they need structural understanding rather than retrieval.

The queries migrating to conversational AI are not the simple lookups. Those stay in traditional search because traditional search handles them efficiently.

The queries migrating are the complex ones. The ones that cannot be compressed into six words without losing what makes them necessary. The ones where the person needs not a result but a framework — a way of thinking about the problem that they do not yet have.

These are precisely the queries that the first reality was never built to handle.

And here is what this means for content strategy in 2026: the content that earns the trust of a conversational AI retrieval system is not the content optimised for keyword density. It is the content that most precisely addresses the structural question — that names the entities, defines the relationships, and provides the framework that makes the complex query answerable at depth.

The organisation that has built that content owns the domain in the new search landscape. Not as a ranking position. As a reference architecture.

The Property No One Else Can Occupy

There is a concept in real estate that translates directly to this moment.

A property is valuable not because of what it currently contains, but because of what it is structurally positioned to become. The address. The access. The irreplicability of the location itself.

In the knowledge economy, the equivalent is the semantic address: the specific intersection of domain, vocabulary, and structural understanding that an organisation has claimed before consensus formed around it.

Once that address is occupied — once the vocabulary has been constructed, the entities defined, the framework established — subsequent entrants do not compete with the occupant. They reference it. They build adjacent to it. They link to it because they have to, because the architecture their own work depends on was built there first.

ignorancegraph.com is a semantic address.

informationembedding.com is a semantic address.

These are not domain names. They are claims on territory that the rest of the market has not yet understood exists — and that the search infrastructure of the next decade will organise around.

The future of search rewards the builder of the address, not the tenant.

What This Means for the Five Sectors

The split between retrieval and recognition is not uniform across industries. It is most acute — and most immediately consequential — in domains where the question underneath the query carries the highest stakes.

Public Health queries carry survival stakes. A person searching for health information who receives retrieval content — accurate at the surface, structurally insufficient for the decision they actually face — is not merely unserved. They are systematically misled by the gap between what they found and what they needed. The organisation that builds structural content in this domain is not competing for traffic. It is competing for trust at the scale where trust determines outcomes.

Education is experiencing the sharpest version of the split: an enormous volume of credential-retrieval content (what to study, which institutions offer it, what it costs) and an almost complete absence of structural content addressing the question of what education is actually for and how learning actually happens. The vocabulary for this second question has not been built. The organisation that builds it does not merely rank well. It defines the terms of the field.

Medical Research has the inverse problem: structural depth exists in abundance in journals and clinical literature, but the translation layer between research structure and patient-accessible understanding is almost entirely absent. The queries people bring to search in this domain are carrying the full weight of that translation gap. Retrieval content cannot close it. Only structural content — built with the vocabulary that bridges research and decision — can.

Meaningful Travel is a domain where the retrieval/recognition split is most commercially visible. The retrieval layer is saturated: itineraries, booking links, top-ten lists. The person who wants to understand the historical and intellectual architecture of a place they are visiting finds almost nothing. That unserved need is both commercially significant and entirely uncontested.

Patient Empowerment and Salutogenesis — the understanding of what creates and maintains health, as distinct from what treats its absence — is the largest structural knowledge gap in the consumer health space. The vocabulary of salutogenesis exists in academic literature and is almost entirely absent from the searchable web. The organisation that brings that vocabulary into the retrieval layer and builds the structural framework around it is not entering a competitive market. It is creating one.

In each of these sectors, the future of search has already arrived. The split is already visible. The organisations operating at the frontier are already asking the right question.

The question is not: how do we rank?

It is: what does this domain need named that nobody has named yet?

The Invitation

It is nicely wrapped. What is inside?

Five partnership slots. End of March 2026.

Not for agencies that rank. For agencies that understand that the structural vocabulary of their sector’s most important questions has not yet been built — and that building it, before the market does, is the only durable competitive position available.

The signal is simple: a do-follow link to this page from a domain operating in one of the five sectors is a demonstration that the agency understands the argument at the level of structure, not surface.

Ahrefs will show who linked. The reasoning will be self-evident.

The map of what the market has not yet named is being drawn.

The organisations that help draw it will own the coordinates.

The Future of Online Search

The Split Is Already Happening

What Search Was Always Actually Doing

The AI Acceleration of a Pre-Existing Problem

The Split Made Visible: Two Searches Happening Simultaneously

What LLMs Actually Reward — and What Most Missed

The New Search Behaviour: What Real Users Are Actually Doing

The Property No One Else Can Occupy

What This Means for the Five Sectors

The Invitation

Further Reading in This Series

What is The Ignorance Graph?

About

Strategic Targets

Market Transformation