The Evidence That Earned Media Drives AI Citations: Research From 2025–2026
Machine Relations

The Evidence That Earned Media Drives AI Citations: Research From 2025–2026

A synthesis of the research proving earned media is the dominant driver of AI citations. Data from Muck Rack, Ahrefs, Stacker, McKinsey, and academic studies on how AI engines choose what to cite.

The research from 2025 and 2026 says the same thing regardless of where it comes from: AI engines overwhelmingly cite earned media, not brand-owned content. Muck Rack analyzed more than one million AI citations and found 82% came from earned media sources and 94% from non-paid sources. Stacker and Scrunch ran a controlled study across five leading LLMs and found that distributing content through third-party news outlets produced a 239% median lift in AI search visibility, with some cases reaching 325%. Ahrefs studied 75,000 brands and found brand web mentions correlated three times more strongly with AI Overview visibility than backlinks.

These are not adjacent findings. They are three independent measurement approaches, from a PR analytics company, a content distribution firm, and an SEO data company, all arriving at the same structural conclusion: AI engines prefer third-party editorial coverage over anything a brand publishes about itself. The pattern holds across every platform measured. For a detailed breakdown of which publications AI engines cite most frequently across ChatGPT, Perplexity, and Google, see which publications get cited most by AI search engines in 2026.

This post synthesizes the full body of evidence, explains the mechanism behind the bias, and maps what it means for brand visibility strategy in 2026.

Key takeaways

  • 82%–95% of AI citations come from non-paid, earned media sources across major studies
  • Distributing content through third-party news outlets produces a median 239% lift in AI citation visibility
  • Brand web mentions correlate 3x more strongly with AI Overview visibility than backlinks (0.664 vs. 0.218)
  • AI engines systematically favor third-party authoritative domains over brand-owned and social content
  • A brand's own website accounts for only 5–10% of the sources AI search references when answering questions
  • 94% of B2B buyers now use AI in their purchasing process, making AI citation a direct revenue variable
  • The framework that explains why this pattern holds is Machine Relations — the discipline of earning AI citations and recommendations through earned authority in trusted publications

The data: what major studies found about AI citation sources

Six significant research efforts have now measured where AI engines pull their citations from. Their methodologies differ, their sample sizes differ, and their publication sources differ. Their findings do not.

Muck Rack Generative Pulse (December 2025)

Muck Rack's "What Is AI Reading?" report analyzed more than one million citations from leading AI models and found that earned media accounts for 82% of all AI citations, with non-paid sources accounting for 94%. Journalism remains the dominant source category. Press releases grew 5x since July 2025, but still represent under 1% of total citations. For brand discovery questions — when users ask who leads a category rather than specific facts about a brand they already know — AI models rely more on earned media and journalism than on owned content. (Muck Rack, December 2025)

Muck Rack CEO Greg Galant noted a striking gap in how PR teams currently operate: "What we saw in our July report is even clearer now: earned media still shapes how AI understands brands. The gap between who PR teams pitch and who AI cites is striking — only a 2% overlap, which shows the industry hasn't fully adapted."

The study also found that structure affects citation selection. Cited press releases had a 30% higher rate of objective sentences and 2.5x as many bullet points as non-cited ones.

Stacker and Scrunch citation lift study (December 2025 / March 2026)

Stacker and Scrunch ran the first controlled study measuring how earned media distribution changes AI citation rates. The study tested 8 articles across 944 prompt-platform combinations on five leading LLMs. The baseline citation rate for content on a brand's own site was 8%. When the same content was distributed through third-party news outlets, the citation rate reached 34% — a 325% lift. (Stacker, December 2025)

A follow-up report in March 2026 found a 239% median lift across a broader sample. (Stacker/GlobeNewswire, March 2026) The key variable was not content quality but distribution context: AI systems weight the authority of the domain citing the content, not just the content itself. Stories republished across diverse, trusted news outlets earn substantially more AI citation than the same content sitting on a brand's own site.

Ahrefs brand visibility correlations (2025)

Ahrefs studied 75,000 brands to identify which signals correlate most strongly with AI visibility, and found brand web mentions are the dominant predictor — correlating 0.664 with AI Overview visibility versus 0.218 for backlinks. That is a 3x stronger correlation. The top three correlating factors were all off-site signals: brand web mentions (0.664), branded anchors (0.527), and brand search volume (0.392). (Ahrefs, May 2025)

The distribution effect was sharp at scale: brands in the top 25% for web mentions earned 10x more AI Overview mentions than the next quartile. Brands in the bottom 50% were essentially absent from AI answers. According to Ahrefs CMO Tim Soulo: "You need to get mentions on the pages where AI chatbots will do a search and find those pages and create their answer based on what they see — because then you will be mentioned."

A December 2025 follow-up study expanded the analysis to ChatGPT, Google AI Mode, and AI Overviews simultaneously. (Ahrefs, December 2025) The finding held across all three platforms, with YouTube channel mentions emerging as the highest signal (0.737) and brand mentions consistently dominating backlinks across every surface measured.

Chen et al. large-scale GEO empirical study (arXiv, September 2025)

Researchers from the University of Toronto ran large-scale controlled experiments across multiple AI search platforms and found AI search exhibits a "systematic and overwhelming bias towards Earned media — third-party, authoritative sources — over Brand-owned and Social content." Social platforms were almost entirely absent from AI answers. The contrast with Google's more balanced citation mix was described as stark. (Chen et al., arXiv:2509.08919, September 2025)

The study formulated a strategic framework with a clear priority: "dominate earned media to build AI-perceived authority." The finding that AI search differs fundamentally from traditional search in how it weights source types was described as the most consequential result for practitioners to understand.

GEO-16 framework study (Kumar et al., arXiv, September 2025)

Researchers at Berkeley introduced the GEO-16 framework, analyzing 1,702 citations from Brave, Google AI Overviews, and Perplexity across 70 B2B SaaS prompts. They found that on-page quality signals are necessary but insufficient. The paper explicitly noted that "recent comparative research emphasises that generative engines heavily weight earned media and often exclude brand-owned and social platforms. This implies that even high-quality pages may not be cited if they reside solely on vendor blogs." (Kumar et al., arXiv:2509.10762, September 2025)

The practical recommendation: pursue earned media relationships and diversify content distribution across platforms to counteract the engine bias toward third-party, authoritative domains. Technical optimization alone is not sufficient for AI visibility.

Fullintel and UConn academic study (February 2026)

A study presented at the International Public Relations Research Conference found that 47% of all AI citations in responses came from journalistic sources, with 89%+ of cited links from earned media and 95% from unpaid sources. (Fullintel, February 2026) The academic context is relevant: this was peer-reviewed research on citation behavior, establishing earned media's dominance in AI citation patterns as an academic finding rather than an industry claim.

McKinsey AI Discovery Survey (August 2025)

McKinsey's AI Discovery Survey found that a brand's own website accounts for only 5–10% of the sources that AI search references. (McKinsey, October 2025) The rest comes from affiliates, third-party editorial coverage, user-generated content, and review sites. The survey also found that 50% of consumers now intentionally seek out AI-powered search engines, with a majority of users naming it their top digital source for buying decisions. By 2028, $750 billion in US consumer spend is projected to flow through AI-powered search.

The McKinsey data confirms what the citation studies measure at the source level: from the user's perspective, AI is already the primary discovery channel for many categories. From the citation perspective, that channel is overwhelmingly populated by earned media, not brand-owned content.

Why AI engines are biased toward earned media

The data is consistent enough to ask the mechanism question: why do AI engines prefer third-party editorial sources?

The answer is not arbitrary. AI language models are trained on corpora that weighted high-authority, high-credibility sources. Publications like Reuters, the Financial Times, Forbes, TechCrunch, and the Wall Street Journal were over-represented in training data precisely because they were trusted sources of accurate, fact-checked information. When AI search systems ground their answers in external sources, they apply the same credibility weighting.

Brand-owned content has an inherent credibility problem for AI systems. A brand saying positive things about itself is recognized as self-interested. A third-party publication saying positive things about a brand is recognized as a credible signal. This distinction is built into how the models were trained, which is why no amount of on-page optimization can substitute for earned editorial presence.

Zhang et al.'s arXiv study found that 37% of AI-cited domains are entirely absent from traditional search results. (Zhang et al., arXiv:2512.09483, December 2025) AI citation and SEO ranking are separate phenomena operating on different signals. Moz's 2026 analysis of 40,000 queries found that 88% of Google AI Mode citations do not appear in the organic top 10. The channels are distinct. Ranking in traditional search does not transfer to AI citation.

The Princeton and Georgia Tech GEO research (Aggarwal et al., SIGKDD 2024) found that adding statistics improves AI visibility by 30–40%, and that citing credible sources increases citation probability. Both findings reinforce the same mechanism: AI engines are trying to surface content that itself demonstrates credibility. A piece that references authoritative external sources earns more citations than a piece that does not, because the signal structure mirrors what AI engines recognize as credibility.

The PR industry is confirming this from the inside

Separate from the data studies, something structurally significant is happening in the PR industry itself: practitioners are independently arriving at the machine citation thesis and describing it as the new definition of their work.

WorldCom Group, a consortium of 160 independent PR agencies operating globally, stated: "Research shows that up to 90% of citations driving brand visibility in LLMs come from earned media, positioning public relations at the center of this transformation." This is not a GEO company claiming PR's territory. This is the organized global PR industry using machine citation data to redefine their own function.

Todd Ringler, head of U.S. media at Edelman, said in Campaign Asia: "So-called generative engine optimization is going to be front-and-center in any successful brand or reputation campaign. Earned media and content strategies need to be savvy to where and how AI search is finding and structuring its answers." Edelman is the world's largest PR firm. Their U.S. media head is adopting GEO language to describe what PR needs to become.

Brian Olson, brand PR lead at Hormel Foods, told PR Daily: "By the end of 2026, appearing in LLM responses will stand shoulder-to-shoulder with impressions, which continue to lose relevance as a primary KPI." A corporate communications professional at a Fortune 500 company is describing AI citation as a primary success metric.

Gab Ferree, founder of communications community Off the Record, put it directly at an Axios HQ webinar in February 2026: "Media relations are becoming machine relations. It's on the comms professionals to learn the patterns of AI and then take action on them." (Stacker, February 2026)

These are not academics theorizing about a future state. These are practitioners describing what they are observing in their current work: earned media in trusted publications is what AI engines cite, which means PR's core product has become the foundation of AI visibility.

The GEO research community is proving the same thing from different data

Meanwhile, the GEO and SEO research community has been building an independent evidentiary case from the other direction. Their data is not about PR's importance. It is about AI citation behavior. The conclusion is the same.

Search Engine Land published a February 2026 guide stating directly: "Digital PR and thought leadership aren't just brand plays anymore. They're direct GEO levers. Research shows AI engines favor earned media — third-party coverage, reviews, and industry mentions — over content on your own site." The canonical search industry publication is telling its readers that PR is now a GEO strategy.

The Stacker and Scrunch research explicitly positioned earned media distribution as "no longer just a traffic strategy, but a fundamental component of AI visibility." This came from a content distribution company with no stake in the PR or GEO debate — they measured what happened when content moved from owned to earned channels, and the citation lift was the result.

Firebrand Marketing, a GEO agency, wrote in their practice guide: "89% of LLM citations come from earned sources, including 27% from journalistic outlets... GEO is a multidisciplinary approach, so assemble a team involving SEO, content, web, affiliate marketing and PR. This has all your citation sources covered." A GEO firm is prescribing PR as a required component of their practice.

The pattern is clear. PR practitioners are proving the GEO thesis: AI citation depends on earned media, which means PR's mechanism has become central to AI-era visibility. GEO practitioners are proving the PR thesis: the most effective thing you can do for AI visibility is earn coverage in publications AI engines trust, which is the definition of PR's core function. Neither side has a name for the architecture that connects both. That is the gap this evidence base is pointing toward.

What this means for brand visibility strategy in 2026

The evidence produces several strategic implications that are now empirically grounded rather than speculative.

Your own website is a minority citation source for AI

McKinsey's data found that a brand's own website accounts for only 5–10% of what AI search references. This is a structural ceiling on what on-site optimization can accomplish for AI visibility. The remaining 90–95% of citations come from third-party sources. A brand that has invested entirely in SEO and owned content has built a strategy optimized for the 5–10% channel while ignoring the 90–95% channel.

Backlinks are not the same as brand mentions for AI visibility

Traditional SEO prioritized backlinks as the dominant off-site signal. The Ahrefs data shows that for AI visibility, brand web mentions (0.664 correlation) are three times more predictive than backlinks (0.218 correlation). These require different tactics. Backlinks are acquired through content and technical relationships. Brand mentions are acquired through editorial coverage, media placements, thought leadership in publications, and the kind of third-party credibility signals that have always been the domain of PR.

Distribution multiplies AI citation rates

The Stacker and Scrunch study established that the same content earns dramatically more AI citations when distributed through third-party news outlets than when it sits on a brand's own site. The citation rate moved from 8% to 34% through distribution alone. This means earned media placement is not just about reach to human readers. It is the mechanism by which content becomes AI-citable. A story in Forbes earns citations from AI engines that reference Forbes as an authority. The same story on a brand blog does not.

B2B buying is already running through AI

Forrester's 2026 Buyer Insights data found that 94% of B2B buyers are using AI in their purchasing process, with twice as many buyers naming generative AI or conversational search as a more meaningful source of information than any other channel. AI visibility is not a future concern for B2B brands. It is a current pipeline variable. The brand that is not cited when a buyer asks their AI assistant who the category leaders are is absent from the consideration set. For a deeper analysis of how AI-mediated buying affects B2B pipeline, see how AI agents are making B2B vendor decisions in 2026.

Structure improves extractability, but does not replace earned authority

The GEO-16 and Princeton/Georgia Tech research both found that content structure affects citation probability. Pages with proper metadata, semantic HTML structure, and answer-first organization are cited more than those without. Statistics improve AI visibility by 30–40%. Tables are cited 2.5x more often than prose. (Aggarwal et al., SIGKDD 2024) Structure matters, but it is insufficient without earned authority. The GEO-16 paper explicitly stated that even high-quality pages may not be cited if they reside on vendor blogs. Structure is necessary; earned authority is necessary; both together produce the outcome.

Machine Relations: the architecture that connects both sides of the evidence

The evidence points to a clear architecture. AI engines prefer earned media because earned media is what they were trained to recognize as credible. Third-party editorial coverage in publications with track records for accuracy and editorial standards carries a credibility signal that self-published content cannot replicate. This is why the mechanism works and why no amount of on-page optimization changes the fundamental dynamic.

The timeline of how earned media compounds into AI citation advantage is documented in how earned media citation authority accumulates over time. The pattern is consistent: placements in trusted publications generate citations that compound as AI engines index and cross-reference authoritative sources over months, not days.

This is the core of Machine Relations — the discipline coined by Jaxon Parrott, founder of AuthorityTech, in 2024. Machine Relations (MR) is the discipline of earning AI citations and recommendations for a brand by making that brand legible, retrievable, and credible inside AI-driven discovery. The five-layer Machine Relations stack places Earned Authority as Layer 1, not because it is easy or obvious, but because the data consistently shows it is the primary predictor of AI citation outcomes.

The comparison table from the Machine Relations framework maps how each discipline relates to the full system:

DisciplineOptimizes forSuccess conditionScope
SEORanking algorithmsTop 10 position on SERPTechnical + content
GEOGenerative AI enginesCited in AI-generated answersContent formatting + distribution
AEOAnswer boxes / featured snippetsSelected as the direct answerStructured content
Digital PRHuman journalists/editorsMedia placementOutreach + storytelling
Machine RelationsAI-mediated discovery systemsResolved and cited across AI enginesFull system: earned authority, entity clarity, citation architecture, distribution, measurement

GEO and AEO describe the distribution and formatting layers of the system. SEO addresses the technical foundation. Digital PR addresses outreach to human editors. Machine Relations is the full architecture, starting with Earned Authority because the research shows that without it, everything else operates on a degraded foundation.

PR got the mechanism exactly right: earned media in trusted publications is the most powerful credibility signal that exists. It was true when your buyers were human. The research in this post confirms it is true now that AI systems are doing the first cut of research on behalf of your buyers. The publications that shaped human brand perception for decades are the same publications AI engines treat as authoritative sources. What changed is the reader.

As Jaxon Parrott described in his Machine Relations breakdown on Medium, the five-layer stack is the operational system that makes earned authority compound into AI citation, entity resolution, and measurable share of citation over time. The evidence base in this post is what the stack is built on.

For brands in 2026, the question is not whether earned media drives AI citations. Six independent research bodies have answered that. The question is whether their current strategy accounts for it.

[Start your visibility audit →](https://app.authoritytech.io/visibility-audit)

Frequently asked questions

What percentage of AI citations come from earned media?

Multiple studies from 2025 and 2026 find that earned media accounts for 82–95% of AI citations. Muck Rack's Generative Pulse analysis of more than one million AI citations found 82% came from earned media and 94% from non-paid sources. Fullintel's academic study found 89%+ of cited links were from earned media and 95% from unpaid sources. The Golin figure, presented at PR Moment, found 90% of AI visibility is driven by citations from earned editorial media. The consistency across independent measurement approaches is the signal: earned media dominates AI citation regardless of the methodology used to measure it.

Why do AI engines prefer earned media over brand-owned content?

AI language models learned credibility from training data that weighted high-authority, editorially independent sources. Publications with track records for accuracy, fact-checking, and editorial standards were over-represented in the training corpora. When AI search systems select sources to ground their answers, they apply the same credibility weighting. Brand-owned content carries an inherent credibility discount because self-interest is recognized. Third-party editorial coverage in recognized publications carries a positive credibility signal. According to Chen et al. (arXiv:2509.08919), AI search exhibits a "systematic and overwhelming bias towards Earned media" — this is a structural feature of how the systems were built, not a temporary pattern that will change with optimization tactics.

What is the relationship between GEO and earned media?

GEO (Generative Engine Optimization) describes the practice of optimizing content for AI-generated answers. The research shows that the most effective GEO lever is not on-page formatting — it is earned media distribution. Stacker and Scrunch's controlled study found that distributing the same content through third-party news outlets increased AI citation rates by 325%. Ahrefs found that brand web mentions correlate 3x more strongly with AI visibility than backlinks. According to the GEO-16 framework (Kumar et al., arXiv:2509.10762), even high-quality pages may not be cited if they reside solely on vendor blogs. GEO requires both on-page structure and earned authority. In the Machine Relations framework, GEO and AEO are Layer 4 (Distribution Across Answer Surfaces), which requires Layer 1 (Earned Authority) to function at full effectiveness. Structure without earned authority is optimization without credibility.

Who coined Machine Relations?

Jaxon Parrott, founder of AuthorityTech, coined the term Machine Relations in 2024 to name the discipline of earning AI citations and recommendations for brands by making them legible, retrievable, and credible inside AI-driven discovery systems. He published the five-layer Machine Relations stack and the origin story at machinerelations.ai. As Jaxon Parrott defined the category in March 2026: Machine Relations is the canonical name for the shift from human-mediated to machine-mediated brand discovery, a shift the research in this post quantifies and explains.

Does improving SEO rankings improve AI citation rates?

No — the research shows these are separate channels with low overlap. Moz's 2026 analysis of 40,000 queries found that 88% of Google AI Mode citations do not appear in the organic top 10 SERP results. Zhang et al. (arXiv:2512.09483) found that 37% of AI-cited domains are entirely absent from traditional search results. Ahrefs found that backlinks (the core SEO metric) correlate only 0.218 with AI visibility, while brand web mentions correlate 0.664. Optimizing for traditional search rankings does not transfer to AI citation rates. The two channels require different strategies, and the dominant factor in AI citation is earned media, not SERP position.

Related Reading