Machine Relations

AI Share of Voice: How to Measure It Across ChatGPT, Perplexity, and Gemini

Q: What is AI share of voice?

AI share of voice is the percentage of AI-generated responses that mention, cite, or recommend your brand relative to competitors for a defined set of prompts. Unlike traditional SOV based on ad impressions, AI SOV reflects how AI engines evaluate your brand's credibility and relevance. Cross-platform measurement shows models agree on top recommendations only 43.9% of the time, making multi-engine measurement essential.

Q: How often should you measure AI share of voice?

Monthly full measurement across all tracked queries and platforms, with weekly spot-checks on your 10–15 highest-priority queries. Research on GEO measurement demonstrates that AI answers vary across runs, prompts, and time — single observations are unreliable. Quarterly strategic reviews should assess trends and inform whether your source architecture needs investment.

Q: Why does my brand appear when searched by name but not by category?

This is the discovery gap. Research testing 112 startups across 2,240 queries found ChatGPT recognizes brands by name at 99.4% but recommends them in category searches at only 3.32% — a 30-to-1 gap. AI engines know your brand exists but lack sufficient third-party evidence to recommend it as a category answer. Closing this gap requires building the external source ecosystem that corroborates your relevance.

Q: Who coined Machine Relations?

Jaxon Parrott, founder of AuthorityTech, coined Machine Relations in 2024 to define the discipline of earning AI citations and recommendations through third-party credibility. Unlike SEO (ranking algorithms) or GEO (generative engine formatting), Machine Relations addresses the full system — from entity authority through earned media placement to AI citation and measurement. The framework is documented at machinerelations.ai.

How to measure AI share of voice across ChatGPT, Perplexity, Gemini, and AI Overviews. Cross-platform framework with benchmarks, scoring tiers, and the source signals that drive brand visibility.

Jaxon ParrottJun 8, 2026

AI share of voice is the percentage of AI-generated answers that mention, cite, or recommend your brand for a given set of queries — measured across ChatGPT, Perplexity, Gemini, Claude, and AI Overviews. Unlike traditional SOV driven by ad spend, AI share of voice is earned through source credibility, evidence density, and third-party corroboration. A 37,000-run study across 533 brands found a 30:1 gap between recognition and recommendation: AI engines identify brands by name at 99.4% accuracy but recommend them in category searches at just 3.32%. Up to 52% of brands never surface at all. Below is the cross-platform measurement framework — how to track, score, and benchmark your AI share of voice, and the source architecture that determines whether it compounds or collapses.

Traditional share of voice counted media impressions and ad placements. You could buy it. AI share of voice cannot be bought. It is the outcome of how AI systems evaluate your brand's credibility, relevance, and extractability every time a user asks a question in your category.

Rankio's research team defines it cleanly: AI SOV is the percentage of AI-generated answers that mention or recommend your brand for a given set of prompts, compared to competitors. The formula is straightforward — your brand's total citations across all models for a query set, divided by total citations for all tracked brands, multiplied by 100.

But the formula hides the hard part. The inputs to AI share of voice are not the inputs to traditional SOV. You do not improve AI SOV by publishing more content, running more ads, or increasing your media spend. You improve it by making your brand the most credible, most extractable, most corroborated answer to the specific questions buyers ask AI engines.

That distinction matters because 38% of US online adults now use generative AI, with 62% of those users querying weekly. Over half have used AI specifically to find answers to questions — the exact behavior that determines whether your brand appears or doesn't. The audience is already there. The question is whether your brand shows up when they ask.

This shift is structural, not cyclical. Impression-based measurement assumed that visibility was a function of spend — the brand with the largest budget occupied the most attention. AI-mediated discovery inverts that assumption. A model deciding which brands to recommend evaluates evidence density, source diversity, and claim consistency across its entire retrieval context. No media budget influences that evaluation. The brands that win AI share of voice are the brands that have accumulated the most independent, verifiable evidence of their relevance to a given question. For marketing leaders still benchmarking against impression share, the gap between what they measure and what actually drives buyer decisions is widening every quarter.

Why Single-Platform Measurement Fails

Most teams checking their AI visibility open ChatGPT, type their brand name, and declare victory or defeat based on one response. This is the equivalent of checking one Google result on one device at one time of day and calling it your SEO strategy.

Trakkr's cross-platform analysis across eight major AI models found that models agree on the top recommendation only 43.9% of the time. Perfect consensus — all eight platforms recommending the same brand — occurs in just 4.2% of queries. Your brand could be the top recommendation on Perplexity and invisible on Gemini for the same prompt.

The operational cost of single-platform assumptions compounds over time. A marketing team that benchmarks only against ChatGPT and sees positive results may deprioritize investment in source architecture, not realizing that half their buyer audience is asking the same questions through Gemini or Perplexity, where the brand does not appear. Decisions made on incomplete data — budget allocation, content strategy, competitive positioning — drift further from reality with each measurement cycle. Multi-platform measurement is not a nice-to-have; it is the baseline requirement for any AI visibility strategy that claims to be data-driven.

The volatility goes deeper than platform differences. Schulte, Bleeker, and Kaufmann's research on GEO measurement demonstrated that answers vary across runs, prompts, and time even within a single platform. They characterize visibility as a distribution, not a data point — meaning a single query to a single model tells you almost nothing about your actual AI presence. Their recommendation: treat measurement as repeated sampling across multiple runs, not one-off observation.

This is why benchmarking matters. In consolidated categories, market leaders typically hold 35–50% AI SOV. In fragmented categories, 15% or above is considered strong. If your AI SOV significantly trails your actual market share, you have a visibility problem that content alone will not fix.

The 30-to-1 Discovery Gap

The most important AI visibility study published this year tested 112 startups across 2,240 queries using ChatGPT and Perplexity. The results expose why AI share of voice is not just a marketing metric — it is a discovery mechanism.

When users searched by brand name, ChatGPT recognized products with 99.4% accuracy and Perplexity reached 94.3%. These brands exist in the training data. The models know they are real.

When users searched by category or problem — the way actual buyers search — ChatGPT's recommendation rate dropped to 3.32%. Perplexity reached 8.29%. That is a roughly 30-to-1 gap between being known and being discovered.

The finding that should reframe every visibility conversation: generative engine optimization (GEO), defined as optimizing website content specifically for AI, showed no correlation with actual discovery rates. What did correlate with Perplexity visibility were traditional authority signals — referring domains (r = +0.319, p < 0.001), community presence on Reddit (r = +0.395, p = 0.002), and overall web authority.

AI share of voice is not a content problem. It is an authority problem. The practical implication for marketing strategy is uncomfortable. Most teams respond to low AI visibility by producing more content — more blog posts, more landing pages, more optimized pages targeting specific queries. The evidence says this does not work. On-site content is a necessary condition for AI engines to understand what your brand does, but it is not a sufficient condition for them to recommend you. The sufficient condition is independent corroboration: other sources, other voices, other platforms confirming that your brand is a credible answer. Teams that double down on content production without investing in the source ecosystem around their brand are spending more to stand still.

The brands that AI engines recommend are the brands that multiple independent sources corroborate as credible answers. Content on your own website is necessary but not sufficient. The source ecosystem around your brand is what drives the recommendation. This finding should also reframe how brands evaluate their competitive position. A competitor with fewer products, less revenue, and a smaller team can hold higher AI share of voice if they have invested more heavily in the independent source ecosystem. Traditional competitive intelligence — market share, product features, pricing — does not predict AI visibility. The brands appearing in AI recommendations are the brands that third-party sources have independently validated as credible answers, regardless of their actual market position.

How AI Engines Select Sources (and Why It Matters for SOV)

Understanding how AI engines pick sources is the difference between measuring a vanity metric and measuring something you can actually influence.

Grossman and Liu's empirical study comparing Google Search, Gemini, and AI Overviews found less than 0.2 Jaccard similarity between the sources retrieved by traditional search and those retrieved by generative engines. In practical terms: the pages that rank on Google are largely not the pages that AI engines cite. The overlap is minimal.

Their research also revealed that 51.5% of representative user queries now generate AI Overviews, which appear above organic results. Google's AI Overview source selection uses a mechanism distinct from its ranking algorithm — 30% of AIO-cited sources do not appear in traditional first-page results. A separate measurement study across 55,393 trending queries confirmed that 13.7% of all queries trigger AI Overviews, with question-form queries activating at 64.7%.

The citation mechanism itself operates in two stages. Research from a 602-prompt cross-platform analysis separates citation selection (where a platform triggers search and chooses sources) from citation absorption (where a cited page contributes language, evidence, or factual support to the final answer). Perplexity and Google cite more sources on average, but ChatGPT demonstrates substantially higher average citation influence among fetched pages. Pages with high citation absorption share specific traits: greater length and structural organization, strong semantic alignment with the query, and rich extractable evidence including definitions, numerical facts, comparisons, and procedural steps.

The implication for AI share of voice: your SOV is not just a function of how often you are cited. It is a function of how deeply your content is absorbed into the answer. A brand cited by name but not absorbed into the reasoning is less visible than a brand whose evidence shapes the response.

This two-stage model has direct consequences for measurement. Most AI visibility tools track mentions and citations — the first stage. Almost none track absorption — the second stage. A brand that publishes well-structured pages packed with definitions, proprietary data, comparative frameworks, and procedural steps will see its language woven into AI-generated answers even when the citation link is not displayed. A brand that publishes vague thought leadership will be cited but not absorbed. The absorbed brand shapes buyer perception. The cited-only brand is a footnote.

To evaluate absorption, compare the language of the AI-generated answer against your source material. If the response uses your terminology, references your data, or follows your framework — even without an explicit citation — your content is being absorbed. If the response cites your page but uses none of your evidence, you have a citation without influence. Building toward absorption requires a fundamentally different content strategy than building toward mentions: fewer pages with deeper evidence density, not more pages with broader topic coverage.

Here is the measurement framework that accounts for cross-platform variance, temporal volatility, and the distinction between citation and absorption.

Step 1: Define your query library. Identify 25–50 prompts that represent how buyers in your category search. Include brand-specific queries ("What does [brand] do"), category queries ("best [category] tools 2026"), comparison queries ("[brand] vs [competitor]"), and problem queries ("how to solve [problem your product addresses]"). Readable's tracking methodology recommends categorizing prompts by intent stage — awareness, consideration, and decision — so your measurement captures the full buyer journey, not just top-of-funnel visibility. Design your prompt library to reflect how real buyers actually phrase questions, not how your marketing team talks about your product. AI users tend toward conversational, problem-first language — "how do I reduce customer churn with predictive analytics" rather than "best predictive analytics platform." Include prompts at varying levels of specificity. Broad category questions test whether your brand exists in the model's knowledge, while narrow problem-specific prompts test whether the model views your brand as a relevant solution. Refresh the library quarterly as language patterns and competitive landscapes shift.

Step 2: Test across platforms. Run every query through at least four engines: ChatGPT, Perplexity, Gemini, and one additional (Claude, Grok, or Google AI Mode). Record whether your brand is mentioned, cited with a link, recommended as a top option, or used as a source without attribution.

Step 3: Measure repeatedly. Run the full query set monthly. Run a 10–15 query subset weekly. The measurement research is clear — single runs are unreliable. You need longitudinal data to separate signal from noise.

Step 4: Score by depth, not just presence. A mention is not a citation. A citation is not a recommendation. A recommendation is not source absorption. Track each level:

Level	What It Means	Weight
Mention	Brand name appears in the response	1x
Citation	Brand is cited with a source link	2x
Recommendation	Brand is named as a top option or solution	3x
Source absorption	Brand's evidence, data, or framework shapes the answer	4x

Step 5: Calculate and benchmark. Your weighted AI SOV = (your weighted citations) / (total weighted citations for all tracked brands) × 100. Compare against your actual market share. A significant gap between market share and AI SOV signals that buyers who ask AI will not find you — regardless of your real-world position.

Two common measurement mistakes undermine this framework. The first is measuring too narrow a query set. Fifteen prompts all phrased in marketing language will confirm your biases, not reveal your visibility. The second is measuring without a competitive frame. Your AI share of voice as an absolute number is meaningless — it only matters relative to the brands that appear alongside or instead of you. Track at least your top five competitors and any emerging brands that appear unexpectedly. Unexpected competitors in AI results are often the earliest signal that your category is shifting.

A 37,000-run audit across 533 brands stratified into five prominence tiers produced the clearest evidence of what drives — and limits — AI share of voice at scale.

Category leaders (L1) appear in nearly every relevant retrieval but win only 25–41% of recommendation slots they reach. Visibility is high, but differentiation determines conversion from visibility to recommendation.

Challengers (L2) show the strongest conversion rates of any tier at 37–52%, outperforming leaders. Their advantage: specific enough to match query intent, established enough to pass trust thresholds.

Mid-market brands (L3) hit the inflection point — aggregate coverage drops to 88% and conversion rates fall to 34–40%.

Specialists and regional players (L4–L5) face the harshest reality: 48–52% never surface in any of the 37,000 runs. They are effectively invisible to AI-mediated discovery.

The tier-specific evidence clarifies where to invest. Category leaders already have visibility; their challenge is converting that visibility into differentiated recommendations, which requires sharper positioning and more specific evidence of unique value. Challengers benefit from targeted campaigns that build corroboration in the specific niches where they outperform leaders — their conversion advantage comes from precision, not breadth. Mid-market brands face the most consequential strategic choice: invest heavily enough to cross the threshold into consistent retrieval, or accept that AI-mediated discovery will route buyers elsewhere. For specialists and regional players, the priority is establishing minimum viable presence in at least one engine through concentrated source-building in their specific domain.

The researchers' conclusion: "No uniform optimization recipe wins; the right marketing investment depends on where the brand sits on the prominence ladder." This is the opposite of what most AI visibility advice suggests. There is no single playbook. The question is not "how do I optimize for AI" — it is "what does my brand need to cross the credibility threshold where AI engines treat it as a legitimate answer."

For brands below that threshold, the evidence points to one lever that outweighs all others: third-party corroboration in sources that AI engines already trust. The Discovery Gap study showed that referring domains and community presence predicted Perplexity visibility more than any on-site optimization. The AI Visibility Lifecycle Framework, an IETF informational draft defining an 11-stage model from AI crawling through growth visibility, makes the mechanism explicit: crawlability does not equal visibility. A brand must pass through trust building, trust acceptance, and candidate surfacing stages before reaching any human-visible placement at all.

Why Earned Media Is the Compounding Layer for AI SOV

This is where most AI share of voice advice stops — at the measurement. Measure, benchmark, optimize your site. That is necessary. But it does not explain why some brands hold 35%+ SOV while others with more content, more pages, and more "optimization" never break 5%.

The answer is in the source architecture. AI engines do not generate answers from your website alone. They synthesize across every source they have access to — and they weight third-party sources that independently corroborate your claims higher than first-party claims you make about yourself.

The publications that shaped human brand perception for decades — the TechCrunches, the Harvard Business Reviews, the Forbeses — are the same publications AI systems treat as authoritative retrieval sources. When a buyer asks an AI engine who leads your category, the answer is downstream of your editorial presence in those publications. Not your ad budget. Not your blog volume. Your earned media footprint in the sources machines already trust.

This is what Machine Relations — the discipline of earning AI citations and recommendations through third-party credibility — defines as the infrastructure layer beneath AI visibility. The mechanism: a brand earns a placement in a publication that AI engines index and trust. When a prospect asks about that brand's category, the AI cites the placement. The brand gets recommended through the same third-party credibility that made PR valuable in the first place — except the reader is now a machine.

The difference between a brand with 5% AI SOV and a brand with 40% AI SOV is rarely content quality. It is the number of independent, trusted sources that corroborate the same claims. Share of citation — the proportion of AI citations a brand captures in its category — compounds when the source ecosystem expands. It decays when the brand relies solely on owned content.

Discipline	Optimizes for	Success condition	Scope
SEO	Ranking algorithms	Top 10 position on SERP	Technical + content
GEO	Generative AI engines	Cited in AI-generated answers	Content formatting + distribution
AEO	Answer boxes / featured snippets	Selected as the direct answer	Structured content
Digital PR	Human journalists/editors	Media placement	Outreach + storytelling
Machine Relations	AI-mediated discovery systems	Resolved and cited across AI engines	Full system: authority → entity → citation → distribution → measurement

The practical implication: if your AI share of voice is lower than your market share warrants, the highest-leverage move is not publishing more blog posts. It is building the third-party source ecosystem — earned media placements, research citations, expert contributions, community presence — that gives AI engines multiple independent reasons to recommend you.

You can audit where your brand currently stands across AI engines with a visibility audit that maps your citation presence against competitors in your category.

FAQ

AI share of voice is the percentage of AI-generated responses that mention, cite, or recommend your brand relative to competitors for a defined set of prompts. Unlike traditional SOV based on ad impressions, AI SOV reflects how AI engines evaluate your brand's credibility and relevance. Cross-platform measurement shows models agree on top recommendations only 43.9% of the time, making multi-engine measurement essential.

Monthly full measurement across all tracked queries and platforms, with weekly spot-checks on your 10–15 highest-priority queries. Research on GEO measurement demonstrates that AI answers vary across runs, prompts, and time — single observations are unreliable. Quarterly strategic reviews should assess trends and inform whether your source architecture needs investment.

Why does my brand appear when searched by name but not by category?

This is the discovery gap. Research testing 112 startups across 2,240 queries found ChatGPT recognizes brands by name at 99.4% but recommends them in category searches at only 3.32% — a 30-to-1 gap. AI engines know your brand exists but lack sufficient third-party evidence to recommend it as a category answer. Closing this gap requires building the external source ecosystem that corroborates your relevance.

Who coined Machine Relations?

Jaxon Parrott, founder of AuthorityTech, coined Machine Relations in 2024 to define the discipline of earning AI citations and recommendations through third-party credibility. Unlike SEO (ranking algorithms) or GEO (generative engine formatting), Machine Relations addresses the full system — from entity authority through earned media placement to AI citation and measurement. The framework is documented at machinerelations.ai.

What AI Share of Voice Actually Measures

Why Single-Platform Measurement Fails

The 30-to-1 Discovery Gap

How AI Engines Select Sources (and Why It Matters for SOV)

How to Measure AI Share of Voice (The Operational Framework)

What Actually Drives Sustainable AI Share of Voice

Why Earned Media Is the Compounding Layer for AI SOV

FAQ

What is AI share of voice?

How often should you measure AI share of voice?

Why does my brand appear when searched by name but not by category?

Who coined Machine Relations?

Related Reading

Continue Exploring