Schema Markup and AI Citations: What the Data Actually Shows (2026)

Rahul MarthakRahul Marthak
May 12, 2026
12 min read

Every SEO agency I've spoken with is selling the same thing right now: "Add schema, get cited by AI." It sounds clean. It sounds like a system. And if you're a founder who just got burned by $5K/month PDF reports, it's exactly what you want to hear. But the relationship between schema markup and AI citations is far more complicated than any agency pitch deck will admit.

TL;DR
  • A real Ahrefs study across 1,885 pages found schema did not meaningfully increase AI citations in Google AI Overviews, AI Mode, or ChatGPT.
  • Google AI Overviews citations actually dropped 4.6% after schema was added.
  • Schema still has value, but it doesn't make a page worth citing.
  • AI visibility is earned through content quality, topical authority, and sourcing.

Does Schema Markup Actually Help With AI Citations?

Let me give you the honest answer before someone else sells you the comfortable one.

Ahrefs tracked 1,885 pages that added JSON-LD schema against 4,000 matched control pages, monitoring changes between August 2025 and March 2026. The results on schema markup and AI citations were not what most agencies are telling you:

Ahrefs schema citation study
Google AI Overviews
-4.6%
Statistically significant drop after schema was added.
Google AI Mode
+2.4%
Not statistically significant. Effect indistinguishable from noise.
ChatGPT
+2.2%
Not statistically significant. No proven citation lift.
The only statistically significant movement went in the wrong direction.

The only movement that actually cleared the statistical significance bar was the drop in AI Overviews. The gains in AI Mode and ChatGPT were indistinguishable from noise.

This doesn't mean structured data is worthless. It means schema is not the lever your agency claims it is for AI citation rates. Schema helps Googlebot parse your pages, supports rich results in traditional search, and can improve how your content maps to a knowledge graph. Those are real, secondary benefits.

But large language models like the ones powering Google's AI Overviews, ChatGPT, and Perplexity AI don't pull citations because a page has FAQPage or HowTo schema attached to it. They pull citations because the content answers the question better than anything else available, is clearly sourced, and demonstrates genuine expertise.

The distinction matters enormously for where you spend your time. If you're an early-stage startup with limited engineering bandwidth and a fractional content budget, adding schema is a one-time task. Building content that deserves to be cited is ongoing work, and it's the only work that actually compounds.

Schema describes the page. It doesn't make the page worth referencing. That's not a subtle point. That's the entire point.

Why Isn't My Schema Markup Getting Cited by AI?

Because citation in AI answers is an information retrieval problem, not a metadata problem.

When retrieval-augmented generation systems pull sources, they're running semantic search against indexed content. They're asking: "Which document best satisfies this query with the most credible, specific, clearly structured answer?" Named entity recognition and natural language processing are doing the heavy lifting. Your JSON-LD is not part of that retrieval calculation.

Here's what the structured data inflation problem looks like in practice: agencies add schema types that don't match the actual content, inflate markup to cover as many entity types as possible, and present that activity as AI optimisation. When citations don't follow, they blame the algorithm.

What's actually missing is almost always one of these:

Schema can improve how search engines understand entity relationships on your page. But if the underlying ontology of your content is thin, no amount of markup changes that. You're decorating an empty room.


Schema Markup for AI Visibility: What Actually Works

Schema has a legitimate role. I'm not here to tell you to strip it all out. What I'm telling you is to understand what it actually does.

Structured data communicates page context to machines. It tells Googlebot that this page is a product, this one is a recipe, this one is an article with a named author and a publish date. That disambiguation supports E-E-A-T signals, helps with rich results in traditional SERP features, and aids knowledge graph entity association.

Those are real outcomes. They're just not the AI citation outcomes agencies are promising.

AI citation mechanism
Metadata layer
Schema describes context
Article typeAuthor and dateEntity relationshipsRich result eligibility
vs
Retrieval layer
AI cites useful answers
Original dataDirect answersTopical authorityCredible sourcing
Schema can support retrieval clarity. It does not replace answer quality.

Here's what actually moves the needle for AI visibility:

Schema can support all of this. But it doesn't replace any of it. If you're doing the real work, schema is a useful coat of paint. If you're not, it's lipstick.


My Agency Added Schema but ChatGPT Still Ignores My Site

I hear this constantly from founders who signed on to SEO retainers promising AI citation growth. Let me be direct about how SEO agencies cherry-pick metrics to protect this narrative.

When schema is added and citations don't improve, the agency will show you something that did improve. Rich snippet impressions in Google Search Console. Crawl efficiency. Schema validation scores in Bing Webmaster Tools. These are real metrics, and some of them matter. But they're not AI citations, and the agency knows that.

The Ahrefs data makes this uncomfortable to ignore: schema markup and AI citations have no proven positive correlation. The study tracked nearly 6,000 pages across multiple AI systems over seven months. The result is a flat line with one statistically significant movement, and it went in the wrong direction for AI Overviews.

ChatGPT, Perplexity AI, and Google's AI systems are not using schema as a citation signal. They're using content quality, entity authority, sourcing credibility, and retrieval relevance. If your site is being ignored, the question to ask is: "Is our content genuinely the best available answer to this query?" Not: "Have we added enough schema types?"

If you're a founder burning through runway and someone is telling you that JSON-LD is going to unlock AI visibility, push back. Ask them to show you a controlled study. Ask them what the citation mechanism actually is. The honest answer is that no one has cracked this yet, and anyone who says they have is selling traffic graphs that don't prove results.


Does Structured Data Help Get Mentioned in AI Answers?

Contextually, yes. Causally, not in the way being sold.

Structured data contributes to a well-understood page. A well-understood page can more easily map to knowledge graph nodes, build entity salience, and reduce ambiguity in how AI systems classify the content. Those things may, over time, influence whether a page is in the candidate pool for AI citations at all.

But the Ahrefs study is telling us something specific: adding schema to an existing page does not reliably increase that page's citation rate across the major AI systems. The mechanism doesn't work the way it was theorised to work.

The conditions that actually get pages mentioned in AI answers are harder to package and sell. They look like this: a page answers a specific question with a degree of accuracy and clarity that isn't matched elsewhere in the index. The source has published multiple pieces on the topic, building genuine topical authority. The information is recent, citable, and has been referenced by other pages. The author or publication has a verifiable identity and track record.

None of that is schema. All of that is content strategy.

I've built organic systems for 12 startups that now drive over 2 million in monthly traffic and 23,000+ monthly leads. The pattern isn't schema implementation. It's building content that deserves to be referenced and an authority structure that supports it.

That's what the search volume trap teaches you to ignore: surface metrics feel like progress while the underlying content problem compounds.


Is Schema Markup Worth It for AI Search in 2026?

Yes. But only with accurate expectations.

Here's how I actually think about it:

Schema value by use case
Use caseSchema valueAI citation impact
Rich results: FAQ, HowTo, ProductHighNone direct
Entity disambiguationMediumIndirect, long-term
Knowledge graph associationMediumIndirect, long-term
AI citation rate increaseNone provenNeutral to negative
Page parsing by GooglebotHighNone direct

Schema is infrastructure. It makes your pages machine-readable, which is foundational. Implement it correctly, keep it accurate, and don't inflate it with types that don't match your actual content. The schema inflation problem is real: over-tagged pages can actually erode trust signals rather than build them.

Where schema earns its keep: product pages with accurate Product and Offer markup, article pages with correct Article schema including author and date, FAQ content that maps directly to real questions users ask. These support rich results, which support click-through rates, which compound over time.

What schema doesn't do: change the content quality calculus that AI systems use when deciding whether to cite you. That decision is downstream of whether your content is genuinely authoritative, original, well-sourced, and clearly written. Schema markup and AI citations remain decoupled at the retrieval layer.


What Actually Makes AI Tools Cite Your Website as a Source

The honest list. No shortcuts, no magic switches.

Schema supports several of these by improving page clarity and entity disambiguation. But it's one input, not the system.


Conclusion

The Ahrefs study across 1,885 pages is a clean result: schema markup and AI citations are not causally linked in any meaningful way. Google AI Overviews citations dropped 4.6%. AI Mode and ChatGPT showed no significant movement.

Key takeaways:

Build the page that deserves to be cited. The rest follows.


Frequently Asked Questions

Does adding JSON-LD schema directly increase AI citations?

No, based on current data. The Ahrefs study tracking 1,885 pages found no statistically significant increase in ChatGPT or AI Mode citations after schema was added, and AI Overviews citations dropped 4.6%. Schema helps machines parse pages but doesn't influence the content quality signals AI citation systems prioritise.

What does structured data actually do for SEO?

Structured data improves machine readability, supports rich results in traditional search, aids knowledge graph entity association, and helps Googlebot correctly classify your content. These are real benefits. They're just separate from AI citation rates, which are driven by content quality and authority signals.

Why did my AI Overviews citations drop after adding schema?

The Ahrefs data shows a 4.6% drop in AI Overviews citations across pages that added schema, the only statistically significant result in the study. The exact mechanism isn't confirmed, but over-tagging or mismatched schema types may reduce content trust signals. Accurate, minimal schema outperforms inflated markup.

How do startups actually get cited in ChatGPT and Perplexity?

By publishing original research, answering specific questions directly and completely, building topical authority through content clusters, and earning third-party mentions via digital PR. These content quality signals drive retrieval in large language model systems. Schema supports this work but doesn't substitute for it.

Is my SEO agency overselling schema for AI visibility?

Possibly. If the pitch is "add schema and get cited by AI," ask them to show a controlled study demonstrating that mechanism. The Ahrefs data shows it doesn't work that way. Schema is infrastructure. AI citation comes from content that genuinely earns the reference.

Should I remove schema markup if it's not driving AI citations?

No. Remove inaccurate or inflated schema types that don't match your actual content. Keep schema that correctly describes your pages, supports rich results, and aids entity disambiguation. The issue isn't schema existing. It's schema being sold as an AI citation strategy when the data doesn't support that claim.

Rahul Marthak

Rahul Marthak

Founder, fSEO & sneo.ai

Rahul Marthak is a pioneering SEO strategist with over seven years of experience in transforming startups into revenue-generating powerhouses. As the founder of fSEO, a cutting-edge fractional SEO service, he specializes in implementing innovative, new-age SEO strategies that elevate organic visibility across both search engines and LLM citations. Rahul's expertise has been instrumental in driving over 2 million monthly visitors and generating more than 23,000 leads per month for his clients. Additionally, he is the visionary behind sneo.ai, a groundbreaking SEO software that empowers users to make data-driven decisions with unprecedented speed and efficiency. With a proven track record of scaling 12 startups, Rahul Marthak is not just an SEO expert; he is a catalyst for growth and a thought leader in the digital marketing arena.

More from the blog

Structured Data Markup Inflation: Why Installing 15 Schema Types Is Making Google Distrust Your Site

Why excessive schema can erode trust instead of building it.

SERP Features Cannibalization Is Quietly Killing Your Organic Traffic

Why rankings without clicks are vanity.