How do startups get cited in ChatGPT and Perplexity?

By publishing original research, answering specific questions directly, building topical authority through content clusters, and earning third-party mentions through digital PR.

Should I remove schema markup if it is not driving AI citations?

No. Remove inaccurate or inflated schema types that do not match your content. Keep schema that correctly describes your pages, supports rich results, and aids entity disambiguation.

Schema Markup and AI Citations: What the Data Actually Shows (2026)

Q: Does adding JSON-LD schema directly increase AI citations?

No, based on current data. An Ahrefs study tracking 1,885 pages found no statistically significant increase in ChatGPT or AI Mode citations after schema was added, and AI Overviews citations dropped 4.6%.

Q: Why did AI Overviews citations drop after adding schema?

The exact mechanism is not confirmed, but over-tagging or mismatched schema types may reduce content trust signals. Accurate, minimal schema is safer than inflated markup.

Q: Is my SEO agency overselling schema for AI visibility?

Possibly. If the pitch is simply add schema and get cited by AI, ask for a controlled study and the citation mechanism. Current data does not support that claim.

Every SEO agency I've spoken with is selling the same thing right now: "Add schema, get cited by AI." It sounds clean. It sounds like a system. And if you're a founder who just got burned by $5K/month PDF reports, it's exactly what you want to hear. But the relationship between schema markup and AI citations is far more complicated than any agency pitch deck will admit.

TL;DR

A real Ahrefs study across 1,885 pages found schema did not meaningfully increase AI citations in Google AI Overviews, AI Mode, or ChatGPT.
Google AI Overviews citations actually dropped 4.6% after schema was added.
Schema still has value, but it doesn't make a page worth citing.
AI visibility is earned through content quality, topical authority, and sourcing.

Does Schema Markup Actually Help With AI Citations?

Let me give you the honest answer before someone else sells you the comfortable one.

Ahrefs tracked 1,885 pages that added JSON-LD schema against 4,000 matched control pages, monitoring changes between August 2025 and March 2026. The results on schema markup and AI citations were not what most agencies are telling you:

Google AI Overviews: down 4.6% and statistically significant
Google AI Mode: up 2.4% and not statistically significant
ChatGPT: up 2.2% and not statistically significant

Ahrefs schema citation study

Google AI Overviews

-4.6%

Statistically significant drop after schema was added.

Google AI Mode

+2.4%

Not statistically significant. Effect indistinguishable from noise.

ChatGPT

+2.2%

Not statistically significant. No proven citation lift.

The only statistically significant movement went in the wrong direction.

The only movement that actually cleared the statistical significance bar was the drop in AI Overviews. The gains in AI Mode and ChatGPT were indistinguishable from noise.

This doesn't mean structured data is worthless. It means schema is not the lever your agency claims it is for AI citation rates. Schema helps Googlebot parse your pages, supports rich results in traditional search, and can improve how your content maps to a knowledge graph. Those are real, secondary benefits.

But large language models like the ones powering Google's AI Overviews, ChatGPT, and Perplexity AI don't pull citations because a page has FAQPage or HowTo schema attached to it. They pull citations because the content answers the question better than anything else available, is clearly sourced, and demonstrates genuine expertise.

The distinction matters enormously for where you spend your time. If you're an early-stage startup with limited engineering bandwidth and a fractional content budget, adding schema is a one-time task. Building content that deserves to be cited is ongoing work, and it's the only work that actually compounds.

Schema describes the page. It doesn't make the page worth referencing. That's not a subtle point. That's the entire point.

Why Isn't My Schema Markup Getting Cited by AI?

Because citation in AI answers is an information retrieval problem, not a metadata problem.

When retrieval-augmented generation systems pull sources, they're running semantic search against indexed content. They're asking: "Which document best satisfies this query with the most credible, specific, clearly structured answer?" Named entity recognition and natural language processing are doing the heavy lifting. Your JSON-LD is not part of that retrieval calculation.

Here's what the structured data inflation problem looks like in practice: agencies add schema types that don't match the actual content, inflate markup to cover as many entity types as possible, and present that activity as AI optimisation. When citations don't follow, they blame the algorithm.

What's actually missing is almost always one of these:

No clear, direct answer: The page talks around the topic instead of answering it in the first 200 words.
Weak sourcing: Claims are unattributed, generic, or lack primary data.
Zero original information: The page is a synthesis of other syntheses. There's nothing here that doesn't already exist elsewhere.
Thin topical authority: One article on a topic doesn't establish authority. A cluster of deeply interconnected content does.
Poor entity salience: The page doesn't clearly signal what the content is about through consistent use of semantic entities.

Schema can improve how search engines understand entity relationships on your page. But if the underlying ontology of your content is thin, no amount of markup changes that. You're decorating an empty room.

Schema Markup for AI Visibility: What Actually Works

Schema has a legitimate role. I'm not here to tell you to strip it all out. What I'm telling you is to understand what it actually does.

Structured data communicates page context to machines. It tells Googlebot that this page is a product, this one is a recipe, this one is an article with a named author and a publish date. That disambiguation supports E-E-A-T signals, helps with rich results in traditional SERP features, and aids knowledge graph entity association.

Those are real outcomes. They're just not the AI citation outcomes agencies are promising.

AI citation mechanism

Metadata layer

Schema describes context

Article typeAuthor and dateEntity relationshipsRich result eligibility

Retrieval layer

AI cites useful answers

Original dataDirect answersTopical authorityCredible sourcing

Schema can support retrieval clarity. It does not replace answer quality.

Here's what actually moves the needle for AI visibility:

Clear, direct answers in the body of the page: AI systems reward pages that answer questions without burying the answer under three paragraphs of scene-setting. Get to it in the first 100 words.
Original data and primary sourcing: Perplexity AI and ChatGPT's browsing systems actively prefer pages that cite primary research, publish original studies, or include expert quotes that can't be found elsewhere.
Digital PR and third-party citations: Being mentioned in authoritative publications builds the kind of off-page credibility that actually influences AI citation patterns.
Topical authority over breadth: Fifty shallow articles won't outperform five deeply researched ones. Build clusters, not catalogues.
Page structure that supports information retrieval: Use headers as genuine navigational landmarks. Write answers as answers, not essays. This is what semantic search actually rewards.

Schema can support all of this. But it doesn't replace any of it. If you're doing the real work, schema is a useful coat of paint. If you're not, it's lipstick.

My Agency Added Schema but ChatGPT Still Ignores My Site

I hear this constantly from founders who signed on to SEO retainers promising AI citation growth. Let me be direct about how SEO agencies cherry-pick metrics to protect this narrative.

When schema is added and citations don't improve, the agency will show you something that did improve. Rich snippet impressions in Google Search Console. Crawl efficiency. Schema validation scores in Bing Webmaster Tools. These are real metrics, and some of them matter. But they're not AI citations, and the agency knows that.

The Ahrefs data makes this uncomfortable to ignore: schema markup and AI citations have no proven positive correlation. The study tracked nearly 6,000 pages across multiple AI systems over seven months. The result is a flat line with one statistically significant movement, and it went in the wrong direction for AI Overviews.

ChatGPT, Perplexity AI, and Google's AI systems are not using schema as a citation signal. They're using content quality, entity authority, sourcing credibility, and retrieval relevance. If your site is being ignored, the question to ask is: "Is our content genuinely the best available answer to this query?" Not: "Have we added enough schema types?"

If you're a founder burning through runway and someone is telling you that JSON-LD is going to unlock AI visibility, push back. Ask them to show you a controlled study. Ask them what the citation mechanism actually is. The honest answer is that no one has cracked this yet, and anyone who says they have is selling traffic graphs that don't prove results.

Does Structured Data Help Get Mentioned in AI Answers?

Contextually, yes. Causally, not in the way being sold.

Structured data contributes to a well-understood page. A well-understood page can more easily map to knowledge graph nodes, build entity salience, and reduce ambiguity in how AI systems classify the content. Those things may, over time, influence whether a page is in the candidate pool for AI citations at all.

But the Ahrefs study is telling us something specific: adding schema to an existing page does not reliably increase that page's citation rate across the major AI systems. The mechanism doesn't work the way it was theorised to work.

The conditions that actually get pages mentioned in AI answers are harder to package and sell. They look like this: a page answers a specific question with a degree of accuracy and clarity that isn't matched elsewhere in the index. The source has published multiple pieces on the topic, building genuine topical authority. The information is recent, citable, and has been referenced by other pages. The author or publication has a verifiable identity and track record.

None of that is schema. All of that is content strategy.

I've built organic systems for 12 startups that now drive over 2 million in monthly traffic and 23,000+ monthly leads. The pattern isn't schema implementation. It's building content that deserves to be referenced and an authority structure that supports it.

That's what the search volume trap teaches you to ignore: surface metrics feel like progress while the underlying content problem compounds.

Is Schema Markup Worth It for AI Search in 2026?

Yes. But only with accurate expectations.

Here's how I actually think about it:

Schema value by use case

Use case	Schema value	AI citation impact
Rich results: FAQ, HowTo, Product	High	None direct
Entity disambiguation	Medium	Indirect, long-term
Knowledge graph association	Medium	Indirect, long-term
AI citation rate increase	None proven	Neutral to negative
Page parsing by Googlebot	High	None direct

Schema is infrastructure. It makes your pages machine-readable, which is foundational. Implement it correctly, keep it accurate, and don't inflate it with types that don't match your actual content. The schema inflation problem is real: over-tagged pages can actually erode trust signals rather than build them.

Where schema earns its keep: product pages with accurate Product and Offer markup, article pages with correct Article schema including author and date, FAQ content that maps directly to real questions users ask. These support rich results, which support click-through rates, which compound over time.

What schema doesn't do: change the content quality calculus that AI systems use when deciding whether to cite you. That decision is downstream of whether your content is genuinely authoritative, original, well-sourced, and clearly written. Schema markup and AI citations remain decoupled at the retrieval layer.

What Actually Makes AI Tools Cite Your Website as a Source

The honest list. No shortcuts, no magic switches.

Answer the question directly and completely: AI systems reward content that provides the full answer within the page, not scattered across internal links or buried under introductory padding.
Publish original research and data: First-party data is the highest-value citation signal available. One original study outperforms fifty curated roundups.
Build topical depth, not breadth: A site that has published 20 deeply researched articles on one topic will outperform a site with 200 shallow articles across many. SERP feature cannibalization is what happens when you try to be everywhere without depth.
Earn third-party mentions: Digital PR, expert quotes in major publications, and backlinks from authoritative domains all signal that your content is worth referencing.
Use E-E-A-T signals explicitly: Name the author, show credentials, link to primary sources, and publish dates visibly. These are trust signals AI systems can parse.
Write for retrieval, not just ranking: Clear subheadings, direct answers in the first paragraph of each section, and consistent use of the target semantic entities all improve how your pages perform in information retrieval contexts.

Schema supports several of these by improving page clarity and entity disambiguation. But it's one input, not the system.

Conclusion

The Ahrefs study across 1,885 pages is a clean result: schema markup and AI citations are not causally linked in any meaningful way. Google AI Overviews citations dropped 4.6%. AI Mode and ChatGPT showed no significant movement.

Key takeaways:

Schema is useful infrastructure, not an AI citation lever.
The gains agencies are promising don't have data behind them.
AI visibility is earned through content quality, original data, topical authority, and strong sourcing.
If your agency is selling schema as an AI citation strategy, ask for the mechanism.

Build the page that deserves to be cited. The rest follows.

Frequently Asked Questions

Does adding JSON-LD schema directly increase AI citations?

No, based on current data. The Ahrefs study tracking 1,885 pages found no statistically significant increase in ChatGPT or AI Mode citations after schema was added, and AI Overviews citations dropped 4.6%. Schema helps machines parse pages but doesn't influence the content quality signals AI citation systems prioritise.

What does structured data actually do for SEO?

Structured data improves machine readability, supports rich results in traditional search, aids knowledge graph entity association, and helps Googlebot correctly classify your content. These are real benefits. They're just separate from AI citation rates, which are driven by content quality and authority signals.

Why did my AI Overviews citations drop after adding schema?

The Ahrefs data shows a 4.6% drop in AI Overviews citations across pages that added schema, the only statistically significant result in the study. The exact mechanism isn't confirmed, but over-tagging or mismatched schema types may reduce content trust signals. Accurate, minimal schema outperforms inflated markup.

How do startups actually get cited in ChatGPT and Perplexity?

By publishing original research, answering specific questions directly and completely, building topical authority through content clusters, and earning third-party mentions via digital PR. These content quality signals drive retrieval in large language model systems. Schema supports this work but doesn't substitute for it.

Is my SEO agency overselling schema for AI visibility?

Possibly. If the pitch is "add schema and get cited by AI," ask them to show a controlled study demonstrating that mechanism. The Ahrefs data shows it doesn't work that way. Schema is infrastructure. AI citation comes from content that genuinely earns the reference.

Should I remove schema markup if it's not driving AI citations?

No. Remove inaccurate or inflated schema types that don't match your actual content. Keep schema that correctly describes your pages, supports rich results, and aids entity disambiguation. The issue isn't schema existing. It's schema being sold as an AI citation strategy when the data doesn't support that claim.