What is schema markup?
Schema markup is structured data added to a page in a format that machines can read. The most common implementation is JSON-LD - a small block of JSON inside a script tag in the page head.
For traditional SEO, schema enables rich results in Google: star ratings, FAQ accordions, product cards. For GEO, schema does something different. It gives AI systems a clean, unambiguous answer to questions like "what is this company called?", "what does this product do?" and "who wrote this article?". When the structured answer matches the on-page content, AI tools are far more likely to cite it confidently.
Does schema markup actually help AI cite you?
Yes, but only for a narrow set of schema types and only when the content on the page genuinely matches the structured data. Schema is not a magic wand. It is a clarification layer.
The schema types that move the needle for GEO answer three questions an AI system needs settled before it cites you: who are you, what do you sell, and what does this specific page assert? Everything else is noise.
Which schema types should every B2B SaaS implement?
If you only do three things, do these:
Organization schema on your homepage and About page. This is the entity record for your company. Include name, URL, logo, description and a sameAs array linking to your verified profiles on LinkedIn, G2, Capterra, GitHub, X and any other authoritative platform. The sameAs property is one of the most direct signals AI systems use to disambiguate you from companies with similar names.
{
"@context": "https://schema.org",
"@type": "Organization",
"name": "Acme Analytics",
"url": "https://acmeanalytics.com",
"logo": "https://acmeanalytics.com/logo.png",
"description": "Product analytics for B2B SaaS teams",
"sameAs": [
"https://www.linkedin.com/company/acme-analytics",
"https://twitter.com/acmeanalytics",
"https://www.g2.com/products/acme-analytics"
]
}Product or SoftwareApplication schema on your product and pricing pages. This tells AI what you actually sell, what category you compete in and how you're priced. For SaaS, SoftwareApplication is usually the more specific choice.
FAQPage schema on any page with question-and-answer content. This is the schema type with the most direct GEO payoff because the question / answer format is exactly what AI systems extract from. Only mark up genuine FAQs that appear on the visible page. Fake FAQs added only for schema will eventually get you penalised.
{
"@context": "https://schema.org",
"@type": "FAQPage",
"mainEntity": [{
"@type": "Question",
"name": "What does Acme Analytics integrate with?",
"acceptedAnswer": {
"@type": "Answer",
"text": "Acme Analytics has native integrations with Segment, Snowflake, BigQuery, HubSpot and Salesforce."
}
}]
}What about Article schema?
Every blog post and guide on your site should have Article or BlogPostingschema. The fields that matter most for GEO are headline, author, datePublished and publisher. Author signals are especially important - AI systems weigh author authority when deciding whether to cite a piece of content.
Link the author to a real Person schema with a URL pointing to their author page on your site, and include relevant credentials in their bio. Ghost-written articles attributed to a generic "team" account are weaker for both SEO and GEO.
Which schema types are a waste of time?
HowTo schema had its moment around 2019-2021. Google has since deprecated rich results for most HowTo content, and AI tools don't preferentially cite it. If your content is instructional, structure the page well and let the prose do the work.
Speakable schema was built for voice assistants that never gained traction in B2B research. Current AI tools don't use it. Skip.
Standalone ImageObject schema rarely improves citations. Use descriptive alt text, meaningful filenames and surrounding context instead.
How do I know if my schema is actually working?
Validate every schema block in two places:
- Google's Rich Results Test tells you whether your markup is valid and which rich results it's eligible for.
- Schema.org's validator catches structural issues the Google tool misses, especially nested types.
Schema that doesn't validate is worse than no schema at all. AI systems will quietly ignore malformed blocks and you'll have wasted the effort.
What about llms.txt?
llms.txt is a proposed convention for telling AI crawlers which pages on your site you'd most like them to use. It is not schema markup, it is not currently honoured by the major AI tools, and it is no substitute for getting your on-page schema right. Add one if you want - it costs nothing - but don't expect it to move citations on its own.
What is the minimum viable schema setup?
If you do nothing else, ship these four blocks this week:
- Organization schema on your homepage, with a thorough
sameAsarray. - SoftwareApplication schema on your main product page.
- FAQPage schema on at least one page targeting a high-intent question.
- Article schema on every blog post, with a real author.
That gets you 80% of the GEO value of schema with about 20% of the effort. Everything beyond this is fine-tuning.
