[Technical Foundation]

Schema markup that actually helps AI cite you

Category: Technical Foundation · Reading time: 8 min

Schema markup is one of the most over-hyped and most misunderstood parts of GEO. Most schema types do nothing for AI citations. A small handful do most of the work. This guide shows you which schema types to implement, which to skip, and how to add them correctly.

Clean desk scene with a laptop showing structured data, representing schema markup for AI

What is schema markup?

Schema markup is structured data added to a page in a format that machines can read. The most common implementation is JSON-LD - a small block of JSON inside a script tag in the page head.

For traditional SEO, schema enables rich results in Google: star ratings, FAQ accordions, product cards. For GEO, schema does something different. It gives AI systems a clean, unambiguous answer to questions like "what is this company called?", "what does this product do?" and "who wrote this article?". When the structured answer matches the on-page content, AI tools are far more likely to cite it confidently.

Does schema markup actually help AI cite you?

Yes, but only for a narrow set of schema types and only when the content on the page genuinely matches the structured data. Schema is not a magic wand. It is a clarification layer.

The schema types that move the needle for GEO answer three questions an AI system needs settled before it cites you: who are you, what do you sell, and what does this specific page assert? Everything else is noise.

Schema type
Verdict
Why
Organization
Use
Anchors who you are. Foundational for entity recognition.
Product / SoftwareApplication
Use
Tells AI what you actually sell, plus pricing and category.
FAQPage
Use
Question / answer pairs are the format AI most readily extracts.
Article / BlogPosting
Use
Author, publish date and headline help AI weigh freshness and authority.
BreadcrumbList
Use
Helps AI understand site structure and the topic hierarchy of a page.
Review / AggregateRating
Use
Useful when ratings are real and verifiable. Misuse triggers Google penalties.
HowTo
Skip
Google has deprecated rich results for most HowTo content. Low GEO value.
Speakable
Skip
Built for voice assistants that never took off. Not used by current AI tools.
ImageObject (standalone)
Skip
Rarely cited. Use alt text and surrounding copy instead.
Schema markup scorecard for GEO

Which schema types should every B2B SaaS implement?

If you only do three things, do these:

Organization schema on your homepage and About page. This is the entity record for your company. Include name, URL, logo, description and a sameAs array linking to your verified profiles on LinkedIn, G2, Capterra, GitHub, X and any other authoritative platform. The sameAs property is one of the most direct signals AI systems use to disambiguate you from companies with similar names.

Organization schema example
{
  "@context": "https://schema.org",
  "@type": "Organization",
  "name": "Acme Analytics",
  "url": "https://acmeanalytics.com",
  "logo": "https://acmeanalytics.com/logo.png",
  "description": "Product analytics for B2B SaaS teams",
  "sameAs": [
    "https://www.linkedin.com/company/acme-analytics",
    "https://twitter.com/acmeanalytics",
    "https://www.g2.com/products/acme-analytics"
  ]
}

Product or SoftwareApplication schema on your product and pricing pages. This tells AI what you actually sell, what category you compete in and how you're priced. For SaaS, SoftwareApplication is usually the more specific choice.

FAQPage schema on any page with question-and-answer content. This is the schema type with the most direct GEO payoff because the question / answer format is exactly what AI systems extract from. Only mark up genuine FAQs that appear on the visible page. Fake FAQs added only for schema will eventually get you penalised.

FAQPage schema example
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [{
    "@type": "Question",
    "name": "What does Acme Analytics integrate with?",
    "acceptedAnswer": {
      "@type": "Answer",
      "text": "Acme Analytics has native integrations with Segment, Snowflake, BigQuery, HubSpot and Salesforce."
    }
  }]
}

What about Article schema?

Every blog post and guide on your site should have Article or BlogPostingschema. The fields that matter most for GEO are headline, author, datePublished and publisher. Author signals are especially important - AI systems weigh author authority when deciding whether to cite a piece of content.

Link the author to a real Person schema with a URL pointing to their author page on your site, and include relevant credentials in their bio. Ghost-written articles attributed to a generic "team" account are weaker for both SEO and GEO.

Which schema types are a waste of time?

HowTo schema had its moment around 2019-2021. Google has since deprecated rich results for most HowTo content, and AI tools don't preferentially cite it. If your content is instructional, structure the page well and let the prose do the work.

Speakable schema was built for voice assistants that never gained traction in B2B research. Current AI tools don't use it. Skip.

Standalone ImageObject schema rarely improves citations. Use descriptive alt text, meaningful filenames and surrounding context instead.

How do I know if my schema is actually working?

Validate every schema block in two places:

  1. Google's Rich Results Test tells you whether your markup is valid and which rich results it's eligible for.
  2. Schema.org's validator catches structural issues the Google tool misses, especially nested types.

Schema that doesn't validate is worse than no schema at all. AI systems will quietly ignore malformed blocks and you'll have wasted the effort.

What about llms.txt?

llms.txt is a proposed convention for telling AI crawlers which pages on your site you'd most like them to use. It is not schema markup, it is not currently honoured by the major AI tools, and it is no substitute for getting your on-page schema right. Add one if you want - it costs nothing - but don't expect it to move citations on its own.

What is the minimum viable schema setup?

If you do nothing else, ship these four blocks this week:

  1. Organization schema on your homepage, with a thorough sameAs array.
  2. SoftwareApplication schema on your main product page.
  3. FAQPage schema on at least one page targeting a high-intent question.
  4. Article schema on every blog post, with a real author.

That gets you 80% of the GEO value of schema with about 20% of the effort. Everything beyond this is fine-tuning.