What is a technical GEO audit?
A technical GEO audit is a structured inspection of the parts of your site AI tools touch: how crawlers fetch it, how the HTML is rendered, how content is structured, and what machine-readable signals (schema, metadata, llms.txt) are attached. It is not a content audit and it is not a brand audit - it answers a single question: can an AI crawler arrive, read, and extract your facts cleanly?
How do I run a GEO audit, step by step?
- Crawl your site. Run Screaming Frog or Sitebulb across your full URL set. Export the report.
- Check raw HTML. For your homepage, top 3 product pages and pricing page, view-source or
curl -A "GPTBot" <url>. Confirm content is in the source, not injected by JS. - Validate schema. Paste each key URL into Google's Rich Results Test and the Schema.org validator. Note what's missing or invalid.
- Check robots.txt and sitemap. Verify GPTBot, ClaudeBot, PerplexityBot and Google-Extended aren't blocked, and that your sitemap is current.
- Run an AI legibility test. Paste each key URL into ChatGPT, Claude and Perplexity. Ask: "what does this page say, and what's missing?" Capture the gaps.
- Check Core Web Vitals. Run PageSpeed Insights on the same key URLs.
- Score and prioritise. Sort every finding into "fix this week / fix this month / nice to have" using the framework below.
Budget 3-4 hours for a first audit on a typical SaaS site. The first one is the slowest - subsequent ones run in under an hour once you have a template.
What should I actually check?
Eight areas, in roughly the order they matter for AI citations.
If any single row in this table is failing on your homepage or core product pages, fix it before doing anything else - including content work. A broken foundation invalidates every on-site and off-site move you make later.
Which tools should I use?
You can run a full first audit with free tools. The paid options become useful when you want ongoing monitoring or you're auditing 1000+ URLs.
Can AI help me run the audit itself?
Yes - and this is the part of GEO auditing that's changed most in the last twelve months. ChatGPT, Claude and Perplexity are now the fastest way to test how AI actually reads your site. Five prompts worth running on any key URL:
- "Summarise the content of this page in three sentences." - tests basic legibility.
- "What category of product is this, and who is it for?" - tests entity and positioning clarity.
- "List every feature mentioned on this page." - exposes whether features are extractable or buried in marketing copy.
- "How much does this product cost?" - exposes pricing pages hidden behind tabs or JS.
- "Who founded this company and where are they based?" - tests Organization schema and About-page signal.
If the answers are wrong, vague or include "I couldn't find that on the page", you have a specific, fixable problem - usually a rendering, structure or schema issue. Paste the failing output into ChatGPT or Claude and ask: "what change to the page would have let you answer correctly?" The response is often a usable rewrite brief.
You can also paste your raw HTML or rendered DOM into Claude and ask it to flag missing schema, weak headings or ambiguous entity references. It's the closest thing to an on-demand technical reviewer.
How do I prioritise the fixes?
Sort every finding into one of three tiers. Ship tier one before you touch tiers two or three.
- AI bots blocked in robots.txt (GPTBot, ClaudeBot, PerplexityBot, Google-Extended).
- Key pages rendered client-side only, returning an empty HTML shell.
- Missing or broken Organization schema on the homepage.
- No sitemap, or a sitemap returning 404s and old URLs.
- Thin or duplicate H1s, no semantic structure on key landing pages.
- FAQ and pricing content trapped inside accordions or tabs that hide it from the HTML.
- Missing FAQPage, Product or Article schema on pages that warrant it.
- Slow TTFB or heavy JS that blocks the first render.
- Publishing a llms.txt file pointing AI tools at your best pages.
- Adding speakable schema and breadcrumbs.
- Consolidating near-duplicate URLs with canonicals.
- Tidying internal linking so canonical pages get the most internal mentions.
What's the single most important fix?
Almost always: making sure the raw HTML response contains your content. A surprising number of B2B SaaS sites still render homepage and pricing copy entirely client-side, which means AI crawlers that don't execute JavaScript - and most don't - see an empty shell. Fix that one issue and you've already lifted the ceiling on every other GEO move.
Second-most-important: a valid Organization schema block on your homepage. It's the single biggest entity-recognition signal you can ship in an afternoon, and it underpins every branded prompt buyers run in AI tools.
How often should I re-audit?
Full audit once a quarter. A lighter pass (raw HTML check, schema validation, AI legibility prompts on your top 5 pages) every month. Re-run the audit immediately after any major site redesign or framework migration - those are the moments rendering and schema break.
For the layers that sit on top of the audit, see schema markup that actually helps AI cite you and how to make your SaaS site crawlable for AI.
