How to Run an AI Readiness Assessment on Your Website: A Practical Audit for Marketing Teams
The pressure to respond to AI has led many marketing teams to add generative tools to the content workflow, automate a few processes, and call it progress. That’s useful work, but it doesn’t answer a more fundamental question about the website itself: whether AI tools like ChatGPT, Perplexity, and Google’s AI Overviews can find your content, understand it, and cite it back to the people asking them questions.
Creating content with AI is a production question. Being findable by AI is a structural question about how the site is built, and it’s the one that tends to get skipped. An honest AI readiness assessment is meant to surface that gap. What follows is a practical framework built around three pillars: schema markup, content structure, and internal linking.

The Biggest Myth About AI Readiness
The most common misconception about AI readiness is that it’s about the authoring experience. Teams invest in content generation tools, translation workflows, and editorial automation, all of which help the people inside the organization. What often gets overlooked is the external-facing side: whether the content those tools produce can be extracted and cited by AI systems once it’s published.
That’s a structural problem, and it shows up in specific places. More than half of all websites have no structured data at all, which means AI tools have to guess at what a page is about instead of reading it directly. High-value content often lives inside a single rich-text field with no labeled pieces for a model to pull from. Internal linking between related pages is weak or inconsistent, so the connections between a program, the classes inside it, and the people who teach it are invisible to anything trying to map them.
Thankfully, none of these are exotic problems. They’re the kind of thing a basic audit surfaces quickly, which is where the three pillars come in.
Audit Your Schema Markup First
Schema markup tells AI what your content means rather than just what it says. It’s the difference between a page that reads “Dr. Sarah Chen, Director of Admissions” and a page that explicitly labels that string as a Person with a job title, an organization, and a verified profile link. The first version leaves the model guessing. The second version hands it a fact it can cite with confidence.
Without schema, AI systems fill in the blanks probabilistically, and they often skip pages that are harder to parse in favour of sources that made the work easier. Google itself recommends structured data for AI search, which is about as clear a signal as the industry gets. Schema is also one of the cheapest items on this list to implement, and a basic rollout on the pages that matter most can usually be done inside a single sprint.
How to Check If Your Site Has Any Schema at All
You don’t need a specialist to run this check. Three tools handle most of what you need:
- Screaming Frog for a full crawl that shows every page and flags whether it contains structured data
- Google’s Rich Results Test for a quick single-page check
- org validator for detailed debugging when something looks off
Run the crawl, then spot-check the pages that carry the most weight: the homepage, the top service or program pages, and the highest-traffic articles. If nothing comes back, that’s the answer, and it’s a common one.
The Minimum Schema Every Site Should Have
For a basic implementation that makes a site legible to AI, four schema types do most of the work:

Author attribution is worth taking seriously here. AI models use signals like credentials, linked profiles, and connections to professional associations when deciding whether a source is safe to cite, which is why a practical guide to schema markup treats Person markup as part of the foundation rather than a finishing touch.
Check Whether Your Content Is Structured or Trapped
The second pillar is what’s sometimes called the blob body field problem, and it shows up in almost every content audit. Open a high-value page inside the CMS to see how it’s built, and the entire thing, from the hero headline down to the final call to action, lives inside one rich-text field. To an AI model trying to parse the page, the whole thing reads as one undifferentiated paragraph of HTML.
When all page content lives inside a single rich-text field, AI systems struggle to identify relationships between pieces of information like pricing, outcomes, instructors, or prerequisites. Structured content solves this by separating content into labeled fields that can be independently understood, reused, and surfaced across channels.

What Structured Content Looks Like
Structured content splits a page into labeled fields that each hold one meaningful piece of information. A program page, for instance, might have separate fields for title, summary, duration, cost, outcomes, prerequisites, and instructor. Each field maps to something an AI tool can identify and pull independently, so the content can adapt to new contexts without being rewritten, which is what makes structured content stay flexible across channels.
The AI benefit is the obvious one, but the operational benefits tend to be even larger in practice. Editors work faster because they’re filling in labeled fields instead of wrestling with formatting inside a wall of text. Pages stay visually consistent because the template handles the heavy lifting. And the same content becomes reusable across contexts, so one program summary can feed the program page, a comparison table, a search result card, and a syndication feed without anyone rewriting anything.
How to Spot Unstructured Pages on Your Site
The fastest way to run this check is to open your top five traffic pages inside the CMS and look at the edit screen. A single WYSIWYG field containing everything is a blob. Multiple labeled fields with specific purposes are structured content. Counting how many of your highest-value pages fall into each category is a reasonable first pass at an AI website audit, and the results usually point directly to the next piece of work.
Look at How Your Pages Connect to Each Other
The third pillar is internal linking, and it matters more for AI than it did for traditional SEO. AI tools build their understanding of a brand by crawling the relationships between pages, so a strong internal link structure teaches the model that an admissions page connects to programs, programs connect to faculty, and faculty connect to student outcomes. A weak one leaves the model looking at a collection of loosely related pages without a clear story tying them together.
Large enterprise sites often suffer from fragmented taxonomies, inconsistent tagging, and broken internal links, making it difficult for crawlers to understand how content relates.
The cumulative effect is a site that reads as a confusing mess to any crawler trying to make sense of it, whether that crawler belongs to Google or to an AI assistant.
How to Audit Your Entity Relationships
Start with the most important entity on the site, whether that’s a flagship program, a headline service, or a core product offering, and map out every page that should connect to it. Then check whether those connections exist in the form of real internal links, and whether the anchor text describes the relationship in natural language rather than generic phrases like “click here” or “learn more.”
Let’s use an example from higher education.
A graduate program page should link to the individual courses inside it, each course should link to the instructor who teaches it, and each instructor’s bio should link back to the program and to any relevant student outcomes. That web of connections is the beginning of a knowledge graph, and it’s what gives AI systems the context to explain your organization at the level of relationships rather than individual pages.
Drupal makes this kind of work considerably easier once a content model supports how to connect related content through entity references.
Build a Phased Roadmap You Can Realistically Follow
The temptation when tackling something like this is to try everything at once, which usually means nothing gets finished. A phased plan works better, and for teams that want to know how to make a website discoverable by AI without overwhelming the roadmap, breaking the work into time horizons is the most reliable approach.
First 30 days: quick wins
- Implement basic schema on the homepage and top ten pages
- Audit the three highest-intent pages for the blob body field problem
- Set up GA4 tracking for AI referral traffic so there’s a baseline to measure against
Next 90 days: structural improvements
- Restructure content models for the most important content types
- Build internal linking between key entities on priority pages
- Expand schema coverage across article, product, or program pages
Longer-term: capacity building
- Map out full knowledge graph relationships for the most critical entities
- Make CMS-level improvements that keep ongoing audits easy for editors to run
- Treat the CMS like a product, with a steady loop of publish, measure, and improve, rather than a publishing tool that ships once and gets left alone
How Drupal Makes This Work Out of the Box
Most of this audit applies regardless of the CMS underneath the site, and the principles translate to any platform that takes content seriously. Drupal happens to be built in a way that makes each piece of the audit easier to act on, which is part of why it holds up well for complex, content-heavy projects.
Drupal supports many of the structural requirements behind AI readiness out of the box. Structured fields, entity references, and Schema.org integrations make it easier to organize content in ways AI systems can understand and connect.
Modules like Schema Metatag help automate structured data implementation, while Drupal’s content modeling capabilities make internal relationships between programs, people, services, and content easier to maintain over time.
Where to Go From Here
A good audit comes back to three questions:
- Does the site use schema markup, and does it cover the pages that matter?
- Is the content structured into labeled fields, or trapped in a single body field?
- Does the internal linking teach AI tools how the key entities on the site connect to each other?
Honest answers to those three questions point directly to the next piece of work. For a deeper walkthrough, the ImageX team recently ran a Search Engine Journal webinar called Drupal AI Audit: How CMOs Can Assess Whether Their Stack Is Built for What’s Coming, covering citation readiness, content structure, and a technical roadmap for Drupal AI. View now at your own pace.
