seojuice

AI Agent Readiness Checker

Run a full scan for discovery, crawl control, markdown, MCP, OAuth, and agent-facing standards. Then keep the score, the evidence, and the before-versus-after history.

No signup required 5 free scans / IP / day Public share URL Same engine SEOJuice runs continuously

What it means

What 'agent-ready' actually means

A site that scores fine on a traditional SEO audit can still be invisible to AI agents. Different reading patterns expose different gaps.

A search engine crawler and an AI agent read your site for different reasons. Googlebot wants to rank a page in a results list. ChatGPT browse, Perplexity, Claude's web tools, and Gemini want to answer a question — often by quoting you, citing you, or completing a task on a user's behalf. Those reading patterns expose different gaps.

This tool checks the four things that determine whether an AI agent can find, read, trust, and act on your site. Discoverability covers robots.txt, sitemap.xml, and the /.well-known/ entries agents now look for first. AI readability is whether your HTML is clean enough for a language model to parse without JavaScript, whether you publish a markdown alternate, and whether your structured data maps to your visible content. Policy and identity tells agents which bots you welcome and how your content can be used. Action surfaces is whether agents can do anything beyond reading: an MCP server, an API catalog, schema.org actions, an OAuth-protected programmatic surface.

We grade each category independently and combine them into a 0–100 readiness score, then map that to a 0–5 readiness level so the next move is clear. Most sites land at Level 1 or Level 2 on first scan, regardless of brand size — agent-readiness is genuinely new ground.

How it works

The four categories we score

Each weighted by how much it affects whether AI agents can discover, read, and act on your site.

  1. 1 20%

    Discoverability

    Checks whether agents can find your site-level instructions and machine-readable entry points quickly.

  2. 2 15%

    Content Accessibility

    Looks at whether agents can consume your pages in a cleaner representation than raw rendered HTML.

  3. 3 20%

    Bot Access Control

    Measures how clearly you express AI crawler policy and downstream usage preferences.

  4. 4 45%

    API, Auth, MCP & Skill Discovery

    Measures whether an agent can discover actual machine interfaces instead of stopping at static content.

Who uses this

Three patterns where the agent-ready scan earns its keep

Agency account managers

Run a public scan before a sales call, screenshot the score, drop the share URL into the proposal. Treats agent-readiness as a measurable retainer line item.

In-house SEO & PMs

Use the score history to prove that bot-infrastructure work moved the needle. Re-scan after each shipped fix and watch the level climb.

Founders & technical PMs

Get a one-page snapshot of where AI agents currently rank you, with a prioritized fix list and an honest assessment of how big each fix is.

Reading your report

Six panels per scan, here's what each one is for

1

Overall score & level

0–100 score with a 0–5 level mapping. Level is the number you reference when discussing progress.

2

Category breakdown

Per-category bars. A red category is where the next sprint should focus, regardless of overall score.

3

Top recommended fixes

Click-to-expand. Each fix shows what's broken, why it matters, and developer-ready instructions. One-button "Copy developer brief".

4

Full audit evidence

Every check we ran with the actual HTTP responses, headers, and snippets. Open this when you want to verify a finding.

5

Score history

Once you have two scans, this becomes a trend chart. Use it to prove an infra change moved the score.

6

Benchmark

Where the scanned domain sits relative to other completed scans for the same site type.

What good looks like

Three reference points for interpreting your score

Level 0–1 · 0–24

Effectively invisible to AI agents

Missing robots.txt or sitemap, no structured data, JavaScript-rendered content, no /.well-known/ entries. ChatGPT browse and Perplexity will skip the site or use stale Google cache.

Level 2–3 · 25–69

Readable, but not yet identified

Agents can fetch and parse the site cleanly. What's missing is bot-specific identity (LLM policy, content-use signals) and discovery surfaces (API catalog, MCP). Most sites land here.

Level 4–5 · 70–100

Agent-transactable

Discovery, auth, and action surfaces are live. Agents can identify themselves, read your policy, and call your APIs on a user's behalf. A small percentage of sites are here today.

Methodology & scoring details

Each check returns one of four states: pass (full credit), fail (zero credit), warn (partial credit, usually because we found a signal but it was incomplete), or neutral (informational, not scored). Within a category, the category score is the weighted average of its individual checks; an optional check we couldn't reach is dropped from the denominator rather than counted as a failure.

The four category scores combine into the overall 0–100 score using the weights shown in the table above. We deliberately avoid a single-pass / single-fail gate: a low Discoverability score caps everything that depends on it, but a strong Action Surfaces score still pulls the overall up because it represents real work that helps real agents.

Level mapping: 0 (0–9, invisible), 1 (10–24, indexed only), 2 (25–49, readable), 3 (50–69, identified), 4 (70–84, transactable), 5 (85+, agent-native).

We re-fetch the site fresh for each scan, including robots.txt, sitemap, and a sample of pages from the sitemap. We do not rely on cached or third-party data. The sample is small (typically 1–3 pages) so the scan is fast and the cost is bounded.

FAQ

Frequently asked questions

What does “agent-ready” actually mean?

It means an AI system can discover your rules, read your content in a clean format, and find the machine-facing interfaces behind the site without reverse-engineering the frontend.

Why is robots.txt alone not enough?

robots.txt is only one part of the picture. A site can be crawlable and still be hard for agents to use if it lacks markdown delivery, discovery headers, API metadata, OAuth discovery, or MCP-style machine entry points.

Why does this matter for AI search and ecommerce?

More discovery starts with AI systems and conversational interfaces. If your site is hard for machines to read or act on, you are less likely to be cited, routed to, or used in agent-led buying flows.

Are the reports public, and does a rerun overwrite the last one?

Reports are public at /ar/<domain>. A rerun does not overwrite the last completed scan. The public page keeps the latest finished report visible while a new run is still in progress, and same-day reruns stay distinct in history.

Does this replace a traditional SEO audit?

No, it complements it. Traditional SEO audits check whether Googlebot can crawl and rank your pages. Agent-readiness checks whether the new generation of AI agents can discover, read, and act on your site. A site can pass one and fail the other; in practice you want both.

How is this different from llms.txt?

llms.txt is a single-file proposal for surfacing AI-readable content. It's one signal we check (under AI Readability), but agent-readiness covers a much wider surface — robots policy, structured data, action interfaces, identity. A perfect llms.txt alone gets partial credit in one category.

Does my site need an MCP server to score well?

No. MCP is one of several action-surface signals and only matters for sites that want agents to take actions (book, buy, query). A documentation site, blog, or marketing site can score 70+ without an MCP server.

How often should I re-scan?

After every shipped change to robots.txt, sitemap, structured data, or any of the discovery surfaces. The score history will show whether the change actually moved the needle. For passive monitoring, monthly is fine.

Can I get my report removed?

Yes. Email vadim@seojuice.io with the domain and we'll delete the public report within one business day.

Why does the score change between scans even with no code change?

A few causes: a third-party service we depend on returned an error, the sample of pages we scanned was different, your CDN served slightly different bytes, or a check we recently calibrated weighed signals differently. If you see a swing larger than ~5 points without a code change, compare the audit evidence — the diff is usually visible there.

What does "site type" do?

Site type adjusts the benchmark we compare your score against and which optional checks are weighted higher. An ecommerce site benefits more from product schema and a robust API catalog; a content site benefits more from clean markdown alternates.

Is the report data shared with anyone?

The report URL is publicly accessible (it has to be, for the share/benchmark feature to work). The raw HTTP fetches we ran during the scan are stored only as much as the report needs to show the evidence — we don't retain page bodies beyond the previews shown in the audit.

Glossary
/.well-known/
Standardized URL prefix where machines look for site-level metadata (security policy, OAuth config, change-password, agent discovery).
llms.txt
Proposed single-file convention for surfacing AI-readable content alongside the human-facing site.
MCP (Model Context Protocol)
An open protocol for letting AI assistants call your APIs and tools securely.
Structured data
JSON-LD blocks (schema.org) that describe the meaning of your content to machines.
Markdown alternate
A plain-text Markdown version of an HTML page, served via Link: rel="alternate" type="text/markdown".
Agent
An AI system that fetches and reasons over web content (ChatGPT browse, Perplexity, Claude tools, Gemini, etc.).
Bot policy
The set of rules in robots.txt + AI-specific extensions describing which automated agents you allow and how content may be used.
Action surface
A machine-callable interface (API, MCP server, schema.org Action) that lets agents do something on a user's behalf.
View all →