An AI visibility audit is a structured evaluation of how AI engines like ChatGPT, Perplexity, Gemini, and Google AI Overviews currently mention, cite, and recommend your brand. It answers a question that traditional analytics can't: when potential customers ask AI engines about your category, what do they hear about you, and how does that compare to competitors?
Audits are diagnostic, not strategic. They tell you the current state across engines and prompts, where the gaps are, and which inputs (schema, citations, content) are likely driving the gaps. The strategy comes after, informed by what the audit reveals.
What an AI visibility audit measures
A useful audit covers five dimensions and outputs a snapshot of where you sit on the AI readiness maturity model, from invisible to cited. Skip any one and the picture is incomplete.
1. Brand presence across engines
The first layer: are you mentioned at all, in which engines, and for which prompts. A brand mentioned by Perplexity but absent from ChatGPT has a different problem than one absent from all engines.
Audit checklist:
- Per-engine presence: Across the same prompt set, which engines mention you?
- Per-prompt coverage: Which prompts surface your brand and which don't?
- Engine-specific gaps: Which engines never mention you, even when others do?
2. Mention quality and citation depth
Presence isn't enough. A mention as a top recommendation differs operationally from a passing reference, and a citation that points to your URL is more durable than one that names you without linking.
- Mention type distribution: What percentage of mentions are direct recommendations, comparative references, or passing mentions?
- Citation rate: How often is your URL cited as a source, especially in engines like Perplexity and Google AI Overviews that surface citations explicitly?
- Description depth: When mentioned, is your brand described with substance (features, value, audience) or named without context?
3. Competitive share-of-voice
Visibility is relative. Your performance only makes sense alongside competitor performance for the same prompts. The audit should generate share metrics that translate raw mention counts into competitive position.
- Share of mentions: Your brand's percentage of total brand mentions across the prompt set.
- Recommendation share: Of all direct recommendations, what percentage went to your brand?
- Citation share: Among cited URLs in your category, what percentage are yours?
4. Sentiment and accuracy
AI engines describe brands with a tone (positive, neutral, critical) and sometimes with errors. The audit should flag both: a recommendation with caveats reads differently to potential buyers, and factual errors are correctable through schema and content updates.
Issues to flag:
- Hallucinated features (claims your product has capabilities it doesn't).
- Wrong pricing or outdated plan information.
- Misattributed quotes or statements.
- Negative framing based on outdated information.
- Contradictions across engines (one says X, another says not-X).
5. Technical readiness signals
Future visibility depends on present infrastructure. The audit should check whether the technical foundations AI engines reward are actually in place on your site.
- Schema markup coverage: Are Organization, Product, FAQ, and Article schemas present on the right pages?
- Indexability: Are critical pages crawlable, or hidden behind client-side rendering or login walls?
- Citation infrastructure: Do you have presence on the third-party sites AI engines cite for your category?
- Community footprint: Are you discussed in subreddits and forums that AI engines retrieve from?
How to run an AI visibility audit manually
A manual audit is feasible for a single brand and a focused prompt set. The structure:
- Define 10 prompts that match how your audience phrases questions in your category. Avoid hyper-branded queries that aren't representative.
- Identify 3 to 5 competitors to benchmark against.
- Run each prompt across ChatGPT, Perplexity, Gemini, and Google AI Overviews. Save the full response, not just yes/no.
- Tag each response: which brands mentioned, mention type, sentiment, citation URLs.
- Compute share-of-mentions and share-of-recommendations across the prompt set.
- Cross-check schema, indexability, and citation footprint as separate technical checks.
- Document findings as a punch list of fixable issues sorted by effort and impact.
Plan on 4 to 8 hours for a thorough first audit. Subsequent audits go faster because the prompt set and competitor list are stable.
When to use an automated audit tool
Manual audits work for one brand and one cycle. They break down quickly when scale enters the picture.
Switch to an automated tool when:
- You're auditing more than 20 prompts (manual logging becomes inconsistent).
- You need to audit more than 4 engines on a regular cadence.
- You want trend data: weekly or monthly audits compared over time.
- You're tracking 3+ competitors (the prompt-times-engines-times-brands matrix grows fast).
- You manage multiple brands or clients, where workspace separation and per-client reporting matter.
Automated audits don't replace human interpretation: the tool gathers and structures data, but understanding what the data means and what to do about it still requires judgment.
Engine-by-engine: what to look for in each
Different engines reveal different signals. Tailor the audit to extract what each one offers.
- ChatGPT: Watch for both browsing-on and browsing-off responses; they can differ significantly. Note training-data baseline (no browsing) as a long-term anchor.
- Perplexity: Capture every cited URL. Perplexity is the most diagnostic engine because it shows you exactly which sources informed each answer.
- Gemini: Strong correlation with Google search ranking. If your brand ranks for relevant terms, it usually shows in Gemini.
- Google AI Overviews: AI-generated answers in Google search results. Critical channel because of Google's reach. Citation patterns mirror AI Overviews' source preferences.
- Claude: Less commonly audited but increasingly relevant in enterprise. Claude tends to favor depth and reasoning quality, so substantive content lands well.
- Grok: Real-time X integration makes Grok useful for brands with active social conversation. Static brand presence often doesn't surface here.
Practical takeaways
If you're running your first AI visibility audit:
- Start with 10 prompts and 3 engines. Don't aim for completeness on cycle one.
- Always include 3 to 5 competitor names. Without benchmarks, the data has no context.
- Capture full response text, not just yes/no. Sentiment and detail matter as much as presence.
- Tag findings by effort (easy/medium/hard) and impact (high/medium/low). Sort the action list by easy-and-high first.
- Re-audit quarterly with the same prompt set so changes are comparable.
- Switch to automated tooling when manual cycles cross 4 hours of data gathering per audit.
An audit alone doesn't change visibility. The value comes from acting on what the audit reveals: schema, citations, communities, and owned content are the levers, and the audit just tells you which to pull first. Pair it with ongoing monitoring so the next quarter's audit measures real progress, not random fluctuation. Brands that audit and don't act spend hours building reports that gather dust.
If you'd rather not run the audit manually, Appearly's free AI visibility audit runs the cross-engine analysis automatically and surfaces a prioritized action list.
Frequently asked questions
What is an AI visibility audit?
An AI visibility audit is a structured evaluation of how AI engines (ChatGPT, Perplexity, Gemini, Google AI Overviews, Claude, Grok) currently mention, cite, and recommend your brand. The audit measures presence across engines, mention quality, competitive share-of-voice, sentiment and accuracy, and the technical readiness signals (schema, indexability) that drive future visibility.
How long does an AI visibility audit take?
A manual audit covering one brand across 10 prompts and 3 engines typically takes 2 to 4 hours of focused work to gather data, plus another 2 to 3 hours to analyze and document findings. An automated audit tool reduces the data-gathering portion to minutes; the analysis still benefits from human interpretation.
How often should I audit my AI visibility?
A full audit makes sense quarterly. Between full audits, lighter weekly monitoring of a fixed prompt set captures movement without the overhead of a full review. Full audits are useful after major product changes, competitor launches, or changes in AI engine behavior, all of which can shift the picture meaningfully.