robots.txt

A plain-text file at /robots.txt that tells web crawlers (search engines and AI bots) which pages they may access. Mis-configuring it is the #1 way brands accidentally block themselves from AI engines.

By 2025, the major AI crawlers each have their own user agent: GPTBot (OpenAI), ClaudeBot (Anthropic), PerplexityBot (Perplexity), Google-Extended (Gemini training), Bytespider (ByteDance/Doubao). Many sites block these by default after a 2024 wave of “protect our content” advice — which has the unintended effect of removing the brand from AI answers entirely.

For most B2B brands, you want these bots to crawl you. Add explicit User-agent: GPTBot / Allow: / rules to your robots.txt. VibecodeAEO audits this automatically.

Related terms

Audit your brand against this concept

VibecodeAEO scans your site for all AEO factors weekly and tells you exactly what to fix.