Back to TILs
Today I Learned

Blocking AI Crawlers is the New 'noindex'

January 21, 2026
1 min read
---

# TIL: Blocking AI Crawlers is the New "noindex"

If you're blocking GPTBot, Anthropic, Perplexity, Gemini — you're trading future reach for short-term control.

The Math

AI search traffic today: ~1% AI search traffic tomorrow: 25–35%

Let them crawl. Train the discovery layer. Be early.

Common AI Crawler User Agents

CrawlerCompany
GPTBotOpenAI
ClaudeBot / Anthropic-AIAnthropic
PerplexityBotPerplexity
Google-ExtendedGoogle (Gemini)

The Robots.txt Decision

Blocking these crawlers:

bash
User-agent: GPTBot
Disallow: /

User-agent: ClaudeBot
Disallow: /

Feels like control. Actually it's invisibility.

Why This Matters

When someone asks an AI "how do I do X" and your content isn't in the training data, you don't exist in that conversation.

The sites that trained the discovery layer early will own the AI search results later.

Visibility > invisibility.

More TILs You Might Like

TIL: Blocking AI Crawlers is the New 'noindex' | Harshit Luthra