AI Bot Crawl Statistics

Transparency dashboard — geolocus.ai, rolling 30 days.

Updated:

29
Crawls — 7 days
992
Crawls — 30 days
1,000
Crawls — all time
16.6%
AI bot share (30d)

AI Crawlers — Last 30 Days

165 crawls — 10 AI bot types.

Bot Crawls Share Last Seen
GPTBot (OpenAI) AI 98 59.4%
ClaudeBot (Anthropic) AI 26 15.8%
ChatGPT (OpenAI) AI 22 13.3%
PerplexityBot AI 7 4.2%
Meta AI (Llama) AI 4 2.4%
Amazon Alexa AI 4 2.4%
ByteSpider (TikTok) AI 1 0.6%
Common Crawl AI 1 0.6%
ChatGPT Search (OpenAI) AI 1 0.6%
You.com Bot AI 1 0.6%
Total AI165100%

Search & Other Crawlers — Last 30 Days

827 crawls — 10 bot types.

Bot Crawls Share Last Seen
Googlebot Search 777 94.0%
unknown_bot SEO/Other 17 2.1%
GoogleOther Search 12 1.5%
Bingbot (Microsoft) Search 8 1.0%
Facebook SEO/Other 8 1.0%
bot SEO/Other 1 0.1%
Baiduspider Search 1 0.1%
SemrushBot SEO/Other 1 0.1%
Majestic SEO/Other 1 0.1%
spider SEO/Other 1 0.1%
Total827100%

Top Crawled Paths — Last 30 Days

Path Crawls Share of Total
/ 171 17.2%
/services 139 14.0%
/about 135 13.6%
/whitepaper 134 13.5%
/multi-site-survey 108 10.9%
/sitemap.xml 79 8.0%
/research/whitepapers/whitepaper-5-1 46 4.6%
/case-studies 17 1.7%
/methodology/sitemap-benchmark 17 1.7%
/press 16 1.6%
/crawl-stats 16 1.6%
/privacy 15 1.5%
/contact 15 1.5%
/faq 15 1.5%
/press/before-ai-can-understand-your-site-it-must-translate-it 14 1.4%

Collection method: Every bot request to geolocus.ai is intercepted at the Cloudflare edge by the translation-layer CF Pages Function (functions/[[path]].js). 34 bot user-agent patterns are classified via vendor/aryah-shared/src/bot-classification.ts; matched requests are logged to bot_crawl_logs (Supabase Postgres). Cache-HIT requests are also logged via a direct PostgREST insert from the CF Pages Function so no crawl is silently missed.

Raw counts only. No rounding. This is a transparency dashboard — exact numbers are the point. Cross-check against Cloudflare Analytics at any time.

Dogfood: This page demonstrates GEO Signal 3 in action — bot crawl logging on our own property, measured exactly as we measure client properties.

Reference property: Top10Lists.us/crawl-stats — established property with 3M+ crawls/30d using the same methodology.