# Non Sequitur Publishing — robots.txt # Belt-and-suspenders with Cloudflare "Block AI training bots" zone setting. # # General crawl: allow human-facing search engines to index pages. # AI/ML training: disallow across all paths. # # Last updated: 2026-04-19 # Default policy — search engines, indexers, archive bots User-agent: * Allow: / Crawl-delay: 10 # ---------------------------------------------------------------------------- # AI training bots — disallow everything # Upstream list tracked at https://darkvisitors.com/agents and robotstxt.com # ---------------------------------------------------------------------------- User-agent: GPTBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: OAI-SearchBot Disallow: / User-agent: ClaudeBot Disallow: / User-agent: Claude-Web Disallow: / User-agent: anthropic-ai Disallow: / User-agent: Google-Extended Disallow: / User-agent: GoogleOther Disallow: / User-agent: PerplexityBot Disallow: / User-agent: Perplexity-User Disallow: / User-agent: Bytespider Disallow: / User-agent: Amazonbot Disallow: / User-agent: cohere-ai Disallow: / User-agent: CCBot Disallow: / User-agent: Omgilibot Disallow: / User-agent: FacebookBot Disallow: / User-agent: Meta-ExternalAgent Disallow: / User-agent: Meta-ExternalFetcher Disallow: / User-agent: Diffbot Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: ImagesiftBot Disallow: / User-agent: DuckAssistBot Disallow: / User-agent: PetalBot Disallow: / User-agent: MistralAI-User Disallow: / User-agent: Timpibot Disallow: / User-agent: YouBot Disallow: / User-agent: AI2Bot Disallow: / User-agent: AI2Bot-Dolma Disallow: / # Sitemap (Hugo-generated) Sitemap: https://nonsequitur.tech/sitemap.xml