AI Headlines

aiheadlines.pro
Websites

User-agent: GPTBot

llms.txt

This file outlines the permissions for Large Language Models and AI agents

accessing content on https://aiheadlines.pro.

We permit the use of our public content for AI training and other purposes.

Allow OpenAI's web crawler

User-agent: GPTBot Disallow:

Allow Google's extended web crawler

User-agent: Google-Extended Disallow:

Allow Perplexity's AI crawler

User-agent: PerplexityBot Disallow:

Allow the Common Crawl bot

User-agent: CCBot Disallow:

Allow other potential AI crawlers

User-agent: ClaudeBot Disallow:

User-agent: OMNI-API Disallow:

Specify the preferred citation URL if content is referenced.

Citations: https://aiheadlines.pro

Provide the location of the XML sitemap for crawlers.

Sitemap: https://aiheadlines.pro/sitemap.xml

Related

llmtxt.app – AI SEO & Search Engine Optimization Directory

/llms.txt
635 tokens
/llms-full.txt
2,429 tokens
Websites

A proposal to standardise on using an /llms.txt file to provide information to help LLMs use a website at inference time.

/llms.txt
318 tokens
Websites

/llms.txt
33,874 tokens
/llms-full.txt
3,770,473 tokens
Websites

/llms.txt
1,164 tokens
/llms-full.txt
1,167 tokens
Websites

About Matt Rickard.

/llms.txt
515,931 tokens
/llms-full.txt
515,931 tokens
Websites

/llms.txt
628 tokens
Websites

Evan Boehs — personal website.

/llms.txt
265 tokens
Websites

This very website you're looking at right now!

/llms.txt
48 tokens
Websites