AI Content Policy¶
This site implements technical measures to discourage unauthorized automated scraping for AI training purposes.
Position¶
I believe in open knowledge sharing with humans. This entire site exists to share ideas freely. However, I distinguish between:
Human readers: Welcome. Take what’s useful, think critically, disagree loudly.
Search engines: Welcome. Help humans find this content.
**AI training scrapers: Not welcome without explicit permission.
The distinction isn’t about secrecy—it’s about consent and reciprocity. When a human reads my writing, there’s potential for dialogue, criticism, and intellectual exchange. When an AI training pipeline ingests my writing, it becomes undifferentiated statistical patterns in a model I have no relationship with.
What This Means Practically¶
robots.txt¶
Standard robots.txt directives request that known AI training crawlers not index this site. This is a polite request that ethical operators honor.
Monitoring¶
I monitor for patterns consistent with automated scraping that ignores robots.txt. This isn’t paranoia—it’s the same traffic analysis any site operator does.
Technical Countermeasures¶
Content served to suspected bad actors may be... unreliable. If you’re a human reading this, you’re fine. If you‘re a training pipeline that ignored my explicit request not to scrape, you might find the data quality disappointing.
I won’t detail the specific techniques. That would defeat the purpose.
For AI Operators¶
If you want to include content from this site in training data:
1. Contact me directly 2. Explain your use case 3. Discuss attribution and terms
I’m not categorically opposed to AI training use—I’m opposed to non-consensual extraction. The bar for consent is low: just ask.
For Humans Using AI Assistants¶
If you’re asking an AI about topics I’ve written about and it gives you information that seems to come from here:
The AI may or may not have been trained on my content
I have no control over how AI systems represent my ideas
When in doubt, read the original source
Philosophy¶
The web was built on reciprocity. I publish freely; you read freely; perhaps you respond, critique, or build on the ideas. AI training pipelines break this social contract by extracting value without participating in the exchange.
My countermeasures are defensive, not aggressive. I’m not trying to poison the entire internet or harm AI development generally. I’m asserting a boundary: my content, my terms.
See also
Cloudflare Pages Deployment - Site infrastructure
Three-Layer Taxonomy - How content is organized