Data collected with Scrapeer

About

What is this?

A searchable index of AI usage disclosures from across the web. We collect the free-form texts that creators submit when their products contain AI-generated content, and we make them searchable, filterable, and trackable over time.

Why?

Platforms require AI disclosures but don't make them searchable or comparable. We aggregate disclosures from multiple platforms into one place so researchers, journalists, and anyone curious can explore the data.

Data sources

Currently indexing: Steam (game AI disclosures). Planned: Adobe Stock, app stores, music platforms, and more. All data comes from public sources. No personal information is collected.

Methodology

We monitor platform APIs and product information systems for products that declare AI content usage. When a new disclosure is detected, we scrape the product page to extract the full text and metadata. Changes are tracked over time, so we catch updates and revisions too.

Update frequency

New disclosures are detected in near-real-time. Product pages are scraped within hours.

Built with Scrapeer

The entire data pipeline runs on Scrapeer flows, a visual web scraping tool. No code required. scrapeer.com