Who Wrote This?
The data from this search engine and study are made publicly available.
Licensing and Publication Information
These data are released under the Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license and are gathered via various open source projects released as part of Who Wrote This (Data Driven Emapthy LLC).
Downloads
This dataset is available as a SQLite database with 48,997 unique articles. By clicking the download link below you are agreeing to the Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license as bundled in the zip archive.

Released June 3, 2019 by A Samuel Pottinger.

RSS Feeds Used
The following RSS feeds are used per agency:
Organization Feeds
BBC News Home
Breitbart Feedburner
CNN Top Stories
Daily Mail Home
Drudge Report Drudge Report Feed via Feedpress
Fox News Latest Headlines
New York Times Home Page
NPR All Things Considered
Vox All
Wall Street Journal World News and US Business (Dow Jones)
Compliance and Removal Requests
This study leverages RSS feeds which are machine parseable documents that, among many uses, often inform search engines of new content availability. These data are collected in compliance with the Robots Exclusion Protocol by observing robots.txt files, industry-standard instructions which convey to systems like this crawler what content is permissible to process. Requests for removal of content can be emailed to the correspoding author: inquiry [at] datadrivenempathy.com. Requestors may need to conduct physical correspondence to verify identity. Note that the crawler is no longer active.