Skip to Main Content
George Mason University | University Libraries
See Updates and FAQs for the latest library services updates. Subject Librarians are available for online appointments, and Virtual Reference has extended hours.

Text & Data Mining Sources

Access text and data mining sources and text analysis tools.

Current News Sources

Restrictions on Mining Current News Databases

Many popular current news sources do not allow automated searches. The terms of use for these sources prohibits downloading large volumes of text to be stored and analyzed.

Below is a selected list of library subscription news sources which prohibit text and data mining through their search interface:

  • EBSCO
  • Factiva
  • NexisUni
  • ProQuest*

Violating a resource's terms of use is a violation of the University's Responsible Use of Computing policy. For more details consult the Statement on Appropriate Use of Electronic Resources.

When using any resource remember to consult its Terms of Use. For more help, please contact datahelp@gmu.edu, or talk to your subject librarian.

See Access Text Collections for a selected list of news sources that allow text and data mining.


Minable Current News Data

ProQuest TDM Studio

TDM Studio is ProQuest's text data mining platform that enables TDM on licensed ProQuest content. TDM Studio consists of two components: the Workbench is designed for experienced researchers who use their own coding methodologies, and Visualization is designed for users of all levels to quickly spot trends and generate insights.

New York Times Developer's Network
"All the API's to fit to post." NYTimes APIs are for non-commercial use. You will need to set up an account. Consult their FAQ for more details.