Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
| University Libraries
See Updates and FAQs for the latest library services updates. Subject Librarians are available for online appointments, and Virtual Reference has extended hours.

Text & Data Mining Sources

Access text and data mining sources and text analysis tools.

Social Media Data Sources

Twitter Data Sets

Social Media Data from Multiple Sources (Twitter, Facebook, etc.)

Apply for Data Access for Academic Research

  • Twitter Data
    • Accessing Twitter data for analysis depends on your question and time period. For Twitter searches (keyword or hashtag), data older than around seven days typically cannot be retrieved. You can collect data going forward in time. If you need to extract tweets from an account, you can collect data going back and going forward in time. Twitter's Developer Agreement and Policy has details about what you can do with the data.
    • Unless you are using an existing dataset or resource such as Social Media Macroscope, you will need to apply for access to collect Twitter data through the Twitter for Academic Research site.
  • CrowdTangle for Facebook and Instagram
  • Social Science One

Additional Guides

Tools for Extracting Social Media Data

Tools that collect social media data

  • Social Media Macroscope
    • A project by University of Illinois Technology Services and the National Center for Supercomputing Applications (NCSA) with the goal of making social media data, analytics, and visualization tools accessible to researchers and students of all levels of expertise. You will need to set up an account.
  • Social Feed Manager (SFM) (see the tutorial on our Text Analysis guide)
    • Developed by George Washington University Libraries, SFM was designed to help researchers extract data through Twitter, Flickr, Sina Weibo, and Tumblr. SFM is available through the Mason Libraries' Digital Scholarship Center.
  • Twitter Archiver or TAGS add-ons for Google Sheets.
    • This add-on is a quick way to get started with collecting Twitter data for your research.
  • NCapture Plug-in for NVivo
    • NCapture captures data from Twitter, YouTube, LinkedIn as well as Web pages and online PDFs. You need to use NCapture with NVivo in order to extract the data. The DiSC Lab in Fenwick Library has NVivo installed on its computers.
  • MAXQDA

Advanced Collection Methods — coding required