Skip to Main Content
| University Libraries
See Updates and FAQs for the latest library services updates. Subject Librarians are available for online appointments, and Virtual Reference has extended hours.

Social Media Data and Tools

A guide discussing how to acquire, extract, and use social media data and tools

Social Media Data Sources

Download social media data from multiple sources (e.g., from Twitter and Facebook):

  • Dataverse
    • The Dataverse Project is an open source research data repository software. Dataverse includes social media datasets contributed by academic researchers.
  • Facebook Data from the Humanitarian Data Exchange
    • The Humanitarian Data Exchange (HDX) is an open platform for sharing data across crises and organizations. Launched in July 2014, the goal of HDX is to make humanitarian data easy to find and use for analysis. As of November 2021, there are over 200 Facebook datasets available. 
  • Kaggle 
    • Kaggle includes code and data you need for your data science work. Kaggle contains over 50,000 public datasets and 400,000 public notebooks for analysis. 
  • SMILE via Social Media Macroscope
    • Social Media Maacroscope is a project by University of Illinois Technology Services and the National Center for Supercomputing Applications (NCSA) with the goal of making social media data, analytics, and visualization tools accessible to researchers and students of all levels of expertise. SMILE is an open-source social media analytics tool that allows researchers to collect and analyze social media data. SMILE can perform functions such as text-preprocessing, phrase mining, sentiment analysis, network analysis, and machine learning text classification.
    • You need to create an account before you can launch SMILE. 
  • SOMAR: Social Media Archive at ICPSR
    • Coming soon. See their announcement for more information and for other social media archives. 
  • Stanford Large Network Dataset Collection: SNAP
    • Stanford Network Analysis Platform (SNAP) is a general purpose network analysis and graph mining library. SNAP includes various datasets from multiple social media platforms for network analysis. 

Apply to access social media data for academic research:

  • CrowdTangle for Facebook and Instagram
    • CrowdTangle is the leading content discovery and social monitoring platform for publishers and brands.
    • You have to apply to be part of CrowdTangle’s Academics & Researchers program to gain access to the CrowdTangle interface and API, as well as training and resources.

  • Social Science One
    • Social Science One was originally launched to pilot a specific model of industry-academic partnerships, seeking to share Facebook data with academics as a test case. We eventually succeeded in creating, and providing academic access to, the “Facebook URLs dataset” and streaming APIs on Facebook’s page views and political ads library.
    • You will need to complete a Request for Proposal to gain access to their data.