Skip to Main Content
George Mason University | University Libraries
See Updates and FAQs for the latest library services updates. Subject Librarians are available for online appointments, and Virtual Reference has extended hours.

Find Data for Analysis

Popular sources for accessing research data sets for dissertations or class projects

Data for Analytics

Public Data

Collections by Statistical Method

Classic Practice Data 

Suggested Data Science Projects

Crowdsourced Data Collections

  • Kaggle Datasets - (registration required) user-contributed open data with preview or Competition Data
  • Data World  - (registration required) tagged and searchable user-contributed data with previews
  • AWS Data Exchange - Open Data - Many large datasets on a variety of topics (see also the Registry)
  • OpenML - AI ready datasets provide training data for machine learning models. They are uniformly formatted, have rich, consistent metadata, and can be loaded directly into favorite environments.

Specific Datasets

Interesting General Data

Data for Specific Activities

Networks and Spatial Relations

General Related Data

Language and Media

Data for Replication & Teaching

Data Generators

Teaching with Data

Organized by Analysis

Variety of Data Types

Real Data

Data for Replication