This page has advice and practically-focused getting started resources for those who need help getting started with a project.
Some thoughts from Charlie Kufs, the writer of Stats For Cats
What can go wrong AND what you should do about it.
Some thoughts from Karen Grace-Martin at The Analysis Factor
It takes time to collect good data, from designing and pilot testing the questionnaire to recruiting a large and representative sample. If you are doing a data analysis project in one semester, time is the one thing you don't have. Thus, you will probably be doing convenience sampling, which restricts generalizability.
If you are doing a longer project and need to collect data, check out our longer Collect Data with a Survey guide.
See our recommended video tutorials for Learning Qualtrics. Tutorials from the company require you to sign in first.
Experienced data collectors produce the cleanest and easiest to use data files. Always download and read the documentation that comes with it to find variables of interest (note the variable name) and to learn about the population that was surveyed. Also, look for information about how missing or implausible values are represented.
Expect to take several hours to complete.
See also Data Skills module videos on the Data Service YouTube Channel
Interactive text and video tutorials
These suggest completing the surveys module first.
If you are doing a practice project and do not have a pre-existing research question, be nice to yourself and use a dataset provided by your professor (if possible) or from one of the sources listed on the Find Data & Statistics InfoGuide, such as these sources of small-ish datasets on interesting topics:
Go to the Data Center. The "Study Page" has links to the data, codebook, questionnaire, and other information. The Time Series Study is for one year's data and the Cumulative Data File contains multiple years. | |
Data Format: | [Zipped] SPSS (SAV, POR), Stata (DTA), ASCII |
Registration: | Some special surveys are restricted. |
Focus Fast: | Choose the top most "Time Series Study" to get the most recent year. |
Datasets also available through The Roper Center.
Go to their Download Datasets page then select a topical area. Log in to your account (or create one), then pick a survey and click the "Download" arrow on the right. Papers that used the data are also listed. | |
Data Format | [Zipped] SPSS, converting SPSS to Stata, |
Registration | Create an account by providing your name and email and accepting a use agreement and terms of use, which includes agreeing only to use the data for research purposes and not trying to identify individuals. |
Focus Fast | Look at the top few studies within each topic, which are the most recent. |
(Software-independent) A Question-and-Answer format helps make this book a great reference for those who need to do multiple or other multivariate regression, especially those doing clinical research or biostatistics. It also includes chapters on Propensity scores and Correlated observations.
Ask a Librarian | Hours & Directions | Mason Libraries Home
Copyright © George Mason University