Skip to Main Content
George Mason University InfoGuides

QUANTitative Analysis & Statistics: More: Guides for Projects

This page has advice and practically-focused getting started resources for those who need help getting started with a project. 

Before You Begin

Some thoughts from Charlie Kufs, the writer of Stats For Cats

  • Statistics is math
  • Statistics requires a lot of data
  • Data are dependable
  • Statistics provides unique solutions
  • Statistics provides unambiguous results
  • It is easy to lie with statistics
  • Everything is Uncertain
  • Statistics Models
  • Measurement Scales shape Analyses
  • Everything Starts with a Matrix
  • Statistics is More than Description and Testing

What can go wrong AND what you should do about it. 

  • Invalid Data
  • Missing Data
  • Dirty Data
  • Out-of Spec Data
  • Messy Data
  • Corrupted Data
  • Mismatched Data

Some thoughts from Karen Grace-Martin at The Analysis Factor

  • A belief that you can do it
  • A commitment to good practice
  • Statistical knowledge
  • Proficiency in using the tools
  • Experience applying your skills in different situations
  • A mentor who understands the big picture goal and the steps to get there
  • A good resource library
  • Ongoing learning
  • Not allowing enough time to implement and learn
  • Not using a system for keeping track of files and steps
  • Not making it easy to replicate what you’ve done on each step
  1. Write out research questions
  2. Design the study
  3. Choose the variables and determine their measurement levels
  4. Estimate Sample Size
  5. Collect, code, enter, and clean data
  6. Create new variables
  7. Run univariate and bivariate descriptives
  8. Run an initial model
  9. Refine predictors and check model fit
  10. Check assumptions
  11. Check and resolve data issues
  12. Interpret results
  13. Communicate results

Conducting a Survey

It takes time to collect good data, from designing and pilot testing the questionnaire to recruiting a large and representative sample. If you are doing a data analysis project in one semester, time is the one thing you don't have. Thus, you will probably be doing convenience sampling, which restricts generalizability.

If you are doing a longer project and need to collect data, check out our longer Collect Data with a Survey guide.

Use Qualtrics

  • It is a high-quality research-focused survey platform available to all GMU students and faculty for free.
    • You must request an account through your department. Ask your professor for help, or check for a person in your area on the list of Qualtrics Administrators.
  • Data can be downloaded with both numeric values AND labels to be used in any statistical software
    • Always download the data in SPSS format and open it directly in the software you will use (each can import it). Do NOT open/edit it in Excel.
  •  If you are not required to use any particular software, you can use Qualtrics' built-in analysis and visualization features.
    • If you need a free full-featured statistical software, consider jamovi.

See our recommended video tutorials for Learning Qualtrics. Tutorials from the company require you to sign in first.

Secondary Data Analysis

Experienced data collectors produce the cleanest and easiest to use data files. Always download and read the documentation that comes with it to find variables of interest (note the variable name) and to learn about the population that was surveyed. Also, look for information about how missing or implausible values are represented.

Expect to take several hours to complete.

See also Data Skills module videos on the Data Service YouTube Channel

Interactive text and video tutorials

  • Survey data with SPSS - Learn about how to find survey data from the UK and around the world, what you need to know about it before you can start analyzing it, and how to produce simple tables and graphs for your research or reports.
  • Aggregate data - Learn where to find these data, how to produce summary statistics using them and how to create a choropleth map.

These suggest completing the surveys module first.

  • Longitudinal data - Learn what longitudinal studies are available, key features and issues with using longitudinal data and how to start some basic analyses.
  • Exploring crime surveys with R (Beta version) - Learn where to find survey data to explore crime and how to access it. This module also covers getting started with survey data using R Studio including basic functions, exploratory analysis, visualisations and conducting weighted analyses.

If you are doing a practice project and do not have a pre-existing research question, be nice to yourself and use a dataset provided by your professor (if possible) or from one of the sources listed on the Find Data & Statistics InfoGuide, such as these sources of small-ish datasets on interesting topics: 

Multiple Regression and Other Models

(Software-independent)  A Question-and-Answer format helps make this book a great reference for those who need to do multiple or other multivariate regression, especially those doing clinical research or biostatistics. It also includes chapters on Propensity scores and Correlated observations.