Skip to Main Content
George Mason University | University Libraries
See Updates and FAQs for the latest library services updates. Subject Librarians are available for online appointments, and Virtual Reference has extended hours.

Big Data & Geovisualization

Learn more about big spatial temporal data and geovisualization. This guide is a gateway to graduate research in geospatial informatics, earth observing, remote sensing, and geoinformation science

Tools for Storage and Computing

  • Argo Cluster Center: from Mason Office and Research (OCR) Computing Center. Mason faculty and staff, or a Ph.D student use only. Store and process your big data sets.
  • Globus: store and manage your big data (terabyte or petabyte). Choose Mason as an institution to proceed the site.
  • Spatiotemporal Hybrid Cloud Service Centers (stc): Big data analysis services hosted by Mason cloud service center (sponsored by NSF).
  • Apache Hadoop: an open source software framework for storing and processing data sets distributed across industry standard servers. See HDFS (Hadoop Distributed File System) and MapReduce to collect, aggregate, summarize all nodes. See the Hands-On Big Data Workshop presented at IASSIST 2015 Annual Conference (June 2, 2015).
  • Apache Hive: an open source software  facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. 
  • Apache Spark: a fast engine for large-scale data processing.
  • Data APIs resources: a list of resources that provide an API to download big data in various formats (numeric, textual, images).
  • Tools for Collecting Twitter or other Social Media Data: Social Feed Manager (via GWU), CARTO, Getting Started with Twitter Data Collection with Python and Twitter APIs (via Mason Libraries), and NVivo Ncapture, etc. 

Online Visualization Tools

Related Tutorials