Learn more about big spatial temporal data and geovisualization. This guide is a gateway to graduate research in geospatial informatics, earth observing, remote sensing, and geoinformation science
Argo Cluster Center: from Mason Office and Research (OCR) Computing Center. Mason faculty and staff, or a Ph.D student use only. Store and process your big data sets.
Globus: store and manage your big data (terabyte or petabyte). Choose Mason as an institution to proceed the site.
Apache Hadoop: an open source software framework for storing and processing data sets distributed across industry standard servers. See HDFS (Hadoop Distributed File System) and MapReduce to collect, aggregate, summarize all nodes. See the Hands-On Big Data Workshop presented at IASSIST 2015 Annual Conference (June 2, 2015).
Apache Hive: an open source software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL.
Apache Spark: a fast engine for large-scale data processing.
Data APIs resources: a list of resources that provide an API to download big data in various formats (numeric, textual, images).
Learn R (via Mason Libraries): infoguide on resources on learning R.
Tableau.com: data mining, statistical analysis and visualization. Also share information to public.
Related Tutorials
LikedIn Learning (former Lynda.com): many tutorials related to big data process including Hadoop Fundamentals, Python, R, Up and Running with NoSQL datasets, MongoDB, etc. Access to GMU affiliates only.