Skip to Main Content

Data Analysis

Quantitative Analysis Tools

Microsoft Excel

(Supported by Library Staff)

Microsoft Excel is a popular spreadsheet application that is a part of the Microsoft Office Suite. Excel can be used for both data analysis and visualization. To get started, Microsoft has video tutorials on their website and there are many tutorials on LinkedIn Learning

Microsoft Excel can be downloaded for free via the UIS webstore, or accessed on library computers.

R

(Supported by Library Staff)

R is a free, open-source coding program that can be used for data cleaning, analysis (including statistics), and visualization. R's uses are broad, with over 20,000 R packages to choose from, and can also include analysis of geospatial data and text manipulation. To get started, check out this guide from Modern Dive and the R for Data Science textbook. There are also many video tutorials for R on LinkedIn Learning. The Big Book of R is a collection of mostly free R textbooks and tutorials which can be a great way to find books and tutorials on a wide variety of topics. 

R is available for free download here. RStudio, a popular interface for coding in R, is available for free download here. Note that most beginners will want to install both R and RStudio to run R, and they must be installed sequentially (R first, RStudio second). You then click the RStudio icon to launch the program. 

Stata

(Supported by Library Staff)

Stata is a proprietary statistics program. Stata can also support data manipulation, visualization, and automated reporting. Users can also run statistical tests through the graphical user interface and save the log and syntax. To get started, Stata has a resource guide for new users, as well as video tutorials on their website. 

A Stata license can be requested for free via the UIS webstore, accessed on library computers, or accessed remotely through the Frame Virtual Computing Lab.

SPSS

(Supported by Library Staff)

SPSS is a proprietary statistics and complex data management program. SPSS can be appealing for beginners because there is the option to run analysis either using the point-and-click user interface, or by writing code using SPSS syntax. To get started, check out the tutorial on LinkedIn Learning and explore UCLA's SPSS guide

SPSS can be accessed on library computers or remotely through the Frame Virtual Computing Lab. Faculty can request a free SPSS License for research and teaching via the UIS webstore

SAS

(Supported by Library Staff)

SAS is a proprietary statistics software suite that can be used for data analysis, data management, and artificial intelligence. SAS may be a good tool for you if you are working with very large datasets, or if you have familiarity with SQL.

SAS offers free tutorials to students via the SAS SkillBuilder for Students and educators via the SAS Educator Portal. At limited times throughout the year, SAS also offers free online live certification exam prep courses for students and educators - keep an eye on this site for when these courses open up. You can also check out LinkedIn Learning for on-demand tutorials. 

A SAS license can be requested for free via the UIS webstore, accessed on library computers, or accessed remotely through the Frame Virtual Computing Lab.

Python

(Supported by Library Staff)

Python is a free, flexible programming language used for a variety of tasks, including data analysis. Popular packages for numeric computing include SciPy, Pandas, and IPython. To get started, Anaconda has some tutorials on their website and there are many tutorials on LinkedIn Learning

Python can be downloaded for free here. You can code in Python in many different development environments, some of which are listed here:

  • The Anaconda distribution, an all-in-one installer for Python that is good for data science. Anaconda can be accessed on library computers.
  • Miniconda, a smaller version of the Anaconda distribution, is recommended for those who are unfamiliar with installing software and who want to learn coding fundamentals in a simpler environment. 
  • Google Colaboratory, a web-based integrated development environment, can be accessed for free via your Georgetown Google account. Coding in CoLab requires no software installation. 

Mathematica

Mathematica is a software system for technical computing, most commonly used in mathematical, scientific, engineering, and computing fields. To get started, check out the product training offered by Wolfram U, or this tutorial on LinkedIn Learning.

A Mathematica license can be requested for free via the UIS webstore, accessed on library computers, or accessed through the Frame Virtual Computing Lab.

Matlab

Matlab is a programming and numeric computing platform most commonly used by engineers and scientists. To get started, check out the free tutorials on the MathWorks website or a tutorial on LinkedIn Learning.

A Matlab license can be requested for free via the UIS webstore, accessed on library computers, or accessed remotely through the Frame Virtual Computing Lab.

Minitab

Minitab is a statistics software package. To get started, check out the Getting Started guide from the Minitab website or a tutorial on LinkedIn Learning.

A Minitab license can be requested for free via the UIS webstore, accessed on library computers, or accessed remotely through the Frame Virtual Computing Lab.

Set Up a Consultation

Not sure what tool would work best for you? Want help getting started in a tool? Email digitalscholarship@georgetown.edu to set up a consultation. 

Creative Commons   This work is licensed under a Creative Commons Attribution NonCommercial 4.0 International License. | Details of our policy