The Gale Digital Scholar Lab is a single research platform where you can apply natural language processing tools to raw text data (OCR) from Georgetown's Gale Primary Sources holdings, or from uploaded OCR. The Lab is organized in three broad steps: Build, Clean, and Analyze. These steps support newcomers and experienced users alike as they interpret both Gale Primary Sources and their own documents. An integrated Learning Center provides instructional tutorial videos and explanations throughout. The six built-in analysis tools are: Ngrams, Sentiment Analysis, Topic Modeling, Named Entity Recognition, Document Clustering, Parts of Speech.
Click here to access the Gale Digital Scholar Lab
Note: To access the Lab off-campus, you must use this exact link. A link from Google will not let you to log in with your Georgetown credentials.
We recommend logging in through the "Institution Credentials" option. Click on that box and then log in with your Georgetown NetID and Duo authentication. Once you are in the Lab, click "Log In / Create Account" and select "Use University Credentials" to create an account.
Your first time using the Lab, you will be prompted to use your "Personal" workspace. If you are planning a research project, you will want to create a new workspace for that project with the flexibility to share your project with group project members, your research team, or your professor.
The Gale Digital Scholar Lab can be used to analyze full-text data from Gale Primary Sources, or from text files uploaded by the user. The Lab empowers the researcher to build their own collections of text for analysis, walks through the steps for cleaning text data, and runs powerful text analysis, all in the same platform. You do not need to write or run code to analyze data in the Lab.
To see examples of research using the Gale Digital Scholar Lab, check out the Gale Research Showcase on their website.
Not sure where to get started? Email digitalscholarship@georgetown.edu to set up a consultation.
This work is licensed under a Creative Commons Attribution NonCommercial 4.0 International License. | Details of our policy