Options for Processing Big Data for the Social Sciences and Humanities

by PICSciE/Research Computing

Training/Workshop Programming Languages Research & Data Analysis Undergraduate Research

Mon, Feb 13, 2023

4:30 PM – 6 PM EST (GMT-5)

Add to Calendar

Private Location (register to display)




Are you working with data that is too much for your personal computer to handle? This workshop will introduce you to the computing resources that you can turn to at Princeton.

This workshop begins by covering the relevant computer terminology and concepts. We then give an overview of the computing options at Princeton with a focus on the systems provided by Research Computing. Using only a web browser, participants will learn how to work with their files and run programs on a high-performance computing (HPC) cluster called Adroit. More advanced topics will be demonstrated such as running batch jobs using the job scheduler and transferring large files.

Workshop format: Interactive presentation, with hands-on activities.

Target audience: Researchers in the social sciences and humanities with no experience using high-performance computing clusters.

Knowledge prerequisites: Basic computer concepts such as files, directories and the ability to use a web browser (e.g., Chrome, Firefox, Safari). Some knowledge of Python or R would be beneficial.

Hardware/software prerequisites: Participants must have an account on the Adroit cluster at least 48 hours in advance of this workshop. To request an account, complete this form: https://bit.ly/3wicSaH (VPN required if off-campus). Participants will also need a laptop and power cable. The entire workshop can be done using a web browser (e.g., Chrome, Firefox, Safari).

Learning objectives: Attendees will come away with an overview of their computing options at Princeton, a basic understanding of how HPC clusters work, and how to start using HPC clusters at Princeton.


Jonathan Halverson's profile photo

Jonathan Halverson

Research Software and Computing Training Lead

Princeton University

Jonathan Halverson is the Research Software and Computing Training Lead with PICSciE and Research Computing. He has an expertise in data science and he is a founding organizer of the TensorFlow & PyTorch User Group at Princeton. Prior to his current position, Jonathan performed polymer physics research at the Max Planck Institute for Polymer Research and nanoscience research at Brookhaven National Laboratory. He holds a Ph.D. in Chemical Engineering from CUNY.

Hosted By

PICSciE/Research Computing | View More Events
Co-hosted with: GradFUTURES

Contact the organizers