CANCELLED: Data Management on the Research Computing Systems
by
Thu, Mar 26, 2026
2:30 PM – 4 PM EDT (GMT-4)
Private Location (sign in to display)
Details
We will discuss data management throughout the research lifecycle on Princeton's Research Computing (RC) systems. It is designed for members of the University community who use large datasets (on the order of TBs) and need to find ways of moving, verifying, and storing data throughout the RC systems. We will discuss data transfer (rsync and Globus), compression, verification (checksums at source and destination), and archiving (using TigerData). We will also demonstrate examples of data flow on the RC systems, from the production of data on /scratch, to the analysis of data on /projects, to the archiving of data on /tigerdata.
Knowledge prerequisites: Basic familiarity with Linux command line tools
Hardware/software prerequisites: None
Workshop format: Demonstration
Target audience: Students, researchers, faculty, staff
Speakers
Pedro Espino
I am a Research Software Engineer II at the Department of Geosciences and the Princeton Institute for Computational Science and Engineering (PICSciE). My academic research has centered on leveraging high-performance computing (HPC) systems to conduct advanced simulations of astrophysical fluids under extreme gravitational conditions. I apply my expertise in HPC to support climate researchers within the Geosciences department.