Event box
Version control of scientific datasets with DataLad
Abstract:
While Git is a fantastic tool for version control of software, written content and other collections of small files, it can become unmanageable for large, multi-GB data files. Over the years several different Git extensions that reduce the impact of large files in your Git repository were developed, e.g. git-annex and Git LSF. DataLad builds on this history and is a modern implementation of a free and open-source data management system that supports versioning, collaboration, distributed storage and analysis. In this workshop, we will teach you the basics of installing and using DataLad on your computer.
Prerequisites:
Bring your own computer.
Presenter:
Alex Razoumov earned his Ph.D. in computational astrophysics from the University of British Columbia and held postdoctoral positions in Urbana-Champaign, San Diego, Oak Ridge, and Halifax. He spent five years as HPC Analyst in SHARCNET and in 2014 moved back to Vancouver to focus on scientific visualization and training researchers to use advanced computing tools. Alex is currently based at Simon Fraser University.
- Date:
- Friday, February 24, 2023
- Time:
- 1:00pm - 2:30pm
- Categories:
- Data Digital Scholarship Research Commons Research Data Management
- Presenter(s):
- Alex Razoumov, Digital Research Alliance of Canada