Description
Create, maintain, and contribute to a long-living dataset that will update itself automatically across projects, using git and DVC as versioning systems.
Summary
- Problems emerging from data are common in research as well as in the industry.
- I will show you how to create, maintain, and contribute to a long-living dataset that will update itself automatically across projects, using git and DVC as versioning systems, and DAGsHub as a host for the datasets.
- Repository B - AKA the machine learning project, is where I want to use the files stored in my living-dataset.
- This will specifically download the directories images and annotations from inside my dataset repository, and keep information on how to continue tracking the changes made in it.