Running the Tools
The data import and integration process is scripted by DVC. The top-level dvc.yaml
pipeline depends on all required steps for the the core data, so to import the data, just run:
dvc repro
The import process will take approximately 2–3 hours on a reasonably fast computer.
There are some additional useful outputs that the main pipeline does not invoke; you can generate these with:
dvc repro --all-pipelines
If you have configured a remote to store your data files, you can then run dvc push
to push the files to the remote to share with others on your team, copy to another computer, or import into another project.