Fork me at github

Data Version Control

Git extension for data scientists – manage your code and data together

Define
ML steps
Iterate faster with
reproducibility
Share code
and trained models
Get Started
Track code and data
$ git add train.py
$ dvc add images.zip
1
Connect code and data by commands
$ dvc run -d images.zip -o images/ unzip -q images.zip
$ dvc run -d images/ -d train.py -o model.p python train.py
2
Make changes and reproduce
$ vi train.py
$ dvc repro
3
Share code
$ git commit -m 'The baseline model'
$ git push
4
Share data and ML models
$ dvc config AWS.StoragePath mybucket/image_cnn
$ dvc push
5
How DVC works?
Want to know more? Please subscribe

Up to 2 emails per month. No spam.

TwitterLinkedinFacebook

© 2018 Iterative, Inc.

[email protected]