Well-defined workflows in software engineering have made code deployments reproducible and reliable. Data deployments, on the other hand, remain haphazard and unpredictable. In this talk we’ll explore open source techniques for packaging, versioning, and deploying data in reproducible units that resemble containers, data packages.