π Git-like Version Control for Data with Nessie, Iceberg, and Spark
 distributed-systems apache-spark etl s3 data-engineering minio dataops block-storage time-travel data-pipelines data-versioning etl-pipeline spark-etl apache-iceberg git-for-data data-lakehouse apache-nessie atomic-etl table-format branch-based-development 
 - 
 Updated
 Jan 21, 2025 
- Jupyter Notebook