Skip to content

Commit 02f79c9

Browse files
committed
DVC: track raw datasets via pointers; unignore .dvc and dvc.lock
1 parent 58614a1 commit 02f79c9

4 files changed

Lines changed: 37 additions & 6 deletions

File tree

.gitignore

Lines changed: 19 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -213,17 +213,30 @@ $RECYCLE.BIN/
213213
.Trash-*
214214
.nfs*
215215

216-
# Data directories (DVC tracked, not git tracked)
217-
/data/raw/
218-
/data/processed/
219-
/data/derived/
216+
# ignore big data
217+
data/**
218+
data/raw/**
219+
data/processed/**
220+
data/derived/**
221+
222+
# but DO track DVC metadata and tiny samples
223+
!data/
224+
!data/raw/
225+
!data/processed/
226+
!data/derived/
227+
!data/raw/*.dvc
228+
!data/processed/*.dvc
229+
!data/derived/*.dvc
230+
!data/**/*.dvc
231+
!data/.gitignore
232+
!data/samples/**
233+
!dvc.lock
220234

221235
# MLflow tracking
222236
mlruns/
223237
mlartifacts/
224238

225-
# DVC files
226-
/dvc.lock
239+
227240

228241
# Temporary files and caches
229242
*.log

data/raw/lcl.dvc

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
outs:
2+
- md5: e9af3d9ba558b4c348ca0da417987ae1.dir
3+
size: 9344493197
4+
nfiles: 4
5+
hash: md5
6+
path: lcl

data/raw/ssen.dvc

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
outs:
2+
- md5: 3c34d0ce8ebdee189cdb975f8efb583f.dir
3+
size: 36675232
4+
nfiles: 3
5+
hash: md5
6+
path: ssen

data/raw/ukdale.dvc

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
outs:
2+
- md5: 1159ee37ce5afc67c486ec1e4e378f27.dir
3+
size: 11796580238
4+
nfiles: 4
5+
hash: md5
6+
path: ukdale

0 commit comments

Comments
 (0)