Skip to content

Commit df863f9

Browse files
Add configuration for parsing Wehe data (#421)
* Add wehe parsing for scamper1 & annotation types * Restore public archive source in staging
1 parent 27ccc01 commit df863f9

File tree

2 files changed

+26
-3
lines changed

2 files changed

+26
-3
lines changed

apply-cluster.sh

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -18,10 +18,9 @@ CLUSTER=${CLOUDSDK_CONTAINER_CLUSTER:?Please provide cluster name: $USAGE}
1818
DATE_SKIP=${DATE_SKIP:-"0"} # Number of dates to skip between each processed date (for sandbox).
1919
TASK_FILE_SKIP=${TASK_FILE_SKIP:-"0"} # Number of files to skip between each processed file (for sandbox).
2020

21-
# Use sandbox in sandbox, staging in staging, measurement-lab in oti.
21+
# Use sandbox in sandbox, measurement-lab in staging & oti.
2222
SOURCE_PROJECT=${PROJECT_ID/mlab-oti/measurement-lab}
23-
# TODO(soltesz): restore or remove.
24-
#SOURCE_PROJECT=${SOURCE_PROJECT/mlab-staging/measurement-lab}
23+
SOURCE_PROJECT=${SOURCE_PROJECT/mlab-staging/measurement-lab}
2524
sed -i \
2625
-e 's/{{ANNOTATION_SOURCE_PROJECT}}/'${SOURCE_PROJECT}'/g' \
2726
config/config.yml

config/config.yml

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@ monitor:
66
polling_interval: 1m
77
sources:
88
# NOTE: It now matters what order these are in.
9+
## NDT
910
- bucket: archive-{{ANNOTATION_SOURCE_PROJECT}}
1011
experiment: ndt
1112
datatype: annotation2
@@ -66,3 +67,26 @@ sources:
6667
raw: raw_ndt
6768
join: ndt
6869
daily_only: true
70+
## WEHE
71+
- bucket: archive-{{ANNOTATION_SOURCE_PROJECT}}
72+
experiment: wehe
73+
datatype: annotation2
74+
target_datasets:
75+
tmp: tmp_wehe
76+
raw: raw_wehe
77+
daily_only: true
78+
- bucket: archive-{{ANNOTATION_SOURCE_PROJECT}}
79+
experiment: wehe
80+
datatype: hopannotation2
81+
target_datasets:
82+
tmp: tmp_wehe
83+
raw: raw_wehe
84+
daily_only: true
85+
- bucket: archive-{{ANNOTATION_SOURCE_PROJECT}}
86+
experiment: wehe
87+
datatype: scamper1
88+
target_datasets:
89+
tmp: tmp_wehe
90+
raw: raw_wehe
91+
join: wehe
92+
daily_only: true

0 commit comments

Comments
 (0)