BUBA - duplicated files in source-data
Some data files are duplicated in BUBA source-data. Example:
cepremap@eros:~/fetchers-envs/buba/buba-source-data$ find -iname bbk01.cefec3.xml
./topics/GESAMT/time series/BBK01/bbk01.cefec3.xml
./time series/BBK01/bbk01.cefec3.xml
cepremap@eros:~/fetchers-envs/buba/buba-source-data$ md5sum './topics/GESAMT/time series/BBK01/bbk01.cefec3.xml' './time series/BBK01/bbk01.cefec3.xml'
be8b4a583a25369a9145f13266923181 ./topics/GESAMT/time series/BBK01/bbk01.cefec3.xml
be8b4a583a25369a9145f13266923181 ./time series/BBK01/bbk01.cefec3.xml
Before starting
-
pairing with @pdi to have a first venue visit -
check if it's difficult to change this, if not it may be not worth the time to change it (total source-data is 3.6Go)