ECB triggers false positive commits
Looking at recent commits in ecb-json-data, I noticed that some series are just moved around series.jsonl
file, causing false positive commits.
As a consequence, the repository ecb-json-data
has ~30Gb of .git
whereas the biggest dataset weights ~300Mb, so there is a big overhead for version history.
The task to do is to sort the JSON-lines of series.jsonl
by series code, by using the write_series_jsonl
function provided by dbnomics-fetcher-toolbox which takes care of this.