DARES - demandeurs d'emploi - données nationales - CVS - sheet 2
- As system
- I need to get data from sheet 2 from Excel file
- in order to store DARES data
Acceptance criteria:
- visual inspection of data and metadata
-
dataset code MUST be
2-1-csv-cjo-categories
-
EACH
series.json
andobservations.tsv
files for dataset MUST be written with the correct dimensions - dataset MUST be integrated in the directory structure following the hierarchy of top categories and subcategories
Resources
- dataset name: Demandeurs d'emploi inscrits en fin de mois à Pôle emploi par catégorie (A, B, C, D, E)
- dataset filename:
ser_nat_cvs-1disbal.xls
- dimensions:
- concept: row 10
- categorie: row 9
- region: France Métropolitaine (nothing in file), France entière(France in file)
- Adjustment: 'CVS-CJO'
- Units: milliers
- Séries names: row 9 + row 10 + region
Designs
- dbnomics-fetchers/dares-fetcher !2
Activity
-
Newest first Oldest first
-
Show all activity Show comments only Show history only
- Michel Juillard mentioned in issue #79 (closed)
mentioned in issue #79 (closed)
- Michel Juillard added Priority 1 Must label
added Priority 1 Must label
- Christophe Benz added Doing label
added Doing label
- Christophe Benz changed the description
changed the description
- Christophe Benz assigned to @c24b
assigned to @c24b
- Constance de Quatrebarbes changed the description
changed the description
I don't understand one things in Resources in bold
dimensions:
-
concept: row 10
-
categorie: row 9
-
region: France Métropolitaine (nothing in file), France entière(France in file)
-
Adjustment: 'CVS-CJO'
-
Units: milliers
-
What is adjustement? dimension filter? serie name?
-
For france métropolitaine it is said nothing in file does it mean:
-
- no label defined for the distinction 'France Metropolitaine' unlike 'France' in the next serie
-
- no data in the file
-
- no need to extract data from France Métropolitaine I finally understood that it just means no "label" or text info for "France Metropiltain" geo dimension
@MichelJuillard could you please precise your tought?
-
- Owner
@MichelJuillard I'm going to explain that because we spoke about it together.
In short: remove the dimension "Adjustment" because we de-prioritized the Brut files.
But should we keep a mono-valuated dimension ("Adjustment" with the only possible value "CVS-CJO")?
Edited by Christophe Benz1 - Author Maintainer
@cbenz as a general rule, I would keep dimensions that have a single value if they are used as such in the original data. In addition, in this particular case, we don't want to touch the CVS-CJO file if we add later the Brut file
- Owner
Understood, @MichelJuillard . This makes sense.
- Author Maintainer
- Christophe Benz mentioned in issue #105 (closed)
mentioned in issue #105 (closed)
- Constance de Quatrebarbes changed the description
changed the description
Some remarks:
Acceptance criteria "visual inspection of data and metadata"
is a bit vague in my opinion. I propose to precise exactly the delivery:
-
EACH series.json and observations.tsv files for dataset CVS-CJO must be written with the correct dimensions
-
dataset CVS-CJO must be integrated in the directory structure following the hierarchy of top categories and subcategories
-
dataset.json must list the CSV-CJO dataset
Edited by Constance de Quatrebarbes-
Some remarks:
Acceptance criteria "visual inspection of data and metadata"
is a bit vague in my opinion. I propose to precise exactly the delivery:
- EACH series.json and observations.tsv files for dataset CVS-CJO must be written with the correct dimensions
- dataset CVS-CJO must be integrated in the directory structure following the hierarchy of top categories and subcategories
- dataset.json must list the CSV-CJO dataset
Some questions:
Series that mention "France" are for "France entière". Series that don't mention anything are for "France métropolitaine" and should be coded as such
Does it means that dimension_value are coded as such:
"region": { "fx": "France métropolitaine", "fr": "France entière" },
or
"region": { "fx": "France métropolitaine", "fr": "France" },
Some slight details:
I have for now not found a way to declare properly the
unknow_value
for dataset, the validator is mentionning explicitly that it is an excepted valueEdited by Constance de Quatrebarbes- Constance de Quatrebarbes mentioned in merge request dares-fetcher!2 (merged)
mentioned in merge request dares-fetcher!2 (merged)
- Constance de Quatrebarbes changed the description
changed the description
- Christophe Benz marked the checklist item visual inspection of data and metadata as completed
marked the checklist item visual inspection of data and metadata as completed
- Christophe Benz added ~121 and removed Doing labels
added ~121 and removed Doing labels
- Christophe Benz added Doing and removed ~143 labels
added Doing and removed ~143 labels
- Christophe Benz marked the checklist item visual inspection of data and metadata as incomplete
marked the checklist item visual inspection of data and metadata as incomplete
- Owner
Error: dimensions values FR and FX seem to be inverted!
Example: http://next.db.nomics.world/dares/CVS-CJO?dimensions=category:categorie-a;region:fx
- Please register or sign in to reply
- Owner
@c24b Technical sub-task:
- Concatenate the strings like "Catégorie A" to the strings like "Actes positifs de recherche d'emploi, sans emploi" in the dimensions labels, like "Catégorie A – Actes positifs de recherche d'emploi, sans emploi"
(use the "–" and not the "-"
) Edited by Christophe Benz
- Christophe Benz marked the checklist item visual inspection of data and metadata as completed
marked the checklist item visual inspection of data and metadata as completed
- Christophe Benz added ~121 and removed Doing labels
added ~121 and removed Doing labels
- Owner
@c24b I just noticed you seem to have forgotten the following dimension in
dataset.json
:{ "dimensions_labels": {"adjustment": "Adjustment"}, "dimensions_values_labels": {"cvs-cjo": "CVS-CJO"} }
and hard-code it for all series of the sheet 2.
Collapse replies - Owner
In
series.json
we must find:{ "dimensions": { "adjustment": "cvs-cjo" } }
- Christophe Benz added Doing and removed ~143 labels
added Doing and removed ~143 labels
- Christophe Benz changed the description
changed the description
- Christophe Benz marked the checklist item dataset code MUST be
2.1-csv-cjo-categories
as completedmarked the checklist item dataset code MUST be
2.1-csv-cjo-categories
as completed - Christophe Benz changed the description
changed the description
- Owner
Deployed in http://next.db.nomics.world/dares/2-1-CVS-CJO-categories
Bravo @c24b
- Christophe Benz added ~143 and removed Doing labels
added ~143 and removed Doing labels
- Christophe Benz closed
closed