INSEE - series updated but no mention in update feed
Let's take an example: the dataset IPI-2015
.
In RSS feed we see that it has been published on 2019-03-08 via the <pubDate>
XML element:
<item>
<title>[IPI-2015] Indice de la production industrielle</title>
<link>https://bdm.insee.fr/series/sdmx/dataflow/FR1/IPI-2015/1.0</link>
<description>Mise à jour de données pour le dataflow IPI-2015</description>
<pubDate>Fri, 08 Mar 2019 07:45:00 GMT</pubDate>
<guid>https://www.insee.fr/fr/statistiques/series/108695026</guid>
</item>
But some time series have been updated since this date.
For example, the series idBank=010537967
was updated on 2019-03-29 as we can see in IPI-2015 SDMX via the LAST_UPDATE
XML attribute:
<Series AUTRES_REGROUPEMENTS="MIG_CAG" BASIND="2015" CORRECTION="CVS-CJO" REF_AREA="FM" NAF2="SO" NATURE="INDICE" UNIT_MULT="0" UNIT_MEASURE="SO" INDICATEUR="IPI" FREQ="M" IDBANK="010537967" TITLE_FR="Indice CVS-CJO de la production industrielle (base 100 en 2015) - Biens d'investissement (MIG, poste MIG_CAG)" TITLE_EN="SA-WDA industrial production index (base 100 in 2015) - Capital goods (MIG, item CAG)" LAST_UPDATE="2019-03-29" DECIMALS="2">
<Obs TIME_PERIOD="2019-01" OBS_VALUE="109.7" OBS_STATUS="A" OBS_QUAL="DEF" OBS_TYPE="A"/>
<Obs TIME_PERIOD="2018-12" OBS_VALUE="106.18" OBS_STATUS="A" OBS_REV="1" OBS_QUAL="DEF" OBS_TYPE="A"/>
<Obs TIME_PERIOD="2018-11" OBS_VALUE="108.9" OBS_STATUS="A" OBS_REV="1" OBS_QUAL="DEF" OBS_TYPE="A"/>
So we have a problem: DBnomics's INSEE fetcher can't use the RSS feed to know then a dataset has been updated. And I didn't find a way to know these updates, other than downloading the whole database every time.
For now, the DBnomics's INSEE fetcher downloads datasets mentioned by the RSS feed, but I'm pretty sure that many datasets not mentioned by the RSS feed have many series that continue to be updated.
See also:
- documentation Utilisation du service web SDMX