Skip to content
Snippets Groups Projects

ELSTAT Fetcher

Indicateurs Note
Indice de robustesse 0,5/5
Indice de complétude 30%

Download

Convert

  • Create the category_tree from webpage
  • Only the corresponding dataset code are converted: "SEL84", "SEL96", "DKT15", "DKT18"
    • SEL84 has 2 formats for multiples excel files:
      • from file nb 3 to 6 => SEL84_3:
        • table_title = (row1, col0)
        • table_unit = (row2, col0)
        • series_name = file_nb+" - "+sheet_name+ cell(col0, row5:rowMax)
        • periods = (row4, [col1:colmax])
        • values = [(col1, colmax), (row5, rowmax)]
      • from file 7 to 9 => multiple sheet but same format
        • sheet_name
        • dataset_name = (col0, row1)+sheet_name+file_nb
        • dataset_unit = col0, row2
        • headers = [(row4:row6), (col1, colmax)]
        • periods = [row7:maxrow, col0]
        • values = [(row7, rowmax),(col1, colmax)]
    • SEL96 has 1 excel files:
      • first sheet has a special format:
        • sheet_name
        • dataset_name = (col0, row1)+(col0, row2)+sheet_name+file_nb
        • dataset_unit = col0, row3
        • headers = [(row6:rowmax), (col0)] + detect_tree
        • periods = [(col2,colmax), (row6, row7)]
        • values = [(row7, rowmax),(col1, colmax)]
        • series_codes = (col0, (row7, rowmax)
      • skipping next sheets (not possible)
    • DK15 2 excel files with multiple sheets special format DK
      • sheets filter = Not "INFO" and not "Φύλλο1"
      • dataset_code = sheetname
    • DK18 2 excel files with multiple sheets special format DK
      • sheets filter = Not "INFO" and not "Φύλλο1"
      • dataset_code = sheetname
      • title= row6
      • year = row6.split("-")
      • unit row7
      • series_codes = [col0, (row10, rowmax)]
      • series_names = [col1, (row10, rowmax)]
      • period = [(col3,colmax), row9]
      • values = [(col3, colmax, row10, rowmax)]