|
|
# CBO
|
|
|
|
|
|
## Provider
|
|
|
|
|
|
* **provider_name**: CBO
|
|
|
* **provider_longname**: Congressional Budget Office
|
|
|
* **provider URL**: https://www.cbo.gov
|
|
|
* **region**: US
|
|
|
* **terms of use**: https://www.cbo.gov/about/privacy (see Copyright)
|
|
|
* **approximate number of datasets**: 10
|
|
|
|
|
|
|
|
|
## Data accessibility
|
|
|
|
|
|
* **SDMX: (If yes, links and SDMX releases implemented**: NO
|
|
|
* **REST API**: NO
|
|
|
* **Bulk Download: (if yes, links and formats**: XLSX
|
|
|
* **Account**: NO
|
|
|
|
|
|
## Desired datasets
|
|
|
|
|
|
* **Description**: (brief description of subset that we want on Widukind
|
|
|
or list if less than 10)
|
|
|
- Historical Budget Data
|
|
|
|
|
|
|
|
|
## Data tree
|
|
|
|
|
|
* **Existence of a hierachy of datasets on web site**: NO
|
|
|
* **How to recover the information**: linear list
|
|
|
|
|
|
## Datasets
|
|
|
|
|
|
### Download
|
|
|
|
|
|
1. Read HTML page https://www.cbo.gov/about/products/budget-economic-data
|
|
|
2. Find first file (most recent one) URL for each of 10 sections,
|
|
|
except Estimates of Automatic Stabilizers
|
|
|
3. Download the file using the first 5 digit code of file name as name
|
|
|
of stored file. For dbnomics-source-data, we want to use generic names and not names
|
|
|
including date references
|
|
|
|
|
|
### Historical Budget Data
|
|
|
* **datasetCode**: 51134 (from the file name)
|
|
|
* **how to get release date**: From Excel file property Modified
|
|
|
* **dataset docHref**: yes/no
|
|
|
* **dataset notes**: yes/no
|
|
|
* **dimension_list**: (provided or to be made up from the series)
|
|
|
* **use of attributes**: yes/no
|
|
|
* **attribute_list**: (provided or to be made up from the series)
|
|
|
* **available frequencies**: (across all datasets)
|
|
|
* **availability of previous updates**: if yes, provide URL
|
|
|
* **existence of real time datasets**:
|
|
|
|
|
|
## Series
|
|
|
|
|
|
* **Series key**: (provided or to be made up, suggest scheme if necessary)
|
|
|
* **Series name**: (provided or to be made up from dimensions)
|
|
|
* **Series docHref**: yes/no
|
|
|
* **Series notes**: yes/no
|
|
|
* **missing values**: code for missing values or way to detect them
|
|
|
* **date format**:
|
|
|
* **mixed frequencies in the same dataset**: yes/no
|
|
|
|
|
|
## Updates
|
|
|
|
|
|
* **calendar of future updates**: if yes, provide URL
|
|
|
* **summary of previous updates**: if yes, provide URL
|
|
|
* **regular updates**: date and time
|
|
|
* **RSS flow**:
|
|
|
* **best way to monitor updates**:
|
|
|
|
|
|
## Special problems
|
|
|
|
|
|
(like variable names in ESRI)
|
|
|
|
|
|
## Other remarks
|
|
|
|
|
|
## Data samples
|
|
|
* **location**: in test_xxx file or light files in dlstats/dlstats/tests/resources
|
|
|
* **description**: a very brief description of the data samples
|
|
|
|
|
|
|