This repository contains scripts on the Brazilian Universal Healthcare System income statements. I collected and preprocessed in order to facilitate manipulation by the data community.
- Import files
- Extract from file header the year of reference
- Extract the IBGE city code
- Convert objects to numeric (this dataset contains columns where the decimal separator is period or comma)
- Export a CSV at
output_income_balances/health_income_balance_<year>.csv
Run Dataprep Notebook.ipynb
(instruction on notebook).
NOTE
This data pipeline was tested only with files from 2009 to 2019. It might not work on previous data.
I update this data on Kaggle monthly. Get it: https://www.kaggle.com/jairofreitas/brazilian-universal-health-care-data