This repository contains a script for data analysis using Python. The script is designed to process and analyze data, with examples focused on COVID-19 case data from WHO.
- Data extraction,
- Data cleaning,
- Summary output files generation.
- Python 3.x
- Required libraries:
- Pandas
- os
- datetime
You can install the required libraries using:
pip install
- Clone the repository:
git clone https://github.com/marcelinaszcz95/who-covid-cases-analysis.git
- Navigate to the project directory:
cd who-covid-cases-analysis
This project uses data provided by the World Health Organization (WHO). The data is freely available and can be accessed here.
Attribution: World Health Organization. License: CC BY 4.0.
-
Ensure the data file is located in the data/raw directory.
-
Run the analysis script: python src/covid_data_cleaning.py.
This will process the raw data, perform the analysis, and generate the output csv files with clean, sorted data for further visualizations with Tableau. The script generates World data, and also EU data. To generate only World data you can comment out the rest of the code.
Cleaned data files in data/processed