Data Analyst - Impact Analysis of Monkeypox Case Study

Features

Data Preprocessing: Loading, cleaning, and transforming raw data for analysis.
Descriptive Statistics: Overview and statistical description of Monkeypox cases by country and region.
Data Visualization: Time-series plots, Bar charts, Line plots, Annotated visualizations, and Tables to visualize trends and distributions.

Technologies Used

Python (3.x)
Jupyter Notebook
Pandas
Matplotlib
Seaborn

How to Use the Program

Clone the Repository:

git clone https://github.com/RyanGA09/DataAnalyst-ImpactAnalysisOfMonkeypoxCaseStudy.git

Create a Virtual Environment:
```
python -m venv venv
```
Activate the Virtual Environment:
- On Window:
```
venv\Scripts\activate
```
- On macOS and Linux:
```
source venv/bin/activate
```
Install Required Packages:
```
pip install -r requirements.txt
```
Filter Data

Before working on the notebooks, filter the datasets using the provided Python script. You can execute the script by running:
```
python filter_monkeypox_data.py
```
This will process the raw data and generate the necessary filtered datasets for further analysis.
Open the Jupyter Notebook:
- To perform business understanding, gather data, and data cleaning, open the notebook located in the analysis_processing directory, which focuses on data processing tasks:
```
jupyter notebook notebooks/analysis_processing/{start_year}_{start_month}_to_{end_year}_{end_month}/Notebook.ipynb
```
- To conduct Exploratory Data Analysis (EDA) and data visualization, open the notebook in the visualization subdirectory, which is focused on further analysis and visual representation of the data:
```
jupyter notebook notebooks/visualization/Notebook_visualization_{start_year}_{start_month}_to_{end_year}_{end_month}.ipynb
```
Run the Cells:
- In the analysis_processing notebooks, execute each cell sequentially to perform data cleaning, data preparation, and business understanding steps.
- In the visualization notebooks, run each cell to conduct exploratory data analysis (EDA), and create data visualizations based on the processed data.

Dataset Information

Data Source

The dataset used for this project is available in the data/raw/original directory or can be downloaded from the Monkeypox Data Source.

Downloading and Placing the Data

Download Data: If you choose to download the data, you can obtain it from the Monkeypox Data Source.
Placing the Data: After downloading, ensure that the data file is placed in the data/raw/original/ directory. This is where the original, unprocessed data should be stored before any filtering or processing takes place.

Data Filtering Process

Once the original data file is stored in the data/raw/original directory, it will be processed and filtered as per the requirements of the analysis. The filtered data will be saved in the data/raw/filtered directory. The filtering process includes:

Removing irrelevant or incomplete data
Selecting relevant subsets of data for further analysis
Optimizing the data format and quality to meet the needs of the project

Data Processing and Analysis

After the filtered data is prepared in the data/filtered directory, it will undergo further processing and analysis. The processed data, which is used for Exploratory Data Analysis (EDA) and visualization, will be stored in the data/processed directory. This stage includes:

Conducting exploratory data analysis (EDA) to uncover patterns, trends, and insights
Cleaning the data further, if needed, for visualization and statistical analysis
Generating data visualizations to better understand the trends and relationships in the dataset

Project Structure

ImpactAnalysisOfMonkeypoxCaseStudy/
│
├── data/                                                      # Contains the datasets
│   ├── processed/                                             # Contains the processed data, used for EDA and visualization.
│   └── raw/                                                   # Contains the original and filtered data directory
│        ├── filtered/                                         # Contains the filtered data, ready for analysis.
│        └── original/                                         # Contains the original, unfiltered & unprocessed data.
├── notebooks/                                                 # Contains the jupyter notebooks code
├── filter_monkeypox_data.py                                   # Python code for filtering dataset
├── README.md                                                  # Project documentation and usage instructions
└── requirements.txt                                           # List of required Python libraries

Support Me

License

MIT LICENSE

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Analyst - Impact Analysis of Monkeypox Case Study

Features

Technologies Used

How to Use the Program

Dataset Information

Data Source

Downloading and Placing the Data

Data Filtering Process

Data Processing and Analysis

Project Structure

Read More

Support Me

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.vscode		.vscode
data		data
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
filter_monkeypox_data.py		filter_monkeypox_data.py
requirements.txt		requirements.txt

License

RyanGA09/DataAnalyst-ImpactAnalysisOfMonkeypoxCaseStudy

Folders and files

Latest commit

History

Repository files navigation

Data Analyst - Impact Analysis of Monkeypox Case Study

Features

Technologies Used

How to Use the Program

Dataset Information

Data Source

Downloading and Placing the Data

Data Filtering Process

Data Processing and Analysis

Project Structure

Read More

Support Me

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages