This project analyzes COVID-19 data to explore factors affecting mortality rates, focusing on age and gender demographics. It includes data cleaning, descriptive statistics, hypothesis testing, and data visualization using R.
-
Clone the repository: git clone https://github.com/your-username/covid19-data-analysis.git
-
Install required packages: install.packages("Hmisc")
-
Download the COVID-19 dataset (
COVID19_line_list_data.csv
) and place it in the project directory.
-
Open the R script
covid_analysis.R
in your preferred R environment. -
Run the script to perform data analysis: source("covid_analysis.R")
-
The script will read the dataset, clean the data, perform statistical analysis, and generate visualizations.
-
Explore the generated visualizations and analysis results to understand the relationship between age, gender, and COVID-19 mortality rates.
Contributions to this project are welcome! If you have suggestions for improvements, new analysis techniques, or bug fixes, please follow these steps:
- Fork the repository.
- Create a new branch (
git checkout -b feature-branch
). - Make your changes and commit them (
git commit -am 'Add new feature'
). - Push to the branch (
git push origin feature-branch
). - Create a new pull request.
For support or inquiries, please open an issue on GitHub or contact [email protected]