This is the repository for Team 7 for 6103 Introduction to Data Mining class Project.
Data science combines multiple disciplines like math, statistics, specialized programming, advanced analytics, artificial intelligence, and machine learning which helps to uncover actionable insights hidden in data. According to the Harvard Business Review (2022), the amount of data, analytics, and AI in business and society seem unlikely to decline, therefore, data scientist jobs will only continue to grow in its importance. Hence, knowing the factors that affect the salary estimate will be beneficial for professionals working in this field.
- Does salary estimate vary between the states in the USA and data science roles (job titles, description)?
- Is the salary estimate in the US correlated with revenue of the company?
- Does the salary estimate in the US depend on the industry and the sector?
For modeling, we will try to predict the average estimated salary in the data science roles.
Dataset used: https://github.com/picklesueat/data_jobs_data