Skip to content

Latest commit

 

History

History
16 lines (10 loc) · 1.18 KB

README.md

File metadata and controls

16 lines (10 loc) · 1.18 KB

Team 7 (Tanmay Kshirsagar, Sunisha Harish, Kunal Inglunkar, Alejandra Mejia)

This is the repository for Team 7 for 6103 Introduction to Data Mining class Project.

Salary Prediction for Data Science roles

Data science combines multiple disciplines like math, statistics, specialized programming, advanced analytics, artificial intelligence, and machine learning which helps to uncover actionable insights hidden in data. According to the Harvard Business Review (2022), the amount of data, analytics, and AI in business and society seem unlikely to decline, therefore, data scientist jobs will only continue to grow in its importance. Hence, knowing the factors that affect the salary estimate will be beneficial for professionals working in this field.

Our smart questions:

  1. Does salary estimate vary between the states in the USA and data science roles (job titles, description)?
  2. Is the salary estimate in the US correlated with revenue of the company?
  3. Does the salary estimate in the US depend on the industry and the sector?

For modeling, we will try to predict the average estimated salary in the data science roles.

Dataset used: https://github.com/picklesueat/data_jobs_data