5.Results
6.Licensing, Authors, and Acknowledgements
There should be no necessary libraries to run the code here beyond the Anaconda distribution of Python. The code should run with no issues using Python versions 3.*.
Stack Overflow data from 2017 was used to better understand about developers.
The questions I was interested in were:
Is collaborating with remote peers harder than those on site? What is the mostly used IDE? Things which matter for Career Satisfaction
There is only 1 notebook with the name 'Know more about developers'. This contains work related to the above questions in sections. Markdown cells and inline comments, were used to assist in walking through the thought process for individual steps.
For this project, I used pandas and numpy libraries to manipulate data easily for data analysis task. Further, sklearn was used to aplit data, model data and calculate various metrics of the final model.
Matplotlib was mainly used for drawing graphs, for easy demonstration.
collections is used to access data structures for manipulating data.
The main findings of the code can be found at the post available here.
Must give credit to Stack Overflow for the data.
You can find the Licensing for the data and other descriptive information at the Kaggle link available here. Otherwise, feel free to use the code here as you would like!