- Understand a new dataset.
- Analyze the time series and study its characteristics.
- Train a model to predict the amount of water in different areas.
Follow the instructions below:
- Create a new repository based on machine learning project by clicking here.
- Open the newly created repository in Codespace using the Codespace button extension.
- Once the Codespace VSCode has finished opening, start your project by following the instructions below.
Once you have finished solving the exercises, be sure to commit your changes, push to your repository and go to 4Geeks.com to upload the repository link.
This project is going to be done at Kaggle. Kaggle is known for organizing data science competitions in which individuals and teams can compete to create the best model on a variety of tasks.
Specifically, we will explore a competition that has now ended, which rewarded the top performers with $25,000, distributed to the top four solutions.
The first step to participate in a competition is to have a Kaggle account. Registration is very simple and versatile.
Although it is closed, you can still upload your solutions and compare yourself with the rest of the competitors in the ranking. By clicking here you can access the competition where the purpose is to predict whether a customer will default in the future.
Read the information about the problem description, evaluation metrics, timeline and prizes. Once you have an overview of the competition, prepare the notebook with the code development and upload it, compare yourself with the other developers.
Follow these tips:
- Look at the solutions proposed by other developers. Some use time series, others use supervised algorithms such as linear regression, decision trees, etc. Investigate which model might work best for this use case and explore all options.
- Whatever model you choose, be sure to process and explore the data as studied in past modules.
- Trial and error! Don't pretend to get the best model the first time, keep trying.