In this capstone project, we will predict whether the Falcon 9 first stage will land successfully. SpaceX advertises Falcon 9 rocket launches on its website at a cost of 62 million dollars per launch, while other providers charge upwards of 165 million dollars each. Much of SpaceX's savings are due to their ability to reuse the first stage of the rocket. Therefore, determining if the first stage will land successfully can help predict the cost of a launch. This information can be valuable for an alternate company that wants to bid against SpaceX for a rocket launch.
- Develop Python code to manipulate data in a Pandas DataFrame.
- Convert a JSON file into a Pandas DataFrame.
- Create a Jupyter notebook and make it shareable using GitHub.
- Utilize data science methodologies to define and formulate a real-world business problem.
- Use data analysis tools to load a dataset, clean it, and extract interesting insights.
This project is divided into several modules, each focusing on different aspects of data science and machine learning. You will be provided with an overview of the problem and the tools you need to complete the course.
- Learn how to collect and preprocess data.
- Convert a JSON file into a Pandas DataFrame.
- Clean the data and handle missing values.
- Load the dataset into a Jupyter notebook.
- Perform exploratory data analysis to uncover insights and patterns.
- Visualize the data using various plotting libraries.
- Identify and create new features that can improve the prediction model.
- Encode categorical variables using techniques like one-hot encoding.
- Develop machine learning models to predict the landing outcome of the Falcon 9 first stage.
- Evaluate the performance of different models using metrics like accuracy and confusion matrix.
- Fine-tune the model for better performance.
- Deploy the model and make it accessible for real-time predictions.
- Launch Jupyter Notebook:
jupyter notebook
- Open the notebook:
- Navigate to the directory where you cloned the repository.
- Open the Jupyter notebook file (
.ipynb
) and follow the instructions to complete each module.
If you wish to contribute to this project, please follow these steps:
- Fork the repository.
- Create a new branch (
git checkout -b feature-branch
). - Make your changes.
- Commit your changes (
git commit -m 'Add some feature'
). - Push to the branch (
git push origin feature-branch
). - Open a pull request.
This project is licensed under the MIT License - see the LICENSE file for details.
- IBM Data Science Professional Certificate
- SpaceX for providing the data