Skip to content

Performed EDA and feature engineering on flight dataset consisting of 5.8 million records. Developed classification and regression models to predict flight delays and cancellations from various features by building PySpark pipelines.

Notifications You must be signed in to change notification settings

surjits254/Flights-Delay-Cancellation-Prediction

Repository files navigation

Filght-Delay-Cancellation-Prediction

  • Performed EDA and feature engineering on flight dataset consisting of 5.8 million records
  • Developed classification and regression models to predict flight delays and cancellations using pyspark pipelines
  • Trained logistic regression, Random Forest using Pyspark ML

About

Performed EDA and feature engineering on flight dataset consisting of 5.8 million records. Developed classification and regression models to predict flight delays and cancellations from various features by building PySpark pipelines.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published