Skip to content

All my Kaggle kernels (public jupyter notebooks) as a competition contributor - COMPUTER VISION, N.L.P and more

Notifications You must be signed in to change notification settings

obrunet/Kaggle_kernels_2019

Repository files navigation

Kaggle_kernels

Here are all the challenges i've made so far. I've started several weeks ago, but now i clean and make those kernels public. Side note: some of my solutions require write permission on the hard drive, so very few kernel could have been released on Kaggle (because write permissions aren't allowed)...


COMPUTER VISION

Fashion MNIST: Kaggle - Notebook - PDF

  • Use case: use a neural network on a classification task of clothes' images
  • Data: 28x28 grayscale images associated with a label from 10 classes
  • Concepts: Multi Layers Perceptron with Tensorflow .

Image classification: Notebook - PDF

  • Use case: Distinguish images of dogs from cats
  • Data: binary classification of images in color, 25,000 labeled and 12,500 unlabeled for the submission purpose
  • Concepts: deep learning / computer vision using CNN with Tensorflow

Handwritten Digit Generation: Notebook - PDF

  • Use case: generation of new digits
  • Data: the famous MNIST data intended to learn computer vision fundamentals
  • Concepts: unsupervised deep learning with G.A.N - training of a Generator and a Discriminator

N.L.P

Quora Insincere Questions Classification - PART 1/2 : Kaggle - Notebook - PDF

  • Use case: Predict whether a question asked on Quora is sincere or not
  • Data: 1.3M labelled questions text
  • Concepts: supervised ML, 1st part : using N.L.T.K, tokenization, stemming, TF-IDF and CountVectorizer

Quora Insincere Questions Classification - PART 2/2 - COMING SOON : Kaggle - Notebook - PDF

  • Concepts: Word embedding, Word2Vec, R.N.N

RECOMMENDATION SYSTEM

Hybrid Recommendation Engine: Kaggle - Notebook - PDF

  • Use case: playlist generators for video and music services like Netflix, YouTube and Spotify...
  • Data: 100,000 ratings from 1000 users on 1700 movies (MovieLens 100K Dataset)
  • Concepts: supervised ML, Hybrid recommender system (mix collaborative/content-based filtering) with lightFM

DATA SCIENCE

Bike Sharing Demand: Kaggle - Notebook - PDF

  • Use case: forecast rentals of a city bikeshare system
  • Data: datetime, weather infos, rentals number
  • Concepts: supervised ML regression (__GradientBoosting Reg, Ridge/Lasso), metric: RMSLE

Adult Census Income: Kaggle - Notebook - PDF

  • Use case: predict whether income exceeds $50K/yr based on census, define people profiles
  • Data: age, workclass, education, marital-status, occupation, race, sex, capital-gain/loss,hours-per-week, country.
  • Concepts: supervised ML binary classification, model explanation / feature analysis, GridsearchCV.

Customer Segmentation: Kaggle - Notebook - PDF

  • Use case: customer segmentation, target customers with whom you can start marketing strategy
  • Data: customerID, gender, age, annual income (k$) & spending score from a supermarket mall customers
  • Concepts: unsupervised ML (KMeans Clustering)

Fraud Detection: Kaggle - Notebook - PDF

  • Use case: anomalie / fraud detection
  • Data: anonymized credit card transactions labeled as fraudulent or genuine, recorded over 2 days
  • Concepts: supervised ML binary classification, classes highly imbalanced, metric: Area Under the Precision-Recall Curve (AUPRC), PCA, Synthetic Minority Over-sampling Technique (SMOTe) & LOF model (LocalOutlierFactor)

Real Estate Price: Kaggle - Notebook - PDF

  • Use case: predict a real estate price
  • Data: Median house prices for California districts derived from the 1990 census.
  • Concepts: supervised ML regression, analysis of geospatial data

Home Credit Default Risk: Kaggle - Notebook - PDF

  • Use case: predict whether or not an applicant will be able to repay a loan
  • Data: previous credits, POS (point of sales), cash loans, previous applications and repayment history
  • Concepts: supervised ML binary classification (LightGBM, XGBoost), imbalanced classes, metric: area under the ROC curve

Customer Churn: Kaggle - Notebook - PDF

  • Use case: predict behavior to retain customers
  • Data: the ones who left, services, account infos, contract, payment method, charges, demographic info
  • Concepts: supervised ML binary classification, metric: F1 score, hyperparameters tuning, pipelines of models

House Prices: Kaggle - Notebook - PDF

  • Use case: predict sales prices
  • Data: area, shape, condition, construction year...
  • Concepts: supervised ML regression, practice feature engineering, RFs, and gradient boosting

About

All my Kaggle kernels (public jupyter notebooks) as a competition contributor - COMPUTER VISION, N.L.P and more

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published