Skip to content

Latest commit

 

History

History
7 lines (5 loc) · 286 Bytes

README.md

File metadata and controls

7 lines (5 loc) · 286 Bytes

Polish language SPAM classification and EDA of survey data

  1. data_science: spam classification with:

    • word2vec embeddings with LSTMs
    • Morfeusz lemmatized vectors with TF-IDF and Multinomial Naive Bayes algorithm
  2. analytical: exploratory analysis of noisy survey data