Expert Systems

This repository contains the Python code and data to reproduce the results presented in the paper: A. Occhipinti*, L. Rogers*, C. Angione, "A pipeline and comparative study of 12 machine learning models for text classification", Expert Systems with Applications, 201 (2022): 117193

How to run

The following steps are required to run the code:

Python 3.6.x is required, a check is specific put into the code before it continues.
Jupyter notebook server is required
Enron spam corpus dataset is used for this paper, included is the tar zip folders containing the spam emails.
- AV application's will flag some emails as malicious/virus or a scam, this is fine and restore where necessary.
Ensure all pip dependencies are installed as listed in requirements.txt
Run through the steps laid out in the notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.idea		.idea
data		data
notebooks		notebooks
r_scripts		r_scripts
results		results
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Expert Systems

How to run

About

Releases

Packages

Contributors 2

Languages

Angione-Lab/12-machine-learning-models-for-text-classification

Folders and files

Latest commit

History

Repository files navigation

Expert Systems

How to run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages