Skip to content

Predicting primary sub-categories of statistics arXiv papers

Notifications You must be signed in to change notification settings

ehan03/arxiv-stat-nlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CPSC 477 Final Project

Predicting Primary Sub-Categories of Statistics ArXiv Papers

Team Members: Eugene Han, Ali Aldous, Elder Veliz

Setup

  1. Clone the repository.
  2. Create a new conda environment with the following command:
conda env create -f env.yml

Please refer to the YAML file to see all dependencies/libraries used and their versions.

  1. Download the baseline and fine-tuned RoBERTa models from this link.
  2. Place the downloaded models in the models directory.
  3. Run the cells of full_pipeline.ipynb inside the notebooks folder. We recommend skipping the training cells as the models are already trained and saved in the models directory. To replicate the results in the report, go to the evaluation section.

About

Predicting primary sub-categories of statistics arXiv papers

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published