A Template of Organizing NLP/ML Projects

We provide an example directory of NLP/ML projects filled with data sets, source files, evaluation scripts, and model checkpoints. The hierarchy has been practiced in our previous projects, and it shows reasonable flexibility for the model development process (e.g, tunning hyperparameters, stacking different model components, integrating third-party libraries while keeping the project separated).

Overview

There are several principles we will follow,

Keeping preprocessing and postprocessing steps clear.

Separating data preparation with model source files.

Separating evaluation scripts from model source files.

The subdirectories are

data contains datasets, their preprocessing scripts, and intermediate results.
modules are basic blocks for building your model. (encoders, decoders, our fancy modification of the Transformer).
models are glued modules which accomplish the NLP task.
configs hosts configuration files.
eval contains evaluation scripts.
util are helping tools.
ckpts contains model checkpoints.
train.py the training script.
run.py the deploying script.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Template of Organizing NLP/ML Projects

Overview

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ckpts		ckpts
configs		configs
data		data
eval		eval
models		models
modules		modules
utils		utils
README.md		README.md
run.py		run.py
train.py		train.py

AntNLP/antnlp-tproj

Folders and files

Latest commit

History

Repository files navigation

A Template of Organizing NLP/ML Projects

Overview

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages