THUMT is an open-source toolkit for neural machine translation developed by the Tsinghua Natural Language Processing Group. This page describes the document of THUMT-PyTorch.
- CUDA 10.0
- PyTorch
- TensorFlow-2.0 (CPU version)
pip install --upgrade pip
pip install thumt
- Multi-GPU training & decoding
- Multi-worker distributed training
- Mixed precision training & decoding
- Model ensemble & averaging
- Gradient aggregation
- TensorBoard for visualization
We provide a step-by-step walkthrough with a running example: WMT 2018 Chinese-English news translation shared task.
We provide benchmarks on several datasets. See here.