Release 0.5.2
- Split sequence data is now stored in the ONNX file instead of being hardcoded: #21
- Added
verbose
argument to thesplit(..)
method of the Python bindings to display a progress bar - Retrained Chinese model with properly removed punctuation
- Retrained German model with Compound Splitting as additional split level
- docs.rs documentation now has all features enabled
- Added methods to get the levels of the current models:
Python: splitter.get_levels()
JS: splitter.getLevels()
Rust: splitter.logic().split_sequence().get_levels()
- NNSplit now has a website with demo, benchmarks and metrics! https://bminixhofer.github.io/nnsplit/