Chineses Long Text NLP

These techniques enable us to swiftly extract key information from Chinese text and classify similar texts, significantly enhancing the efficiency of our subsidiary in information retrieval and analysis. In the field of natural language processing, handling long texts remains a major challenge for language models, and we have proposed a two-stage model approach to address the difficulties posed by Chinese long texts.

Installation

Environment

python >= 3.８

Setting up the Python Environment

Install miniconda, you can refer to the official documentation:

wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh

Create a Python environment:

conda create -n <name> python=3.8

Python Package Installation

GPU Version：

pip install -r requirements.txt

Quickstart

Navigate to the project directory and set PYTHONPATH:

cd 
export PYTHONPATH="$PWD/src"

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
ci		ci
data		data
docs		docs
models		models
notebooks		notebooks
reports		reports
src		src
tests		tests
.gitignore		.gitignore
AUTHORS.rst		AUTHORS.rst
CHANGELOG.rst		CHANGELOG.rst
Dockerfile		Dockerfile
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-cpu.txt		requirements-cpu.txt
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chineses Long Text NLP

Installation

Environment

Setting up the Python Environment

Python Package Installation

Quickstart

Data Preparation

Label Studio Data

Inference Data

Modeling

About

Releases

Packages

Languages

vic4code/chinese-long-text-nlp

Folders and files

Latest commit

History

Repository files navigation

Chineses Long Text NLP

Installation

Environment

Setting up the Python Environment

Python Package Installation

Quickstart

Data Preparation

Label Studio Data

Inference Data

Modeling

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages