These techniques enable us to swiftly extract key information from Chinese text and classify similar texts, significantly enhancing the efficiency of our subsidiary in information retrieval and analysis. In the field of natural language processing, handling long texts remains a major challenge for language models, and we have proposed a two-stage model approach to address the difficulties posed by Chinese long texts.
- python >= 3.8
- Install miniconda, you can refer to the official documentation:
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh
- Create a Python environment:
conda create -n <name> python=3.8
GPU Version:
pip install -r requirements.txt
- Navigate to the project directory and set
PYTHONPATH
:
cd
export PYTHONPATH="$PWD/src"
- Information Extraction
- Text Classification:
- UTC:
- UIE (Optional):
Coming soon...