This is a simple implementation of semantic search into Elasticsearch:
-
In this demo, we will use PhoBert, a pre-trained language models made specially for Vietnamese language.
-
Including in this demo is also a simple Flask server for you to have a quick demo on what the results would be.
- Ubuntu (Recommended)
- Docker
If you are planning to run the Flask app locally, you also need
The final image may take up to ~7GB and it can take some time to finish building.
# Without CUDA
sh cmd.run.all.sh
# With CUDA
sh cmd.run.all-cuda.sh
sh cmd.stop.all.sh
sh cmd.run.core.sh
poetry install
python src/utils/loadmodel.py
This process can take a long time as it is indexing over 100000, you can try reducing the file size manually.
python src/index_es.py
python3 -m flask --app=app run --host=0.0.0.0
sh cmd.run.clean.sh
Access the site at http://127.0.0.1:5000
Email: [email protected]
Facebook: fb.com/tu.nguyenquang01
Linkedin: linkedin.com/in/quangtudng