pip install requirements.txt
streamlit run app.py
👉 Check out YouTube demo
- Input image from local or URL.
- Leverage DL models to extract text from image:
- Use VNPF's site as collected source.
- Apply models based on the results of NomNaOCR.
- Interactive mode using streamlit-drawable-canvas:
- Drawing mode: draw rectangle boxes on image regions containing characters.
- Editing mode: rotate, skew, scale, move any box of the canvas on demand.
- Undo, Redo or Delete canvas contents.
- Saving OCR results:
- Translate using APIs from:
- VNUHCM University of Science: https://www.clc.hcmus.edu.vn/?page_id=3039
- Sino-Nôm dictionary: https://hvdic.thivien.net/transcript.php#trans
(*) Note: In Editing mode, double-click a box to remove it.
My Vietnamese Sino-Nôm digitalization series :