You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to use DocTR for a document which as texts arranged in two columns and has dense texts. I noticed that the text detection is incorrect. It identified multiple overlapping text blocks because of which the text output is also incorrect.
Here's the original document:
Here's the OCR plot:
Here's the segmentation result:
How to address it?
Code snippet to reproduce the bug
from doctr.io import DocumentFile
# PDF
pdf_doc = DocumentFile.from_pdf("path/to/your/doc.pdf")
Error traceback
No error but the output is incorrect
Environment
python 3.10
Deep Learning backend
Torch
The text was updated successfully, but these errors were encountered:
It's already planned to retrain all detection models with our new augmentation pipeline and an extended dataset for pretraining to make them more robust.
Could you please give "db_mobilenet_v3_large" as detection arch a try (this model is already pretrained with our new augmentation pipeline).
Bug description
I am trying to use DocTR for a document which as texts arranged in two columns and has dense texts. I noticed that the text detection is incorrect. It identified multiple overlapping text blocks because of which the text output is also incorrect.
Here's the original document:
Here's the OCR plot:
Here's the segmentation result:
How to address it?
Code snippet to reproduce the bug
Error traceback
No error but the output is incorrect
Environment
python 3.10
Deep Learning backend
Torch
The text was updated successfully, but these errors were encountered: