The addresses of dgrl or image files need to be specified in the code.
weight of hwdb2:Baidu(78j0)
You can refer to decode_page_with_box.py for data processing of HWDB2.
The format of the label: x1 y1 x2 y2 x3 y3 x4 y4 label
page_imgs
│ 0.png
│ 2.png
│ ...
gt
│ gt_0.txt
│ gt_1.txt
| ...