hiertext

A Pytorch implementation of Unified_Detector for scenetext detection and layout analysis.

i am still working on it on my spare time and any Advising/discussion is welcome ! 🤯:

update

2023-8-24 📆 the training on hiertext was not successful, model outputs are random strip-like shapes. using coco panoptic2017 dataset to train on the Maxdeeplab(from Max Deeplab) was not successful. will working on it.

purpose of this project

This project is for personal interests and selfstudy 🤡

The Unified Detctor model is originally from Google's Tensorflow project Unified Detector. This project is trying to build a similar model on Pytorch for further study on my spare time, inspired by a torch implementation of Max Deeplab.

TaskList

added a paragraph head to original maxdeeplab net
defined a paragraph grouping loss
modified loss computing code to support both raw and balanced style loss.
modified original maxdeeplab code to solve some OOM issue
defined a simple Hiertext dataset for torch dataloader
verrify the model
train the model on Hiertext dataset

reference

remark

the dims in original project config needs a huge number of Memory/FrameBuffer. eg. the mask instance output has shape [256, 384, 1024, 1024], for fp32 it takes 256 x 384 x 1024 x 1024 x 4 = 384 (GB) along. since the best device I can access is a Nvidia GPU with 32G fb, I have to reduce the num of masks under 40 and limit the batchsize to 4.
there are some code changes to reduce memory consumption. hopefully they will lead to same results. eg. use matmul to replace reducesum(expanddim elementwise multiply), use indexing for matched mask instead of multiply full matching matrix.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
max_deeplab		max_deeplab
modeling		modeling
.gitignore		.gitignore
README.md		README.md
datasets.py		datasets.py
run_inference.py		run_inference.py
train.py		train.py
unified_detector.py		unified_detector.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hiertext

update

purpose of this project

TaskList

reference

remark

About

Releases

Packages

Languages

jaysontree/hiertext

Folders and files

Latest commit

History

Repository files navigation

hiertext

update

purpose of this project

TaskList

reference

remark

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages