Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[detection / classification] Train upcoming detection architecture & orientation models #1459

Open
7 of 8 tasks
Tracked by #1752
felixdittrich92 opened this issue Feb 9, 2024 · 13 comments
Open
7 of 8 tasks
Tracked by #1752
Assignees
Labels
framework: pytorch Related to PyTorch backend framework: tensorflow Related to TensorFlow backend module: models Related to doctr.models topic: character classification Related to the task of character classification topic: text detection Related to the task of text detection type: enhancement Improvement
Milestone

Comments

@felixdittrich92
Copy link
Contributor

felixdittrich92 commented Feb 9, 2024

🚀 The feature

FAST - Detection model

PyTorch:

  • fast_tiny
  • fast_small
  • fast_base

TF:

  • fast_tiny
  • fast_small
  • fast_base

Orientation models:

Train-Skripts:
https://github.com/mindee/doctr/blob/main/references/classification/train_pytorch_orientation.py
https://github.com/mindee/doctr/blob/main/references/classification/train_tensorflow_orientation.py

TF & PT Crop: (with the recognition dataset)

  • mobilenet_v3_small_orientation (running)

TF & PT Page: (with the detection dataset)

@felixdittrich92 felixdittrich92 added type: enhancement Improvement module: models Related to doctr.models framework: pytorch Related to PyTorch backend framework: tensorflow Related to TensorFlow backend topic: text detection Related to the task of text detection topic: character classification Related to the task of character classification labels Feb 9, 2024
@felixdittrich92 felixdittrich92 added this to the 0.9.0 milestone Feb 9, 2024
@SkaarFacee
Copy link
Contributor

Is anyone working on this ? I would love to contribute in any way possible. This is something I look forward to learn as well :)

@felixdittrich92
Copy link
Contributor Author

Hi @SkaarFacee :)
This issue can only be solved by @odulcy-mindee because he is the only one with access to the machines an mindee internal datasets :)

@SkaarFacee
Copy link
Contributor

Oh understood. Is there any other similar issue I can work on instead? If not can you suggest some issues ?

@felixdittrich92
Copy link
Contributor Author

felixdittrich92 commented Feb 13, 2024

Hey @SkaarFacee sure :)

#1375 Should be super easy and fast to solve :)

Or if you want to work on something bigger and more deep learning specific:
#1411

@SkaarFacee
Copy link
Contributor

Hey, I'll take a look at them. Thanks again

@TomekPro
Copy link

Hi @SkaarFacee and @felixdittrich92,
could you please share what is the status here? I checked "fast" detection models and it seems that they are not trained yet. Is there any chance you will add them in a nearly future?

@felixdittrich92
Copy link
Contributor Author

Hi @SkaarFacee and @felixdittrich92, could you please share what is the status here? I checked "fast" detection models and it seems that they are not trained yet. Is there any chance you will add them in a nearly future?

Hey @TomekPro yep we finished some fixes and experiments today and will spin up the final runs now so we will publish the checkpoints one by one in the next weeks (1 model takes ~ 1week)

@TomekPro
Copy link

Great, thanks! Those will be on pytorch or TF?

@felixdittrich92
Copy link
Contributor Author

Great, thanks! Those will be on pytorch or TF?

We start with pytorch first :)

@TomekPro
Copy link

@felixdittrich92 any update on this? Thanks!

@felixdittrich92
Copy link
Contributor Author

@felixdittrich92 any update on this? Thanks!

Yep after crashing two machines it is running now i think we can publish on monday the first checkpoint 😅

@odulcy-mindee
Copy link
Collaborator

Yeah, we got a bit (too) unlucky this time 😅 Sorry for the delay, model checkpoint will be released soon

@felixdittrich92
Copy link
Contributor Author

@TomekPro First model available (including reparameterization by default - if used with the ocr_predictor(det_arch="fast_tiny",..) :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
framework: pytorch Related to PyTorch backend framework: tensorflow Related to TensorFlow backend module: models Related to doctr.models topic: character classification Related to the task of character classification topic: text detection Related to the task of text detection type: enhancement Improvement
Projects
None yet
Development

No branches or pull requests

4 participants