Option to cutout documents from images before further processing #554

felixdittrich92 · 2021-10-28T08:51:24Z

what do you think about an option in Documentfile.from_images(.., try_cutout=True) which does the following:
Example
I have currently a modified, more stable version running in our company :)

Use Case for example mobile phone images from documents

Would be nice if i can implement this in doctr also :)
What do you think ?

fg-mindee · 2021-10-28T09:34:15Z

Hey there 👋

Actually we have tackled this internally a few weeks back and it will be integrated into docTR soon 😄
But this solution involves a DL model for segmentation. If you think this could benefit from a classic CV approach, we could discuss that option as well!

Cheers!

felixdittrich92 · 2021-10-28T09:41:09Z

Hi 👋,

in this case i would say if you are ready with your model lets compare both ways :)
I can prepare a notebook if you want which can be used for test purpose !? 😃

fg-mindee · 2021-10-28T09:45:31Z

That's a good idea indeed, if you could have a runnable Colab notebook so that we can compare this 👍 (not opening a PR, just sharing it here)

felixdittrich92 · 2021-10-28T12:05:55Z

Colab Example

Only a very basic example but for test purpose it should be enough :)
Let me know if you need anything else 👍

PS: i have also faiced that it works much better if it is resized much smaller and than before _four_point_transform calculate the points back in relation to the original size (not in the colab example)

fg-mindee · 2021-10-29T21:57:54Z

Thanks a lot, it looks promising for single page docs!
To make sure this matches the same specs, could you illustrate a situation where there are several pages on the same image? (the image segmentation does process it correctly)

felixdittrich92 · 2021-10-30T10:32:20Z

@fg-mindee
I think in this case it is really much more accurate to use the segmentation model do you have some benchmarks for this ? Or also a short colab ? 🤗

Have a nice weekend

fg-mindee · 2021-10-30T10:56:11Z

Well sure, but perhaps we could change your colab to make it work for multiple pages?

Regarding the segmentation option, no colab but it will be integrated into docTR within a week or 2!

felixdittrich92 · 2021-11-01T11:52:13Z

@fg-mindee
yes sure we can do it but i think this will not work very well 😄 let me prepare a sec colab for this :)

felixdittrich92 · 2021-11-02T18:08:30Z

@fg-mindee
will share the other notebook tomorrow

felixdittrich92 · 2021-11-03T12:17:05Z

@fg-mindee
2. example now also multipage images:
Colab Example Multipage

BUT: this works great but for prod it would be need many checks
I'm really excited how accurate and fast your segmentation solution is 🤗
Have you tested also slightly overlapping documents ?

fg-mindee · 2021-11-03T15:57:35Z

Nice 👍

I'm only concerned about the color filtering that seems to be key to the performances of this method. It's usually not robust in bad lightning conditions or any degrading conditions.

For the segmentation-based approach, I'll have to check and will let you know next week 👍

felixdittrich92 · 2021-11-04T07:22:19Z

@fg-mindee
That can be tackled with blurring.
with this method the main problems are:

finding the right treshold value
if other rectangular objects in image
overlapping documents or if 4 corners can´t be detected

fg-mindee · 2021-11-04T09:24:52Z

For sure, we need to conduct some thorough evaluations now to ensure that this method is robust (or can be made robust)! We'll check next week with @charlesmindee, in the meantime, if you have any idea to make it more robust, feel free to iterate on this approach 👍

felixdittrich92 · 2021-11-04T09:56:30Z

@fg-mindee
yes but it would be great to have your seg model to compare between a DL approach and this CV approach 😄

felixdittrich92 · 2021-11-23T14:34:22Z

@fg-mindee
any update on this ? :)
Have you been able to successfully test your segmentation or does it make sense to stick with my approach here ? 😃

fg-mindee · 2021-11-23T21:59:58Z

We should be able to have something in December but for now there is already a lot on our plate 😅

fg-mindee · 2022-03-10T14:45:35Z

@charlesmindee would you mind taking a look at integrating your implementation in docTR for release 0.6.0? 🙏
(no hurry for now)

felixdittrich92 · 2022-04-28T21:11:05Z

@charlesmindee @frgfm any update if we will keep it for 0.6.0 ? :)

frgfm · 2022-05-07T12:59:10Z

This is more up to @charlesmindee for the integration 👍

Generally speaking:

releases A.B.0 include new features
releases A.B.N should be used to add fixes to features brought in A.B.0

So in this case, that should be kept for 0.6.0 yes :)

felixdittrich92 · 2022-09-30T20:59:37Z

@frgfm @charlesmindee
If you want we could also include my model (worked on document segmentation for my company will be finished until end of the year) it is a mobilenet_v3_small with Pyramid Attention Network as segmentation head. Runs currently with 30 Fps on mobile devices (CPU) on i7 it takes ~1-2ms and reaches 96% mIoU (custom dataset). Works fine with onnxruntime or opencv's dnn. The only disadvantage would be i can only share the onnx model + inference code (not the pure model code because it's company internal stuff) wdyt ?

frgfm · 2022-10-15T12:12:10Z

Mmmh, I think we should consider document edge segmentation as a separate task that can be handled by docTR. That way, people could pass it to the corresponding model without making the core pipeline too complex for now

felixdittrich92 · 2022-10-15T20:14:19Z

Sounds good to me 👍

felixdittrich92 · 2024-05-22T13:58:26Z

Topic for contrib module

fg-mindee added this to the 0.5.0 milestone Oct 28, 2021

fg-mindee added the topic: edge spotting Related to the task of document edge spotting label Oct 28, 2021

fg-mindee modified the milestones: 0.5.0, 0.6.0 Dec 26, 2021

fg-mindee assigned charlesmindee Mar 10, 2022

fg-mindee added the module: models Related to doctr.models label Mar 10, 2022

frgfm modified the milestones: 0.6.0, 0.7.0 Jun 28, 2022

frgfm mentioned this issue Jun 28, 2022

Release tracker - v0.6.0 #791

Closed

85 tasks

felixdittrich92 mentioned this issue Sep 26, 2022

Release tracker - v0.9.0 #1074

Closed

6 tasks

felixdittrich92 modified the milestones: 0.9.0, 2.0.0 Feb 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to cutout documents from images before further processing #554

Option to cutout documents from images before further processing #554

felixdittrich92 commented Oct 28, 2021

fg-mindee commented Oct 28, 2021

felixdittrich92 commented Oct 28, 2021

fg-mindee commented Oct 28, 2021 •

edited

Loading

felixdittrich92 commented Oct 28, 2021 •

edited

Loading

fg-mindee commented Oct 29, 2021

felixdittrich92 commented Oct 30, 2021

fg-mindee commented Oct 30, 2021

felixdittrich92 commented Nov 1, 2021

felixdittrich92 commented Nov 2, 2021

felixdittrich92 commented Nov 3, 2021

fg-mindee commented Nov 3, 2021

felixdittrich92 commented Nov 4, 2021

fg-mindee commented Nov 4, 2021

felixdittrich92 commented Nov 4, 2021

felixdittrich92 commented Nov 23, 2021

fg-mindee commented Nov 23, 2021

fg-mindee commented Mar 10, 2022

felixdittrich92 commented Apr 28, 2022

frgfm commented May 7, 2022

felixdittrich92 commented Sep 30, 2022 •

edited

Loading

frgfm commented Oct 15, 2022

felixdittrich92 commented Oct 15, 2022

felixdittrich92 commented May 22, 2024

Option to cutout documents from images before further processing #554

Option to cutout documents from images before further processing #554

Comments

felixdittrich92 commented Oct 28, 2021

fg-mindee commented Oct 28, 2021

felixdittrich92 commented Oct 28, 2021

fg-mindee commented Oct 28, 2021 • edited Loading

felixdittrich92 commented Oct 28, 2021 • edited Loading

fg-mindee commented Oct 29, 2021

felixdittrich92 commented Oct 30, 2021

fg-mindee commented Oct 30, 2021

felixdittrich92 commented Nov 1, 2021

felixdittrich92 commented Nov 2, 2021

felixdittrich92 commented Nov 3, 2021

fg-mindee commented Nov 3, 2021

felixdittrich92 commented Nov 4, 2021

fg-mindee commented Nov 4, 2021

felixdittrich92 commented Nov 4, 2021

felixdittrich92 commented Nov 23, 2021

fg-mindee commented Nov 23, 2021

fg-mindee commented Mar 10, 2022

felixdittrich92 commented Apr 28, 2022

frgfm commented May 7, 2022

felixdittrich92 commented Sep 30, 2022 • edited Loading

frgfm commented Oct 15, 2022

felixdittrich92 commented Oct 15, 2022

felixdittrich92 commented May 22, 2024

fg-mindee commented Oct 28, 2021 •

edited

Loading

felixdittrich92 commented Oct 28, 2021 •

edited

Loading

felixdittrich92 commented Sep 30, 2022 •

edited

Loading