You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The text extractor when requiring the OCR function instead of the simple one writes portions of the PDF to the same bucket as the source PDF. Unfortunately, those are never cleaned up because there - and for good reasons - are no lifecycle rules on that bucket.
We don't want to keep those intermediate PDFs so we either need to change the bucket that holds PDFs during ingest and include lifecycle rules or figure out how to attache lifecycle rules to the intermediate PDFs.
The text was updated successfully, but these errors were encountered:
The text extractor when requiring the OCR function instead of the simple one writes portions of the PDF to the same bucket as the source PDF. Unfortunately, those are never cleaned up because there - and for good reasons - are no lifecycle rules on that bucket.
We don't want to keep those intermediate PDFs so we either need to change the bucket that holds PDFs during ingest and include lifecycle rules or figure out how to attache lifecycle rules to the intermediate PDFs.
The text was updated successfully, but these errors were encountered: