Skip to content

Releases: alephdata/ingest-file

3.19.2

29 Aug 08:30
3.19.2
b70123d
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: 3.18.4...3.19.2

3.19.2-rc1

24 Jul 12:40
38ea7a1
Compare
Choose a tag to compare
3.19.2-rc1 Pre-release
Pre-release

What's Changed

Full Changelog: 3.18.4...3.19.2-rc1

3.19.1

28 Jun 18:39
00aefd8
Compare
Choose a tag to compare

What's Changed

Full Changelog: 3.18.4...3.19.1

3.19.0

28 Jun 11:57
3.19.0
3d834cc
Compare
Choose a tag to compare

What's Changed

Full Changelog: 3.18.4...3.19.0

3.18.4

04 May 13:29
1ba0f49
Compare
Choose a tag to compare

What's Changed

Major PDF library change

We are hereby deprecating pdflib, replacing it with a well maintained, performant library: pymupdf. This enables local development on hardware with Apple Silicon CPUs. This also enables support for JBIG2 images in PDF files.

License change

Because of the above dependency as of this release ingest-file is licensed under the terms of the AGPLv3+ license.

Integrating convert-document into ingest-file

Smaller changes

Dependency upgrades

Full Changelog: 3.18.2...3.18.4

3.18.4-rc4

06 Apr 11:42
af7b1f3
Compare
Choose a tag to compare
3.18.4-rc4 Pre-release
Pre-release
  • Hotfix for the image path where full page images get extracted to (when ingesting PDFs with Type3 fonts)

Full Changelog: 3.18.4-rc3...3.18.4-rc4

3.18.4-rc3

06 Apr 07:28
ee4311a
Compare
Choose a tag to compare
3.18.4-rc3 Pre-release
Pre-release

What's Changed

  • Do full page OCR for PDF pages with Type3 fonts by @stchris in #449

Dependency upgrades

Full Changelog: 3.18.4-rc1...3.18.4-rc3

3.18.4-rc1

24 Mar 11:14
d94c44d
Compare
Choose a tag to compare
3.18.4-rc1 Pre-release
Pre-release

What's Changed

  • Use PyMuPDF instead of pikepdf + pdfminer.six for PDF ingestion (text and image extraction). #441

Dependency upgrades

Full Changelog: 3.18.2...3.18.4-rc1

3.18.3-rc2

13 Mar 10:31
2afe740
Compare
Choose a tag to compare
3.18.3-rc2 Pre-release
Pre-release

What's Changed

Dependency upgrades

Full Changelog: 3.18.2...3.18.3-rc2

3.18.2

19 Jan 10:14
79bc5d2
Compare
Choose a tag to compare

IMPORTANT NOTE: this release was pulled. At this time 3.17.1 is the latest release.

What's Changed

  • Update public error message for password protected PDFs by @catileptic in #422

Dependency upgrades

New Contributors

Full Changelog: 3.18.0...3.18.2