Releases: UB-Mannheim/ocr-fileformat
Releases · UB-Mannheim/ocr-fileformat
v0.7.0
What's Changed
- Add transformation from hOCR to TEI and update transformation matrix by @stweil in #170
- update textract2page to include slub/textract2page#13 by @kba in #171
- update vendor/page-to-alto v1.2.0 -> v1.3.0 by @kba in #172
- Update Dockerfile, fix #173 by @kba in #174
- update textract2page by @bertsky in #177
- update textract2page (for valid @conf ranges) by @bertsky in #180
- update textract2page (v 0.2 - full LAYOUT etc.) by @bertsky in #186
Full Changelog: v0.6.0...v0.7.0
v0.6.0
What's Changed
- Add CodeQL workflow for GitHub code scanning by @lgtm-com in #155
- gcv__page: use -source-json instead of -source-xml by @bertsky in #156
- make install: use newline in sed c cmd by @bertsky in #158
- Add textract2page by @bertsky in #160
- ensure venv for Python tools by @bertsky in #162
- add PRImA converter for GCV→ALTO by @bertsky in #163
- Update Makefile to support macOS by @stweil in #165
- update textract2page, hOCR-to-ALTO and alto-schema by @kba in #166
- Fix two issues reported by CodeQL CI by @stweil in #161
- Fix broken conversions from hOCR to ALTO by @stweil in #167
- Replace broken Travis CI by GitHub action by @stweil in #168
- Use first bash from PATH (allows running on macOS) by @stweil in #169
New Contributors
- @lgtm-com made their first contribution in #155
Full Changelog: v0.5.0...v0.6.0
v0.5.0
What's Changed
- ⬆️ Update JPageConverter to 1.5.05 by @mikegerber in #131
- update hocr2alto to include filak/hOCR-to-ALTO#23 by @kba in #130
- page schemas: use github not primaresearch.org by @kba in #132
- Page to alto python by @kba in #134
- [doc][fix] clear README cli links by @M3ssman in #141
- Add ImageWare MyBib to ALTO conversion by karkraeg, fix #139 by @kba in #140
- page__alto: process all arguments by @bertsky in #142
- when converting to PAGE, always use latest schema by @bertsky in #146
- docker: unlimit POST upload size, #136 by @kba in #137
- Update Saxon-HE by @stweil in #144
- Use git submodules by @stweil in #148
- update page-to-alto by @bertsky in #152
- page to text: rewrite by @bertsky in #151
- Update SaxonHE to version 11.2 by @stweil in #149
- vendor/Makefile: page-to-alto is phony by @bertsky in #154
New Contributors
- @mikegerber made their first contribution in #131
- @M3ssman made their first contribution in #141
- @bertsky made their first contribution in #142
Full Changelog: v0.4.0...v0.5.0
v0.4.0
v0.3.2
v0.3.1
v0.3.0
v0.2.3
Add transformation gcv2hocr and fixes some issues with web interface
- Support new transformation from google cloud vision format to hocr
- Fix format switching in transform web interface
- Produce valid HTML
- Use eslint for JS code style checking
- Use best practices for Dockerfile