You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Our current PDF generation process for the thymeleaf-docs repository starts from Markdown documents, and works like this:
Convert Markdown to HTML using Pandoc. Pandoc applies an HTML template (which includes our own CSS and JavaScript calls) and transforms the Markdown into HTML.
Convert HTML to PDF using wkhtmltopdf. Wkhtmltopdf launches a headless Qt-based WebKit browser and renders the HTML page (including its CSSs --with print media type-- and JavaScript code). The result of rendering is then printed to PDF.
This whole process should be run on the Thymeleaf releases machine, currently CentOS 6.x Linux on Amazon EC2. It should be completely headless and automated with gradle.
The problem is, wkhtmltopdf's output is PDF, but not PDF/A-1 (and it would be nice to have PDF/A-1 docs). Outputting this kind of standard PDF format is not supported by the current version of wkhtmltopdf (0.12.1-61cda93).
A possible solution for converting the current PDF output into PDF/A-1 has already been tried, based on ghostscript (versions 8.70 and 9.14) and a command like:
...but results aren't correct. With gs 8.70 many links inside the PDF raise an error for lacking a required annotation, and with gs 9.14 these errors are reported as warnings and auto-fixed but some pages are rendered with blue background instead of white. And in both cases the resulting PDFs do not pass a PDF/A-1 validator test.
The text was updated successfully, but these errors were encountered:
LibreOffice has also been tested (sending the PDFs to a headless LibreOffice installation in order to ask LibreOffice to convert them to PDF/A-1). Results are really bad, and the output PDF files are not legible anymore.
Our current PDF generation process for the
thymeleaf-docs
repository starts from Markdown documents, and works like this:print
media type-- and JavaScript code). The result of rendering is then printed to PDF.This whole process should be run on the Thymeleaf releases machine, currently CentOS 6.x Linux on Amazon EC2. It should be completely headless and automated with gradle.
The problem is, wkhtmltopdf's output is PDF, but not PDF/A-1 (and it would be nice to have PDF/A-1 docs). Outputting this kind of standard PDF format is not supported by the current version of wkhtmltopdf (0.12.1-61cda93).
A possible solution for converting the current PDF output into PDF/A-1 has already been tried, based on ghostscript (versions 8.70 and 9.14) and a command like:
...but results aren't correct. With gs 8.70 many links inside the PDF raise an error for lacking a required annotation, and with gs 9.14 these errors are reported as warnings and auto-fixed but some pages are rendered with blue background instead of white. And in both cases the resulting PDFs do not pass a PDF/A-1 validator test.
The text was updated successfully, but these errors were encountered: