Skip to content

PdfParser, a standalone PHP library, provides various tools to extract data from a PDF file.

License

Notifications You must be signed in to change notification settings

KejawenLab/pdfparser

 
 

Repository files navigation

PdfParser

Pdf Parser, a standalone PHP library, provides various tools to extract data from a PDF file.

Build Status Current Version composer.lock

Total Downloads Monthly Downloads Daily Downloads

Website : http://www.pdfparser.org

Test the API on our demo page.

This project is supported by Actualys.

Features

Features included :

  • Load/parse objects and headers
  • Extract meta data (author, description, ...)
  • Extract text from ordered pages
  • Support of compressed pdf
  • Support of MAC OS Roman charset encoding
  • Handling of hexa and octal encoding in text sections
  • PSR-0 compliant (autoloader)
  • PSR-1 compliant (code styling)

Currently, secured documents are not supported.

This Library is still under active development. As a result, users must expect BC breaks when using the master version.

Documentation

Read the documentation on website.

Original PDF References files can be downloaded from this url : http://www.adobe.com/devnet/pdf/pdf_reference_archive.html

License

This library is under the LGPLv3 license.

About

PdfParser, a standalone PHP library, provides various tools to extract data from a PDF file.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • PHP 100.0%