Skip to content

This simple Symfony 2 bundle allows you to convert an input PDF file into plain text.

Notifications You must be signed in to change notification settings

webgriffe/pdftotext-bundle

Repository files navigation

PDF to Text Symfony2 Bundle

This Symfony2 bundle allows you to convert an input PDF file into plain text.

Conversion is made through pdftotext command-line utilty (http://en.wikipedia.org/wiki/Pdftotext). pdftotext is part of Xpdf software suite, is included in many Linux distributions and that should be available also for Mac OS X and Windows platforms.

Installation

Install this bundle as any other Symfony2 bundle.

Symfony >= 2.1.x

Add the following requirement to your composer.json:

"require": {
	…
	"webgriffe/pdftotext-bundle": "dev-master"
}

Install the bundle with the following command:

$ composer update webgriffe/pdftotext-bundle

Register the bundle in the AppKernel:

public function registerBundles()
{
	…
	new Webgriffe\PdfToTextBundle\WebgriffePdfToTextBundle(),
}

Symfony 2.0.x

Add the following requirement in your deps file:

…
[WebgriffePdfToTextBundle]
	git=git://github.com/webgriffe/pdftotext-bundle.git
	target=bundles/Webgriffe/PdfToTextBundle

Install the bundle with the following command:

$ bin/vendors install

Register the bundle in the AppKernel:

public function registerBundles()
{
	…
	new Webgriffe\PdfToTextBundle\WebgriffePdfToTextBundle(),
}

Usage

Simply, you can get the PdfToTextConverter from DIC and get the plain text string.

// Acme\MyBundle\Controller\MyController

public function myAction()
{
	$pdfFile = '/path/to/file.pdf';
	$pdfToTextConverter = $this->get('webgriffe_pdf_to_text.converter');
	$pdfText = $pdfToTextConverter->convert($pdfFile);
	
	return new \Symfony\Component\HttpFoundation\Response($pdfText);
}

You can also specify the output encoding (default is UTF-8).

$pdfText = $pdfToTextConverter->convert($pdfFile, 'ISO-8859-1');

Specify pdftotext binary path

You can specify the pdftotext binary path in your config.yml:

webgriffe_pdf_to_text:
    bin_path: /usr/local/bin/pdftotext

Credits

This bundle has been developed by Webgriffe®. Please, report to us any bug or suggestion by GitHub issues.

About

This simple Symfony 2 bundle allows you to convert an input PDF file into plain text.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages