GitHub - akumathedyn123/python-pdf-extractor-pdf2txt: This Python script efficiently extracts text content from multiple PDF files within a designated folder and saves the extracted text as separate TXT files with the same name as the original PDFs (excluding the ".pdf" extension).

pdf_extractor-pdf2txt

This Python script efficiently extracts text content from multiple PDF files within a designated folder and saves the extracted text as separate TXT files with the same name as the original PDFs (excluding the ".pdf" extension).

Features

Processes multiple PDFs in a single run.
Preserves the original file structure for easy identification.
Utilizes the well-established PyPDF2 library for robust PDF handling.

Prerequisites

Python 3.x (https://www.python.org/downloads/)
PyPDF2 library (installation: pip install PyPDF2)

Installation

Clone the Repository:

git clone https://github.com/akumathedyn123/python-pdf-extractor-pdf2txt.git

Navigate to the Project Directory:
```
cd pdf_extractor-pdf2txt
```

Usage

Set the PDF Folder Path:
- Open the main.py file in a text editor.
- Locate the line that defines the pdf_folder variable (usually near the beginning).
- Replace "path/to/folder" with the absolute path to the directory containing your PDF files.
Example: If your PDFs are in a folder named my_pdfs on your desktop, you would change the line to:
```
pdf_folder = os.path.join(os.path.expanduser('~'), 'Desktop', 'my_pdfs')
```
Run the Script:
- From the project directory (where main.py is located), execute the script using the following command:
```
python main.py
```
Note: If you're using Python 3, you might need to replace python with python3 depending on your system setup.

License

This project is licensed under the MIT License (see LICENSE file for details).

Contributing

We encourage contributions to this project. Feel free to submit pull requests for bug fixes, enhancements, or new features.

Contact

For any questions or feedback, please feel free to create an issue on the project's GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pdf_extractor-pdf2txt

Features

Prerequisites

Installation

Usage

License

Contributing

Contact

About

Releases

Packages

Languages

License

akumathedyn123/python-pdf-extractor-pdf2txt

Folders and files

Latest commit

History

Repository files navigation

pdf_extractor-pdf2txt

Features

Prerequisites

Installation

Usage

License

Contributing

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages