Skip to content

Commit

Permalink
update with info on multiple images
Browse files Browse the repository at this point in the history
  • Loading branch information
JasonGellis committed Sep 4, 2024
1 parent e09bd9d commit 66efe88
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,8 @@ Table Reader is a Python command-line interface (CLI) application designed to ex
- Optical character recognition (OCR) Processing: Utilizing the powerful Tesseract OCR engine, Table Reader accurately extracts text from images, including tables and tabular data.
- Data Extraction: Table Reader processes images to identify and extract tabular data, preserving the structure of tables found in the input images.
- Data Cleaning: Table Reader includes functionality to clean and pre-process extracted data, removing special characters while preserving decimal values and English letters, and ensuring consistent formatting.
- Data Export: Once the data is extracted and cleaned, Table Reader enables users to export the data to a structured format, such as CSV files, for further analysis in statistical software or spreadsheet applications.
- Data import: Table Reader can process one or multiple images.
- Data Export: Once the data is extracted and cleaned, Table Reader enables users to export the data to a CSV file for further analysis in statistical software or spreadsheet applications.

## Why Use Table Reader?

Expand Down Expand Up @@ -70,7 +71,6 @@ Optimizing images to meet these criteria before OCR processing can significantly
## Future updates

- Webapp interface
- Upload multiple images
- Ability to select/deselect image and OCR processing
- Support for special and non-English characters
- Ability to handle numbers with a "mid-dot" or "decimal comma" (e.g., writing 2.9 as 2·9).

0 comments on commit 66efe88

Please sign in to comment.