This project explores the application of ensemble learning for textual image classification across multiple languages. By leveraging MobileNetV2 and ResNet50, it aims to enhance generalization, accuracy, and robustness. The ensemble model achieved a validation accuracy of 77%, outperforming individual models and demonstrating its potential in solving complex classification tasks.
- Textual Image Classification: Classifies images based on textual content in multiple languages.
- Ensemble Learning: Combines MobileNetV2 and ResNet50 for improved performance.
- Efficient Training: Optimized architecture for faster training and inference.
- Multi-language Support: Works on textual images from 12 Indian languages, including Gujarati, Odia, Punjabi, Tamil, and others.
The dataset includes images containing text from 12 Indian languages, collected and preprocessed for effective training and evaluation.
- Clone the repository:
git clone https://github.com/yourusername/repo-name.git
- Navigate to the project directory:
cd repo-name
- Install dependencies:
pip install -r requirements.txt
This project is licensed under the MIT License.
For queries or collaboration, feel free to reach out: Email: [email protected]