Welcome to Docx2Dashboard! This Streamlit web application allows you to preprocess Word documents and PDF files and visualize the data in an interactive dashboard, designed specifically for NFWP by World Bank Social Sustainability and Inclusion unit.
To successfully launch the Streamlit web application, ensure all required Python libraries are installed, meeting the specified versions. You can install the dependencies by running:
pip install -r requirements.txt
- Initiate the Streamlit web application by executing the following command in your terminal in 'streamlit' directory:
streamlit run home.py
-
Navigate to the "docx2dashboard" page and proceed as follows:
- Choose the CSV file you intend to update.
- Select your preferred input method: either upload a zip file containing Word documents and PDF files, or manually input the data.
- Provide a name for the new CSV file.
- Click on the submit button to initiate the process.
-
Upon completion of the automated preprocessing procedures, the interactive dashboard will be displayed on your screen, ready for exploration and analysis.
-
Navigate to the "visualization" page to access the interactive visualization dashboard without processing unstructured documents.
-
Please be sure to replace the 'Financial_Diaries_final.csv' with the real dataset. The current 'Financial_Diaries_final.csv' is a dummy dataset that is randomly generated by us to protect any information of respondents.
If you encounter any issues or have any questions, feel free to reach out at [email protected]
Enjoy exploring your data with Docx2Dashboard!🪄