This repository contains the implementation of a data fusion framework that combines satellite images and tabular data for dengue prediction and socioeconomic analysis. The framework leverages variational autoencoders (VAE) for generating embeddings and employs an LSTM-based data fusion model to integrate time series of embeddings and metadata. The experiments in this repository demonstrate the effectiveness of the proposed approach in predicting dengue outbreaks and analyzing socioeconomic indicators like poverty and access to water.
Data fusion of satellite images and tabular data has shown promising results in various domains. In this project, we propose a multimodal data fusion framework that combines the visual information from satellite images with contextual tabular data to predict dengue outbreaks and analyze socioeconomic indicators. The framework consists of two main components: (1) Variational Autoencoders (VAE) for generating embeddings from satellite images and (2) an LSTM-based data fusion model to integrate the embeddings with temporal metadata.
To use the code in this repository, follow these steps:
- Clone the repository:
git clone https://github.com/dsrestrepo/MIT_Multimodal_Satellite_Images_Fusion.git
cd MIT_Multimodal_Satellite_Images_Fusion
- Set up the Python environment. It is recommended to use a virtual environment:
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
- Install the required dependencies:
pip install -r requirements.txt
Once you have installed the required dependencies, you can use the code for generating embeddings, training the fusion model, and conducting experiments on dengue prediction and socioeconomic analysis.
This repository includes the following experiments:
- Embedding Generation: Implementation of variational autoencoders (VAE) for generating embeddings from satellite images.
- Data Fusion Model: Implementation of the LSTM-based data fusion model that integrates the time series of embeddings with temporal metadata.
- Dengue Prediction: Using the fused embeddings to predict dengue outbreaks based on the multimodal data.
- Socioeconomic Analysis: Utilizing the fused embeddings to analyze socioeconomic indicators like poverty and access to water.
The results of the experiments are provided in the results
directory. We include evaluation metrics and visualizations that demonstrate the performance of the data fusion framework for dengue prediction and socioeconomic analysis.
We welcome contributions to this repository! If you find any issues or have improvements to suggest, please open an issue or a pull request.
This project is licensed under the MIT License.
For any questions or inquiries, please feel free to reach out to David Restrepo at [email protected].