Tips: The complete code will be published after the paper is included.
[Paper]
This is the repository for the ASMVQA, a unified assistant system for users to conveniently perform diverse medical practice. The demonstration video of our system can be found at https://youtu.be/o16xNuHK1ao. The repository contains:
- Overview
- Med-VQA Data for Pre-training and Fine-tuning
- Install
We demonstrate ASMVQA, an Assistant System for Medical Visual Question Answering. Given unstructured clinical data, our system first provides a useful tool to automatically build medical visual question answering (VQA) datasets for users. Given a dataset, users can then conveniently select state-of-the-art (SOTA) models from our model library to perform extensive experiments. With simple configurations, our system automatically trains and evaluates the selected models over the dataset, and reports the comprehensive results to users. In particular, we develop a robot assistant to guide users without professional background for performing medical practice, including model evaluation, online clinical consulting, and electronic medical record (EMR) retrieval.
Due to concerns related to data copyright and privacy issues, users are required to obtain datasets ( VQA-RAD, PathVQA, SLAKE, VQA-Med, OVQA) independently via the provided links. Furthermore, users should utilize the preprocessing code supplied by us to prepare the data for analysis.
-
Clone this repository and navigate to Demo/VQASystem:
-
Install Package: Please refer to the envs_files folder to create conda environments of the models we provide.
-
We use MySQL for data storage, you can restore the database by refering to db.sql:
sudo mysql
create database vqa;
use vqa;
source /your_path/db.sql;
- Configure model config: Please deploy the download datasets and modify the model config in file /Demo/VQASystem/model_info.py. An example of modifying the config:
MiniGPT4={
'url':"https://arxiv.org/abs/2304.10592",
'title':"MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models",
"path":{
"VQA-RAD":"/home/coder/projects/Med-VQA/run_vqarad.sh",
"PATHVQA":"/home/coder/projects/Med-VQA/run_path.sh",
"SLAKE":"/home/coder/projects/Med-VQA/run_slake.sh",
"MED-VQA":"/home/coder/projects/Med-VQA/run_medvqa.sh"
},
"dataset":{"VQA-RAD":"data_RAD", "PATHVQA":"data_PathVQA","SLAKE":"/home/coder/projects/Med-VQA/data_SLAKE","MED-VQA":"/home/coder/projects/MEVF/MICCAI19-MedVQA/data_Med/VQA-Med-2019"}
}
- Run Application: We use streamlit framework to develop our App.
pip install streamlit
streamlit run Home.py
If you're interested in some of the model provided in our model library, you can check the readme file in relevant model folder.