Skip to content

Genie-Kim/PintheMemory

Repository files navigation

Pin the Memory (CVPR 2022): Official Project Webpage

This repository provides the official PyTorch implementation of the following paper:

Pin the Memory: Learning to Generalize Semantic Segmentation
Jin Kim (Yonsei Univ.), Jiyoung Lee (NAVER AI Lab), Jungin Park (Yonsei Univ.)
Dongbo Min* (Ewha Womans Univ.), Kwanghoon Sohn* (Yonsei Univ.) (*: co-corresponding author)
CVPR 2022, Accepted as Poster

Paper: arxiv

Abstract: The rise of deep neural networks has led to several breakthroughs for semantic segmentation. In spite of this, a model trained on source domain often fails to work properly in new challenging domains, that is directly concerned with the generalization capability of the model. In this paper, we present a novel memory-guided domain generalization method for semantic segmentation based on meta-learning framework. Especially, our method abstracts the conceptual knowledge of semantic classes into categorical memory which is constant beyond the domains. Upon the meta-learning concept, we repeatedly train memory-guided networks and simulate virtual test to 1) learn how to memorize a domain-agnostic and distinct information of classes and 2) offer an externally settled memory as a class-guidance to reduce the ambiguity of representation in the test data of arbitrary unseen domain. To this end, we also propose memory divergence and feature cohesion losses, which encourage to learn memory reading and update processes for category-aware domain generalization. Extensive experiments for semantic segmentation demonstrate the superior generalization capability of our method over state-of-the-art works on various benchmarks.

Concept Video

Click the figure to watch the video of our paper.

Youtube Video

Pytorch Implementation

Installation Requirements

Please refer to environment.yml

Installation from Dockerfile

Clone this repository.

git clone https://github.com/Genie-Kim/PintheMemory.git
cd PintheMemory

Build docker image from Dockerfile

cd docker
sudo docker build --tag pinmem:1.0 .

Run container

sudo docker run -it --gpus all -h pinmem --name pinmem -v <your workspace>/PintheMemory:/root/PintheMemory -v <your datasetpath>/dg_seg_dataset:/root/dg_seg_dataset -v <your workspace>/experiment_pinmem:/root/experiment_pinmem --ipc=host pinmem:1.0 script -q -c "/bin/bash" /dev/null
(in container)
conda activate pinmem
imageio_download_bin freeimage

How to Run

We evaludated our method on Cityscapes, Mapillary Vistas, BDD-100K,Synthia (SYNTHIA-RAND-CITYSCAPES), GTAV, IDD

  1. For Cityscapes dataset, download "leftImg8bit_trainvaltest.zip" and "gtFine_trainvaltest.zip" from https://www.cityscapes-dataset.com/downloads/
cityscapes
 └ leftImg8bit_trainvaltest
   └ leftImg8bit
     └ train
     └ val
     └ test
 └ gtFine_trainvaltest
   └ gtFine
     └ train
     └ val
     └ test
bdd-100k
 └ images
   └ train
   └ val
   └ test
 └ labels
   └ train
   └ val
mapillary
 └ training
   └ images
   └ labels
 └ validation
   └ images
   └ labels
 └ test
   └ images
   └ labels

We used GTAV_Split to split GTAV dataset into training/validation/test set. Please refer the txt files.

GTAV
 └ images
   └ train
     └ folder
   └ valid
     └ folder
   └ test   
     └ folder
 └ labels
   └ train
     └ folder
   └ valid
     └ folder
   └ test   
     └ folder

We Synthia dataset into train/val set. Please refer the txt files.

synthia
 └ RGB
   └ train
   └ val
 └ GT
   └ COLOR
     └ train
     └ val
   └ LABELS
     └ train
     └ val
  1. You should modify the dataset path to "~/dg_seg_dataset/".
datasetroot = os.path.expanduser('~/dg_seg_dataset/')

Pretrained Models

All models trained for our paper

You can download pretrined parameters evaluated in our paper at Drive. For compatibility with scripts file, put the folder downloaded from the drive in "PintheMemory/pretrained_model".

Train and Evaluation scripts

Please refer to the train, evaluation, tsne plot and memory activation scripts in the PintheMemory/pinmem_xx_scripts folders.

Usage Examples

./pinmem_eval_abla_scripts/eval_custumfolder_pinmem_DR50V3P.sh "<absolute path of any image folder>"

Then the qualitative result images saved in the same folder with pth snapshot file. For T-sne plot, install tsnecuda library.

T-sne plots

Citations

@inproceedings{kim2022pin,
  title={Pin the Memory: Learning to Generalize Semantic Segmentation},
  author={Kim, Jin and Lee, Jiyoung and Park, Jungin and Min, Dongbo and Sohn, Kwanghoon},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={4350--4360},
  year={2022}
}

Acknowledgments

Our implementation is heavily derived from RobustNet and TSMLDG. Thanks to the RobustNet and TSMLDG implementations.