This repository compiles a list of papers related to Mamba and SSM.
Continual improvements are being made to this repository. If you come across any relevant papers that should be included, please don't hesitate to submit a pull request (PR) or open an issue.
(Arxiv 23.12.01) Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper Code
(Arxiv 24.01.08) MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts Paper
(Arxiv 24.01.24) MambaByte: Token-free Selective State Space Model Paper Code
(Arxiv 24.01.31) LOCOST: State-Space Models for Long Document Abstractive Summarization Paper Code
(Arxiv 24.02.01) BlackMamba: Mixture of Experts for State-Space Models Paper Code
(Arxiv 24.02.06) Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks Paper
(Arxiv 24.02.08) Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data Paper
(Arxiv 24.02.15) Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling Paper Code
(Arxiv 24.02.19) Pan-Mamba: Effective pan-sharpening with State Space Model Paper Code
(Arxiv 24.02.23) State Space Models for Event Cameras Paper
(Arxiv 24.02.26) DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models Paper Code
(Arxiv 24.03.03) The Hidden Attention of Mamba Models Paper Code
(Arxiv 24.03.08) MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models Paper
(Arxiv 24.03.11) MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology Paper Code
(Arxiv 24.03.12) Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM Paper Code
(Arxiv 24.03.13) ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes Paper
(Arxiv 24.03.21) ZigMa: Zigzag Mamba Diffusion Model Paper Code
(Arxiv 24.03.26) State Space Models as Foundation Models: A Control Theoretic Overview Paper
(Arxiv 24.03.27) Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation Paper
(Arxiv 24.03.28) Jamba: A Hybrid Transformer-Mamba Language Model Paper Code
(Arxiv 24.03.29) HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM Paper
(Arxiv 24.01.17) Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Paper Code
(Arxiv 24.01.18) VMamba: Visual State Space Model Paper Code
(Arxiv 24.02.05) Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining Paper Code
(Arxiv 24.02.06) U-shaped Vision Mamba for Single Image Dehazing Paper Code
(Arxiv 24.02.23) MambaIR: A Simple Baseline for Image Restoration with State-Space Model Paper Code
(Arxiv 24.02.24) Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning Paper Code
(Arxiv 24.03.04) MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection Paper Code
(Arxiv 24.03.15) EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba Paper Code
(Arxiv 24.03.15) On the low-shot transferability of [V]-Mamba Paper
(Arxiv 24.03.26) PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition Paper Code
(Arxiv 24.03.26) Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion Paper
(Arxiv 24.03.27) Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction Paper
(Arxiv 24.03.27) ReMamber: Referring Image Segmentation with Mamba Twister Paper
(Arxiv 24.03.28) RSMamba: Remote Sensing Image Classification with State Space Model Paper Code
(Arxiv 24.01.25) Vivim: a Video Vision Mamba for Medical Video Object Segmentation Paper Code
(Arxiv 24.03.11) VideoMamba: State Space Model for Efficient Video Understanding Paper Code
(Arxiv 24.03.11) SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding Paper
(Arxiv 24.03.25) VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting Paper Code
(Arxiv 24.01.09) U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation Paper Code
(Arxiv 24.01.24) SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation Paper Code
(Arxiv 24.01.25) Vivim: a Video Vision Mamba for Medical Video Object Segmentation Paper Code
(Arxiv 24.01.25) MambaMorph: a Mamba-based Backbone with Contrastive Feature Learning for Deformable MR-CT Registration Paper Code
(Arxiv 24.02.04) VM-UNet: Vision Mamba UNet for Medical Image Segmentation Paper Code
(Arxiv 24.02.05) nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model Paper Code
(Arxiv 24.02.09) FD-Vision Mamba for Endoscopic Exposure Correction Paper
(Arxiv 24.02.16) Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation Paper Code
(Arxiv 24.03.06) MedMamba: Vision Mamba for Medical Image Classification Paper Code
(Arxiv 24.03.08) LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation Paper Code
(Arxiv 24.03.12) Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention Paper Code
(Arxiv 24.03.20) H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation Paper Code
(Arxiv 24.03.20) ProMamba: Prompt-Mamba for polyp segmentation Paper
(Arxiv 24.03.25) CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification Paper
(Arxiv 24.03.26) Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation Paper
(Arxiv 24.03.29) UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation Paper Code
(Arxiv 24.04.01) T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation Paper
(Arxiv 24.03.14) TimeMachine: A Time Series is Worth 4 Mambas for Long-term Forecasting Paper Code
(Arxiv 24.03.17) Is Mamba Effective for Time Series Forecasting? Paper Code
(Arxiv 24.03.22) SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series Paper Code
(Arxiv 24.02.01) Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces Paper Code
(Arxiv 24.02.13) Graph Mamba: Towards Learning on Graphs with State Space Models Paper Code
(Arxiv 24.03.19) STG-Mamba: Spatial-Temporal Graph Learning via Selective State Space Model Paper
(Arxiv 24.02.16) PointMamba: A Simple State Space Model for Point Cloud Analysis Paper Code
(Arxiv 24.03.01) Point Could Mamba: Point Cloud Learning via State Space Model Paper Code
(Arxiv 24.02.26) DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models Paper Code
(Arxiv 24.03.20) VL-Mamba: Exploring State Space Models for Multimodal Learning Paper Code
(Arxiv 24.03.22) Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference Paper Code
(Arxiv 24.03.14) MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning Paper Code
(Arxiv 24.03.25) Uncovering Selective State Space Model's Capabilities in Lifelong Sequential Recommendation Paper Code
(Arxiv 24.03.29) Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces Paper
(NeurIPS 2020 Spotlight) HiPPO: Recurrent Memory with Optimal Polynomial Projections Paper Code
(ICLR 2022) S4: Efficiently Modeling Long Sequences with Structured State Spaces Paper Code
(ICLR 2023) H3: Hungry Hungry Hippos: Toward Language Modeling with State Space Models Paper Code
Mamba_State_Space_Model_Paper_List
Awesome State-Space Resources for ML
Video-of-Mamba-and-S4-Explained
A Visual Guide to Mamba and State Space Models
If you find this repository useful, please consider citing our paper:
@misc{tang2024vmrnn,
title={VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting},
author={Yujin Tang and Peijie Dong and Zhenheng Tang and Xiaowen Chu and Junwei Liang},
year={2024},
eprint={2403.16536},
archivePrefix={arXiv},
primaryClass={cs.CV}
}