Updated on 2024.11.24

Usage instructions: here

Table of Contents

Depth Estimation
Semactic Segmentation

Depth Estimation

Publish Date	Title	Authors	PDF	Code
2024-11-21	StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart	Jian Shi et.al.	2411.14295	null
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291	null
2024-11-20	OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging	Rajini Makam et.al.	2411.13230	null
2024-11-15	SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction	Yutao Tang et.al.	2411.12592	link
2024-11-18	Towards Degradation-Robust Reconstruction in Generalizable NeRF	Chan Ho Park et.al.	2411.11691	null
2024-11-18	MGNiceNet: Unified Monocular Geometric Scene Understanding	Markus Schön et.al.	2411.11466	null
2024-11-18	The ADUULM-360 Dataset -- A Multi-Modal Dataset for Depth Estimation in Adverse Weather	Markus Schön et.al.	2411.11455	null
2024-11-18	GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views	Boyao Zhou et.al.	2411.11363	null
2024-11-18	Scalable Autoregressive Monocular Depth Estimation	Jinhong Wang et.al.	2411.11361	null
2024-11-16	MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation	Ansh Shah et.al.	2411.10886	link
2024-11-19	EVT: Efficient View Transformation for Multi-Modal 3D Object Detection	Yongjin Lee et.al.	2411.10715	null
2024-11-15	Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses	Yongfan Liu et.al.	2411.10013	null
2024-11-14	Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting	Yian Wang et.al.	2411.09823	null
2024-11-14	Adversarial Attacks Using Differentiable Rendering: A Survey	Matthew Hull et.al.	2411.09749	null
2024-11-14	Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching	Yuran Wang et.al.	2411.09151	null
2024-11-13	OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances	Youqi Liao et.al.	2411.08665	null
2024-11-13	Scaling Properties of Diffusion Models for Perceptual Tasks	Rahul Ravishankar et.al.	2411.08034	null
2024-11-11	$SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation	Yinshuang Xu et.al.	2411.07326	null
2024-11-08	Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning	Quang Truong Nguyen et.al.	2411.05344	null
2024-11-08	SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection	Yun Zhao et.al.	2411.05292	null
2024-11-07	D $^3$ epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes	Siyu Chen et.al.	2411.04826	null
2024-11-06	Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation	Teppei Kurita et.al.	2411.04714	null
2024-11-07	Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation	Qingyao Tian et.al.	2411.04404	null
2024-11-04	PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes	Kebin Peng et.al.	2411.04227	null
2024-11-06	Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions	Zihan Qin et.al.	2411.03638	null
2024-11-05	Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor	Anish Bhattacharya et.al.	2411.03303	null
2024-11-05	Correlation of Object Detection Performance with Visual Saliency and Depth Estimation	Matthias Bartolo et.al.	2411.02844	link
2024-11-05	FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training	Ruihong Yin et.al.	2411.02229	null
2024-11-05	Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training	Yuanqi Yao et.al.	2411.02149	null
2024-11-01	MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes	Sanghyun Byun et.al.	2411.01048	null
2024-11-01	On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR	Li Li et.al.	2411.00600	link
2024-10-31	Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving	Ce Zhou et.al.	2411.00192	null
2024-10-31	ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images	Timing Yang et.al.	2410.24001	link
2024-10-30	Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe	Songyu Xu et.al.	2410.23154	null
2024-10-29	Active Event Alignment for Monocular Distance Estimation	Nan Cai et.al.	2410.22280	null
2024-10-29	PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting	Sunghwan Hong et.al.	2410.22128	link
2024-10-27	Unlocking Comics: The AI4VA Dataset for Visual Understanding	Peter Grönquist et.al.	2410.20459	link
2024-10-27	Depth Attention for Robust RGB Tracking	Yu Liu et.al.	2410.20395	link
2024-10-21	YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning	Ranjan Sapkota et.al.	2410.19846	null
2024-10-25	MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors	Fanqi Pu et.al.	2410.19590	null
2024-10-24	Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction	Hongxin Peng et.al.	2410.18433	null
2024-10-24	Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images	Dong-Guw Lee et.al.	2410.18340	link
2024-10-25	UnCLe: Unsupervised Continual Learning of Depth Completion	Suchisrit Gangopadhyay et.al.	2410.18074	null
2024-10-21	TIPS: Text-Image Pretraining with Spatial Awareness	Kevis-Kokitsi Maninis et.al.	2410.16512	null
2024-10-22	DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain	Kun Wang et.al.	2410.14980	link
2024-10-17	DepthSplat: Connecting Gaussian Splatting and Depth	Haofei Xu et.al.	2410.13862	link
2024-10-16	DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning	Jiabao Wei et.al.	2410.12501	null
2024-10-16	Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture	Dabbrata Das et.al.	2410.11610	null
2024-10-16	CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction	Pranav Gupta et.al.	2410.11211	link
2024-10-14	When Does Perceptual Alignment Benefit Vision Representations?	Shobhita Sundaram et.al.	2410.10817	null
2024-10-14	Depth Any Video with Scalable Synthetic Data	Honghui Yang et.al.	2410.10815	link
2024-10-15	Improved Depth Estimation of Bayesian Neural Networks	Bart van Erp et.al.	2410.10395	link
2024-10-10	Color-Guided Flying Pixel Correction in Depth Images	Ekamresh Vasudevan et.al.	2410.08084	null
2024-10-09	Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models	Ange Lou et.al.	2410.07434	null
2024-10-09	Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation	Runze Chen et.al.	2410.06982	null
2024-10-09	Analysis of different disparity estimation techniques on aerial stereo image datasets	Ishan Narayan et.al.	2410.06711	null
2024-10-08	Vision Transformer based Random Walk for Group Re-Identification	Guoqing Zhang et.al.	2410.05808	null
2024-10-08	CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality	Wenjie Chang et.al.	2410.05735	null
2024-10-07	PhotoReg: Photometrically Registering 3D Gaussian Splatting Models	Ziwen Yuan et.al.	2410.05044	null
2024-10-10	Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy	Pengcheng Chen et.al.	2410.04041	null
2024-10-04	Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering	Laura Fink et.al.	2410.03861	null
2024-10-03	RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions	Ziyao Zeng et.al.	2410.02924	null
2024-10-02	Depth Pro: Sharp Monocular Metric Depth in Less Than a Second	Aleksei Bochkovskii et.al.	2410.02073	link
2024-10-10	Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation	Shuting Zhao et.al.	2410.00979	null
2024-10-01	Radar Meets Vision: Robustifying Monocular Metric Depth Prediction for Mobile Robotics	Marco Job et.al.	2410.00736	null
2024-10-06	Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration	Yida Lin et.al.	2410.00503	null
2024-10-01	Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance	Hongchao Shu et.al.	2410.00386	null
2024-09-30	CCDepth: A Lightweight Self-supervised Depth Estimation Network with Enhanced Interpretability	Xi Zhang et.al.	2409.19933	null
2024-09-30	EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth Prediction	Ivan Reyes-Amezcua et.al.	2409.19930	link
2024-09-29	fCOP: Focal Length Estimation from Category-level Object Priors	Xinyue Zhang et.al.	2409.19641	null
2024-09-29	KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation	Soofiyan Atar et.al.	2409.19490	null
2024-09-27	Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping	Anthony A. Song et.al.	2409.19153	null
2024-09-26	Self-supervised Monocular Depth Estimation with Large Kernel Attention	Xuezhi Xiang et.al.	2409.17895	null
2024-09-26	Self-Distilled Depth Refinement with Noisy Poisson Fusion	Jiaqi Li et.al.	2409.17880	null
2024-09-27	A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts	Aurel Pjetri et.al.	2409.17851	null
2024-09-26	Event-based Stereo Depth Estimation: A Survey	Suman Ghosh et.al.	2409.17680	null
2024-09-26	CAMOT: Camera Angle-aware Multi-Object Tracking	Felix Limanta et.al.	2409.17533	null
2024-09-25	Optical Lens Attack on Deep Learning Based Monocular Depth Estimation	Ce Zhou et.al.	2409.17376	null
2024-09-25	Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation	Richard D. Paul et.al.	2409.17085	null
2024-09-25	EventHDR: from Event to High-Speed HDR Videos and Beyond	Yunhao Zou et.al.	2409.17029	null
2024-09-25	3DDX: Bone Surface Reconstruction from a Single Standard-Geometry Radiograph via Dual-Face Depth Estimation	Yi Gu et.al.	2409.16702	null
2024-09-24	MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling	Yifang Men et.al.	2409.16160	null
2024-09-24	Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data	An Wang et.al.	2409.16063	link
2024-09-23	FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera	Guoyang Zhao et.al.	2409.15054	link
2024-09-23	DepthART: Monocular Depth Estimation as Autoregressive Refinement Task	Bulat Gabdullin et.al.	2409.15010	null
2024-09-23	Generalizing monocular colonoscopy image depth estimation by uncertainty-based global and local fusion network	Sijia Du et.al.	2409.15006	null
2024-09-23	GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth	Aurélien Cecille et.al.	2409.14850	null
2024-09-23	Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras	Ming Li et.al.	2409.14766	null
2024-09-18	Panoptic-Depth Forecasting	Juana Valeria Hurtado et.al.	2409.12008	null
2024-09-17	Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think	Gonzalo Martin Garcia et.al.	2409.11355	link
2024-09-15	GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion	Vitor Guizilini et.al.	2409.09896	null
2024-09-15	Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation	Xiaolong Qian et.al.	2409.09754	link
2024-09-13	PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage	Denis Zavadski et.al.	2409.09144	link
2024-09-25	Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding	Rania Hossam et.al.	2409.08695	link
2024-09-12	Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor	Andrea Conti et.al.	2409.08277	null
2024-09-12	LED: Light Enhanced Depth Estimation at Night	Simon de Moreau et.al.	2409.08031	link
2024-09-12	Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes	Ming Li et.al.	2409.07843	null
2024-09-12	Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy	Bojian Li et.al.	2409.07723	null
2024-09-12	FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments	Devansh Dhrafani et.al.	2409.07715	null
2024-09-10	Deep Neural Networks: Multi-Classification and Universal Approximation	Martín Hernández et.al.	2409.06555	null
2024-09-10	EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Nischal Khanal et.al.	2409.06183	link
2024-09-11	EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels	Qingyao Tian et.al.	2409.05442	null
2024-09-09	Spontaneous magnetic field and disorder effects in BaPtAs_1-x_Sb_x_ with honeycomb network	T. Adachi et.al.	2409.05266	null
2024-09-08	TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs	Horatiu Florea et.al.	2409.05142	null
2024-09-12	Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective	Tim Bader et.al.	2409.04086	link
2024-09-08	Estimating Indoor Scene Depth Maps from Ultrasonic Echoes	Junpei Honma et.al.	2409.03336	null
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-02	GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling	Huawei Sun et.al.	2409.02720	null
2024-09-04	Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects	Kyungmin Jo et.al.	2409.02653	null
2024-09-04	UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching	Soomin Kim et.al.	2409.02545	null
2024-09-04	SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction	Sumin Son et.al.	2409.02513	null
2024-09-04	Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation	Li Liu et.al.	2409.02494	null
2024-09-04	Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization	Cho-Ying Wu et.al.	2409.02486	null
2024-09-04	GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving	Huasong Han et.al.	2409.02382	null
2024-09-03	DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos	Wenbo Hu et.al.	2409.02095	null
2024-09-02	Large Language Models Can Understanding Depth from Monocular Images	Zhongyi Xia et.al.	2409.01133	null
2024-08-30	DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model	Mona Sheikh Zeinoddin et.al.	2408.17433	null
2024-08-30	Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method	Yuji Lin et.al.	2408.17339	null
2024-08-30	Synthetic Lunar Terrain: A Multimodal Open Dataset for Training and Evaluating Neuromorphic Vision Algorithms	Marcus Märtens et.al.	2408.16971	null
2024-08-29	EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More	Kanghao Chen et.al.	2408.16254	null
2024-08-30	Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective	Zhijie Shen et.al.	2408.16227	link
2024-08-27	Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack	Naufal Suryanto et.al.	2408.14879	null
2024-08-26	NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-training	Albert Luginov et.al.	2408.14177	null
2024-08-26	Pixel-Aligned Multi-View Generation with Depth Guided Decoder	Zhenggang Tang et.al.	2408.14016	null
2024-08-25	TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers	Chuanrui Zhang et.al.	2408.13770	null
2024-08-25	InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth	Cho-Ying Wu et.al.	2408.13708	null
2024-08-25	SeeBelow: Sub-dermal 3D Reconstruction of Tumors with Surgical Robotic Palpation and Tactile Exploration	Raghava Uppuluri et.al.	2408.13699	null
2024-08-27	Sapiens: Foundation for Human Vision Models	Rawal Khirodkar et.al.	2408.12569	null
2024-08-21	LiFCal: Online Light Field Camera Calibration via Bundle Adjustment	Aymeric Fleith et.al.	2408.11682	null
2024-08-19	Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video	Shuxian Wang et.al.	2408.10153	null
2024-08-19	SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition	Wiktor Mucha et.al.	2408.10037	link
2024-08-19	P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders	Xuechao Chen et.al.	2408.10007	null
2024-08-14	Enhanced Scale-aware Depth Estimation for Monocular Endoscopic Scenes with Geometric Modeling	Ruofeng Wei et.al.	2408.07266	null
2024-08-12	Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces	Junrui Zhang et.al.	2408.06083	null
2024-08-08	Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation	Daniele Rege Cambrin et.al.	2408.04523	link
2024-08-08	Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework	Subhasis Dasgupta et.al.	2408.04360	null
2024-08-08	Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform	Daniel Vargas et.al.	2408.04195	null
2024-08-07	Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach	Benedikt W. Hosp et.al.	2408.03591	null
2024-08-06	BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications	G. Manni et.al.	2408.03078	link
2024-08-05	Gaussian Mixture based Evidential Learning for Stereo Matching	Weide Liu et.al.	2408.02796	null
2024-08-05	Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining	Dongyang Liu et.al.	2408.02657	link
2024-08-03	MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas	Feng Qiao et.al.	2408.01653	null
2024-08-02	Self-Supervised Depth Estimation Based on Camera Models	Jinchang Zhang et.al.	2408.01565	null
2024-08-01	MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection	Youjia Fu et.al.	2408.00438	null
2024-08-01	High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior	Wencheng Han et.al.	2408.00361	null
2024-07-31	Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching	Pengjie Zhang et.al.	2407.21735	null
2024-07-29	BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation	Kieran Saunders et.al.	2407.20437	null
2024-07-29	Analysis and Improvement of Rank-Ordered Mean Algorithm in Single-Photon LiDAR	William C. Yau et.al.	2407.20399	null
2024-07-29	Improving 2D Feature Representations by 3D-Aware Fine-Tuning	Yuanwen Yue et.al.	2407.20229	null
2024-07-27	Revisit Self-supervised Depth Estimation with Local Structure-from-Motion	Shengjie Zhu et.al.	2407.19166	null
2024-07-27	RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry	Shengjie Zhu et.al.	2407.19154	null
2024-07-26	HybridDepth: Robust Depth Fusion for Mobile AR by Leveraging Depth from Focus and Single-Image Priors	Ashkan Ganj et.al.	2407.18443	link
2024-07-26	Enhanced Depth Estimation and 3D Geometry Reconstruction using Bayesian Helmholtz Stereopsis with Belief Propagation	Razieh Azizi et.al.	2407.18195	null
2024-07-25	BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation	Xiang Zhang et.al.	2407.17952	null
2024-07-25	UMono: Physical Model Informed Hybrid CNN-Transformer Framework for Underwater Monocular Depth Estimation	Jian Wang et.al.	2407.17838	null
2024-07-24	DarSwin-Unet: Distortion Aware Encoder-Decoder Architecture	Akshaya Athwale et.al.	2407.17328	null
2024-07-24	Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches	Chenxing Zhao et.al.	2407.17312	null
2024-07-23	SINDER: Repairing the Singular Defects of DINOv2	Haoqi Wang et.al.	2407.16826	link
2024-07-23	Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions	Fabio Tosi et.al.	2407.16698	link
2024-07-23	ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation	Zhenhua Wu et.al.	2407.16508	null
2024-07-19	Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation	Jinfeng Liu et.al.	2407.14126	link
2024-07-18	Unveiling the purely young star formation history of the SMC's northeastern shell from colour-magnitude diagram fitting	Joanna D. Sakowska et.al.	2407.13876	null
2024-07-18	Many Perception Tasks are Highly Redundant Functions of their Input Data	Rahul Ramesh et.al.	2407.13841	null
2024-07-18	Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks	Antoni Kowalczuk et.al.	2407.12588	link
2024-07-16	Temporally Consistent Stereo Matching	Jiaxi Zeng et.al.	2407.11950	link
2024-07-15	IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation	Yuanhao Zhai et.al.	2407.10937	link
2024-07-15	OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection	Jinghua Hou et.al.	2407.10753	link
2024-07-15	Towards Scale-Aware Full Surround Monodepth with Transformers	Yuchen Yang et.al.	2407.10406	null
2024-07-12	ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion	Sungmin Woo et.al.	2407.09303	link
2024-07-11	ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation	Ruijie Zhu et.al.	2407.08187	link
2024-07-10	Controlling Space and Time with Diffusion Models	Daniel Watson et.al.	2407.07860	null
2024-07-07	SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning	Yi Feng et.al.	2407.05283	link
2024-07-05	A Physical Model-Guided Framework for Underwater Image Enhancement and Depth Estimation	Dazhao Du et.al.	2407.04230	null
2024-07-04	Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation	Laiyan Ding et.al.	2407.04041	null
2024-07-02	Parametric Modeling and Estimation of Photon Registrations for 3D Imaging	Weijian Zhang et.al.	2407.02712	null
2024-07-02	Depth-Aware Endoscopic Video Inpainting	Francis Xiatian Zhang et.al.	2407.02675	link
2024-07-04	Camera-LiDAR Cross-modality Gait Recognition	Wenxuan Guo et.al.	2407.02038	null
2024-07-07	CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation	Huawei Sun et.al.	2407.00697	link
2024-06-28	Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey	Uchitha Rajapaksha et.al.	2406.19675	null
2024-07-05	360 in the Wild: Dataset for Depth Prediction and View Synthesis	Kibaek Park et.al.	2406.18898	null
2024-06-27	Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach	Yuxiang Huang et.al.	2406.18837	null
2024-06-26	DoubleTake: Geometry Guided Depth Estimation	Mohamed Sayed et.al.	2406.18387	null
2024-06-25	Depth-Guided Semi-Supervised Instance Segmentation	Xin Chen et.al.	2406.17413	null
2024-06-20	Uncertainty and Self-Supervision in Single-View Depth	Javier Rodriguez-Puigvert et.al.	2406.14226	null
2024-06-19	WaterMono: Teacher-Guided Anomaly Masking and Enhancement Boosting for Robust Underwater Self-Supervised Monocular Depth Estimation	Yilin Ding et.al.	2406.13344	link
2024-06-18	Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation	Ning-Hsu Wang et.al.	2406.12849	null
2024-06-21	GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models	Yongtao Ge et.al.	2406.12671	link
2024-06-17	DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features	Letian Wang et.al.	2406.12095	null
2024-06-17	MEDeA: Multi-view Efficient Depth Adjustment	Mikhail Artemyev et.al.	2406.12048	null
2024-06-16	3D Gaze Tracking for Studying Collaborative Interactions in Mixed-Reality Environments	Eduardo Davalos et.al.	2406.11003	null
2024-06-15	GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR	Bharat Singh et.al.	2406.10722	null
2024-06-14	The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences	Bria Long et.al.	2406.10447	null
2024-06-14	D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video	Moritz Kappel et.al.	2406.10078	null
2024-06-14	DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications	Li Li et.al.	2406.10068	link
2024-06-14	Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion	Runze Liu et.al.	2406.09782	null
2024-06-13	Depth Anything V2	Lihe Yang et.al.	2406.09414	null
2024-06-14	WonderWorld: Interactive 3D Scene Generation from a Single Image	Hong-Xing Yu et.al.	2406.09394	null
2024-06-13	Scale-Invariant Monocular Depth Estimation via SSI Depth	S. Mahdi H. Miangoleh et.al.	2406.09374	null
2024-06-13	Multiple Prior Representation Learning for Self-Supervised Monocular Depth Estimation via Hybrid Transformer	Guodong Sun et.al.	2406.08928	link
2024-06-13	ToSA: Token Selective Attention for Efficient Vision Transformers	Manish Kumar Singh et.al.	2406.08816	null
2024-06-11	Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation	Yufan Zhu et.al.	2406.07741	link
2024-06-11	PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow	Joshua Tokarsky et.al.	2406.07667	null
2024-06-11	RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks	Zhechao Wang et.al.	2406.07032	null
2024-06-10	PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation	Zhenyu Li et.al.	2406.06679	null
2024-06-09	Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks	Zhiyuan Cheng et.al.	2406.05857	link
2024-06-09	RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering	Rui Zhang et.al.	2406.05852	null
2024-06-07	Normal-guided Detail-Preserving Neural Implicit Functions for High-Fidelity 3D Surface Reconstruction	Aarya Patel et.al.	2406.04861	null
2024-06-07	UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection	Yuchao Wang et.al.	2406.04647	null
2024-06-06	MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation	Ionuţ Grigore et.al.	2406.04532	null
2024-06-06	Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image	Stanislaw Szymanowicz et.al.	2406.04343	null
2024-06-06	Neural Surface Reconstruction from Sparse Views Using Epipolar Geometry	Kaichen Zhou et.al.	2406.04301	null
2024-06-04	VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors	Markus Plack et.al.	2406.02552	null
2024-06-03	L-MAGIC: Language Model Assisted Generation of Images with Coherence	Zhipeng Cai et.al.	2406.01843	link
2024-06-04	Learning Temporally Consistent Video Depth from Video Diffusion Priors	Jiahao Shao et.al.	2406.01493	null
2024-06-03	Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Takayuki Kanai et.al.	2406.00929	null
2024-06-01	MoDGS: Dynamic Gaussian Splatting from Causually-captured Monocular Videos	Qingming Liu et.al.	2406.00434	null
2024-05-30	Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian	Wei Sun et.al.	2405.19657	null
2024-05-28	Hybrid Multi-Head Physics-informed Neural Network for Depth Estimation in Terahertz Imaging	Mingjun Xiang et.al.	2405.18317	null
2024-05-27	Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation	Amir El-Ghoussani et.al.	2405.17704	null
2024-05-27	Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving	Shaoyuan Xie et.al.	2405.17426	link
2024-05-27	All-day Depth Completion	Vadim Ezhov et.al.	2405.17315	null
2024-05-27	GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping	Junyoung Seo et.al.	2405.17251	null
2024-05-27	SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing	Yong-Qiang Mao et.al.	2405.17140	null
2024-05-27	DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge	Yifan Mao et.al.	2405.17102	null
2024-05-27	Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation	Steven Landgraf et.al.	2405.17097	null
2024-05-27	DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation	Mengtan Zhang et.al.	2405.16960	null
2024-05-27	ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection	Ziying Song et.al.	2405.16873	null
2024-05-27	Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations	Jingguo Liu et.al.	2405.16858	null
2024-05-26	Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians	Erik Sandström et.al.	2405.16544	null
2024-05-24	Transparent Object Depth Completion	Yifan Zhou et.al.	2405.15299	null
2024-05-24	MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method	Pan Liao et.al.	2405.15176	null
2024-05-23	EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting	Jiaxu Wang et.al.	2405.14959	link
2024-05-23	Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks	Xingguang Jiang et.al.	2405.14520	null
2024-05-23	Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning	Zhenyu Wei et.al.	2405.14195	null
2024-05-21	Cross-spectral Gated-RGB Stereo Depth Estimation	Samuel Brucker et.al.	2405.12759	null
2024-05-20	Depth Reconstruction with Neural Signed Distance Fields in Structured Light Systems	Rukun Qiao et.al.	2405.12006	null
2024-05-20	Depth Prompting for Sensor-Agnostic Depth Estimation	Jin-Hwi Park et.al.	2405.11867	null
2024-05-19	CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs	Zidong Cao et.al.	2405.11564	null
2024-05-18	Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation Models	Madhu Vankadari et.al.	2405.11158	link
2024-05-17	FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation	Fei Wang et.al.	2405.10885	link
2024-05-17	Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory	Jonas Kälble et.al.	2405.10575	link
2024-05-16	Towards Task-Compatible Compressible Representations	Anderson de Andrade et.al.	2405.10244	link
2024-05-16	KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment	Zhengxu Shi et.al.	2405.09964	null
2024-05-14	CLIP with Quality Captions: A Strong Pretraining for Vision Tasks	Pavan Kumar Anasosalu Vasu et.al.	2405.08911	null

(back to top)

Semactic Segmentation

Publish Date	Title	Authors	PDF	Code
2024-11-21	Revisiting the Integration of Convolution and Attention for Vision Backbone	Lei Zhu et.al.	2411.14429	link
2024-11-21	CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation	Lin Sun et.al.	2411.13836	link
2024-11-21	Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals	Hussni Mohd Zakir et.al.	2411.13774	null
2024-11-20	FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting	Ola Shorinwa et.al.	2411.13753	null
2024-11-20	BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation	Umamaheswaran Raman Kumar et.al.	2411.13251	null
2024-11-20	XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Ziyi Wang et.al.	2411.13243	link
2024-11-20	Automating Sonologists USG Commands with AI and Voice Interface	Emad Mohamed et.al.	2411.13006	null
2024-11-19	A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation	Jiaqi Yang et.al.	2411.12615	link
2024-11-19	SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation	Ron Keuth et.al.	2411.12602	link
2024-11-19	ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator	Xiao Jiang et.al.	2411.12250	null
2024-11-18	ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements	M. Arda Aydın et.al.	2411.12044	link
2024-11-18	Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation	Hanieh Shojaei Miandashti et.al.	2411.11935	null
2024-11-18	MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models	Harshita Sharma et.al.	2411.11362	null
2024-11-18	Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications	Scarlett Raine et.al.	2411.11287	null
2024-11-16	Attention-based U-Net Method for Autonomous Lane Detection	Mohammadhamed Tangestanizadeh et.al.	2411.10902	null
2024-11-16	Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation	Jaisidh Singh et.al.	2411.10845	null
2024-11-19	Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients	Maria Monzon et.al.	2411.10755	link
2024-11-15	Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images	Ammar Qammaz et.al.	2411.10334	null
2024-11-15	CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	Dengke Zhang et.al.	2411.10086	null
2024-11-14	OneNet: A Channel-Wise 1D Convolutional U-Net	Sanghyun Byun et.al.	2411.09838	link
2024-11-14	Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks	Zengyi Yang et.al.	2411.09387	null
2024-11-14	Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation	Yuheng Shi et.al.	2411.09219	link
2024-11-14	Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery	Ashim Dahal et.al.	2411.09101	link
2024-11-13	CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2411.09023	null
2024-11-14	Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation	Yangyang Li et.al.	2411.08756	null
2024-11-13	Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model	Jun Xie et.al.	2411.08592	null
2024-11-12	Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry	Christopher Hahne et.al.	2411.07918	link
2024-11-12	Semantic segmentation on multi-resolution optical and microwave data using deep learning	Jai G Singla et.al.	2411.07581	null
2024-11-11	SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation	Jiale Chen et.al.	2411.06991	null
2024-11-14	Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision	Yueyang Cang et.al.	2411.06727	null
2024-11-10	Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments	Deegan Atha et.al.	2411.06632	null
2024-11-09	Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing	Kaixuan Lu et.al.	2411.06091	null
2024-11-08	Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model	Shuchang Lyu et.al.	2411.05878	link
2024-11-08	Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation	Sien Li et.al.	2411.05307	link
2024-11-07	In the Era of Prompt Learning with Vision-Language Models	Ankit Jha et.al.	2411.04892	null
2024-11-11	ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset	Olaf Wysocki et.al.	2411.04865	link
2024-11-06	Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts	Zhitong Gao et.al.	2411.03829	link
2024-11-06	Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model	Yansong Qu et.al.	2411.03672	null
2024-11-05	Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation	Zhiling Yue et.al.	2411.03551	null
2024-11-05	SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture	Andrew Heschl et.al.	2411.03505	link
2024-11-05	Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need	Qishuai Wen et.al.	2411.03033	link
2024-11-05	Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation	Xavier Timoneda et.al.	2411.02969	null
2024-11-05	Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery	Mohammad Kakooei et.al.	2411.02935	null
2024-11-05	CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation	Jinchao Ge et.al.	2411.02715	null
2024-11-04	Deep Learning on 3D Semantic Segmentation: A Detailed Review	Thodoris Betsas et.al.	2411.02104	null
2024-11-04	Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models	Sharat Agarwal et.al.	2411.01925	null
2024-11-04	DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability	Bo Gao et.al.	2411.01819	null
2024-11-04	Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations	Thanh Nguyen Canh et.al.	2411.01816	null
2024-11-03	PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation	Xinyu Xu et.al.	2411.01624	null
2024-11-01	Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions	Lixiao Yang et.al.	2411.01039	null
2024-11-01	Event-guided Low-light Video Semantic Segmentation	Zhen Yao et.al.	2411.00639	null
2024-11-01	Cross-modal semantic segmentation for indoor environmental perception using single-chip millimeter-wave radar raw data	Hairuo Hu et.al.	2411.00499	null
2024-11-01	Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing	Naufal Suryanto et.al.	2411.00425	link
2024-10-31	A Recipe for Geometry-Aware 3D Mesh Transformers	Mohammad Farazi et.al.	2411.00164	null
2024-10-31	Federated Black-Box Adaptation for Semantic Segmentation	Jay N. Paranjape et.al.	2410.24181	null
2024-10-31	COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes	Muhammad Ali et.al.	2410.24139	link
2024-10-31	Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model	Hao Zhang et.al.	2410.23905	link
2024-10-30	S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving	Maciej K. Wozniak et.al.	2410.23085	null
2024-10-31	CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation	Ziyang Gong et.al.	2410.22629	link
2024-10-29	Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation	Zhaochong An et.al.	2410.22489	null
2024-10-29	Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation	Jintao Tong et.al.	2410.22135	null
2024-10-29	Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models	Imad Ali Shah et.al.	2410.22101	null
2024-10-29	Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation	Ruihao Xia et.al.	2410.21708	link
2024-10-28	Domain Adaptation with a Single Vision-Language Embedding	Mohammad Fahes et.al.	2410.21361	null
2024-10-28	IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks	Manjunath D et.al.	2410.20953	null
2024-10-27	A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models	Camilo Espinosa-Curilem et.al.	2410.20595	link
2024-10-27	Unlocking Comics: The AI4VA Dataset for Visual Understanding	Peter Grönquist et.al.	2410.20459	link
2024-10-27	Historical Test-time Prompt Tuning for Vision Foundation Models	Jingyi Zhang et.al.	2410.20346	null
2024-10-25	OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery	Philipe Dias et.al.	2410.19965	null
2024-10-25	IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation	Kaixian Qu et.al.	2410.19697	null
2024-10-25	Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation	Yao Wu et.al.	2410.19446	link
2024-10-25	Context-Based Visual-Language Place Recognition	Soojin Woo et.al.	2410.19341	link
2024-10-24	Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks	Alexander Jaus et.al.	2410.18684	null
2024-10-24	Unsupervised semantic segmentation of urban high-density multispectral point clouds	Oona Oinonen et.al.	2410.18520	null
2024-10-26	CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator	Stefanos Pasios et.al.	2410.18238	null
2024-10-23	Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers	Achille Chiuchiarelli et.al.	2410.17738	null
2024-10-22	EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding	Zhiyi Pan et.al.	2410.17207	null
2024-10-22	SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments	Jumman Hossain et.al.	2410.16686	null
2024-10-21	TIPS: Text-Image Pretraining with Spatial Awareness	Kevis-Kokitsi Maninis et.al.	2410.16512	null
2024-10-21	GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation	Nazanin Moradinasab et.al.	2410.16485	null
2024-10-21	LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training	Thomas Kreutz et.al.	2410.15833	link
2024-10-21	TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight	Hyun-Kurl Jang et.al.	2410.15674	link
2024-10-21	Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications	Jintao Ren et.al.	2410.15584	null
2024-10-22	Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation	Fnu Neha et.al.	2410.15472	null
2024-10-18	On the Influence of Shape, Texture and Color for Learning Semantic Segmentation	Annika Mütze et.al.	2410.14878	null
2024-10-18	Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+	Arpan Mahara et.al.	2410.14836	null
2024-10-17	ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding	Guangda Ji et.al.	2410.13924	null
2024-10-17	Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks	Clément Playout et.al.	2410.13822	link
2024-10-22	EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything	Joonhyeon Song et.al.	2410.13621	link
2024-10-17	Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation	Ziyang Chen et.al.	2410.13472	null
2024-10-17	SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing	Bin Wang et.al.	2410.13471	link
2024-10-17	Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation	Florian Wulff et.al.	2410.13383	null
2024-10-17	Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation	Houze Liu et.al.	2410.13099	null
2024-10-16	Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation	Wenbo Xu et.al.	2410.13094	null
2024-10-16	Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation	Jesús Alejandro Loera-Ponce et.al.	2410.12988	null
2024-10-16	VividMed: Vision Language Model with Versatile Visual Grounding for Medicine	Lingxiao Luo et.al.	2410.12694	link
2024-10-16	Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans	Luca Marsilio et.al.	2410.12641	null
2024-10-16	SAM-Guided Masked Token Prediction for 3D Scene Understanding	Zhimin Chen et.al.	2410.12158	null
2024-10-15	WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation	Chenghao Qian et.al.	2410.12075	null
2024-10-15	Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning	Rijun Wang et.al.	2410.11913	null
2024-10-15	RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation	Anton Antonov et.al.	2410.11722	link
2024-10-15	InvSeg: Test-Time Prompt Inversion for Semantic Segmentation	Jiayi Lin et.al.	2410.11473	null
2024-10-15	MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation	Xianping Ma et.al.	2410.11160	link
2024-10-14	Locality Alignment Improves Vision-Language Models	Ian Covert et.al.	2410.11087	null
2024-10-14	Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes	Tim Broedermann et.al.	2410.10791	null
2024-10-14	UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation	Lihe Yang et.al.	2410.10777	link
2024-10-14	Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation	Daniel Fusaro et.al.	2410.10510	link
2024-10-14	LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections	Xuezhi Xiang et.al.	2410.10433	null
2024-10-14	V2M: Visual 2-Dimensional Mamba for Image Representation Learning	Chengkun Wang et.al.	2410.10382	link
2024-10-14	GlobalMamba: Global Image Serialization for Vision Mamba	Chengkun Wang et.al.	2410.10316	link
2024-10-13	AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model	Yuchen Li et.al.	2410.09714	null
2024-10-12	An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation	Wei Liang et.al.	2410.09443	null
2024-10-11	Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation	Varduhi Yeghiazaryan et.al.	2410.08946	null
2024-10-11	Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation	Hanieh Shojaei et.al.	2410.08687	null
2024-10-11	DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention	Nguyen Huu Bao Long et.al.	2410.08582	link
2024-10-10	Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving?	Samir Abou Haidar et.al.	2410.08365	null
2024-10-10	Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation	Zhiyi Pan et.al.	2410.08091	null
2024-10-10	Shift and matching queries for video semantic segmentation	Tsubasa Mizuno et.al.	2410.07635	null
2024-10-10	3D Vision-Language Gaussian Splatting	Qucheng Peng et.al.	2410.07577	null
2024-10-11	Bridge the Points: Graph-based Few-shot Segment Anything Semantically	Anqi Zhang et.al.	2410.06964	null
2024-10-09	Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation	Seungho Lee et.al.	2410.06893	null
2024-10-09	Rethinking the Evaluation of Visible and Infrared Image Fusion	Dayan Guan et.al.	2410.06811	link
2024-10-10	QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model	Fei Xie et.al.	2410.06806	link
2024-10-09	Transesophageal Echocardiography Generation using Anatomical Models	Emmanuel Oladokun et.al.	2410.06781	null
2024-10-09	Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy	Qinfeng Zhu et.al.	2410.06725	null
2024-10-09	Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments	Meng Yu et.al.	2410.06626	null
2024-10-09	Towards Natural Image Matting in the Wild via Real-Scenario Prior	Ruihao Xia et.al.	2410.06593	link
2024-10-08	Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions	Mateus Karvat et.al.	2410.06380	null
2024-10-08	Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading	Fang Gao et.al.	2410.05762	null
2024-10-07	Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation	Vince Zhu et.al.	2410.04689	null
2024-10-04	SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2	Hao Yu et.al.	2410.03962	null
2024-10-04	Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features	Benyuan Meng et.al.	2410.03558	link
2024-10-04	Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images	Abhijeet Patil et.al.	2410.03289	link
2024-10-04	HRVMamba: High-Resolution Visual State Space Model for Dense Prediction	Hao Zhang et.al.	2410.03174	null
2024-10-03	HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer	Jingjing Ren et.al.	2410.02528	null
2024-10-04	Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation	Muzhi Zhu et.al.	2410.02369	null
2024-10-03	RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds	Remco Royen et.al.	2410.02323	null
2024-10-03	Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network	Yangyang Qiu et.al.	2410.02224	null
2024-10-03	Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images	Qingyuan Liu et.al.	2410.02207	null
2024-10-02	SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images	Kaiyu Li et.al.	2410.01768	link
2024-10-02	One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations	Shaokang Wu et.al.	2410.01630	null
2024-10-02	Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation	Zhaofeng Shi et.al.	2410.01341	null
2024-10-02	VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings	Andrea Carrara et.al.	2410.01336	null
2024-10-01	RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation	Yazhou Zhu et.al.	2410.01110	null
2024-10-01	Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer	Vlatko Spasev et.al.	2410.01092	null
2024-10-01	Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time	Chiao-An Yang et.al.	2410.01083	link
2024-10-01	DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles	Robert Krajewski et.al.	2410.00769	null
2024-10-01	Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection	Pengxi Zeng et.al.	2410.00582	null
2024-10-01	Precise Workcell Sketching from Point Clouds Using an AR Toolbox	Krzysztof Zieliński et.al.	2410.00479	null
2024-09-30	AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation	Boyu Han et.al.	2409.20398	null
2024-09-30	Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation	Tillmann Rheude et.al.	2409.20287	link
2024-09-30	Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model	Fulong Ma et.al.	2409.20164	null
2024-09-30	Segmenting Wood Rot using Computer Vision Models	Roland Kammerbauer et.al.	2409.20137	null
2024-09-30	Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels	Heeseong Shin et.al.	2409.19846	null
2024-09-27	Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation	Raphael Hagmanns et.al.	2409.18788	null
2024-09-27	Learning from Pattern Completion: Self-supervised Controllable Generation	Zhiqiang Chen et.al.	2409.18694	link
2024-09-27	Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast	Xiaoke Hao et.al.	2409.18543	link
2024-10-01	Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization	Siru Li et.al.	2409.18434	null
2024-09-26	Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning	Siyi Lu et.al.	2409.17659	null
2024-09-26	Global-Local Medical SAM Adaptor Based on Full Adaption	Meng Wang et.al.	2409.17486	null
2024-09-25	VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection	Liangyu Zhong et.al.	2409.17330	null
2024-09-25	2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation	Tommie Kerssies et.al.	2409.17208	link
2024-09-25	WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks	Alberto Bacchin et.al.	2409.16999	link
2024-09-25	Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis	Illia Tsiporenko et.al.	2409.16940	null
2024-09-24	A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation	Avisha Kumar et.al.	2409.16441	null
2024-09-24	Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds	Asad Ur Rahman et.al.	2409.16381	null
2024-09-24	Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation	Hannah Kerner et.al.	2409.16252	link
2024-09-24	Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation	Harry Rogers et.al.	2409.16213	link
2024-09-24	Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification	Pang-Yuan Pao et.al.	2409.15846	null
2024-09-24	DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation	Soojin Jang et.al.	2409.15801	null
2024-09-24	Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis	Camndon Reed et.al.	2409.15671	null
2024-09-23	ZeroSCD: Zero-Shot Street Scene Change Detection	Shyam Sundar Kannan et.al.	2409.15255	null
2024-09-17	Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks	Edgar Heinert et.al.	2409.11373	null
2024-09-17	MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping	Amirreza Fateh et.al.	2409.11316	link
2024-09-17	Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark	Clifford Broni-Bediako et.al.	2409.11227	link
2024-09-17	HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios	Nick Theisen et.al.	2409.11205	link
2024-09-16	Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning	Amin Karimi Monsefi et.al.	2409.10362	null
2024-09-16	BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images	Wentao Wang et.al.	2409.10269	null
2024-09-15	Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation	Zhanteng Xie et.al.	2409.09899	null
2024-09-15	Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation	Qilong Zhangli et.al.	2409.09893	null
2024-09-15	High Definition Map Mapping and Update: A General Overview and Future Directions	Benny Wijaya et.al.	2409.09726	null
2024-09-14	Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation	Hugo Porta et.al.	2409.09497	null
2024-09-13	AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation	Zechao Sun et.al.	2409.08516	null
2024-09-13	VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation	Ezra MacDonald et.al.	2409.08461	link
2024-09-12	Bayesian Self-Training for Semi-Supervised 3D Segmentation	Ozan Unal et.al.	2409.08102	null
2024-09-12	Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes	Siyu Chen et.al.	2409.07995	null
2024-09-12	SURGIVID: Annotation-Efficient Surgical Video Object Discovery	Çağhan Köksal et.al.	2409.07801	null
2024-09-12	Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation	Fuchen Zheng et.al.	2409.07793	link
2024-09-12	ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation	Fuchen Zheng et.al.	2409.07779	link
2024-09-12	Open-Vocabulary Remote Sensing Image Semantic Segmentation	Qinglong Cao et.al.	2409.07683	null
2024-09-11	Token Turing Machines are Efficient Vision Models	Purvish Jajal et.al.	2409.07613	null
2024-09-11	AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution	Wangduo Xie et.al.	2409.07171	null
2024-09-11	Brain-Inspired Stepwise Patch Merging for Vision Transformers	Yonghao Yu et.al.	2409.06963	null
2024-09-10	Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds	Mu Cai et.al.	2409.06827	link
2024-09-10	A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO	Sabit Ahamed Preanto et.al.	2409.06671	null
2024-09-10	PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation	Yin Hu et.al.	2409.06309	null
2024-09-10	EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Nischal Khanal et.al.	2409.06183	link
2024-09-09	SVS-GAN: Leveraging GANs for Semantic Video Synthesis	Khaled M. Seyam et.al.	2409.06074	null
2024-09-09	Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance	Quang-Huy Che et.al.	2409.06002	null
2024-09-09	Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features	Jacob Gildenblat et.al.	2409.05697	null
2024-09-09	ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions	Furqan Ahmed Shaik et.al.	2409.05327	null
2024-09-08	RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network	Zhiwei Lin et.al.	2409.04979	null
2024-09-06	Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation	Björn Michele et.al.	2409.04409	link
2024-09-05	Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution	Marga Don et.al.	2409.03754	link
2024-09-05	LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones	Moritz Nottebaum et.al.	2409.03460	link
2024-09-05	Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications	Tong Bu et.al.	2409.03368	null
2024-09-05	UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking	Md. Mahfuzur Rahman et.al.	2409.03245	null
2024-09-05	Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation	Xixi Jiang et.al.	2409.03228	link
2024-09-06	iSeg: An Iterative Refinement-based Framework for Training-free Segmentation	Lin Sun et.al.	2409.03209	link
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-04	CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation	Minhee Cho et.al.	2409.02699	null
2024-09-04	SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction	Sumin Son et.al.	2409.02513	null
2024-09-03	K-Origins: Better Colour Quantification for Neural Networks	Lewis Mason et.al.	2409.02281	link
2024-09-03	AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions	Chenghao Qian et.al.	2409.02045	null
2024-09-03	Segmenting Object Affordances: Reproducibility and Sensitivity to Scale	Tommaso Apicella et.al.	2409.01814	link
2024-09-03	Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation	Haodong Wang et.al.	2409.01662	null
2024-09-02	Semantic Segmentation from Image Labels by Reconstruction from Structured Decomposition	Xuanrui Zeng et.al.	2409.01472	link
2024-09-02	SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation	Alberto Bacchin et.al.	2409.01109	link
2024-09-02	Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions	Taorong Liu et.al.	2409.01072	null
2024-08-30	Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes	Li Zhang et.al.	2408.17421	link
2024-08-30	Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations	Ahmed Hammam et.al.	2408.17311	null
2024-08-30	Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training	Zizheng Huang et.al.	2408.17081	link
2024-08-30	Transient Fault Tolerant Semantic Segmentation for Autonomous Driving	Leonardo Iurada et.al.	2408.16952	link
2024-08-29	SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection	Rohit Venkata Sai Dulam et.al.	2408.16645	null
2024-08-29	MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation	Linyan Yang et.al.	2408.16478	null
2024-08-29	Multi-source Domain Adaptation for Panoramic Semantic Segmentation	Jing Jiang et.al.	2408.16469	null
2024-08-29	EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More	Kanghao Chen et.al.	2408.16254	null
2024-08-28	SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors	Zhiqing Zhang et.al.	2408.15887	null
2024-08-28	DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries	Yu Yang et.al.	2408.15813	null
2024-08-28	TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation	Junbao Zhou et.al.	2408.15657	link
2024-08-27	Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images	Silvia Seidlitz et.al.	2408.15373	link
2024-08-27	An Investigation on The Position Encoding in Vision-Based Dynamics Prediction	Jiageng Zhu et.al.	2408.15201	null
2024-08-27	Applying ViT in Generalized Few-shot Semantic Segmentation	Liyuan Geng et.al.	2408.14957	link
2024-08-27	Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack	Naufal Suryanto et.al.	2408.14879	null
2024-08-27	MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation	Yuanbing Zhu et.al.	2408.14776	null
2024-08-26	Physically Feasible Semantic Segmentation	Shamik Basu et.al.	2408.14672	link
2024-08-25	OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation	Muhammad Rameez ur Rahman et.al.	2408.13936	link
2024-08-25	Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation	Yuwen Pan et.al.	2408.13838	null
2024-08-25	TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather	Xiongwei Zhao et.al.	2408.13802	link
2024-08-25	ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation	Xin Zhang et.al.	2408.13771	null
2024-08-25	Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation	Zhaoyang Li et.al.	2408.13752	null
2024-08-24	ESA: Annotation-Efficient Active Learning for Semantic Segmentation	Jinchao Ge et.al.	2408.13491	link
2024-08-23	Accuracy Improvement of Cell Image Segmentation Using Feedback Former	Hinako Mitsuoka et.al.	2408.12974	null
2024-08-23	Image Segmentation in Foundation Model Era: A Survey	Tianfei Zhou et.al.	2408.12957	null
2024-08-23	Symmetric masking strategy enhances the performance of Masked Image Modeling	Khanh-Binh Nguyen et.al.	2408.12772	null
2024-08-22	Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets	Wolfgang Boettcher et.al.	2408.12489	null
2024-08-22	The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation	Tuyen Tran et.al.	2408.12447	null
2024-08-21	UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images	Enze Zhu et.al.	2408.11545	null
2024-08-21	Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation	Chuandong Liu et.al.	2408.11280	null
2024-08-20	NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency	Valentinos Pariza et.al.	2408.11054	null
2024-08-20	CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients	Karen Sanchez et.al.	2408.10827	null
2024-08-20	Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?	Chen Liang et.al.	2408.10627	null
2024-08-20	Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation	Jiawei Han et.al.	2408.10537	link
2024-08-19	Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network	Rasha Alshawi et.al.	2408.10181	null
2024-08-19	Dynamic Label Injection for Imbalanced Industrial Defect Segmentation	Emanuele Caruso et.al.	2408.10031	link
2024-08-19	Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis	Kira Maag et.al.	2408.10021	null
2024-08-19	Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving	Jun Yan et.al.	2408.09839	link
2024-08-18	OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras	Muhammad Rameez Ur Rahman et.al.	2408.09424	link
2024-08-18	Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration	Hao Ai et.al.	2408.09336	null
2024-08-17	Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology	Junchao Zhu et.al.	2408.09278	link
2024-08-17	GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation	Weiming Zhang et.al.	2408.09115	null
2024-08-17	Depth-guided Texture Diffusion for Image Semantic Segmentation	Wei Sun et.al.	2408.09097	null
2024-08-15	5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks	Dongshuo Yin et.al.	2408.08345	link
2024-08-14	MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis	Nimeesha Chan et.al.	2408.07773	link
2024-08-15	MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation	Beoungwoo Kang et.al.	2408.07576	link
2024-08-15	MagicFace: Training-free Universal-Style Human Image Customized Synthesis	Yibin Wang et.al.	2408.07433	null
2024-08-14	Segment Using Just One Example	Pratik Vora et.al.	2408.07393	null
2024-08-14	Ensemble architecture in polyp segmentation	Hao-Yun Hsu et.al.	2408.07262	link
2024-08-14	Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks	Raghavendra Singh et.al.	2408.07243	null
2024-08-14	Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training	Ethan Kou et.al.	2408.07239	null
2024-08-13	ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation	Jingyun Wang et.al.	2408.06747	link
2024-08-10	Dilated Convolution with Learnable Spacings	Ismail Khalfaoui-Hassani et.al.	2408.06383	null
2024-08-12	Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images	Siladittya Manna et.al.	2408.06235	null
2024-08-12	A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting	Felix Assion et.al.	2408.06071	null
2024-08-12	Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning	Xinrong Hu et.al.	2408.05889	null
2024-08-11	Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task	Hannuo Zhang et.al.	2408.05777	null
2024-08-11	MacFormer: Semantic Segmentation with Fine Object Boundaries	Guoan Xu et.al.	2408.05699	null
2024-08-10	Multimodal generative semantic communication based on latent diffusion model	Weiqi Fu et.al.	2408.05455	null
2024-08-09	In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation	Dahyun Kang et.al.	2408.04961	link
2024-08-09	ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation	Mengcheng Lan et.al.	2408.04883	link
2024-08-09	Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning	Fumihiro Kaneko et.al.	2408.04795	null
2024-08-08	SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation	Jieming Yu et.al.	2408.04593	null
2024-08-08	SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios	Sriram Mandalika et.al.	2408.04482	null
2024-08-08	What could go wrong? Discovering and describing failure modes in computer vision	Gabriela Csurka et.al.	2408.04471	null
2024-08-07	CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications	Tianfang Zhang et.al.	2408.03703	link
2024-08-07	SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology	Mingya Zhang et.al.	2408.03651	link
2024-08-06	Post-Mortem Human Iris Segmentation Analysis with Deep Learning	Afzal Hossain et.al.	2408.03448	null
2024-08-06	Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression	Jonas Schmitt et.al.	2408.03046	link
2024-08-05	Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation	Sai Prasanna et.al.	2408.02297	null
2024-08-05	Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs	Jeongkee Lim et.al.	2408.02261	null
2024-08-05	Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders	Muhammad Abdullah Jamal et.al.	2408.02245	null
2024-08-04	Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation	Ye Du et.al.	2408.02039	null
2024-08-03	Bayesian Active Learning for Semantic Segmentation	Sima Didari et.al.	2408.01694	null
2024-08-03	A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection	Omkar Oak et.al.	2408.01692	null
2024-08-03	Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation	Balázs Opra et.al.	2408.01640	null
2024-08-02	Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans	Lukas Kratochvila et.al.	2408.01526	null
2024-08-02	Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation	Yuanzhi Su et.al.	2408.01356	null
2024-08-02	StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation	Bingyu Li et.al.	2408.01343	null
2024-08-02	Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach	Yabin Zhu et.al.	2408.00969	null
2024-08-01	Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	Siyu Jiao et.al.	2408.00744	null
2024-08-01	Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function	Matias Oscar Volman Stern et.al.	2408.00707	null
2024-08-01	AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation	Asbjørn Munk et.al.	2408.00640	null
2024-08-01	SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation	Shengbo Tan et.al.	2408.00496	null
2024-07-31	Open-Vocabulary Audio-Visual Semantic Segmentation	Ruohao Guo et.al.	2407.21721	null
2024-07-31	MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment	Anurag Das et.al.	2407.21654	null
2024-07-31	Small Object Few-shot Segmentation for Vision-based Industrial Inspection	Zilong Zhang et.al.	2407.21351	null
2024-07-31	On-the-fly Point Feature Representation for Point Clouds Analysis	Jiangyi Wang et.al.	2407.21335	null
2024-07-31	Fine-grained Metrics for Point Cloud Semantic Segmentation	Zhuheng Lu et.al.	2407.21289	null
2024-07-30	PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds	Kerem Mertoğlu et.al.	2407.21150	null
2024-07-30	Learning Ordinality in Semantic Segmentation	Rafael Cristino et.al.	2407.20959	null
2024-07-29	Improving 2D Feature Representations by 3D-Aware Fine-Tuning	Yuanwen Yue et.al.	2407.20229	null
2024-07-29	Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset	Yimian Dai et.al.	2407.20078	link
2024-07-29	Language-driven Grasp Detection with Mask-guided Attention	Tuan Van Vo et.al.	2407.19877	null
2024-07-29	Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets	Muhammad Abdullah Jamal et.al.	2407.19714	null
2024-07-29	ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement	Ezequiel Perez-Zarate et.al.	2407.19708	link
2024-07-28	ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding	Zhen Chen et.al.	2407.19435	link
2024-07-27	Ensembling convolutional neural networks for human skin segmentation	Patryk Kuban et.al.	2407.19310	null
2024-07-27	Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network	Gang Pan et.al.	2407.19271	null
2024-07-26	Sparse Refinement for Efficient High-Resolution Semantic Segmentation	Zhijian Liu et.al.	2407.19014	null
2024-07-29	Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation	Jingjun Yi et.al.	2407.18568	null
2024-07-25	Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception	Julia Hindel et.al.	2407.18145	null
2024-07-25	TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework	Guanfeng Tang et.al.	2407.18038	null
2024-07-25	Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions	Jan Nikolas Morshuis et.al.	2407.18026	link
2024-07-24	Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation	Hyunwoo Yu et.al.	2407.17261	link
2024-07-24	Trans2Unet: Neural fusion for Nuclei Semantic Segmentation	Dinh-Phu Tran et.al.	2407.17181	null
2024-07-24	PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning	Mu Chen et.al.	2407.17101	null
2024-07-25	Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste	Qinfeng Zhu et.al.	2407.17028	link
2024-07-24	Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images	Dooseop Choi et.al.	2407.17003	link
2024-07-23	Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving	Anam Manzoor et.al.	2407.16647	null
2024-07-23	Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging	Daniela L. Ramos et.al.	2407.16608	null
2024-07-23	Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision	Aditya Krishnan et.al.	2407.16102	null
2024-07-22	MILAN: Milli-Annotations for Lidar Semantic Segmentation	Nermin Samet et.al.	2407.15797	null
2024-07-22	Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond	Silvio Galesso et.al.	2407.15739	link
2024-07-22	MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics	Alexander Melekhin et.al.	2407.15663	link
2024-07-22	Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling	Bo Yuan et.al.	2407.15429	link
2024-07-22	Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data	Junha Song et.al.	2407.15383	null
2024-07-21	Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation	Xiaoyang Wu et.al.	2407.15282	null
2024-07-20	Downstream-Pretext Domain Knowledge Traceback for Active Learning	Beichen Zhang et.al.	2407.14720	null
2024-07-19	Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model	Kun Zhao et.al.	2407.14326	null
2024-07-19	Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation	Zhengyuan Xie et.al.	2407.14142	link
2024-07-19	GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation	Florian Chabot et.al.	2407.14108	null
2024-07-18	Many Perception Tasks are Highly Redundant Functions of their Input Data	Rahul Ramesh et.al.	2407.13841	null
2024-07-18	GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model	Abdelrahman Shaker et.al.	2407.13772	link
2024-07-18	SegPoint: Segment Any Point Cloud via Large Language Model	Shuting He et.al.	2407.13761	null
2024-07-18	MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis	Ziming Zhong et.al.	2407.13675	link
2024-07-18	Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models	Xiaoyu Zhu et.al.	2407.13642	null
2024-07-18	FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures	Hao Lu et.al.	2407.13500	link
2024-07-18	FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions	Sohyun Lee et.al.	2407.13437	null
2024-07-18	Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability	Judith Dijk et.al.	2407.13392	null
2024-07-18	Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation	Chang Liu et.al.	2407.13363	null
2024-07-18	Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation	Shoumeng Qiu et.al.	2407.13254	null
2024-07-18	OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation	Jian Sun et.al.	2407.13137	null
2024-07-16	Mitigating Background Shift in Class-Incremental Semantic Segmentation	Gilhan Park et.al.	2407.11859	link
2024-07-16	Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation	Juncheng Ma et.al.	2407.11820	null
2024-07-16	XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach	Truong Thanh Hung Nguyen et.al.	2407.11771	null
2024-07-16	OAM-TCD: A globally diverse dataset of high-resolution tree cover maps	Josh Veitch-Michaelis et.al.	2407.11743	null
2024-07-16	SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds	Yanbo Wang et.al.	2407.11569	link
2024-07-16	Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations	Yunya Gao et.al.	2407.11381	link
2024-07-16	Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities	Xu Zheng et.al.	2407.11351	null
2024-07-16	Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation	Xu Zheng et.al.	2407.11344	null
2024-07-16	TCFormer: Visual Recognition via Token Clustering Transformer	Wang Zeng et.al.	2407.11321	link
2024-07-15	Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding	Danish Nazir et.al.	2407.11224	null
2024-07-15	No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations	Walter Simoncini et.al.	2407.10964	link
2024-07-15	APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2407.10649	null
2024-07-15	Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs	Rong Ma et.al.	2407.10534	null
2024-07-14	Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data	Tuo Feng et.al.	2407.10200	link
2024-07-14	RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation	Li Li et.al.	2407.10159	link
2024-07-14	HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation	Chengjie Jiang et.al.	2407.10047	null
2024-07-13	Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation	Anqi Zhang et.al.	2407.09838	null
2024-07-13	Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach	Md Rakibul Islam et.al.	2407.09828	null
2024-07-13	3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance	Xiaoxu Xu et.al.	2407.09826	null
2024-07-13	TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation	Xiaopei Wu et.al.	2407.09751	null
2024-07-12	FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background	Muhammad Ali et.al.	2407.09379	link
2024-07-12	Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy	Julian Wyatt et.al.	2407.09192	null
2024-07-12	Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off	Levente Halmosi et.al.	2407.09150	link
2024-07-12	Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation	Wei Cong et.al.	2407.09047	null
2024-07-12	Textual Query-Driven Mask Transformer for Domain Generalized Segmentation	Byeonghyun Pak et.al.	2407.09033	null
2024-07-12	Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation	Zihao Li et.al.	2407.08994	null
2024-07-11	Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation	Tong Shao et.al.	2407.08268	null
2024-07-11	Enrich the content of the image Using Context-Aware Copy Paste	Qiushi Guo et.al.	2407.08151	null
2024-07-10	MambaVision: A Hybrid Mamba-Transformer Vision Backbone	Ali Hatamizadeh et.al.	2407.08083	link
2024-07-10	Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift	Elliot Vincent et.al.	2407.07616	link
2024-07-10	H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper	Ryan Banks et.al.	2407.07604	link
2024-07-11	Trainable Highly-expressive Activation Functions	Irit Chelly et.al.	2407.07564	null
2024-07-10	Deformable-Heatmap-Segmentation for Automobile Visual Perception	Hongyu Jin et.al.	2407.07493	null
2024-07-10	Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining	Tianfang Sun et.al.	2407.07465	null
2024-07-11	HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation	Guoan Xu et.al.	2407.07441	null
2024-07-09	ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation	Yuyuan Liu et.al.	2407.07171	link
2024-07-08	Training-free CryoET Tomogram Segmentation	Yizhou Zhao et.al.	2407.06833	link
2024-07-09	CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM	Aditya Murali et.al.	2407.06795	null
2024-07-09	LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration	Jiayi Liu et.al.	2407.06512	link
2024-07-08	Leveraging image captions for selective whole slide image annotation	Jingna Qiu et.al.	2407.06363	null
2024-07-08	Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots	Siva Krishna Ravipati et.al.	2407.06077	null
2024-07-08	Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts	Puzuo Wang et.al.	2407.06043	null
2024-07-08	RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation	Sarah Elmahdy et.al.	2407.06016	link
2024-07-07	Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images	Tuan T. Nguyen et.al.	2407.05452	null
2024-07-07	Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness	Idris Hamoud et.al.	2407.05448	null
2024-07-06	A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation	Monika Wysoczańska et.al.	2407.05061	null
2024-07-06	BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support	Vladyslav Polushko et.al.	2407.05007	null
2024-07-05	Explainable Metric Learning for Deflating Data Bias	Emma Andrews et.al.	2407.04866	null
2024-07-05	LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes	Zexian Huang et.al.	2407.04326	null
2024-07-04	Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier	Prantik Howlader et.al.	2407.04036	link
2024-07-04	Relative Difficulty Distillation for Semantic Segmentation	Dong Liang et.al.	2407.03719	null
2024-07-04	POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation	Arindam Dutta et.al.	2407.03549	null
2024-07-03	A Unified Framework for 3D Scene Understanding	Wei Xu et.al.	2407.03263	null
2024-07-03	ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation	Chang Li et.al.	2407.03033	null
2024-07-03	ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation	Yipin Guo et.al.	2407.02881	null
2024-07-03	Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation	Tao Chen et.al.	2407.02768	null
2024-07-02	Open Panoramic Segmentation	Junwei Zheng et.al.	2407.02685	null
2024-07-02	Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction	Tinghuai Wang et.al.	2407.02639	null
2024-07-02	Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather	Junsung Park et.al.	2407.02286	link
2024-07-02	MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders	Baijiong Lin et.al.	2407.02228	link
2024-07-02	Occlusion-Aware Seamless Segmentation	Yihong Cao et.al.	2407.02182	link
2024-07-02	VRBiom: A New Periocular Dataset for Biometric Applications of HMD	Ketan Kotwal et.al.	2407.02150	null
2024-07-02	Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts	Pasquale De Marinis et.al.	2407.02075	null
2024-07-02	Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning	Chengchao Shen et.al.	2407.02014	link
2024-07-01	Label-free Neural Semantic Image Synthesis	Jiayi Wang et.al.	2407.01790	null
2024-07-01	PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction	Xuan Yu et.al.	2407.01349	null
2024-07-01	CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes	Danial Qashqai et.al.	2407.01328	link
2024-06-29	SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City	Guohao Wang et.al.	2407.00296	link
2024-07-01	Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding	Yifan Tang et.al.	2406.19791	null
2024-06-28	Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation	Junsung Park et.al.	2406.19638	link
2024-06-28	PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation	Deyi Ji et.al.	2406.19632	null
2024-06-27	Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model	Haobo Yuan et.al.	2406.19369	null
2024-06-27	ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation	Nazanin Moradinasab et.al.	2406.19225	null
2024-06-30	Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO	Fuseini Mumuni et.al.	2406.19057	null
2024-06-27	Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation	Tao Lian et.al.	2406.18809	null
2024-06-26	CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data	Nikolaos Dionelis et.al.	2406.18279	null
2024-06-26	The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval	Meinardus Boris et.al.	2406.18113	link
2024-06-26	Few-Shot Medical Image Segmentation with High-Fidelity Prototypes	Song Tang et.al.	2406.18074	link
2024-06-25	Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2406.17679	null
2024-06-25	DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation	Ahmad Mohammadshirazi et.al.	2406.17591	link
2024-06-25	Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation	Felix Stillger et.al.	2406.17541	null
2024-06-25	Investigating Self-Supervised Methods for Label-Efficient Learning	Srinivasa Rao Nandam et.al.	2406.17460	null
2024-06-25	Pseudo Labelling for Enhanced Masked Autoencoders	Srinivasa Rao Nandam et.al.	2406.17450	null
2024-06-25	Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model	Zhuoyuan Li et.al.	2406.17442	null
2024-06-25	Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes	Qi Ma et.al.	2406.17438	link
2024-06-24	Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation	Yizheng Wu et.al.	2406.16776	link
2024-06-24	μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation	Pierangela Bruno et.al.	2406.16724	null
2024-06-24	GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection	Harnaik Dhami et.al.	2406.16625	null
2024-06-24	LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images	Xiaowen Ma et.al.	2406.16502	link
2024-06-24	Cascade Reward Sampling for Efficient Decoding-Time Alignment	Bolian Li et.al.	2406.16306	null
2024-06-24	SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments	Neng Wang et.al.	2406.16279	link
2024-06-23	UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery	Pengfei Zhang et.al.	2406.16129	null
2024-06-22	Fine-grained Background Representation for Weakly Supervised Semantic Segmentation	Xu Yin et.al.	2406.15755	null
2024-06-20	Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery	Ilham Adi Panuntun et.al.	2406.14220	null
2024-06-20	Trusting Semantic Segmentation Networks	Samik Some et.al.	2406.14201	null
2024-06-20	EvSegSNN: Neuromorphic Semantic Segmentation for Event Data	Dalia Hareb et.al.	2406.14178	null
2024-06-20	Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images	Qinfeng Zhu et.al.	2406.14086	link
2024-06-19	Search-based DNN Testing and Retraining with GAN-enhanced Simulations	Mohammed Oualid Attaoui et.al.	2406.13359	null
2024-06-19	Deep Learning-Based 3D Instance and Semantic Segmentation: A Review	Siddiqui Muhammad Yasir et.al.	2406.13308	null
2024-06-18	Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation	Guoyu Yang et.al.	2406.12496	link
2024-06-18	Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble	Wang Liu et.al.	2406.12271	null
2024-06-17	OoDIS: Anomaly Instance Segmentation Benchmark	Alexey Nekrasov et.al.	2406.11835	link
2024-06-17	Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT	Maximilian E. Tschuchnig et.al.	2406.11650	null
2024-06-17	SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation	Zhenchao Lin et.al.	2406.11441	link
2024-06-17	Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding	Yunsong Wang et.al.	2406.11283	null
2024-06-17	Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation	Bingfeng Zhang et.al.	2406.11189	null
2024-06-16	$α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion	Sanbao Su et.al.	2406.11021	null
2024-06-16	PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery	Libo Wang et.al.	2406.10828	link
2024-06-15	GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR	Bharat Singh et.al.	2406.10722	null
2024-06-15	A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection	Chenyao Zhou et.al.	2406.10678	link
2024-06-14	ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers	Narges Norouzi et.al.	2406.09936	null
2024-06-14	Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions	Aldi Piroli et.al.	2406.09906	null
2024-06-14	Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation	Brunó B. Englert et.al.	2406.09896	link
2024-06-14	Open-Vocabulary Semantic Segmentation with Image Embedding Balancing	Xiangheng Shan et.al.	2406.09829	link
2024-06-13	Instance-level quantitative saliency in multiple sclerosis lesion segmentation	Federico Spagnolo et.al.	2406.09335	null
2024-06-13	APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation	Weizhao He et.al.	2406.08372	null
2024-06-12	Dataset Enhancement with Instance-Level Augmentations	Orest Kupyn et.al.	2406.08249	link
2024-06-13	A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder	Lixian Zhang et.al.	2406.08079	null
2024-06-12	OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding	Yinan Deng et.al.	2406.08009	link
2024-06-12	SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation	Chanda Grover Kamra et.al.	2406.07986	link
2024-06-12	Small Scale Data-Free Knowledge Distillation	He Liu et.al.	2406.07876	link
2024-06-11	Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph	Sergey Linok et.al.	2406.07113	null
2024-06-11	PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving	Yining Shi et.al.	2406.07037	null
2024-06-12	LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection	Jiahua Xu et.al.	2406.07023	null
2024-06-10	Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation	Dong Zhao et.al.	2406.06813	link
2024-06-09	Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation	Abdul Qayyum et.al.	2406.06643	null
2024-06-10	Merlin: A Vision Language Foundation Model for 3D Computed Tomography	Louis Blankemeier et.al.	2406.06512	null
2024-06-10	UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving	Daniel Bogdoll et.al.	2406.06370	null
2024-06-09	Scaling Graph Convolutions for Mobile Vision	William Avery et.al.	2406.05850	link
2024-06-09	Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation	Jun Yu et.al.	2406.05837	null
2024-06-09	Convolution and Attention-Free Mamba-based Cardiac Image Segmentation	Abbas Khan et.al.	2406.05786	null
2024-06-09	Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language	Mark Hamilton et.al.	2406.05629	link
2024-06-08	A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+	Jianzhao Wang et.al.	2406.05513	null
2024-06-08	Layered Image Vectorization via Semantic Simplification	Zhenyu Wang et.al.	2406.05404	null
2024-06-08	1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation	Qingfeng Liu et.al.	2406.05352	null
2024-06-07	USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation	Xiaoqi Wang et.al.	2406.05271	null
2024-06-07	Semantic Segmentation on VSPW Dataset through Masked Video Consistency	Chen Liang et.al.	2406.04979	null
2024-06-07	Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment	Venkanna Babu Guthula et.al.	2406.04949	null
2024-06-06	Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis	Chengeng Liu et.al.	2406.04149	null
2024-06-06	Frequency-based Matcher for Long-tailed Semantic Segmentation	Shan Li et.al.	2406.03917	link
2024-06-07	Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge	Nan Zhang et.al.	2406.03799	link
2024-06-06	DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation	Zilu Guo et.al.	2406.03702	link
2024-06-05	Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation	Maximilian Zenk et.al.	2406.03323	null
2024-06-05	Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy	Yunho Kim et.al.	2406.02989	null
2024-06-04	W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics	Andre Schreiber et.al.	2406.02822	link
2024-06-04	Window to Wall Ratio Detection using SegFormer	Zoe De Simone et.al.	2406.02706	link
2024-06-04	Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning	Heather Doig et.al.	2406.01932	null
2024-06-03	EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding	Thanh-Dat Truong et.al.	2406.01429	null
2024-06-03	TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation	Antonio Santo et.al.	2406.01395	link
2024-06-03	ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds	Ka Lung Cheung et.al.	2406.01337	link
2024-06-03	LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism	Miao Fu et.al.	2406.01228	null
2024-06-04	GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer	Ding Jia et.al.	2406.01210	link
2024-06-03	S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography	Yuhan Song et.al.	2406.01191	null
2024-06-02	Diffusion Features to Bridge Domain Gap for Semantic Segmentation	Yuxiang Ji et.al.	2406.00777	null
2024-06-02	Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation	Yunheng Li et.al.	2406.00670	null
2024-06-02	Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024	Biao Wu et.al.	2406.00587	null
2024-05-31	Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks	Linlin Yu et.al.	2405.20986	null
2024-05-31	Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation	Wooseok Shin et.al.	2405.20610	link
2024-05-30	P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation	Qi Zhang et.al.	2405.20443	null
2024-05-30	SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow	Chaoyang Wang et.al.	2405.20282	link
2024-05-30	MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion	Angel Villar-Corrales et.al.	2405.19921	link
2024-05-30	Open-Set Domain Adaptation for Semantic Segmentation	Seun-An Choe et.al.	2405.19899	link
2024-05-30	DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation	Ron Keuth et.al.	2405.19746	link
2024-05-30	Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes	Yong-Qiang Mao et.al.	2405.19735	null
2024-05-30	CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation	Ankush Gajanan Arudkar et.al.	2405.19672	null
2024-05-29	Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation	Lianlei Shan et.al.	2405.19568	null
2024-05-29	Enabling Visual Recognition at Radio Frequency	Haowen Lai et.al.	2405.19516	null
2024-05-29	Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models	Tianrun Chen et.al.	2405.19326	null
2024-05-29	A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation	Niclas Vödisch et.al.	2405.19035	link
2024-05-29	Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation	Zelin Peng et.al.	2405.18840	null
2024-05-28	Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation	JuneHyoung Kwon et.al.	2405.18148	null
2024-05-28	Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images	Lianlei Shan et.al.	2405.18078	null
2024-05-28	RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields	Mihnea-Bogdan Jurca et.al.	2405.18033	null
2024-05-28	DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture	Shentong Mo et.al.	2405.17995	null
2024-05-28	The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention	Xingyu Ding et.al.	2405.17776	null
2024-05-27	Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation	Steven Landgraf et.al.	2405.17097	null
2024-05-27	DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking	Hongtao Wang et.al.	2405.16980	null
2024-05-27	Collective Perception Datasets for Autonomous Driving: A Comprehensive Review	Sven Teufel et.al.	2405.16973	null
2024-05-27	Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models	Qian Wang et.al.	2405.16947	null
2024-05-27	A re-calibration method for object detection with multi-modal alignment bias in autonomous driving	Zhihang Song et.al.	2405.16848	null
2024-05-25	BOLD: Boolean Logic Deep Learning	Van Minh Nguyen et.al.	2405.16339	null
2024-05-25	Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation	Huizhou Chen et.al.	2405.16099	null
2024-05-25	Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality	Hakim Ikebayashi et.al.	2405.16008	null
2024-05-24	Visualize and Paint GAN Activations	Rudolf Herdt et.al.	2405.15636	null
2024-05-24	Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets	Hoàng-Ân Lê et.al.	2405.15394	null
2024-05-24	U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation	Bingyu Li et.al.	2405.15365	link
2024-05-24	Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation	Jiayi Chen et.al.	2405.15265	null
2024-05-23	Mamba-R: Vision Mamba ALSO Needs Registers	Feng Wang et.al.	2405.14858	null
2024-05-23	Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation	Daniel Kienzle et.al.	2405.14467	null
2024-05-23	MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models	Jiuming Liu et.al.	2405.14338	null
2024-05-23	Tuning-free Universally-Supervised Semantic Segmentation	Xiaobo Yang et.al.	2405.14294	null
2024-05-23	SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation	Kai Yao et.al.	2405.14278	null
2024-05-23	Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations	Mohammed Baharoon et.al.	2405.14239	null
2024-05-24	Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification	Taylor Archibald et.al.	2405.14162	null
2024-05-23	Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips	Yaotian Liu et.al.	2405.14154	null
2024-05-22	TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System	Diogo Lavado et.al.	2405.13989	null
2024-05-22	Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer	Qihang Fan et.al.	2405.13337	null
2024-05-21	Transparency Distortion Robustness for SOTA Image Segmentation Tasks	Volker Knauthe et.al.	2405.12864	null
2024-05-20	A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation	Sushmita Sarker et.al.	2405.11903	null
2024-05-20	Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments	Jooyong Park et.al.	2405.11855	null
2024-05-20	Universal Organizer of SAM for Unsupervised Semantic Segmentation	Tingting Li et.al.	2405.11742	null
2024-05-19	Interpreting a Semantic Segmentation Model for Coastline Detection	Conor O'Sullivan et.al.	2405.11500	null
2024-05-17	CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation	Mushui Liu et.al.	2405.10530	link
2024-05-16	Towards Task-Compatible Compressible Representations	Anderson de Andrade et.al.	2405.10244	link
2024-05-16	A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance	Andrea Matteazzi et.al.	2405.10046	null
2024-05-16	Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation	Jihwan Kwak et.al.	2405.09858	null
2024-05-15	Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation	Guo Yachan et.al.	2405.09682	null

(back to top)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Updated on 2024.11.24

Depth Estimation

Semactic Segmentation

Files

README.md

Latest commit

History

README.md

File metadata and controls

Updated on 2024.11.24

Depth Estimation

Semactic Segmentation