GitHub - ZhuYingJessica/cv-daily: This is an Arxiv paper collection

Updated on 2024.11.24

Usage instructions: here

Table of Contents

Depth Estimation
Semactic Segmentation

Depth Estimation

Publish Date	Title	Authors	PDF	Code
2024-11-21	StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart	Jian Shi et.al.	2411.14295	null
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291	null
2024-11-20	OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging	Rajini Makam et.al.	2411.13230	null
2024-11-15	SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction	Yutao Tang et.al.	2411.12592	link
2024-11-18	Towards Degradation-Robust Reconstruction in Generalizable NeRF	Chan Ho Park et.al.	2411.11691	null
2024-11-18	MGNiceNet: Unified Monocular Geometric Scene Understanding	Markus Schön et.al.	2411.11466	null
2024-11-18	The ADUULM-360 Dataset -- A Multi-Modal Dataset for Depth Estimation in Adverse Weather	Markus Schön et.al.	2411.11455	null
2024-11-18	GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views	Boyao Zhou et.al.	2411.11363	null
2024-11-18	Scalable Autoregressive Monocular Depth Estimation	Jinhong Wang et.al.	2411.11361	null
2024-11-16	MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation	Ansh Shah et.al.	2411.10886	link
2024-11-19	EVT: Efficient View Transformation for Multi-Modal 3D Object Detection	Yongjin Lee et.al.	2411.10715	null
2024-11-15	Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses	Yongfan Liu et.al.	2411.10013	null
2024-11-14	Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting	Yian Wang et.al.	2411.09823	null
2024-11-14	Adversarial Attacks Using Differentiable Rendering: A Survey	Matthew Hull et.al.	2411.09749	null
2024-11-14	Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching	Yuran Wang et.al.	2411.09151	null
2024-11-13	OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances	Youqi Liao et.al.	2411.08665	null
2024-11-13	Scaling Properties of Diffusion Models for Perceptual Tasks	Rahul Ravishankar et.al.	2411.08034	null
2024-11-11	$SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation	Yinshuang Xu et.al.	2411.07326	null
2024-11-08	Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning	Quang Truong Nguyen et.al.	2411.05344	null
2024-11-08	SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection	Yun Zhao et.al.	2411.05292	null
2024-11-07	D $^3$ epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes	Siyu Chen et.al.	2411.04826	null
2024-11-06	Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation	Teppei Kurita et.al.	2411.04714	null
2024-11-07	Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation	Qingyao Tian et.al.	2411.04404	null
2024-11-04	PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes	Kebin Peng et.al.	2411.04227	null
2024-11-06	Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions	Zihan Qin et.al.	2411.03638	null
2024-11-05	Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor	Anish Bhattacharya et.al.	2411.03303	null
2024-11-05	Correlation of Object Detection Performance with Visual Saliency and Depth Estimation	Matthias Bartolo et.al.	2411.02844	link
2024-11-05	FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training	Ruihong Yin et.al.	2411.02229	null
2024-11-05	Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training	Yuanqi Yao et.al.	2411.02149	null
2024-11-01	MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes	Sanghyun Byun et.al.	2411.01048	null
2024-11-01	On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR	Li Li et.al.	2411.00600	link
2024-10-31	Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving	Ce Zhou et.al.	2411.00192	null
2024-10-31	ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images	Timing Yang et.al.	2410.24001	link
2024-10-30	Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe	Songyu Xu et.al.	2410.23154	null
2024-10-29	Active Event Alignment for Monocular Distance Estimation	Nan Cai et.al.	2410.22280	null
2024-10-29	PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting	Sunghwan Hong et.al.	2410.22128	link
2024-10-27	Unlocking Comics: The AI4VA Dataset for Visual Understanding	Peter Grönquist et.al.	2410.20459	link
2024-10-27	Depth Attention for Robust RGB Tracking	Yu Liu et.al.	2410.20395	link
2024-10-21	YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning	Ranjan Sapkota et.al.	2410.19846	null
2024-10-25	MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors	Fanqi Pu et.al.	2410.19590	null
2024-10-24	Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction	Hongxin Peng et.al.	2410.18433	null
2024-10-24	Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images	Dong-Guw Lee et.al.	2410.18340	link
2024-10-25	UnCLe: Unsupervised Continual Learning of Depth Completion	Suchisrit Gangopadhyay et.al.	2410.18074	null
2024-10-21	TIPS: Text-Image Pretraining with Spatial Awareness	Kevis-Kokitsi Maninis et.al.	2410.16512	null
2024-10-22	DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain	Kun Wang et.al.	2410.14980	link
2024-10-17	DepthSplat: Connecting Gaussian Splatting and Depth	Haofei Xu et.al.	2410.13862	link
2024-10-16	DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning	Jiabao Wei et.al.	2410.12501	null
2024-10-16	Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture	Dabbrata Das et.al.	2410.11610	null
2024-10-16	CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction	Pranav Gupta et.al.	2410.11211	link
2024-10-14	When Does Perceptual Alignment Benefit Vision Representations?	Shobhita Sundaram et.al.	2410.10817	null
2024-10-14	Depth Any Video with Scalable Synthetic Data	Honghui Yang et.al.	2410.10815	link
2024-10-15	Improved Depth Estimation of Bayesian Neural Networks	Bart van Erp et.al.	2410.10395	link
2024-10-10	Color-Guided Flying Pixel Correction in Depth Images	Ekamresh Vasudevan et.al.	2410.08084	null
2024-10-09	Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models	Ange Lou et.al.	2410.07434	null
2024-10-09	Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation	Runze Chen et.al.	2410.06982	null
2024-10-09	Analysis of different disparity estimation techniques on aerial stereo image datasets	Ishan Narayan et.al.	2410.06711	null
2024-10-08	Vision Transformer based Random Walk for Group Re-Identification	Guoqing Zhang et.al.	2410.05808	null
2024-10-08	CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality	Wenjie Chang et.al.	2410.05735	null
2024-10-07	PhotoReg: Photometrically Registering 3D Gaussian Splatting Models	Ziwen Yuan et.al.	2410.05044	null
2024-10-10	Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy	Pengcheng Chen et.al.	2410.04041	null
2024-10-04	Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering	Laura Fink et.al.	2410.03861	null
2024-10-03	RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions	Ziyao Zeng et.al.	2410.02924	null
2024-10-02	Depth Pro: Sharp Monocular Metric Depth in Less Than a Second	Aleksei Bochkovskii et.al.	2410.02073	link
2024-10-10	Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation	Shuting Zhao et.al.	2410.00979	null
2024-10-01	Radar Meets Vision: Robustifying Monocular Metric Depth Prediction for Mobile Robotics	Marco Job et.al.	2410.00736	null
2024-10-06	Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration	Yida Lin et.al.	2410.00503	null
2024-10-01	Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance	Hongchao Shu et.al.	2410.00386	null
2024-09-30	CCDepth: A Lightweight Self-supervised Depth Estimation Network with Enhanced Interpretability	Xi Zhang et.al.	2409.19933	null
2024-09-30	EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth Prediction	Ivan Reyes-Amezcua et.al.	2409.19930	link
2024-09-29	fCOP: Focal Length Estimation from Category-level Object Priors	Xinyue Zhang et.al.	2409.19641	null
2024-09-29	KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation	Soofiyan Atar et.al.	2409.19490	null
2024-09-27	Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping	Anthony A. Song et.al.	2409.19153	null
2024-09-26	Self-supervised Monocular Depth Estimation with Large Kernel Attention	Xuezhi Xiang et.al.	2409.17895	null
2024-09-26	Self-Distilled Depth Refinement with Noisy Poisson Fusion	Jiaqi Li et.al.	2409.17880	null
2024-09-27	A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts	Aurel Pjetri et.al.	2409.17851	null
2024-09-26	Event-based Stereo Depth Estimation: A Survey	Suman Ghosh et.al.	2409.17680	null
2024-09-26	CAMOT: Camera Angle-aware Multi-Object Tracking	Felix Limanta et.al.	2409.17533	null
2024-09-25	Optical Lens Attack on Deep Learning Based Monocular Depth Estimation	Ce Zhou et.al.	2409.17376	null
2024-09-25	Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation	Richard D. Paul et.al.	2409.17085	null
2024-09-25	EventHDR: from Event to High-Speed HDR Videos and Beyond	Yunhao Zou et.al.	2409.17029	null
2024-09-25	3DDX: Bone Surface Reconstruction from a Single Standard-Geometry Radiograph via Dual-Face Depth Estimation	Yi Gu et.al.	2409.16702	null
2024-09-24	MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling	Yifang Men et.al.	2409.16160	null
2024-09-24	Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data	An Wang et.al.	2409.16063	link
2024-09-23	FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera	Guoyang Zhao et.al.	2409.15054	link
2024-09-23	DepthART: Monocular Depth Estimation as Autoregressive Refinement Task	Bulat Gabdullin et.al.	2409.15010	null
2024-09-23	Generalizing monocular colonoscopy image depth estimation by uncertainty-based global and local fusion network	Sijia Du et.al.	2409.15006	null
2024-09-23	GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth	Aurélien Cecille et.al.	2409.14850	null
2024-09-23	Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras	Ming Li et.al.	2409.14766	null
2024-09-18	Panoptic-Depth Forecasting	Juana Valeria Hurtado et.al.	2409.12008	null
2024-09-17	Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think	Gonzalo Martin Garcia et.al.	2409.11355	link
2024-09-15	GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion	Vitor Guizilini et.al.	2409.09896	null
2024-09-15	Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation	Xiaolong Qian et.al.	2409.09754	link
2024-09-13	PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage	Denis Zavadski et.al.	2409.09144	link
2024-09-25	Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding	Rania Hossam et.al.	2409.08695	link
2024-09-12	Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor	Andrea Conti et.al.	2409.08277	null
2024-09-12	LED: Light Enhanced Depth Estimation at Night	Simon de Moreau et.al.	2409.08031	link
2024-09-12	Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes	Ming Li et.al.	2409.07843	null
2024-09-12	Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy	Bojian Li et.al.	2409.07723	null
2024-09-12	FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments	Devansh Dhrafani et.al.	2409.07715	null
2024-09-10	Deep Neural Networks: Multi-Classification and Universal Approximation	Martín Hernández et.al.	2409.06555	null
2024-09-10	EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Nischal Khanal et.al.	2409.06183	link
2024-09-11	EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels	Qingyao Tian et.al.	2409.05442	null
2024-09-09	Spontaneous magnetic field and disorder effects in BaPtAs_1-x_Sb_x_ with honeycomb network	T. Adachi et.al.	2409.05266	null
2024-09-08	TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs	Horatiu Florea et.al.	2409.05142	null
2024-09-12	Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective	Tim Bader et.al.	2409.04086	link
2024-09-08	Estimating Indoor Scene Depth Maps from Ultrasonic Echoes	Junpei Honma et.al.	2409.03336	null
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-02	GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling	Huawei Sun et.al.	2409.02720	null
2024-09-04	Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects	Kyungmin Jo et.al.	2409.02653	null
2024-09-04	UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching	Soomin Kim et.al.	2409.02545	null
2024-09-04	SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction	Sumin Son et.al.	2409.02513	null
2024-09-04	Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation	Li Liu et.al.	2409.02494	null
2024-09-04	Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization	Cho-Ying Wu et.al.	2409.02486	null
2024-09-04	GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving	Huasong Han et.al.	2409.02382	null
2024-09-03	DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos	Wenbo Hu et.al.	2409.02095	null
2024-09-02	Large Language Models Can Understanding Depth from Monocular Images	Zhongyi Xia et.al.	2409.01133	null
2024-08-30	DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model	Mona Sheikh Zeinoddin et.al.	2408.17433	null
2024-08-30	Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method	Yuji Lin et.al.	2408.17339	null
2024-08-30	Synthetic Lunar Terrain: A Multimodal Open Dataset for Training and Evaluating Neuromorphic Vision Algorithms	Marcus Märtens et.al.	2408.16971	null
2024-08-29	EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More	Kanghao Chen et.al.	2408.16254	null
2024-08-30	Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective	Zhijie Shen et.al.	2408.16227	link
2024-08-27	Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack	Naufal Suryanto et.al.	2408.14879	null
2024-08-26	NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-training	Albert Luginov et.al.	2408.14177	null
2024-08-26	Pixel-Aligned Multi-View Generation with Depth Guided Decoder	Zhenggang Tang et.al.	2408.14016	null
2024-08-25	TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers	Chuanrui Zhang et.al.	2408.13770	null
2024-08-25	InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth	Cho-Ying Wu et.al.	2408.13708	null
2024-08-25	SeeBelow: Sub-dermal 3D Reconstruction of Tumors with Surgical Robotic Palpation and Tactile Exploration	Raghava Uppuluri et.al.	2408.13699	null
2024-08-27	Sapiens: Foundation for Human Vision Models	Rawal Khirodkar et.al.	2408.12569	null
2024-08-21	LiFCal: Online Light Field Camera Calibration via Bundle Adjustment	Aymeric Fleith et.al.	2408.11682	null
2024-08-19	Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video	Shuxian Wang et.al.	2408.10153	null
2024-08-19	SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition	Wiktor Mucha et.al.	2408.10037	link
2024-08-19	P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders	Xuechao Chen et.al.	2408.10007	null
2024-08-14	Enhanced Scale-aware Depth Estimation for Monocular Endoscopic Scenes with Geometric Modeling	Ruofeng Wei et.al.	2408.07266	null
2024-08-12	Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces	Junrui Zhang et.al.	2408.06083	null
2024-08-08	Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation	Daniele Rege Cambrin et.al.	2408.04523	link
2024-08-08	Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework	Subhasis Dasgupta et.al.	2408.04360	null
2024-08-08	Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform	Daniel Vargas et.al.	2408.04195	null
2024-08-07	Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach	Benedikt W. Hosp et.al.	2408.03591	null
2024-08-06	BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications	G. Manni et.al.	2408.03078	link
2024-08-05	Gaussian Mixture based Evidential Learning for Stereo Matching	Weide Liu et.al.	2408.02796	null
2024-08-05	Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining	Dongyang Liu et.al.	2408.02657	link
2024-08-03	MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas	Feng Qiao et.al.	2408.01653	null
2024-08-02	Self-Supervised Depth Estimation Based on Camera Models	Jinchang Zhang et.al.	2408.01565	null
2024-08-01	MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection	Youjia Fu et.al.	2408.00438	null
2024-08-01	High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior	Wencheng Han et.al.	2408.00361	null
2024-07-31	Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching	Pengjie Zhang et.al.	2407.21735	null
2024-07-29	BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation	Kieran Saunders et.al.	2407.20437	null
2024-07-29	Analysis and Improvement of Rank-Ordered Mean Algorithm in Single-Photon LiDAR	William C. Yau et.al.	2407.20399	null
2024-07-29	Improving 2D Feature Representations by 3D-Aware Fine-Tuning	Yuanwen Yue et.al.	2407.20229	null
2024-07-27	Revisit Self-supervised Depth Estimation with Local Structure-from-Motion	Shengjie Zhu et.al.	2407.19166	null
2024-07-27	RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry	Shengjie Zhu et.al.	2407.19154	null
2024-07-26	HybridDepth: Robust Depth Fusion for Mobile AR by Leveraging Depth from Focus and Single-Image Priors	Ashkan Ganj et.al.	2407.18443	link
2024-07-26	Enhanced Depth Estimation and 3D Geometry Reconstruction using Bayesian Helmholtz Stereopsis with Belief Propagation	Razieh Azizi et.al.	2407.18195	null
2024-07-25	BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation	Xiang Zhang et.al.	2407.17952	null
2024-07-25	UMono: Physical Model Informed Hybrid CNN-Transformer Framework for Underwater Monocular Depth Estimation	Jian Wang et.al.	2407.17838	null
2024-07-24	DarSwin-Unet: Distortion Aware Encoder-Decoder Architecture	Akshaya Athwale et.al.	2407.17328	null
2024-07-24	Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches	Chenxing Zhao et.al.	2407.17312	null
2024-07-23	SINDER: Repairing the Singular Defects of DINOv2	Haoqi Wang et.al.	2407.16826	link
2024-07-23	Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions	Fabio Tosi et.al.	2407.16698	link
2024-07-23	ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation	Zhenhua Wu et.al.	2407.16508	null
2024-07-19	Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation	Jinfeng Liu et.al.	2407.14126	link
2024-07-18	Unveiling the purely young star formation history of the SMC's northeastern shell from colour-magnitude diagram fitting	Joanna D. Sakowska et.al.	2407.13876	null
2024-07-18	Many Perception Tasks are Highly Redundant Functions of their Input Data	Rahul Ramesh et.al.	2407.13841	null
2024-07-18	Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks	Antoni Kowalczuk et.al.	2407.12588	link
2024-07-16	Temporally Consistent Stereo Matching	Jiaxi Zeng et.al.	2407.11950	link
2024-07-15	IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation	Yuanhao Zhai et.al.	2407.10937	link
2024-07-15	OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection	Jinghua Hou et.al.	2407.10753	link
2024-07-15	Towards Scale-Aware Full Surround Monodepth with Transformers	Yuchen Yang et.al.	2407.10406	null
2024-07-12	ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion	Sungmin Woo et.al.	2407.09303	link
2024-07-11	ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation	Ruijie Zhu et.al.	2407.08187	link
2024-07-10	Controlling Space and Time with Diffusion Models	Daniel Watson et.al.	2407.07860	null
2024-07-07	SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning	Yi Feng et.al.	2407.05283	link
2024-07-05	A Physical Model-Guided Framework for Underwater Image Enhancement and Depth Estimation	Dazhao Du et.al.	2407.04230	null
2024-07-04	Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation	Laiyan Ding et.al.	2407.04041	null
2024-07-02	Parametric Modeling and Estimation of Photon Registrations for 3D Imaging	Weijian Zhang et.al.	2407.02712	null
2024-07-02	Depth-Aware Endoscopic Video Inpainting	Francis Xiatian Zhang et.al.	2407.02675	link
2024-07-04	Camera-LiDAR Cross-modality Gait Recognition	Wenxuan Guo et.al.	2407.02038	null
2024-07-07	CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation	Huawei Sun et.al.	2407.00697	link
2024-06-28	Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey	Uchitha Rajapaksha et.al.	2406.19675	null
2024-07-05	360 in the Wild: Dataset for Depth Prediction and View Synthesis	Kibaek Park et.al.	2406.18898	null
2024-06-27	Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach	Yuxiang Huang et.al.	2406.18837	null
2024-06-26	DoubleTake: Geometry Guided Depth Estimation	Mohamed Sayed et.al.	2406.18387	null
2024-06-25	Depth-Guided Semi-Supervised Instance Segmentation	Xin Chen et.al.	2406.17413	null
2024-06-20	Uncertainty and Self-Supervision in Single-View Depth	Javier Rodriguez-Puigvert et.al.	2406.14226	null
2024-06-19	WaterMono: Teacher-Guided Anomaly Masking and Enhancement Boosting for Robust Underwater Self-Supervised Monocular Depth Estimation	Yilin Ding et.al.	2406.13344	link
2024-06-18	Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation	Ning-Hsu Wang et.al.	2406.12849	null
2024-06-21	GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models	Yongtao Ge et.al.	2406.12671	link
2024-06-17	DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features	Letian Wang et.al.	2406.12095	null
2024-06-17	MEDeA: Multi-view Efficient Depth Adjustment	Mikhail Artemyev et.al.	2406.12048	null
2024-06-16	3D Gaze Tracking for Studying Collaborative Interactions in Mixed-Reality Environments	Eduardo Davalos et.al.	2406.11003	null
2024-06-15	GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR	Bharat Singh et.al.	2406.10722	null
2024-06-14	The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences	Bria Long et.al.	2406.10447	null
2024-06-14	D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video	Moritz Kappel et.al.	2406.10078	null
2024-06-14	DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications	Li Li et.al.	2406.10068	link
2024-06-14	Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion	Runze Liu et.al.	2406.09782	null
2024-06-13	Depth Anything V2	Lihe Yang et.al.	2406.09414	null
2024-06-14	WonderWorld: Interactive 3D Scene Generation from a Single Image	Hong-Xing Yu et.al.	2406.09394	null
2024-06-13	Scale-Invariant Monocular Depth Estimation via SSI Depth	S. Mahdi H. Miangoleh et.al.	2406.09374	null
2024-06-13	Multiple Prior Representation Learning for Self-Supervised Monocular Depth Estimation via Hybrid Transformer	Guodong Sun et.al.	2406.08928	link
2024-06-13	ToSA: Token Selective Attention for Efficient Vision Transformers	Manish Kumar Singh et.al.	2406.08816	null
2024-06-11	Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation	Yufan Zhu et.al.	2406.07741	link
2024-06-11	PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow	Joshua Tokarsky et.al.	2406.07667	null
2024-06-11	RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks	Zhechao Wang et.al.	2406.07032	null
2024-06-10	PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation	Zhenyu Li et.al.	2406.06679	null
2024-06-09	Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks	Zhiyuan Cheng et.al.	2406.05857	link
2024-06-09	RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering	Rui Zhang et.al.	2406.05852	null
2024-06-07	Normal-guided Detail-Preserving Neural Implicit Functions for High-Fidelity 3D Surface Reconstruction	Aarya Patel et.al.	2406.04861	null
2024-06-07	UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection	Yuchao Wang et.al.	2406.04647	null
2024-06-06	MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation	Ionuţ Grigore et.al.	2406.04532	null
2024-06-06	Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image	Stanislaw Szymanowicz et.al.	2406.04343	null
2024-06-06	Neural Surface Reconstruction from Sparse Views Using Epipolar Geometry	Kaichen Zhou et.al.	2406.04301	null
2024-06-04	VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors	Markus Plack et.al.	2406.02552	null
2024-06-03	L-MAGIC: Language Model Assisted Generation of Images with Coherence	Zhipeng Cai et.al.	2406.01843	link
2024-06-04	Learning Temporally Consistent Video Depth from Video Diffusion Priors	Jiahao Shao et.al.	2406.01493	null
2024-06-03	Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Takayuki Kanai et.al.	2406.00929	null
2024-06-01	MoDGS: Dynamic Gaussian Splatting from Causually-captured Monocular Videos	Qingming Liu et.al.	2406.00434	null
2024-05-30	Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian	Wei Sun et.al.	2405.19657	null
2024-05-28	Hybrid Multi-Head Physics-informed Neural Network for Depth Estimation in Terahertz Imaging	Mingjun Xiang et.al.	2405.18317	null
2024-05-27	Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation	Amir El-Ghoussani et.al.	2405.17704	null
2024-05-27	Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving	Shaoyuan Xie et.al.	2405.17426	link
2024-05-27	All-day Depth Completion	Vadim Ezhov et.al.	2405.17315	null
2024-05-27	GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping	Junyoung Seo et.al.	2405.17251	null
2024-05-27	SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing	Yong-Qiang Mao et.al.	2405.17140	null
2024-05-27	DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge	Yifan Mao et.al.	2405.17102	null
2024-05-27	Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation	Steven Landgraf et.al.	2405.17097	null
2024-05-27	DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation	Mengtan Zhang et.al.	2405.16960	null
2024-05-27	ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection	Ziying Song et.al.	2405.16873	null
2024-05-27	Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations	Jingguo Liu et.al.	2405.16858	null
2024-05-26	Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians	Erik Sandström et.al.	2405.16544	null
2024-05-24	Transparent Object Depth Completion	Yifan Zhou et.al.	2405.15299	null
2024-05-24	MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method	Pan Liao et.al.	2405.15176	null
2024-05-23	EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting	Jiaxu Wang et.al.	2405.14959	link
2024-05-23	Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks	Xingguang Jiang et.al.	2405.14520	null
2024-05-23	Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning	Zhenyu Wei et.al.	2405.14195	null
2024-05-21	Cross-spectral Gated-RGB Stereo Depth Estimation	Samuel Brucker et.al.	2405.12759	null
2024-05-20	Depth Reconstruction with Neural Signed Distance Fields in Structured Light Systems	Rukun Qiao et.al.	2405.12006	null
2024-05-20	Depth Prompting for Sensor-Agnostic Depth Estimation	Jin-Hwi Park et.al.	2405.11867	null
2024-05-19	CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs	Zidong Cao et.al.	2405.11564	null
2024-05-18	Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation Models	Madhu Vankadari et.al.	2405.11158	link
2024-05-17	FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation	Fei Wang et.al.	2405.10885	link
2024-05-17	Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory	Jonas Kälble et.al.	2405.10575	link
2024-05-16	Towards Task-Compatible Compressible Representations	Anderson de Andrade et.al.	2405.10244	link
2024-05-16	KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment	Zhengxu Shi et.al.	2405.09964	null
2024-05-14	CLIP with Quality Captions: A Strong Pretraining for Vision Tasks	Pavan Kumar Anasosalu Vasu et.al.	2405.08911	null

(back to top)

Semactic Segmentation

Publish Date	Title	Authors	PDF	Code
2024-11-21	Revisiting the Integration of Convolution and Attention for Vision Backbone	Lei Zhu et.al.	2411.14429	link
2024-11-21	CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation	Lin Sun et.al.	2411.13836	link
2024-11-21	Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals	Hussni Mohd Zakir et.al.	2411.13774	null
2024-11-20	FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting	Ola Shorinwa et.al.	2411.13753	null
2024-11-20	BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation	Umamaheswaran Raman Kumar et.al.	2411.13251	null
2024-11-20	XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Ziyi Wang et.al.	2411.13243	link
2024-11-20	Automating Sonologists USG Commands with AI and Voice Interface	Emad Mohamed et.al.	2411.13006	null
2024-11-19	A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation	Jiaqi Yang et.al.	2411.12615	link
2024-11-19	SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation	Ron Keuth et.al.	2411.12602	link
2024-11-19	ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator	Xiao Jiang et.al.	2411.12250	null
2024-11-18	ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements	M. Arda Aydın et.al.	2411.12044	link
2024-11-18	Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation	Hanieh Shojaei Miandashti et.al.	2411.11935	null
2024-11-18	MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models	Harshita Sharma et.al.	2411.11362	null
2024-11-18	Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications	Scarlett Raine et.al.	2411.11287	null
2024-11-16	Attention-based U-Net Method for Autonomous Lane Detection	Mohammadhamed Tangestanizadeh et.al.	2411.10902	null
2024-11-16	Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation	Jaisidh Singh et.al.	2411.10845	null
2024-11-19	Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients	Maria Monzon et.al.	2411.10755	link
2024-11-15	Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images	Ammar Qammaz et.al.	2411.10334	null
2024-11-15	CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	Dengke Zhang et.al.	2411.10086	null
2024-11-14	OneNet: A Channel-Wise 1D Convolutional U-Net	Sanghyun Byun et.al.	2411.09838	link
2024-11-14	Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks	Zengyi Yang et.al.	2411.09387	null
2024-11-14	Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation	Yuheng Shi et.al.	2411.09219	link
2024-11-14	Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery	Ashim Dahal et.al.	2411.09101	link
2024-11-13	CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2411.09023	null
2024-11-14	Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation	Yangyang Li et.al.	2411.08756	null
2024-11-13	Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model	Jun Xie et.al.	2411.08592	null
2024-11-12	Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry	Christopher Hahne et.al.	2411.07918	link
2024-11-12	Semantic segmentation on multi-resolution optical and microwave data using deep learning	Jai G Singla et.al.	2411.07581	null
2024-11-11	SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation	Jiale Chen et.al.	2411.06991	null
2024-11-14	Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision	Yueyang Cang et.al.	2411.06727	null
2024-11-10	Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments	Deegan Atha et.al.	2411.06632	null
2024-11-09	Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing	Kaixuan Lu et.al.	2411.06091	null
2024-11-08	Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model	Shuchang Lyu et.al.	2411.05878	link
2024-11-08	Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation	Sien Li et.al.	2411.05307	link
2024-11-07	In the Era of Prompt Learning with Vision-Language Models	Ankit Jha et.al.	2411.04892	null
2024-11-11	ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset	Olaf Wysocki et.al.	2411.04865	link
2024-11-06	Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts	Zhitong Gao et.al.	2411.03829	link
2024-11-06	Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model	Yansong Qu et.al.	2411.03672	null
2024-11-05	Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation	Zhiling Yue et.al.	2411.03551	null
2024-11-05	SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture	Andrew Heschl et.al.	2411.03505	link
2024-11-05	Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need	Qishuai Wen et.al.	2411.03033	link
2024-11-05	Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation	Xavier Timoneda et.al.	2411.02969	null
2024-11-05	Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery	Mohammad Kakooei et.al.	2411.02935	null
2024-11-05	CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation	Jinchao Ge et.al.	2411.02715	null
2024-11-04	Deep Learning on 3D Semantic Segmentation: A Detailed Review	Thodoris Betsas et.al.	2411.02104	null
2024-11-04	Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models	Sharat Agarwal et.al.	2411.01925	null
2024-11-04	DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability	Bo Gao et.al.	2411.01819	null
2024-11-04	Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations	Thanh Nguyen Canh et.al.	2411.01816	null
2024-11-03	PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation	Xinyu Xu et.al.	2411.01624	null
2024-11-01	Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions	Lixiao Yang et.al.	2411.01039	null
2024-11-01	Event-guided Low-light Video Semantic Segmentation	Zhen Yao et.al.	2411.00639	null
2024-11-01	Cross-modal semantic segmentation for indoor environmental perception using single-chip millimeter-wave radar raw data	Hairuo Hu et.al.	2411.00499	null
2024-11-01	Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing	Naufal Suryanto et.al.	2411.00425	link
2024-10-31	A Recipe for Geometry-Aware 3D Mesh Transformers	Mohammad Farazi et.al.	2411.00164	null
2024-10-31	Federated Black-Box Adaptation for Semantic Segmentation	Jay N. Paranjape et.al.	2410.24181	null
2024-10-31	COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes	Muhammad Ali et.al.	2410.24139	link
2024-10-31	Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model	Hao Zhang et.al.	2410.23905	link
2024-10-30	S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving	Maciej K. Wozniak et.al.	2410.23085	null
2024-10-31	CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation	Ziyang Gong et.al.	2410.22629	link
2024-10-29	Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation	Zhaochong An et.al.	2410.22489	null
2024-10-29	Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation	Jintao Tong et.al.	2410.22135	null
2024-10-29	Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models	Imad Ali Shah et.al.	2410.22101	null
2024-10-29	Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation	Ruihao Xia et.al.	2410.21708	link
2024-10-28	Domain Adaptation with a Single Vision-Language Embedding	Mohammad Fahes et.al.	2410.21361	null
2024-10-28	IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks	Manjunath D et.al.	2410.20953	null
2024-10-27	A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models	Camilo Espinosa-Curilem et.al.	2410.20595	link
2024-10-27	Unlocking Comics: The AI4VA Dataset for Visual Understanding	Peter Grönquist et.al.	2410.20459	link
2024-10-27	Historical Test-time Prompt Tuning for Vision Foundation Models	Jingyi Zhang et.al.	2410.20346	null
2024-10-25	OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery	Philipe Dias et.al.	2410.19965	null
2024-10-25	IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation	Kaixian Qu et.al.	2410.19697	null
2024-10-25	Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation	Yao Wu et.al.	2410.19446	link
2024-10-25	Context-Based Visual-Language Place Recognition	Soojin Woo et.al.	2410.19341	link
2024-10-24	Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks	Alexander Jaus et.al.	2410.18684	null
2024-10-24	Unsupervised semantic segmentation of urban high-density multispectral point clouds	Oona Oinonen et.al.	2410.18520	null
2024-10-26	CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator	Stefanos Pasios et.al.	2410.18238	null
2024-10-23	Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers	Achille Chiuchiarelli et.al.	2410.17738	null
2024-10-22	EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding	Zhiyi Pan et.al.	2410.17207	null
2024-10-22	SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments	Jumman Hossain et.al.	2410.16686	null
2024-10-21	TIPS: Text-Image Pretraining with Spatial Awareness	Kevis-Kokitsi Maninis et.al.	2410.16512	null
2024-10-21	GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation	Nazanin Moradinasab et.al.	2410.16485	null
2024-10-21	LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training	Thomas Kreutz et.al.	2410.15833	link
2024-10-21	TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight	Hyun-Kurl Jang et.al.	2410.15674	link
2024-10-21	Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications	Jintao Ren et.al.	2410.15584	null
2024-10-22	Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation	Fnu Neha et.al.	2410.15472	null
2024-10-18	On the Influence of Shape, Texture and Color for Learning Semantic Segmentation	Annika Mütze et.al.	2410.14878	null
2024-10-18	Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+	Arpan Mahara et.al.	2410.14836	null
2024-10-17	ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding	Guangda Ji et.al.	2410.13924	null
2024-10-17	Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks	Clément Playout et.al.	2410.13822	link
2024-10-22	EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything	Joonhyeon Song et.al.	2410.13621	link
2024-10-17	Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation	Ziyang Chen et.al.	2410.13472	null
2024-10-17	SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing	Bin Wang et.al.	2410.13471	link
2024-10-17	Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation	Florian Wulff et.al.	2410.13383	null
2024-10-17	Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation	Houze Liu et.al.	2410.13099	null
2024-10-16	Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation	Wenbo Xu et.al.	2410.13094	null
2024-10-16	Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation	Jesús Alejandro Loera-Ponce et.al.	2410.12988	null
2024-10-16	VividMed: Vision Language Model with Versatile Visual Grounding for Medicine	Lingxiao Luo et.al.	2410.12694	link
2024-10-16	Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans	Luca Marsilio et.al.	2410.12641	null
2024-10-16	SAM-Guided Masked Token Prediction for 3D Scene Understanding	Zhimin Chen et.al.	2410.12158	null
2024-10-15	WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation	Chenghao Qian et.al.	2410.12075	null
2024-10-15	Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning	Rijun Wang et.al.	2410.11913	null
2024-10-15	RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation	Anton Antonov et.al.	2410.11722	link
2024-10-15	InvSeg: Test-Time Prompt Inversion for Semantic Segmentation	Jiayi Lin et.al.	2410.11473	null
2024-10-15	MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation	Xianping Ma et.al.	2410.11160	link
2024-10-14	Locality Alignment Improves Vision-Language Models	Ian Covert et.al.	2410.11087	null
2024-10-14	Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes	Tim Broedermann et.al.	2410.10791	null
2024-10-14	UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation	Lihe Yang et.al.	2410.10777	link
2024-10-14	Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation	Daniel Fusaro et.al.	2410.10510	link
2024-10-14	LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections	Xuezhi Xiang et.al.	2410.10433	null
2024-10-14	V2M: Visual 2-Dimensional Mamba for Image Representation Learning	Chengkun Wang et.al.	2410.10382	link
2024-10-14	GlobalMamba: Global Image Serialization for Vision Mamba	Chengkun Wang et.al.	2410.10316	link
2024-10-13	AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model	Yuchen Li et.al.	2410.09714	null
2024-10-12	An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation	Wei Liang et.al.	2410.09443	null
2024-10-11	Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation	Varduhi Yeghiazaryan et.al.	2410.08946	null
2024-10-11	Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation	Hanieh Shojaei et.al.	2410.08687	null
2024-10-11	DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention	Nguyen Huu Bao Long et.al.	2410.08582	link
2024-10-10	Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving?	Samir Abou Haidar et.al.	2410.08365	null
2024-10-10	Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation	Zhiyi Pan et.al.	2410.08091	null
2024-10-10	Shift and matching queries for video semantic segmentation	Tsubasa Mizuno et.al.	2410.07635	null
2024-10-10	3D Vision-Language Gaussian Splatting	Qucheng Peng et.al.	2410.07577	null
2024-10-11	Bridge the Points: Graph-based Few-shot Segment Anything Semantically	Anqi Zhang et.al.	2410.06964	null
2024-10-09	Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation	Seungho Lee et.al.	2410.06893	null
2024-10-09	Rethinking the Evaluation of Visible and Infrared Image Fusion	Dayan Guan et.al.	2410.06811	link
2024-10-10	QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model	Fei Xie et.al.	2410.06806	link
2024-10-09	Transesophageal Echocardiography Generation using Anatomical Models	Emmanuel Oladokun et.al.	2410.06781	null
2024-10-09	Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy	Qinfeng Zhu et.al.	2410.06725	null
2024-10-09	Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments	Meng Yu et.al.	2410.06626	null
2024-10-09	Towards Natural Image Matting in the Wild via Real-Scenario Prior	Ruihao Xia et.al.	2410.06593	link
2024-10-08	Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions	Mateus Karvat et.al.	2410.06380	null
2024-10-08	Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading	Fang Gao et.al.	2410.05762	null
2024-10-07	Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation	Vince Zhu et.al.	2410.04689	null
2024-10-04	SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2	Hao Yu et.al.	2410.03962	null
2024-10-04	Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features	Benyuan Meng et.al.	2410.03558	link
2024-10-04	Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images	Abhijeet Patil et.al.	2410.03289	link
2024-10-04	HRVMamba: High-Resolution Visual State Space Model for Dense Prediction	Hao Zhang et.al.	2410.03174	null
2024-10-03	HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer	Jingjing Ren et.al.	2410.02528	null
2024-10-04	Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation	Muzhi Zhu et.al.	2410.02369	null
2024-10-03	RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds	Remco Royen et.al.	2410.02323	null
2024-10-03	Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network	Yangyang Qiu et.al.	2410.02224	null
2024-10-03	Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images	Qingyuan Liu et.al.	2410.02207	null
2024-10-02	SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images	Kaiyu Li et.al.	2410.01768	link
2024-10-02	One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations	Shaokang Wu et.al.	2410.01630	null
2024-10-02	Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation	Zhaofeng Shi et.al.	2410.01341	null
2024-10-02	VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings	Andrea Carrara et.al.	2410.01336	null
2024-10-01	RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation	Yazhou Zhu et.al.	2410.01110	null
2024-10-01	Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer	Vlatko Spasev et.al.	2410.01092	null
2024-10-01	Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time	Chiao-An Yang et.al.	2410.01083	link
2024-10-01	DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles	Robert Krajewski et.al.	2410.00769	null
2024-10-01	Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection	Pengxi Zeng et.al.	2410.00582	null
2024-10-01	Precise Workcell Sketching from Point Clouds Using an AR Toolbox	Krzysztof Zieliński et.al.	2410.00479	null
2024-09-30	AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation	Boyu Han et.al.	2409.20398	null
2024-09-30	Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation	Tillmann Rheude et.al.	2409.20287	link
2024-09-30	Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model	Fulong Ma et.al.	2409.20164	null
2024-09-30	Segmenting Wood Rot using Computer Vision Models	Roland Kammerbauer et.al.	2409.20137	null
2024-09-30	Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels	Heeseong Shin et.al.	2409.19846	null
2024-09-27	Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation	Raphael Hagmanns et.al.	2409.18788	null
2024-09-27	Learning from Pattern Completion: Self-supervised Controllable Generation	Zhiqiang Chen et.al.	2409.18694	link
2024-09-27	Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast	Xiaoke Hao et.al.	2409.18543	link
2024-10-01	Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization	Siru Li et.al.	2409.18434	null
2024-09-26	Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning	Siyi Lu et.al.	2409.17659	null
2024-09-26	Global-Local Medical SAM Adaptor Based on Full Adaption	Meng Wang et.al.	2409.17486	null
2024-09-25	VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection	Liangyu Zhong et.al.	2409.17330	null
2024-09-25	2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation	Tommie Kerssies et.al.	2409.17208	link
2024-09-25	WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks	Alberto Bacchin et.al.	2409.16999	link
2024-09-25	Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis	Illia Tsiporenko et.al.	2409.16940	null
2024-09-24	A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation	Avisha Kumar et.al.	2409.16441	null
2024-09-24	Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds	Asad Ur Rahman et.al.	2409.16381	null
2024-09-24	Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation	Hannah Kerner et.al.	2409.16252	link
2024-09-24	Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation	Harry Rogers et.al.	2409.16213	link
2024-09-24	Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification	Pang-Yuan Pao et.al.	2409.15846	null
2024-09-24	DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation	Soojin Jang et.al.	2409.15801	null
2024-09-24	Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis	Camndon Reed et.al.	2409.15671	null
2024-09-23	ZeroSCD: Zero-Shot Street Scene Change Detection	Shyam Sundar Kannan et.al.	2409.15255	null
2024-09-17	Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks	Edgar Heinert et.al.	2409.11373	null
2024-09-17	MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping	Amirreza Fateh et.al.	2409.11316	link
2024-09-17	Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark	Clifford Broni-Bediako et.al.	2409.11227	link
2024-09-17	HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios	Nick Theisen et.al.	2409.11205	link
2024-09-16	Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning	Amin Karimi Monsefi et.al.	2409.10362	null
2024-09-16	BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images	Wentao Wang et.al.	2409.10269	null
2024-09-15	Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation	Zhanteng Xie et.al.	2409.09899	null
2024-09-15	Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation	Qilong Zhangli et.al.	2409.09893	null
2024-09-15	High Definition Map Mapping and Update: A General Overview and Future Directions	Benny Wijaya et.al.	2409.09726	null
2024-09-14	Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation	Hugo Porta et.al.	2409.09497	null
2024-09-13	AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation	Zechao Sun et.al.	2409.08516	null
2024-09-13	VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation	Ezra MacDonald et.al.	2409.08461	link
2024-09-12	Bayesian Self-Training for Semi-Supervised 3D Segmentation	Ozan Unal et.al.	2409.08102	null
2024-09-12	Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes	Siyu Chen et.al.	2409.07995	null
2024-09-12	SURGIVID: Annotation-Efficient Surgical Video Object Discovery	Çağhan Köksal et.al.	2409.07801	null
2024-09-12	Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation	Fuchen Zheng et.al.	2409.07793	link
2024-09-12	ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation	Fuchen Zheng et.al.	2409.07779	link
2024-09-12	Open-Vocabulary Remote Sensing Image Semantic Segmentation	Qinglong Cao et.al.	2409.07683	null
2024-09-11	Token Turing Machines are Efficient Vision Models	Purvish Jajal et.al.	2409.07613	null
2024-09-11	AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution	Wangduo Xie et.al.	2409.07171	null
2024-09-11	Brain-Inspired Stepwise Patch Merging for Vision Transformers	Yonghao Yu et.al.	2409.06963	null
2024-09-10	Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds	Mu Cai et.al.	2409.06827	link
2024-09-10	A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO	Sabit Ahamed Preanto et.al.	2409.06671	null
2024-09-10	PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation	Yin Hu et.al.	2409.06309	null
2024-09-10	EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Nischal Khanal et.al.	2409.06183	link
2024-09-09	SVS-GAN: Leveraging GANs for Semantic Video Synthesis	Khaled M. Seyam et.al.	2409.06074	null
2024-09-09	Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance	Quang-Huy Che et.al.	2409.06002	null
2024-09-09	Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features	Jacob Gildenblat et.al.	2409.05697	null
2024-09-09	ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions	Furqan Ahmed Shaik et.al.	2409.05327	null
2024-09-08	RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network	Zhiwei Lin et.al.	2409.04979	null
2024-09-06	Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation	Björn Michele et.al.	2409.04409	link
2024-09-05	Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution	Marga Don et.al.	2409.03754	link
2024-09-05	LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones	Moritz Nottebaum et.al.	2409.03460	link
2024-09-05	Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications	Tong Bu et.al.	2409.03368	null
2024-09-05	UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking	Md. Mahfuzur Rahman et.al.	2409.03245	null
2024-09-05	Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation	Xixi Jiang et.al.	2409.03228	link
2024-09-06	iSeg: An Iterative Refinement-based Framework for Training-free Segmentation	Lin Sun et.al.	2409.03209	link
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-04	CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation	Minhee Cho et.al.	2409.02699	null
2024-09-04	SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction	Sumin Son et.al.	2409.02513	null
2024-09-03	K-Origins: Better Colour Quantification for Neural Networks	Lewis Mason et.al.	2409.02281	link
2024-09-03	AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions	Chenghao Qian et.al.	2409.02045	null
2024-09-03	Segmenting Object Affordances: Reproducibility and Sensitivity to Scale	Tommaso Apicella et.al.	2409.01814	link
2024-09-03	Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation	Haodong Wang et.al.	2409.01662	null
2024-09-02	Semantic Segmentation from Image Labels by Reconstruction from Structured Decomposition	Xuanrui Zeng et.al.	2409.01472	link
2024-09-02	SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation	Alberto Bacchin et.al.	2409.01109	link
2024-09-02	Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions	Taorong Liu et.al.	2409.01072	null
2024-08-30	Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes	Li Zhang et.al.	2408.17421	link
2024-08-30	Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations	Ahmed Hammam et.al.	2408.17311	null
2024-08-30	Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training	Zizheng Huang et.al.	2408.17081	link
2024-08-30	Transient Fault Tolerant Semantic Segmentation for Autonomous Driving	Leonardo Iurada et.al.	2408.16952	link
2024-08-29	SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection	Rohit Venkata Sai Dulam et.al.	2408.16645	null
2024-08-29	MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation	Linyan Yang et.al.	2408.16478	null
2024-08-29	Multi-source Domain Adaptation for Panoramic Semantic Segmentation	Jing Jiang et.al.	2408.16469	null
2024-08-29	EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More	Kanghao Chen et.al.	2408.16254	null
2024-08-28	SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors	Zhiqing Zhang et.al.	2408.15887	null
2024-08-28	DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries	Yu Yang et.al.	2408.15813	null
2024-08-28	TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation	Junbao Zhou et.al.	2408.15657	link
2024-08-27	Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images	Silvia Seidlitz et.al.	2408.15373	link
2024-08-27	An Investigation on The Position Encoding in Vision-Based Dynamics Prediction	Jiageng Zhu et.al.	2408.15201	null
2024-08-27	Applying ViT in Generalized Few-shot Semantic Segmentation	Liyuan Geng et.al.	2408.14957	link
2024-08-27	Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack	Naufal Suryanto et.al.	2408.14879	null
2024-08-27	MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation	Yuanbing Zhu et.al.	2408.14776	null
2024-08-26	Physically Feasible Semantic Segmentation	Shamik Basu et.al.	2408.14672	link
2024-08-25	OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation	Muhammad Rameez ur Rahman et.al.	2408.13936	link
2024-08-25	Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation	Yuwen Pan et.al.	2408.13838	null
2024-08-25	TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather	Xiongwei Zhao et.al.	2408.13802	link
2024-08-25	ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation	Xin Zhang et.al.	2408.13771	null
2024-08-25	Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation	Zhaoyang Li et.al.	2408.13752	null
2024-08-24	ESA: Annotation-Efficient Active Learning for Semantic Segmentation	Jinchao Ge et.al.	2408.13491	link
2024-08-23	Accuracy Improvement of Cell Image Segmentation Using Feedback Former	Hinako Mitsuoka et.al.	2408.12974	null
2024-08-23	Image Segmentation in Foundation Model Era: A Survey	Tianfei Zhou et.al.	2408.12957	null
2024-08-23	Symmetric masking strategy enhances the performance of Masked Image Modeling	Khanh-Binh Nguyen et.al.	2408.12772	null
2024-08-22	Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets	Wolfgang Boettcher et.al.	2408.12489	null
2024-08-22	The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation	Tuyen Tran et.al.	2408.12447	null
2024-08-21	UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images	Enze Zhu et.al.	2408.11545	null
2024-08-21	Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation	Chuandong Liu et.al.	2408.11280	null
2024-08-20	NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency	Valentinos Pariza et.al.	2408.11054	null
2024-08-20	CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients	Karen Sanchez et.al.	2408.10827	null
2024-08-20	Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?	Chen Liang et.al.	2408.10627	null
2024-08-20	Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation	Jiawei Han et.al.	2408.10537	link
2024-08-19	Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network	Rasha Alshawi et.al.	2408.10181	null
2024-08-19	Dynamic Label Injection for Imbalanced Industrial Defect Segmentation	Emanuele Caruso et.al.	2408.10031	link
2024-08-19	Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis	Kira Maag et.al.	2408.10021	null
2024-08-19	Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving	Jun Yan et.al.	2408.09839	link
2024-08-18	OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras	Muhammad Rameez Ur Rahman et.al.	2408.09424	link
2024-08-18	Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration	Hao Ai et.al.	2408.09336	null
2024-08-17	Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology	Junchao Zhu et.al.	2408.09278	link
2024-08-17	GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation	Weiming Zhang et.al.	2408.09115	null
2024-08-17	Depth-guided Texture Diffusion for Image Semantic Segmentation	Wei Sun et.al.	2408.09097	null
2024-08-15	5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks	Dongshuo Yin et.al.	2408.08345	link
2024-08-14	MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis	Nimeesha Chan et.al.	2408.07773	link
2024-08-15	MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation	Beoungwoo Kang et.al.	2408.07576	link
2024-08-15	MagicFace: Training-free Universal-Style Human Image Customized Synthesis	Yibin Wang et.al.	2408.07433	null
2024-08-14	Segment Using Just One Example	Pratik Vora et.al.	2408.07393	null
2024-08-14	Ensemble architecture in polyp segmentation	Hao-Yun Hsu et.al.	2408.07262	link
2024-08-14	Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks	Raghavendra Singh et.al.	2408.07243	null
2024-08-14	Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training	Ethan Kou et.al.	2408.07239	null
2024-08-13	ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation	Jingyun Wang et.al.	2408.06747	link
2024-08-10	Dilated Convolution with Learnable Spacings	Ismail Khalfaoui-Hassani et.al.	2408.06383	null
2024-08-12	Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images	Siladittya Manna et.al.	2408.06235	null
2024-08-12	A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting	Felix Assion et.al.	2408.06071	null
2024-08-12	Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning	Xinrong Hu et.al.	2408.05889	null
2024-08-11	Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task	Hannuo Zhang et.al.	2408.05777	null
2024-08-11	MacFormer: Semantic Segmentation with Fine Object Boundaries	Guoan Xu et.al.	2408.05699	null
2024-08-10	Multimodal generative semantic communication based on latent diffusion model	Weiqi Fu et.al.	2408.05455	null
2024-08-09	In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation	Dahyun Kang et.al.	2408.04961	link
2024-08-09	ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation	Mengcheng Lan et.al.	2408.04883	link
2024-08-09	Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning	Fumihiro Kaneko et.al.	2408.04795	null
2024-08-08	SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation	Jieming Yu et.al.	2408.04593	null
2024-08-08	SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios	Sriram Mandalika et.al.	2408.04482	null
2024-08-08	What could go wrong? Discovering and describing failure modes in computer vision	Gabriela Csurka et.al.	2408.04471	null
2024-08-07	CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications	Tianfang Zhang et.al.	2408.03703	link
2024-08-07	SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology	Mingya Zhang et.al.	2408.03651	link
2024-08-06	Post-Mortem Human Iris Segmentation Analysis with Deep Learning	Afzal Hossain et.al.	2408.03448	null
2024-08-06	Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression	Jonas Schmitt et.al.	2408.03046	link
2024-08-05	Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation	Sai Prasanna et.al.	2408.02297	null
2024-08-05	Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs	Jeongkee Lim et.al.	2408.02261	null
2024-08-05	Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders	Muhammad Abdullah Jamal et.al.	2408.02245	null
2024-08-04	Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation	Ye Du et.al.	2408.02039	null
2024-08-03	Bayesian Active Learning for Semantic Segmentation	Sima Didari et.al.	2408.01694	null
2024-08-03	A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection	Omkar Oak et.al.	2408.01692	null
2024-08-03	Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation	Balázs Opra et.al.	2408.01640	null
2024-08-02	Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans	Lukas Kratochvila et.al.	2408.01526	null
2024-08-02	Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation	Yuanzhi Su et.al.	2408.01356	null
2024-08-02	StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation	Bingyu Li et.al.	2408.01343	null
2024-08-02	Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach	Yabin Zhu et.al.	2408.00969	null
2024-08-01	Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	Siyu Jiao et.al.	2408.00744	null
2024-08-01	Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function	Matias Oscar Volman Stern et.al.	2408.00707	null
2024-08-01	AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation	Asbjørn Munk et.al.	2408.00640	null
2024-08-01	SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation	Shengbo Tan et.al.	2408.00496	null
2024-07-31	Open-Vocabulary Audio-Visual Semantic Segmentation	Ruohao Guo et.al.	2407.21721	null
2024-07-31	MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment	Anurag Das et.al.	2407.21654	null
2024-07-31	Small Object Few-shot Segmentation for Vision-based Industrial Inspection	Zilong Zhang et.al.	2407.21351	null
2024-07-31	On-the-fly Point Feature Representation for Point Clouds Analysis	Jiangyi Wang et.al.	2407.21335	null
2024-07-31	Fine-grained Metrics for Point Cloud Semantic Segmentation	Zhuheng Lu et.al.	2407.21289	null
2024-07-30	PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds	Kerem Mertoğlu et.al.	2407.21150	null
2024-07-30	Learning Ordinality in Semantic Segmentation	Rafael Cristino et.al.	2407.20959	null
2024-07-29	Improving 2D Feature Representations by 3D-Aware Fine-Tuning	Yuanwen Yue et.al.	2407.20229	null
2024-07-29	Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset	Yimian Dai et.al.	2407.20078	link
2024-07-29	Language-driven Grasp Detection with Mask-guided Attention	Tuan Van Vo et.al.	2407.19877	null
2024-07-29	Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets	Muhammad Abdullah Jamal et.al.	2407.19714	null
2024-07-29	ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement	Ezequiel Perez-Zarate et.al.	2407.19708	link
2024-07-28	ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding	Zhen Chen et.al.	2407.19435	link
2024-07-27	Ensembling convolutional neural networks for human skin segmentation	Patryk Kuban et.al.	2407.19310	null
2024-07-27	Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network	Gang Pan et.al.	2407.19271	null
2024-07-26	Sparse Refinement for Efficient High-Resolution Semantic Segmentation	Zhijian Liu et.al.	2407.19014	null
2024-07-29	Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation	Jingjun Yi et.al.	2407.18568	null
2024-07-25	Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception	Julia Hindel et.al.	2407.18145	null
2024-07-25	TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework	Guanfeng Tang et.al.	2407.18038	null
2024-07-25	Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions	Jan Nikolas Morshuis et.al.	2407.18026	link
2024-07-24	Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation	Hyunwoo Yu et.al.	2407.17261	link
2024-07-24	Trans2Unet: Neural fusion for Nuclei Semantic Segmentation	Dinh-Phu Tran et.al.	2407.17181	null
2024-07-24	PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning	Mu Chen et.al.	2407.17101	null
2024-07-25	Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste	Qinfeng Zhu et.al.	2407.17028	link
2024-07-24	Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images	Dooseop Choi et.al.	2407.17003	link
2024-07-23	Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving	Anam Manzoor et.al.	2407.16647	null
2024-07-23	Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging	Daniela L. Ramos et.al.	2407.16608	null
2024-07-23	Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision	Aditya Krishnan et.al.	2407.16102	null
2024-07-22	MILAN: Milli-Annotations for Lidar Semantic Segmentation	Nermin Samet et.al.	2407.15797	null
2024-07-22	Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond	Silvio Galesso et.al.	2407.15739	link
2024-07-22	MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics	Alexander Melekhin et.al.	2407.15663	link
2024-07-22	Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling	Bo Yuan et.al.	2407.15429	link
2024-07-22	Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data	Junha Song et.al.	2407.15383	null
2024-07-21	Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation	Xiaoyang Wu et.al.	2407.15282	null
2024-07-20	Downstream-Pretext Domain Knowledge Traceback for Active Learning	Beichen Zhang et.al.	2407.14720	null
2024-07-19	Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model	Kun Zhao et.al.	2407.14326	null
2024-07-19	Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation	Zhengyuan Xie et.al.	2407.14142	link
2024-07-19	GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation	Florian Chabot et.al.	2407.14108	null
2024-07-18	Many Perception Tasks are Highly Redundant Functions of their Input Data	Rahul Ramesh et.al.	2407.13841	null
2024-07-18	GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model	Abdelrahman Shaker et.al.	2407.13772	link
2024-07-18	SegPoint: Segment Any Point Cloud via Large Language Model	Shuting He et.al.	2407.13761	null
2024-07-18	MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis	Ziming Zhong et.al.	2407.13675	link
2024-07-18	Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models	Xiaoyu Zhu et.al.	2407.13642	null
2024-07-18	FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures	Hao Lu et.al.	2407.13500	link
2024-07-18	FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions	Sohyun Lee et.al.	2407.13437	null
2024-07-18	Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability	Judith Dijk et.al.	2407.13392	null
2024-07-18	Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation	Chang Liu et.al.	2407.13363	null
2024-07-18	Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation	Shoumeng Qiu et.al.	2407.13254	null
2024-07-18	OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation	Jian Sun et.al.	2407.13137	null
2024-07-16	Mitigating Background Shift in Class-Incremental Semantic Segmentation	Gilhan Park et.al.	2407.11859	link
2024-07-16	Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation	Juncheng Ma et.al.	2407.11820	null
2024-07-16	XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach	Truong Thanh Hung Nguyen et.al.	2407.11771	null
2024-07-16	OAM-TCD: A globally diverse dataset of high-resolution tree cover maps	Josh Veitch-Michaelis et.al.	2407.11743	null
2024-07-16	SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds	Yanbo Wang et.al.	2407.11569	link
2024-07-16	Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations	Yunya Gao et.al.	2407.11381	link
2024-07-16	Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities	Xu Zheng et.al.	2407.11351	null
2024-07-16	Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation	Xu Zheng et.al.	2407.11344	null
2024-07-16	TCFormer: Visual Recognition via Token Clustering Transformer	Wang Zeng et.al.	2407.11321	link
2024-07-15	Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding	Danish Nazir et.al.	2407.11224	null
2024-07-15	No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations	Walter Simoncini et.al.	2407.10964	link
2024-07-15	APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2407.10649	null
2024-07-15	Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs	Rong Ma et.al.	2407.10534	null
2024-07-14	Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data	Tuo Feng et.al.	2407.10200	link
2024-07-14	RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation	Li Li et.al.	2407.10159	link
2024-07-14	HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation	Chengjie Jiang et.al.	2407.10047	null
2024-07-13	Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation	Anqi Zhang et.al.	2407.09838	null
2024-07-13	Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach	Md Rakibul Islam et.al.	2407.09828	null
2024-07-13	3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance	Xiaoxu Xu et.al.	2407.09826	null
2024-07-13	TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation	Xiaopei Wu et.al.	2407.09751	null
2024-07-12	FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background	Muhammad Ali et.al.	2407.09379	link
2024-07-12	Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy	Julian Wyatt et.al.	2407.09192	null
2024-07-12	Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off	Levente Halmosi et.al.	2407.09150	link
2024-07-12	Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation	Wei Cong et.al.	2407.09047	null
2024-07-12	Textual Query-Driven Mask Transformer for Domain Generalized Segmentation	Byeonghyun Pak et.al.	2407.09033	null
2024-07-12	Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation	Zihao Li et.al.	2407.08994	null
2024-07-11	Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation	Tong Shao et.al.	2407.08268	null
2024-07-11	Enrich the content of the image Using Context-Aware Copy Paste	Qiushi Guo et.al.	2407.08151	null
2024-07-10	MambaVision: A Hybrid Mamba-Transformer Vision Backbone	Ali Hatamizadeh et.al.	2407.08083	link
2024-07-10	Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift	Elliot Vincent et.al.	2407.07616	link
2024-07-10	H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper	Ryan Banks et.al.	2407.07604	link
2024-07-11	Trainable Highly-expressive Activation Functions	Irit Chelly et.al.	2407.07564	null
2024-07-10	Deformable-Heatmap-Segmentation for Automobile Visual Perception	Hongyu Jin et.al.	2407.07493	null
2024-07-10	Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining	Tianfang Sun et.al.	2407.07465	null
2024-07-11	HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation	Guoan Xu et.al.	2407.07441	null
2024-07-09	ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation	Yuyuan Liu et.al.	2407.07171	link
2024-07-08	Training-free CryoET Tomogram Segmentation	Yizhou Zhao et.al.	2407.06833	link
2024-07-09	CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM	Aditya Murali et.al.	2407.06795	null
2024-07-09	LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration	Jiayi Liu et.al.	2407.06512	link
2024-07-08	Leveraging image captions for selective whole slide image annotation	Jingna Qiu et.al.	2407.06363	null
2024-07-08	Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots	Siva Krishna Ravipati et.al.	2407.06077	null
2024-07-08	Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts	Puzuo Wang et.al.	2407.06043	null
2024-07-08	RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation	Sarah Elmahdy et.al.	2407.06016	link
2024-07-07	Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images	Tuan T. Nguyen et.al.	2407.05452	null
2024-07-07	Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness	Idris Hamoud et.al.	2407.05448	null
2024-07-06	A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation	Monika Wysoczańska et.al.	2407.05061	null
2024-07-06	BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support	Vladyslav Polushko et.al.	2407.05007	null
2024-07-05	Explainable Metric Learning for Deflating Data Bias	Emma Andrews et.al.	2407.04866	null
2024-07-05	LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes	Zexian Huang et.al.	2407.04326	null
2024-07-04	Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier	Prantik Howlader et.al.	2407.04036	link
2024-07-04	Relative Difficulty Distillation for Semantic Segmentation	Dong Liang et.al.	2407.03719	null
2024-07-04	POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation	Arindam Dutta et.al.	2407.03549	null
2024-07-03	A Unified Framework for 3D Scene Understanding	Wei Xu et.al.	2407.03263	null
2024-07-03	ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation	Chang Li et.al.	2407.03033	null
2024-07-03	ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation	Yipin Guo et.al.	2407.02881	null
2024-07-03	Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation	Tao Chen et.al.	2407.02768	null
2024-07-02	Open Panoramic Segmentation	Junwei Zheng et.al.	2407.02685	null
2024-07-02	Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction	Tinghuai Wang et.al.	2407.02639	null
2024-07-02	Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather	Junsung Park et.al.	2407.02286	link
2024-07-02	MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders	Baijiong Lin et.al.	2407.02228	link
2024-07-02	Occlusion-Aware Seamless Segmentation	Yihong Cao et.al.	2407.02182	link
2024-07-02	VRBiom: A New Periocular Dataset for Biometric Applications of HMD	Ketan Kotwal et.al.	2407.02150	null
2024-07-02	Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts	Pasquale De Marinis et.al.	2407.02075	null
2024-07-02	Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning	Chengchao Shen et.al.	2407.02014	link
2024-07-01	Label-free Neural Semantic Image Synthesis	Jiayi Wang et.al.	2407.01790	null
2024-07-01	PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction	Xuan Yu et.al.	2407.01349	null
2024-07-01	CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes	Danial Qashqai et.al.	2407.01328	link
2024-06-29	SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City	Guohao Wang et.al.	2407.00296	link
2024-07-01	Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding	Yifan Tang et.al.	2406.19791	null
2024-06-28	Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation	Junsung Park et.al.	2406.19638	link
2024-06-28	PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation	Deyi Ji et.al.	2406.19632	null
2024-06-27	Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model	Haobo Yuan et.al.	2406.19369	null
2024-06-27	ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation	Nazanin Moradinasab et.al.	2406.19225	null
2024-06-30	Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO	Fuseini Mumuni et.al.	2406.19057	null
2024-06-27	Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation	Tao Lian et.al.	2406.18809	null
2024-06-26	CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data	Nikolaos Dionelis et.al.	2406.18279	null
2024-06-26	The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval	Meinardus Boris et.al.	2406.18113	link
2024-06-26	Few-Shot Medical Image Segmentation with High-Fidelity Prototypes	Song Tang et.al.	2406.18074	link
2024-06-25	Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2406.17679	null
2024-06-25	DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation	Ahmad Mohammadshirazi et.al.	2406.17591	link
2024-06-25	Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation	Felix Stillger et.al.	2406.17541	null
2024-06-25	Investigating Self-Supervised Methods for Label-Efficient Learning	Srinivasa Rao Nandam et.al.	2406.17460	null
2024-06-25	Pseudo Labelling for Enhanced Masked Autoencoders	Srinivasa Rao Nandam et.al.	2406.17450	null
2024-06-25	Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model	Zhuoyuan Li et.al.	2406.17442	null
2024-06-25	Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes	Qi Ma et.al.	2406.17438	link
2024-06-24	Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation	Yizheng Wu et.al.	2406.16776	link
2024-06-24	μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation	Pierangela Bruno et.al.	2406.16724	null
2024-06-24	GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection	Harnaik Dhami et.al.	2406.16625	null
2024-06-24	LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images	Xiaowen Ma et.al.	2406.16502	link
2024-06-24	Cascade Reward Sampling for Efficient Decoding-Time Alignment	Bolian Li et.al.	2406.16306	null
2024-06-24	SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments	Neng Wang et.al.	2406.16279	link
2024-06-23	UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery	Pengfei Zhang et.al.	2406.16129	null
2024-06-22	Fine-grained Background Representation for Weakly Supervised Semantic Segmentation	Xu Yin et.al.	2406.15755	null
2024-06-20	Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery	Ilham Adi Panuntun et.al.	2406.14220	null
2024-06-20	Trusting Semantic Segmentation Networks	Samik Some et.al.	2406.14201	null
2024-06-20	EvSegSNN: Neuromorphic Semantic Segmentation for Event Data	Dalia Hareb et.al.	2406.14178	null
2024-06-20	Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images	Qinfeng Zhu et.al.	2406.14086	link
2024-06-19	Search-based DNN Testing and Retraining with GAN-enhanced Simulations	Mohammed Oualid Attaoui et.al.	2406.13359	null
2024-06-19	Deep Learning-Based 3D Instance and Semantic Segmentation: A Review	Siddiqui Muhammad Yasir et.al.	2406.13308	null
2024-06-18	Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation	Guoyu Yang et.al.	2406.12496	link
2024-06-18	Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble	Wang Liu et.al.	2406.12271	null
2024-06-17	OoDIS: Anomaly Instance Segmentation Benchmark	Alexey Nekrasov et.al.	2406.11835	link
2024-06-17	Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT	Maximilian E. Tschuchnig et.al.	2406.11650	null
2024-06-17	SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation	Zhenchao Lin et.al.	2406.11441	link
2024-06-17	Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding	Yunsong Wang et.al.	2406.11283	null
2024-06-17	Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation	Bingfeng Zhang et.al.	2406.11189	null
2024-06-16	$α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion	Sanbao Su et.al.	2406.11021	null
2024-06-16	PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery	Libo Wang et.al.	2406.10828	link
2024-06-15	GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR	Bharat Singh et.al.	2406.10722	null
2024-06-15	A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection	Chenyao Zhou et.al.	2406.10678	link
2024-06-14	ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers	Narges Norouzi et.al.	2406.09936	null
2024-06-14	Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions	Aldi Piroli et.al.	2406.09906	null
2024-06-14	Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation	Brunó B. Englert et.al.	2406.09896	link
2024-06-14	Open-Vocabulary Semantic Segmentation with Image Embedding Balancing	Xiangheng Shan et.al.	2406.09829	link
2024-06-13	Instance-level quantitative saliency in multiple sclerosis lesion segmentation	Federico Spagnolo et.al.	2406.09335	null
2024-06-13	APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation	Weizhao He et.al.	2406.08372	null
2024-06-12	Dataset Enhancement with Instance-Level Augmentations	Orest Kupyn et.al.	2406.08249	link
2024-06-13	A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder	Lixian Zhang et.al.	2406.08079	null
2024-06-12	OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding	Yinan Deng et.al.	2406.08009	link
2024-06-12	SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation	Chanda Grover Kamra et.al.	2406.07986	link
2024-06-12	Small Scale Data-Free Knowledge Distillation	He Liu et.al.	2406.07876	link
2024-06-11	Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph	Sergey Linok et.al.	2406.07113	null
2024-06-11	PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving	Yining Shi et.al.	2406.07037	null
2024-06-12	LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection	Jiahua Xu et.al.	2406.07023	null
2024-06-10	Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation	Dong Zhao et.al.	2406.06813	link
2024-06-09	Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation	Abdul Qayyum et.al.	2406.06643	null
2024-06-10	Merlin: A Vision Language Foundation Model for 3D Computed Tomography	Louis Blankemeier et.al.	2406.06512	null
2024-06-10	UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving	Daniel Bogdoll et.al.	2406.06370	null
2024-06-09	Scaling Graph Convolutions for Mobile Vision	William Avery et.al.	2406.05850	link
2024-06-09	Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation	Jun Yu et.al.	2406.05837	null
2024-06-09	Convolution and Attention-Free Mamba-based Cardiac Image Segmentation	Abbas Khan et.al.	2406.05786	null
2024-06-09	Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language	Mark Hamilton et.al.	2406.05629	link
2024-06-08	A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+	Jianzhao Wang et.al.	2406.05513	null
2024-06-08	Layered Image Vectorization via Semantic Simplification	Zhenyu Wang et.al.	2406.05404	null
2024-06-08	1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation	Qingfeng Liu et.al.	2406.05352	null
2024-06-07	USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation	Xiaoqi Wang et.al.	2406.05271	null
2024-06-07	Semantic Segmentation on VSPW Dataset through Masked Video Consistency	Chen Liang et.al.	2406.04979	null
2024-06-07	Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment	Venkanna Babu Guthula et.al.	2406.04949	null
2024-06-06	Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis	Chengeng Liu et.al.	2406.04149	null
2024-06-06	Frequency-based Matcher for Long-tailed Semantic Segmentation	Shan Li et.al.	2406.03917	link
2024-06-07	Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge	Nan Zhang et.al.	2406.03799	link
2024-06-06	DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation	Zilu Guo et.al.	2406.03702	link
2024-06-05	Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation	Maximilian Zenk et.al.	2406.03323	null
2024-06-05	Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy	Yunho Kim et.al.	2406.02989	null
2024-06-04	W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics	Andre Schreiber et.al.	2406.02822	link
2024-06-04	Window to Wall Ratio Detection using SegFormer	Zoe De Simone et.al.	2406.02706	link
2024-06-04	Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning	Heather Doig et.al.	2406.01932	null
2024-06-03	EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding	Thanh-Dat Truong et.al.	2406.01429	null
2024-06-03	TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation	Antonio Santo et.al.	2406.01395	link
2024-06-03	ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds	Ka Lung Cheung et.al.	2406.01337	link
2024-06-03	LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism	Miao Fu et.al.	2406.01228	null
2024-06-04	GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer	Ding Jia et.al.	2406.01210	link
2024-06-03	S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography	Yuhan Song et.al.	2406.01191	null
2024-06-02	Diffusion Features to Bridge Domain Gap for Semantic Segmentation	Yuxiang Ji et.al.	2406.00777	null
2024-06-02	Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation	Yunheng Li et.al.	2406.00670	null
2024-06-02	Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024	Biao Wu et.al.	2406.00587	null
2024-05-31	Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks	Linlin Yu et.al.	2405.20986	null
2024-05-31	Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation	Wooseok Shin et.al.	2405.20610	link
2024-05-30	P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation	Qi Zhang et.al.	2405.20443	null
2024-05-30	SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow	Chaoyang Wang et.al.	2405.20282	link
2024-05-30	MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion	Angel Villar-Corrales et.al.	2405.19921	link
2024-05-30	Open-Set Domain Adaptation for Semantic Segmentation	Seun-An Choe et.al.	2405.19899	link
2024-05-30	DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation	Ron Keuth et.al.	2405.19746	link
2024-05-30	Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes	Yong-Qiang Mao et.al.	2405.19735	null
2024-05-30	CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation	Ankush Gajanan Arudkar et.al.	2405.19672	null
2024-05-29	Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation	Lianlei Shan et.al.	2405.19568	null
2024-05-29	Enabling Visual Recognition at Radio Frequency	Haowen Lai et.al.	2405.19516	null
2024-05-29	Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models	Tianrun Chen et.al.	2405.19326	null
2024-05-29	A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation	Niclas Vödisch et.al.	2405.19035	link
2024-05-29	Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation	Zelin Peng et.al.	2405.18840	null
2024-05-28	Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation	JuneHyoung Kwon et.al.	2405.18148	null
2024-05-28	Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images	Lianlei Shan et.al.	2405.18078	null
2024-05-28	RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields	Mihnea-Bogdan Jurca et.al.	2405.18033	null
2024-05-28	DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture	Shentong Mo et.al.	2405.17995	null
2024-05-28	The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention	Xingyu Ding et.al.	2405.17776	null
2024-05-27	Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation	Steven Landgraf et.al.	2405.17097	null
2024-05-27	DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking	Hongtao Wang et.al.	2405.16980	null
2024-05-27	Collective Perception Datasets for Autonomous Driving: A Comprehensive Review	Sven Teufel et.al.	2405.16973	null
2024-05-27	Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models	Qian Wang et.al.	2405.16947	null
2024-05-27	A re-calibration method for object detection with multi-modal alignment bias in autonomous driving	Zhihang Song et.al.	2405.16848	null
2024-05-25	BOLD: Boolean Logic Deep Learning	Van Minh Nguyen et.al.	2405.16339	null
2024-05-25	Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation	Huizhou Chen et.al.	2405.16099	null
2024-05-25	Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality	Hakim Ikebayashi et.al.	2405.16008	null
2024-05-24	Visualize and Paint GAN Activations	Rudolf Herdt et.al.	2405.15636	null
2024-05-24	Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets	Hoàng-Ân Lê et.al.	2405.15394	null
2024-05-24	U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation	Bingyu Li et.al.	2405.15365	link
2024-05-24	Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation	Jiayi Chen et.al.	2405.15265	null
2024-05-23	Mamba-R: Vision Mamba ALSO Needs Registers	Feng Wang et.al.	2405.14858	null
2024-05-23	Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation	Daniel Kienzle et.al.	2405.14467	null
2024-05-23	MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models	Jiuming Liu et.al.	2405.14338	null
2024-05-23	Tuning-free Universally-Supervised Semantic Segmentation	Xiaobo Yang et.al.	2405.14294	null
2024-05-23	SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation	Kai Yao et.al.	2405.14278	null
2024-05-23	Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations	Mohammed Baharoon et.al.	2405.14239	null
2024-05-24	Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification	Taylor Archibald et.al.	2405.14162	null
2024-05-23	Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips	Yaotian Liu et.al.	2405.14154	null
2024-05-22	TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System	Diogo Lavado et.al.	2405.13989	null
2024-05-22	Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer	Qihang Fan et.al.	2405.13337	null
2024-05-21	Transparency Distortion Robustness for SOTA Image Segmentation Tasks	Volker Knauthe et.al.	2405.12864	null
2024-05-20	A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation	Sushmita Sarker et.al.	2405.11903	null
2024-05-20	Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments	Jooyong Park et.al.	2405.11855	null
2024-05-20	Universal Organizer of SAM for Unsupervised Semantic Segmentation	Tingting Li et.al.	2405.11742	null
2024-05-19	Interpreting a Semantic Segmentation Model for Coastline Detection	Conor O'Sullivan et.al.	2405.11500	null
2024-05-17	CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation	Mushui Liu et.al.	2405.10530	link
2024-05-16	Towards Task-Compatible Compressible Representations	Anderson de Andrade et.al.	2405.10244	link
2024-05-16	A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance	Andrea Matteazzi et.al.	2405.10046	null
2024-05-16	Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation	Jihwan Kwak et.al.	2405.09858	null
2024-05-15	Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation	Guo Yachan et.al.	2405.09682	null

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 207 Commits
.github/workflows		.github/workflows
docs		docs
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2024.11.24

Depth Estimation

Semactic Segmentation

About

Releases

Packages

Languages

ZhuYingJessica/cv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2024.11.24

Depth Estimation

Semactic Segmentation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages