Skip to content

ZhuYingJessica/cv-daily

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Contributors Forks Stargazers Issues

Updated on 2024.11.24

Usage instructions: here

Table of Contents
  1. Depth Estimation
  2. Semactic Segmentation

Depth Estimation

Publish Date Title Authors PDF Code
2024-11-21 StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart Jian Shi et.al. 2411.14295 null
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291 null
2024-11-20 OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging Rajini Makam et.al. 2411.13230 null
2024-11-15 SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction Yutao Tang et.al. 2411.12592 link
2024-11-18 Towards Degradation-Robust Reconstruction in Generalizable NeRF Chan Ho Park et.al. 2411.11691 null
2024-11-18 MGNiceNet: Unified Monocular Geometric Scene Understanding Markus Schön et.al. 2411.11466 null
2024-11-18 The ADUULM-360 Dataset -- A Multi-Modal Dataset for Depth Estimation in Adverse Weather Markus Schön et.al. 2411.11455 null
2024-11-18 GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views Boyao Zhou et.al. 2411.11363 null
2024-11-18 Scalable Autoregressive Monocular Depth Estimation Jinhong Wang et.al. 2411.11361 null
2024-11-16 MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation Ansh Shah et.al. 2411.10886 link
2024-11-19 EVT: Efficient View Transformation for Multi-Modal 3D Object Detection Yongjin Lee et.al. 2411.10715 null
2024-11-15 Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses Yongfan Liu et.al. 2411.10013 null
2024-11-14 Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting Yian Wang et.al. 2411.09823 null
2024-11-14 Adversarial Attacks Using Differentiable Rendering: A Survey Matthew Hull et.al. 2411.09749 null
2024-11-14 Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching Yuran Wang et.al. 2411.09151 null
2024-11-13 OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances Youqi Liao et.al. 2411.08665 null
2024-11-13 Scaling Properties of Diffusion Models for Perceptual Tasks Rahul Ravishankar et.al. 2411.08034 null
2024-11-11 $SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation Yinshuang Xu et.al. 2411.07326 null
2024-11-08 Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning Quang Truong Nguyen et.al. 2411.05344 null
2024-11-08 SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection Yun Zhao et.al. 2411.05292 null
2024-11-07 D $^3$ epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes Siyu Chen et.al. 2411.04826 null
2024-11-06 Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation Teppei Kurita et.al. 2411.04714 null
2024-11-07 Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation Qingyao Tian et.al. 2411.04404 null
2024-11-04 PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes Kebin Peng et.al. 2411.04227 null
2024-11-06 Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions Zihan Qin et.al. 2411.03638 null
2024-11-05 Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor Anish Bhattacharya et.al. 2411.03303 null
2024-11-05 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation Matthias Bartolo et.al. 2411.02844 link
2024-11-05 FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training Ruihong Yin et.al. 2411.02229 null
2024-11-05 Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training Yuanqi Yao et.al. 2411.02149 null
2024-11-01 MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes Sanghyun Byun et.al. 2411.01048 null
2024-11-01 On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR Li Li et.al. 2411.00600 link
2024-10-31 Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving Ce Zhou et.al. 2411.00192 null
2024-10-31 ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images Timing Yang et.al. 2410.24001 link
2024-10-30 Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe Songyu Xu et.al. 2410.23154 null
2024-10-29 Active Event Alignment for Monocular Distance Estimation Nan Cai et.al. 2410.22280 null
2024-10-29 PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting Sunghwan Hong et.al. 2410.22128 link
2024-10-27 Unlocking Comics: The AI4VA Dataset for Visual Understanding Peter Grönquist et.al. 2410.20459 link
2024-10-27 Depth Attention for Robust RGB Tracking Yu Liu et.al. 2410.20395 link
2024-10-21 YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning Ranjan Sapkota et.al. 2410.19846 null
2024-10-25 MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors Fanqi Pu et.al. 2410.19590 null
2024-10-24 Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction Hongxin Peng et.al. 2410.18433 null
2024-10-24 Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images Dong-Guw Lee et.al. 2410.18340 link
2024-10-25 UnCLe: Unsupervised Continual Learning of Depth Completion Suchisrit Gangopadhyay et.al. 2410.18074 null
2024-10-21 TIPS: Text-Image Pretraining with Spatial Awareness Kevis-Kokitsi Maninis et.al. 2410.16512 null
2024-10-22 DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain Kun Wang et.al. 2410.14980 link
2024-10-17 DepthSplat: Connecting Gaussian Splatting and Depth Haofei Xu et.al. 2410.13862 link
2024-10-16 DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning Jiabao Wei et.al. 2410.12501 null
2024-10-16 Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture Dabbrata Das et.al. 2410.11610 null
2024-10-16 CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction Pranav Gupta et.al. 2410.11211 link
2024-10-14 When Does Perceptual Alignment Benefit Vision Representations? Shobhita Sundaram et.al. 2410.10817 null
2024-10-14 Depth Any Video with Scalable Synthetic Data Honghui Yang et.al. 2410.10815 link
2024-10-15 Improved Depth Estimation of Bayesian Neural Networks Bart van Erp et.al. 2410.10395 link
2024-10-10 Color-Guided Flying Pixel Correction in Depth Images Ekamresh Vasudevan et.al. 2410.08084 null
2024-10-09 Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models Ange Lou et.al. 2410.07434 null
2024-10-09 Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation Runze Chen et.al. 2410.06982 null
2024-10-09 Analysis of different disparity estimation techniques on aerial stereo image datasets Ishan Narayan et.al. 2410.06711 null
2024-10-08 Vision Transformer based Random Walk for Group Re-Identification Guoqing Zhang et.al. 2410.05808 null
2024-10-08 CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality Wenjie Chang et.al. 2410.05735 null
2024-10-07 PhotoReg: Photometrically Registering 3D Gaussian Splatting Models Ziwen Yuan et.al. 2410.05044 null
2024-10-10 Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy Pengcheng Chen et.al. 2410.04041 null
2024-10-04 Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering Laura Fink et.al. 2410.03861 null
2024-10-03 RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions Ziyao Zeng et.al. 2410.02924 null
2024-10-02 Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Aleksei Bochkovskii et.al. 2410.02073 link
2024-10-10 Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation Shuting Zhao et.al. 2410.00979 null
2024-10-01 Radar Meets Vision: Robustifying Monocular Metric Depth Prediction for Mobile Robotics Marco Job et.al. 2410.00736 null
2024-10-06 Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration Yida Lin et.al. 2410.00503 null
2024-10-01 Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance Hongchao Shu et.al. 2410.00386 null
2024-09-30 CCDepth: A Lightweight Self-supervised Depth Estimation Network with Enhanced Interpretability Xi Zhang et.al. 2409.19933 null
2024-09-30 EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth Prediction Ivan Reyes-Amezcua et.al. 2409.19930 link
2024-09-29 fCOP: Focal Length Estimation from Category-level Object Priors Xinyue Zhang et.al. 2409.19641 null
2024-09-29 KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation Soofiyan Atar et.al. 2409.19490 null
2024-09-27 Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping Anthony A. Song et.al. 2409.19153 null
2024-09-26 Self-supervised Monocular Depth Estimation with Large Kernel Attention Xuezhi Xiang et.al. 2409.17895 null
2024-09-26 Self-Distilled Depth Refinement with Noisy Poisson Fusion Jiaqi Li et.al. 2409.17880 null
2024-09-27 A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts Aurel Pjetri et.al. 2409.17851 null
2024-09-26 Event-based Stereo Depth Estimation: A Survey Suman Ghosh et.al. 2409.17680 null
2024-09-26 CAMOT: Camera Angle-aware Multi-Object Tracking Felix Limanta et.al. 2409.17533 null
2024-09-25 Optical Lens Attack on Deep Learning Based Monocular Depth Estimation Ce Zhou et.al. 2409.17376 null
2024-09-25 Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation Richard D. Paul et.al. 2409.17085 null
2024-09-25 EventHDR: from Event to High-Speed HDR Videos and Beyond Yunhao Zou et.al. 2409.17029 null
2024-09-25 3DDX: Bone Surface Reconstruction from a Single Standard-Geometry Radiograph via Dual-Face Depth Estimation Yi Gu et.al. 2409.16702 null
2024-09-24 MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling Yifang Men et.al. 2409.16160 null
2024-09-24 Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data An Wang et.al. 2409.16063 link
2024-09-23 FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera Guoyang Zhao et.al. 2409.15054 link
2024-09-23 DepthART: Monocular Depth Estimation as Autoregressive Refinement Task Bulat Gabdullin et.al. 2409.15010 null
2024-09-23 Generalizing monocular colonoscopy image depth estimation by uncertainty-based global and local fusion network Sijia Du et.al. 2409.15006 null
2024-09-23 GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth Aurélien Cecille et.al. 2409.14850 null
2024-09-23 Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras Ming Li et.al. 2409.14766 null
2024-09-18 Panoptic-Depth Forecasting Juana Valeria Hurtado et.al. 2409.12008 null
2024-09-17 Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Gonzalo Martin Garcia et.al. 2409.11355 link
2024-09-15 GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion Vitor Guizilini et.al. 2409.09896 null
2024-09-15 Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation Xiaolong Qian et.al. 2409.09754 link
2024-09-13 PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage Denis Zavadski et.al. 2409.09144 link
2024-09-25 Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding Rania Hossam et.al. 2409.08695 link
2024-09-12 Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor Andrea Conti et.al. 2409.08277 null
2024-09-12 LED: Light Enhanced Depth Estimation at Night Simon de Moreau et.al. 2409.08031 link
2024-09-12 Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes Ming Li et.al. 2409.07843 null
2024-09-12 Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy Bojian Li et.al. 2409.07723 null
2024-09-12 FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments Devansh Dhrafani et.al. 2409.07715 null
2024-09-10 Deep Neural Networks: Multi-Classification and Universal Approximation Martín Hernández et.al. 2409.06555 null
2024-09-10 EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation Nischal Khanal et.al. 2409.06183 link
2024-09-11 EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels Qingyao Tian et.al. 2409.05442 null
2024-09-09 Spontaneous magnetic field and disorder effects in BaPtAs_1-x_Sb_x_ with honeycomb network T. Adachi et.al. 2409.05266 null
2024-09-08 TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs Horatiu Florea et.al. 2409.05142 null
2024-09-12 Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective Tim Bader et.al. 2409.04086 link
2024-09-08 Estimating Indoor Scene Depth Maps from Ultrasonic Echoes Junpei Honma et.al. 2409.03336 null
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838 null
2024-09-02 GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling Huawei Sun et.al. 2409.02720 null
2024-09-04 Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects Kyungmin Jo et.al. 2409.02653 null
2024-09-04 UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching Soomin Kim et.al. 2409.02545 null
2024-09-04 SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction Sumin Son et.al. 2409.02513 null
2024-09-04 Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation Li Liu et.al. 2409.02494 null
2024-09-04 Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization Cho-Ying Wu et.al. 2409.02486 null
2024-09-04 GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving Huasong Han et.al. 2409.02382 null
2024-09-03 DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos Wenbo Hu et.al. 2409.02095 null
2024-09-02 Large Language Models Can Understanding Depth from Monocular Images Zhongyi Xia et.al. 2409.01133 null
2024-08-30 DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model Mona Sheikh Zeinoddin et.al. 2408.17433 null
2024-08-30 Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method Yuji Lin et.al. 2408.17339 null
2024-08-30 Synthetic Lunar Terrain: A Multimodal Open Dataset for Training and Evaluating Neuromorphic Vision Algorithms Marcus Märtens et.al. 2408.16971 null
2024-08-29 EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More Kanghao Chen et.al. 2408.16254 null
2024-08-30 Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective Zhijie Shen et.al. 2408.16227 link
2024-08-27 Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack Naufal Suryanto et.al. 2408.14879 null
2024-08-26 NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-training Albert Luginov et.al. 2408.14177 null
2024-08-26 Pixel-Aligned Multi-View Generation with Depth Guided Decoder Zhenggang Tang et.al. 2408.14016 null
2024-08-25 TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers Chuanrui Zhang et.al. 2408.13770 null
2024-08-25 InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth Cho-Ying Wu et.al. 2408.13708 null
2024-08-25 SeeBelow: Sub-dermal 3D Reconstruction of Tumors with Surgical Robotic Palpation and Tactile Exploration Raghava Uppuluri et.al. 2408.13699 null
2024-08-27 Sapiens: Foundation for Human Vision Models Rawal Khirodkar et.al. 2408.12569 null
2024-08-21 LiFCal: Online Light Field Camera Calibration via Bundle Adjustment Aymeric Fleith et.al. 2408.11682 null
2024-08-19 Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video Shuxian Wang et.al. 2408.10153 null
2024-08-19 SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition Wiktor Mucha et.al. 2408.10037 link
2024-08-19 P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders Xuechao Chen et.al. 2408.10007 null
2024-08-14 Enhanced Scale-aware Depth Estimation for Monocular Endoscopic Scenes with Geometric Modeling Ruofeng Wei et.al. 2408.07266 null
2024-08-12 Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces Junrui Zhang et.al. 2408.06083 null
2024-08-08 Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation Daniele Rege Cambrin et.al. 2408.04523 link
2024-08-08 Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework Subhasis Dasgupta et.al. 2408.04360 null
2024-08-08 Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform Daniel Vargas et.al. 2408.04195 null
2024-08-07 Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach Benedikt W. Hosp et.al. 2408.03591 null
2024-08-06 BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications G. Manni et.al. 2408.03078 link
2024-08-05 Gaussian Mixture based Evidential Learning for Stereo Matching Weide Liu et.al. 2408.02796 null
2024-08-05 Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining Dongyang Liu et.al. 2408.02657 link
2024-08-03 MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas Feng Qiao et.al. 2408.01653 null
2024-08-02 Self-Supervised Depth Estimation Based on Camera Models Jinchang Zhang et.al. 2408.01565 null
2024-08-01 MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection Youjia Fu et.al. 2408.00438 null
2024-08-01 High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior Wencheng Han et.al. 2408.00361 null
2024-07-31 Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching Pengjie Zhang et.al. 2407.21735 null
2024-07-29 BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation Kieran Saunders et.al. 2407.20437 null
2024-07-29 Analysis and Improvement of Rank-Ordered Mean Algorithm in Single-Photon LiDAR William C. Yau et.al. 2407.20399 null
2024-07-29 Improving 2D Feature Representations by 3D-Aware Fine-Tuning Yuanwen Yue et.al. 2407.20229 null
2024-07-27 Revisit Self-supervised Depth Estimation with Local Structure-from-Motion Shengjie Zhu et.al. 2407.19166 null
2024-07-27 RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry Shengjie Zhu et.al. 2407.19154 null
2024-07-26 HybridDepth: Robust Depth Fusion for Mobile AR by Leveraging Depth from Focus and Single-Image Priors Ashkan Ganj et.al. 2407.18443 link
2024-07-26 Enhanced Depth Estimation and 3D Geometry Reconstruction using Bayesian Helmholtz Stereopsis with Belief Propagation Razieh Azizi et.al. 2407.18195 null
2024-07-25 BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation Xiang Zhang et.al. 2407.17952 null
2024-07-25 UMono: Physical Model Informed Hybrid CNN-Transformer Framework for Underwater Monocular Depth Estimation Jian Wang et.al. 2407.17838 null
2024-07-24 DarSwin-Unet: Distortion Aware Encoder-Decoder Architecture Akshaya Athwale et.al. 2407.17328 null
2024-07-24 Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches Chenxing Zhao et.al. 2407.17312 null
2024-07-23 SINDER: Repairing the Singular Defects of DINOv2 Haoqi Wang et.al. 2407.16826 link
2024-07-23 Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions Fabio Tosi et.al. 2407.16698 link
2024-07-23 ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation Zhenhua Wu et.al. 2407.16508 null
2024-07-19 Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation Jinfeng Liu et.al. 2407.14126 link
2024-07-18 Unveiling the purely young star formation history of the SMC's northeastern shell from colour-magnitude diagram fitting Joanna D. Sakowska et.al. 2407.13876 null
2024-07-18 Many Perception Tasks are Highly Redundant Functions of their Input Data Rahul Ramesh et.al. 2407.13841 null
2024-07-18 Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks Antoni Kowalczuk et.al. 2407.12588 link
2024-07-16 Temporally Consistent Stereo Matching Jiaxi Zeng et.al. 2407.11950 link
2024-07-15 IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation Yuanhao Zhai et.al. 2407.10937 link
2024-07-15 OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection Jinghua Hou et.al. 2407.10753 link
2024-07-15 Towards Scale-Aware Full Surround Monodepth with Transformers Yuchen Yang et.al. 2407.10406 null
2024-07-12 ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion Sungmin Woo et.al. 2407.09303 link
2024-07-11 ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation Ruijie Zhu et.al. 2407.08187 link
2024-07-10 Controlling Space and Time with Diffusion Models Daniel Watson et.al. 2407.07860 null
2024-07-07 SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning Yi Feng et.al. 2407.05283 link
2024-07-05 A Physical Model-Guided Framework for Underwater Image Enhancement and Depth Estimation Dazhao Du et.al. 2407.04230 null
2024-07-04 Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation Laiyan Ding et.al. 2407.04041 null
2024-07-02 Parametric Modeling and Estimation of Photon Registrations for 3D Imaging Weijian Zhang et.al. 2407.02712 null
2024-07-02 Depth-Aware Endoscopic Video Inpainting Francis Xiatian Zhang et.al. 2407.02675 link
2024-07-04 Camera-LiDAR Cross-modality Gait Recognition Wenxuan Guo et.al. 2407.02038 null
2024-07-07 CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation Huawei Sun et.al. 2407.00697 link
2024-06-28 Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey Uchitha Rajapaksha et.al. 2406.19675 null
2024-07-05 360 in the Wild: Dataset for Depth Prediction and View Synthesis Kibaek Park et.al. 2406.18898 null
2024-06-27 Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach Yuxiang Huang et.al. 2406.18837 null
2024-06-26 DoubleTake: Geometry Guided Depth Estimation Mohamed Sayed et.al. 2406.18387 null
2024-06-25 Depth-Guided Semi-Supervised Instance Segmentation Xin Chen et.al. 2406.17413 null
2024-06-20 Uncertainty and Self-Supervision in Single-View Depth Javier Rodriguez-Puigvert et.al. 2406.14226 null
2024-06-19 WaterMono: Teacher-Guided Anomaly Masking and Enhancement Boosting for Robust Underwater Self-Supervised Monocular Depth Estimation Yilin Ding et.al. 2406.13344 link
2024-06-18 Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation Ning-Hsu Wang et.al. 2406.12849 null
2024-06-21 GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models Yongtao Ge et.al. 2406.12671 link
2024-06-17 DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features Letian Wang et.al. 2406.12095 null
2024-06-17 MEDeA: Multi-view Efficient Depth Adjustment Mikhail Artemyev et.al. 2406.12048 null
2024-06-16 3D Gaze Tracking for Studying Collaborative Interactions in Mixed-Reality Environments Eduardo Davalos et.al. 2406.11003 null
2024-06-15 GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR Bharat Singh et.al. 2406.10722 null
2024-06-14 The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences Bria Long et.al. 2406.10447 null
2024-06-14 D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video Moritz Kappel et.al. 2406.10078 null
2024-06-14 DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications Li Li et.al. 2406.10068 link
2024-06-14 Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion Runze Liu et.al. 2406.09782 null
2024-06-13 Depth Anything V2 Lihe Yang et.al. 2406.09414 null
2024-06-14 WonderWorld: Interactive 3D Scene Generation from a Single Image Hong-Xing Yu et.al. 2406.09394 null
2024-06-13 Scale-Invariant Monocular Depth Estimation via SSI Depth S. Mahdi H. Miangoleh et.al. 2406.09374 null
2024-06-13 Multiple Prior Representation Learning for Self-Supervised Monocular Depth Estimation via Hybrid Transformer Guodong Sun et.al. 2406.08928 link
2024-06-13 ToSA: Token Selective Attention for Efficient Vision Transformers Manish Kumar Singh et.al. 2406.08816 null
2024-06-11 Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation Yufan Zhu et.al. 2406.07741 link
2024-06-11 PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow Joshua Tokarsky et.al. 2406.07667 null
2024-06-11 RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks Zhechao Wang et.al. 2406.07032 null
2024-06-10 PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation Zhenyu Li et.al. 2406.06679 null
2024-06-09 Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks Zhiyuan Cheng et.al. 2406.05857 link
2024-06-09 RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering Rui Zhang et.al. 2406.05852 null
2024-06-07 Normal-guided Detail-Preserving Neural Implicit Functions for High-Fidelity 3D Surface Reconstruction Aarya Patel et.al. 2406.04861 null
2024-06-07 UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection Yuchao Wang et.al. 2406.04647 null
2024-06-06 MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation Ionuţ Grigore et.al. 2406.04532 null
2024-06-06 Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image Stanislaw Szymanowicz et.al. 2406.04343 null
2024-06-06 Neural Surface Reconstruction from Sparse Views Using Epipolar Geometry Kaichen Zhou et.al. 2406.04301 null
2024-06-04 VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors Markus Plack et.al. 2406.02552 null
2024-06-03 L-MAGIC: Language Model Assisted Generation of Images with Coherence Zhipeng Cai et.al. 2406.01843 link
2024-06-04 Learning Temporally Consistent Video Depth from Video Diffusion Priors Jiahao Shao et.al. 2406.01493 null
2024-06-03 Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry Takayuki Kanai et.al. 2406.00929 null
2024-06-01 MoDGS: Dynamic Gaussian Splatting from Causually-captured Monocular Videos Qingming Liu et.al. 2406.00434 null
2024-05-30 Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian Wei Sun et.al. 2405.19657 null
2024-05-28 Hybrid Multi-Head Physics-informed Neural Network for Depth Estimation in Terahertz Imaging Mingjun Xiang et.al. 2405.18317 null
2024-05-27 Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation Amir El-Ghoussani et.al. 2405.17704 null
2024-05-27 Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving Shaoyuan Xie et.al. 2405.17426 link
2024-05-27 All-day Depth Completion Vadim Ezhov et.al. 2405.17315 null
2024-05-27 GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping Junyoung Seo et.al. 2405.17251 null
2024-05-27 SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing Yong-Qiang Mao et.al. 2405.17140 null
2024-05-27 DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge Yifan Mao et.al. 2405.17102 null
2024-05-27 Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation Steven Landgraf et.al. 2405.17097 null
2024-05-27 DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation Mengtan Zhang et.al. 2405.16960 null
2024-05-27 ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection Ziying Song et.al. 2405.16873 null
2024-05-27 Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations Jingguo Liu et.al. 2405.16858 null
2024-05-26 Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians Erik Sandström et.al. 2405.16544 null
2024-05-24 Transparent Object Depth Completion Yifan Zhou et.al. 2405.15299 null
2024-05-24 MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method Pan Liao et.al. 2405.15176 null
2024-05-23 EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting Jiaxu Wang et.al. 2405.14959 link
2024-05-23 Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks Xingguang Jiang et.al. 2405.14520 null
2024-05-23 Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning Zhenyu Wei et.al. 2405.14195 null
2024-05-21 Cross-spectral Gated-RGB Stereo Depth Estimation Samuel Brucker et.al. 2405.12759 null
2024-05-20 Depth Reconstruction with Neural Signed Distance Fields in Structured Light Systems Rukun Qiao et.al. 2405.12006 null
2024-05-20 Depth Prompting for Sensor-Agnostic Depth Estimation Jin-Hwi Park et.al. 2405.11867 null
2024-05-19 CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs Zidong Cao et.al. 2405.11564 null
2024-05-18 Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation Models Madhu Vankadari et.al. 2405.11158 link
2024-05-17 FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation Fei Wang et.al. 2405.10885 link
2024-05-17 Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory Jonas Kälble et.al. 2405.10575 link
2024-05-16 Towards Task-Compatible Compressible Representations Anderson de Andrade et.al. 2405.10244 link
2024-05-16 KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment Zhengxu Shi et.al. 2405.09964 null
2024-05-14 CLIP with Quality Captions: A Strong Pretraining for Vision Tasks Pavan Kumar Anasosalu Vasu et.al. 2405.08911 null

(back to top)

Semactic Segmentation

Publish Date Title Authors PDF Code
2024-11-21 Revisiting the Integration of Convolution and Attention for Vision Backbone Lei Zhu et.al. 2411.14429 link
2024-11-21 CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation Lin Sun et.al. 2411.13836 link
2024-11-21 Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals Hussni Mohd Zakir et.al. 2411.13774 null
2024-11-20 FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting Ola Shorinwa et.al. 2411.13753 null
2024-11-20 BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation Umamaheswaran Raman Kumar et.al. 2411.13251 null
2024-11-20 XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation Ziyi Wang et.al. 2411.13243 link
2024-11-20 Automating Sonologists USG Commands with AI and Voice Interface Emad Mohamed et.al. 2411.13006 null
2024-11-19 A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation Jiaqi Yang et.al. 2411.12615 link
2024-11-19 SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation Ron Keuth et.al. 2411.12602 link
2024-11-19 ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator Xiao Jiang et.al. 2411.12250 null
2024-11-18 ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements M. Arda Aydın et.al. 2411.12044 link
2024-11-18 Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation Hanieh Shojaei Miandashti et.al. 2411.11935 null
2024-11-18 MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models Harshita Sharma et.al. 2411.11362 null
2024-11-18 Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications Scarlett Raine et.al. 2411.11287 null
2024-11-16 Attention-based U-Net Method for Autonomous Lane Detection Mohammadhamed Tangestanizadeh et.al. 2411.10902 null
2024-11-16 Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation Jaisidh Singh et.al. 2411.10845 null
2024-11-19 Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients Maria Monzon et.al. 2411.10755 link
2024-11-15 Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images Ammar Qammaz et.al. 2411.10334 null
2024-11-15 CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation Dengke Zhang et.al. 2411.10086 null
2024-11-14 OneNet: A Channel-Wise 1D Convolutional U-Net Sanghyun Byun et.al. 2411.09838 link
2024-11-14 Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks Zengyi Yang et.al. 2411.09387 null
2024-11-14 Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation Yuheng Shi et.al. 2411.09219 link
2024-11-14 Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery Ashim Dahal et.al. 2411.09101 link
2024-11-13 CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation Xuming Zhang et.al. 2411.09023 null
2024-11-14 Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation Yangyang Li et.al. 2411.08756 null
2024-11-13 Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model Jun Xie et.al. 2411.08592 null
2024-11-12 Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry Christopher Hahne et.al. 2411.07918 link
2024-11-12 Semantic segmentation on multi-resolution optical and microwave data using deep learning Jai G Singla et.al. 2411.07581 null
2024-11-11 SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation Jiale Chen et.al. 2411.06991 null
2024-11-14 Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision Yueyang Cang et.al. 2411.06727 null
2024-11-10 Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments Deegan Atha et.al. 2411.06632 null
2024-11-09 Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing Kaixuan Lu et.al. 2411.06091 null
2024-11-08 Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model Shuchang Lyu et.al. 2411.05878 link
2024-11-08 Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation Sien Li et.al. 2411.05307 link
2024-11-07 In the Era of Prompt Learning with Vision-Language Models Ankit Jha et.al. 2411.04892 null
2024-11-11 ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset Olaf Wysocki et.al. 2411.04865 link
2024-11-06 Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts Zhitong Gao et.al. 2411.03829 link
2024-11-06 Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model Yansong Qu et.al. 2411.03672 null
2024-11-05 Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation Zhiling Yue et.al. 2411.03551 null
2024-11-05 SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture Andrew Heschl et.al. 2411.03505 link
2024-11-05 Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need Qishuai Wen et.al. 2411.03033 link
2024-11-05 Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation Xavier Timoneda et.al. 2411.02969 null
2024-11-05 Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery Mohammad Kakooei et.al. 2411.02935 null
2024-11-05 CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation Jinchao Ge et.al. 2411.02715 null
2024-11-04 Deep Learning on 3D Semantic Segmentation: A Detailed Review Thodoris Betsas et.al. 2411.02104 null
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability Bo Gao et.al. 2411.01819 null
2024-11-04 Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations Thanh Nguyen Canh et.al. 2411.01816 null
2024-11-03 PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation Xinyu Xu et.al. 2411.01624 null
2024-11-01 Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions Lixiao Yang et.al. 2411.01039 null
2024-11-01 Event-guided Low-light Video Semantic Segmentation Zhen Yao et.al. 2411.00639 null
2024-11-01 Cross-modal semantic segmentation for indoor environmental perception using single-chip millimeter-wave radar raw data Hairuo Hu et.al. 2411.00499 null
2024-11-01 Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing Naufal Suryanto et.al. 2411.00425 link
2024-10-31 A Recipe for Geometry-Aware 3D Mesh Transformers Mohammad Farazi et.al. 2411.00164 null
2024-10-31 Federated Black-Box Adaptation for Semantic Segmentation Jay N. Paranjape et.al. 2410.24181 null
2024-10-31 COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes Muhammad Ali et.al. 2410.24139 link
2024-10-31 Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model Hao Zhang et.al. 2410.23905 link
2024-10-30 S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Maciej K. Wozniak et.al. 2410.23085 null
2024-10-31 CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation Ziyang Gong et.al. 2410.22629 link
2024-10-29 Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation Zhaochong An et.al. 2410.22489 null
2024-10-29 Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation Jintao Tong et.al. 2410.22135 null
2024-10-29 Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models Imad Ali Shah et.al. 2410.22101 null
2024-10-29 Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation Ruihao Xia et.al. 2410.21708 link
2024-10-28 Domain Adaptation with a Single Vision-Language Embedding Mohammad Fahes et.al. 2410.21361 null
2024-10-28 IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks Manjunath D et.al. 2410.20953 null
2024-10-27 A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models Camilo Espinosa-Curilem et.al. 2410.20595 link
2024-10-27 Unlocking Comics: The AI4VA Dataset for Visual Understanding Peter Grönquist et.al. 2410.20459 link
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 null
2024-10-25 OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery Philipe Dias et.al. 2410.19965 null
2024-10-25 IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation Kaixian Qu et.al. 2410.19697 null
2024-10-25 Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation Yao Wu et.al. 2410.19446 link
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-24 Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks Alexander Jaus et.al. 2410.18684 null
2024-10-24 Unsupervised semantic segmentation of urban high-density multispectral point clouds Oona Oinonen et.al. 2410.18520 null
2024-10-26 CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator Stefanos Pasios et.al. 2410.18238 null
2024-10-23 Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers Achille Chiuchiarelli et.al. 2410.17738 null
2024-10-22 EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding Zhiyi Pan et.al. 2410.17207 null
2024-10-22 SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments Jumman Hossain et.al. 2410.16686 null
2024-10-21 TIPS: Text-Image Pretraining with Spatial Awareness Kevis-Kokitsi Maninis et.al. 2410.16512 null
2024-10-21 GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation Nazanin Moradinasab et.al. 2410.16485 null
2024-10-21 LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training Thomas Kreutz et.al. 2410.15833 link
2024-10-21 TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight Hyun-Kurl Jang et.al. 2410.15674 link
2024-10-21 Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications Jintao Ren et.al. 2410.15584 null
2024-10-22 Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation Fnu Neha et.al. 2410.15472 null
2024-10-18 On the Influence of Shape, Texture and Color for Learning Semantic Segmentation Annika Mütze et.al. 2410.14878 null
2024-10-18 Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+ Arpan Mahara et.al. 2410.14836 null
2024-10-17 ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Guangda Ji et.al. 2410.13924 null
2024-10-17 Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks Clément Playout et.al. 2410.13822 link
2024-10-22 EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything Joonhyeon Song et.al. 2410.13621 link
2024-10-17 Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation Ziyang Chen et.al. 2410.13472 null
2024-10-17 SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing Bin Wang et.al. 2410.13471 link
2024-10-17 Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation Florian Wulff et.al. 2410.13383 null
2024-10-17 Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation Houze Liu et.al. 2410.13099 null
2024-10-16 Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation Wenbo Xu et.al. 2410.13094 null
2024-10-16 Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation Jesús Alejandro Loera-Ponce et.al. 2410.12988 null
2024-10-16 VividMed: Vision Language Model with Versatile Visual Grounding for Medicine Lingxiao Luo et.al. 2410.12694 link
2024-10-16 Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans Luca Marsilio et.al. 2410.12641 null
2024-10-16 SAM-Guided Masked Token Prediction for 3D Scene Understanding Zhimin Chen et.al. 2410.12158 null
2024-10-15 WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation Chenghao Qian et.al. 2410.12075 null
2024-10-15 Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning Rijun Wang et.al. 2410.11913 null
2024-10-15 RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation Anton Antonov et.al. 2410.11722 link
2024-10-15 InvSeg: Test-Time Prompt Inversion for Semantic Segmentation Jiayi Lin et.al. 2410.11473 null
2024-10-15 MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation Xianping Ma et.al. 2410.11160 link
2024-10-14 Locality Alignment Improves Vision-Language Models Ian Covert et.al. 2410.11087 null
2024-10-14 Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes Tim Broedermann et.al. 2410.10791 null
2024-10-14 UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation Lihe Yang et.al. 2410.10777 link
2024-10-14 Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation Daniel Fusaro et.al. 2410.10510 link
2024-10-14 LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections Xuezhi Xiang et.al. 2410.10433 null
2024-10-14 V2M: Visual 2-Dimensional Mamba for Image Representation Learning Chengkun Wang et.al. 2410.10382 link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 link
2024-10-13 AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model Yuchen Li et.al. 2410.09714 null
2024-10-12 An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation Wei Liang et.al. 2410.09443 null
2024-10-11 Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation Varduhi Yeghiazaryan et.al. 2410.08946 null
2024-10-11 Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation Hanieh Shojaei et.al. 2410.08687 null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 link
2024-10-10 Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Samir Abou Haidar et.al. 2410.08365 null
2024-10-10 Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation Zhiyi Pan et.al. 2410.08091 null
2024-10-10 Shift and matching queries for video semantic segmentation Tsubasa Mizuno et.al. 2410.07635 null
2024-10-10 3D Vision-Language Gaussian Splatting Qucheng Peng et.al. 2410.07577 null
2024-10-11 Bridge the Points: Graph-based Few-shot Segment Anything Semantically Anqi Zhang et.al. 2410.06964 null
2024-10-09 Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation Seungho Lee et.al. 2410.06893 null
2024-10-09 Rethinking the Evaluation of Visible and Infrared Image Fusion Dayan Guan et.al. 2410.06811 link
2024-10-10 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 link
2024-10-09 Transesophageal Echocardiography Generation using Anatomical Models Emmanuel Oladokun et.al. 2410.06781 null
2024-10-09 Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy Qinfeng Zhu et.al. 2410.06725 null
2024-10-09 Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments Meng Yu et.al. 2410.06626 null
2024-10-09 Towards Natural Image Matting in the Wild via Real-Scenario Prior Ruihao Xia et.al. 2410.06593 link
2024-10-08 Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions Mateus Karvat et.al. 2410.06380 null
2024-10-08 Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading Fang Gao et.al. 2410.05762 null
2024-10-07 Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation Vince Zhu et.al. 2410.04689 null
2024-10-04 SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2 Hao Yu et.al. 2410.03962 null
2024-10-04 Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features Benyuan Meng et.al. 2410.03558 link
2024-10-04 Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images Abhijeet Patil et.al. 2410.03289 link
2024-10-04 HRVMamba: High-Resolution Visual State Space Model for Dense Prediction Hao Zhang et.al. 2410.03174 null
2024-10-03 HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer Jingjing Ren et.al. 2410.02528 null
2024-10-04 Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation Muzhi Zhu et.al. 2410.02369 null
2024-10-03 RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds Remco Royen et.al. 2410.02323 null
2024-10-03 Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network Yangyang Qiu et.al. 2410.02224 null
2024-10-03 Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images Qingyuan Liu et.al. 2410.02207 null
2024-10-02 SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images Kaiyu Li et.al. 2410.01768 link
2024-10-02 One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations Shaokang Wu et.al. 2410.01630 null
2024-10-02 Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation Zhaofeng Shi et.al. 2410.01341 null
2024-10-02 VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings Andrea Carrara et.al. 2410.01336 null
2024-10-01 RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation Yazhou Zhu et.al. 2410.01110 null
2024-10-01 Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer Vlatko Spasev et.al. 2410.01092 null
2024-10-01 Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time Chiao-An Yang et.al. 2410.01083 link
2024-10-01 DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles Robert Krajewski et.al. 2410.00769 null
2024-10-01 Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection Pengxi Zeng et.al. 2410.00582 null
2024-10-01 Precise Workcell Sketching from Point Clouds Using an AR Toolbox Krzysztof Zieliński et.al. 2410.00479 null
2024-09-30 AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation Boyu Han et.al. 2409.20398 null
2024-09-30 Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation Tillmann Rheude et.al. 2409.20287 link
2024-09-30 Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model Fulong Ma et.al. 2409.20164 null
2024-09-30 Segmenting Wood Rot using Computer Vision Models Roland Kammerbauer et.al. 2409.20137 null
2024-09-30 Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels Heeseong Shin et.al. 2409.19846 null
2024-09-27 Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation Raphael Hagmanns et.al. 2409.18788 null
2024-09-27 Learning from Pattern Completion: Self-supervised Controllable Generation Zhiqiang Chen et.al. 2409.18694 link
2024-09-27 Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast Xiaoke Hao et.al. 2409.18543 link
2024-10-01 Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization Siru Li et.al. 2409.18434 null
2024-09-26 Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning Siyi Lu et.al. 2409.17659 null
2024-09-26 Global-Local Medical SAM Adaptor Based on Full Adaption Meng Wang et.al. 2409.17486 null
2024-09-25 VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection Liangyu Zhong et.al. 2409.17330 null
2024-09-25 2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation Tommie Kerssies et.al. 2409.17208 link
2024-09-25 WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks Alberto Bacchin et.al. 2409.16999 link
2024-09-25 Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis Illia Tsiporenko et.al. 2409.16940 null
2024-09-24 A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation Avisha Kumar et.al. 2409.16441 null
2024-09-24 Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds Asad Ur Rahman et.al. 2409.16381 null
2024-09-24 Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation Hannah Kerner et.al. 2409.16252 link
2024-09-24 Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation Harry Rogers et.al. 2409.16213 link
2024-09-24 Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification Pang-Yuan Pao et.al. 2409.15846 null
2024-09-24 DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation Soojin Jang et.al. 2409.15801 null
2024-09-24 Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis Camndon Reed et.al. 2409.15671 null
2024-09-23 ZeroSCD: Zero-Shot Street Scene Change Detection Shyam Sundar Kannan et.al. 2409.15255 null
2024-09-17 Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks Edgar Heinert et.al. 2409.11373 null
2024-09-17 MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping Amirreza Fateh et.al. 2409.11316 link
2024-09-17 Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark Clifford Broni-Bediako et.al. 2409.11227 link
2024-09-17 HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios Nick Theisen et.al. 2409.11205 link
2024-09-16 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning Amin Karimi Monsefi et.al. 2409.10362 null
2024-09-16 BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images Wentao Wang et.al. 2409.10269 null
2024-09-15 Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation Zhanteng Xie et.al. 2409.09899 null
2024-09-15 Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation Qilong Zhangli et.al. 2409.09893 null
2024-09-15 High Definition Map Mapping and Update: A General Overview and Future Directions Benny Wijaya et.al. 2409.09726 null
2024-09-14 Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation Hugo Porta et.al. 2409.09497 null
2024-09-13 AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation Zechao Sun et.al. 2409.08516 null
2024-09-13 VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation Ezra MacDonald et.al. 2409.08461 link
2024-09-12 Bayesian Self-Training for Semi-Supervised 3D Segmentation Ozan Unal et.al. 2409.08102 null
2024-09-12 Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes Siyu Chen et.al. 2409.07995 null
2024-09-12 SURGIVID: Annotation-Efficient Surgical Video Object Discovery Çağhan Köksal et.al. 2409.07801 null
2024-09-12 Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation Fuchen Zheng et.al. 2409.07793 link
2024-09-12 ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation Fuchen Zheng et.al. 2409.07779 link
2024-09-12 Open-Vocabulary Remote Sensing Image Semantic Segmentation Qinglong Cao et.al. 2409.07683 null
2024-09-11 Token Turing Machines are Efficient Vision Models Purvish Jajal et.al. 2409.07613 null
2024-09-11 AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution Wangduo Xie et.al. 2409.07171 null
2024-09-11 Brain-Inspired Stepwise Patch Merging for Vision Transformers Yonghao Yu et.al. 2409.06963 null
2024-09-10 Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds Mu Cai et.al. 2409.06827 link
2024-09-10 A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO Sabit Ahamed Preanto et.al. 2409.06671 null
2024-09-10 PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation Yin Hu et.al. 2409.06309 null
2024-09-10 EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation Nischal Khanal et.al. 2409.06183 link
2024-09-09 SVS-GAN: Leveraging GANs for Semantic Video Synthesis Khaled M. Seyam et.al. 2409.06074 null
2024-09-09 Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance Quang-Huy Che et.al. 2409.06002 null
2024-09-09 Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features Jacob Gildenblat et.al. 2409.05697 null
2024-09-09 ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions Furqan Ahmed Shaik et.al. 2409.05327 null
2024-09-08 RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network Zhiwei Lin et.al. 2409.04979 null
2024-09-06 Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation Björn Michele et.al. 2409.04409 link
2024-09-05 Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution Marga Don et.al. 2409.03754 link
2024-09-05 LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones Moritz Nottebaum et.al. 2409.03460 link
2024-09-05 Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications Tong Bu et.al. 2409.03368 null
2024-09-05 UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking Md. Mahfuzur Rahman et.al. 2409.03245 null
2024-09-05 Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation Xixi Jiang et.al. 2409.03228 link
2024-09-06 iSeg: An Iterative Refinement-based Framework for Training-free Segmentation Lin Sun et.al. 2409.03209 link
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838 null
2024-09-04 CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation Minhee Cho et.al. 2409.02699 null
2024-09-04 SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction Sumin Son et.al. 2409.02513 null
2024-09-03 K-Origins: Better Colour Quantification for Neural Networks Lewis Mason et.al. 2409.02281 link
2024-09-03 AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions Chenghao Qian et.al. 2409.02045 null
2024-09-03 Segmenting Object Affordances: Reproducibility and Sensitivity to Scale Tommaso Apicella et.al. 2409.01814 link
2024-09-03 Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation Haodong Wang et.al. 2409.01662 null
2024-09-02 Semantic Segmentation from Image Labels by Reconstruction from Structured Decomposition Xuanrui Zeng et.al. 2409.01472 link
2024-09-02 SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation Alberto Bacchin et.al. 2409.01109 link
2024-09-02 Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions Taorong Liu et.al. 2409.01072 null
2024-08-30 Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes Li Zhang et.al. 2408.17421 link
2024-08-30 Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations Ahmed Hammam et.al. 2408.17311 null
2024-08-30 Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training Zizheng Huang et.al. 2408.17081 link
2024-08-30 Transient Fault Tolerant Semantic Segmentation for Autonomous Driving Leonardo Iurada et.al. 2408.16952 link
2024-08-29 SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection Rohit Venkata Sai Dulam et.al. 2408.16645 null
2024-08-29 MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation Linyan Yang et.al. 2408.16478 null
2024-08-29 Multi-source Domain Adaptation for Panoramic Semantic Segmentation Jing Jiang et.al. 2408.16469 null
2024-08-29 EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More Kanghao Chen et.al. 2408.16254 null
2024-08-28 SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors Zhiqing Zhang et.al. 2408.15887 null
2024-08-28 DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries Yu Yang et.al. 2408.15813 null
2024-08-28 TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation Junbao Zhou et.al. 2408.15657 link
2024-08-27 Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images Silvia Seidlitz et.al. 2408.15373 link
2024-08-27 An Investigation on The Position Encoding in Vision-Based Dynamics Prediction Jiageng Zhu et.al. 2408.15201 null
2024-08-27 Applying ViT in Generalized Few-shot Semantic Segmentation Liyuan Geng et.al. 2408.14957 link
2024-08-27 Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack Naufal Suryanto et.al. 2408.14879 null
2024-08-27 MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation Yuanbing Zhu et.al. 2408.14776 null
2024-08-26 Physically Feasible Semantic Segmentation Shamik Basu et.al. 2408.14672 link
2024-08-25 OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation Muhammad Rameez ur Rahman et.al. 2408.13936 link
2024-08-25 Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation Yuwen Pan et.al. 2408.13838 null
2024-08-25 TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather Xiongwei Zhao et.al. 2408.13802 link
2024-08-25 ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation Xin Zhang et.al. 2408.13771 null
2024-08-25 Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation Zhaoyang Li et.al. 2408.13752 null
2024-08-24 ESA: Annotation-Efficient Active Learning for Semantic Segmentation Jinchao Ge et.al. 2408.13491 link
2024-08-23 Accuracy Improvement of Cell Image Segmentation Using Feedback Former Hinako Mitsuoka et.al. 2408.12974 null
2024-08-23 Image Segmentation in Foundation Model Era: A Survey Tianfei Zhou et.al. 2408.12957 null
2024-08-23 Symmetric masking strategy enhances the performance of Masked Image Modeling Khanh-Binh Nguyen et.al. 2408.12772 null
2024-08-22 Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets Wolfgang Boettcher et.al. 2408.12489 null
2024-08-22 The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation Tuyen Tran et.al. 2408.12447 null
2024-08-21 UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images Enze Zhu et.al. 2408.11545 null
2024-08-21 Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation Chuandong Liu et.al. 2408.11280 null
2024-08-20 NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency Valentinos Pariza et.al. 2408.11054 null
2024-08-20 CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients Karen Sanchez et.al. 2408.10827 null
2024-08-20 Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? Chen Liang et.al. 2408.10627 null
2024-08-20 Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation Jiawei Han et.al. 2408.10537 link
2024-08-19 Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network Rasha Alshawi et.al. 2408.10181 null
2024-08-19 Dynamic Label Injection for Imbalanced Industrial Defect Segmentation Emanuele Caruso et.al. 2408.10031 link
2024-08-19 Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis Kira Maag et.al. 2408.10021 null
2024-08-19 Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving Jun Yan et.al. 2408.09839 link
2024-08-18 OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras Muhammad Rameez Ur Rahman et.al. 2408.09424 link
2024-08-18 Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration Hao Ai et.al. 2408.09336 null
2024-08-17 Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology Junchao Zhu et.al. 2408.09278 link
2024-08-17 GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation Weiming Zhang et.al. 2408.09115 null
2024-08-17 Depth-guided Texture Diffusion for Image Semantic Segmentation Wei Sun et.al. 2408.09097 null
2024-08-15 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks Dongshuo Yin et.al. 2408.08345 link
2024-08-14 MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis Nimeesha Chan et.al. 2408.07773 link
2024-08-15 MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation Beoungwoo Kang et.al. 2408.07576 link
2024-08-15 MagicFace: Training-free Universal-Style Human Image Customized Synthesis Yibin Wang et.al. 2408.07433 null
2024-08-14 Segment Using Just One Example Pratik Vora et.al. 2408.07393 null
2024-08-14 Ensemble architecture in polyp segmentation Hao-Yun Hsu et.al. 2408.07262 link
2024-08-14 Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks Raghavendra Singh et.al. 2408.07243 null
2024-08-14 Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training Ethan Kou et.al. 2408.07239 null
2024-08-13 ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation Jingyun Wang et.al. 2408.06747 link
2024-08-10 Dilated Convolution with Learnable Spacings Ismail Khalfaoui-Hassani et.al. 2408.06383 null
2024-08-12 Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images Siladittya Manna et.al. 2408.06235 null
2024-08-12 A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting Felix Assion et.al. 2408.06071 null
2024-08-12 Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning Xinrong Hu et.al. 2408.05889 null
2024-08-11 Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task Hannuo Zhang et.al. 2408.05777 null
2024-08-11 MacFormer: Semantic Segmentation with Fine Object Boundaries Guoan Xu et.al. 2408.05699 null
2024-08-10 Multimodal generative semantic communication based on latent diffusion model Weiqi Fu et.al. 2408.05455 null
2024-08-09 In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation Dahyun Kang et.al. 2408.04961 link
2024-08-09 ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation Mengcheng Lan et.al. 2408.04883 link
2024-08-09 Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning Fumihiro Kaneko et.al. 2408.04795 null
2024-08-08 SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation Jieming Yu et.al. 2408.04593 null
2024-08-08 SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios Sriram Mandalika et.al. 2408.04482 null
2024-08-08 What could go wrong? Discovering and describing failure modes in computer vision Gabriela Csurka et.al. 2408.04471 null
2024-08-07 CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Tianfang Zhang et.al. 2408.03703 link
2024-08-07 SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology Mingya Zhang et.al. 2408.03651 link
2024-08-06 Post-Mortem Human Iris Segmentation Analysis with Deep Learning Afzal Hossain et.al. 2408.03448 null
2024-08-06 Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Jonas Schmitt et.al. 2408.03046 link
2024-08-05 Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation Sai Prasanna et.al. 2408.02297 null
2024-08-05 Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs Jeongkee Lim et.al. 2408.02261 null
2024-08-05 Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders Muhammad Abdullah Jamal et.al. 2408.02245 null
2024-08-04 Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation Ye Du et.al. 2408.02039 null
2024-08-03 Bayesian Active Learning for Semantic Segmentation Sima Didari et.al. 2408.01694 null
2024-08-03 A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection Omkar Oak et.al. 2408.01692 null
2024-08-03 Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation Balázs Opra et.al. 2408.01640 null
2024-08-02 Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans Lukas Kratochvila et.al. 2408.01526 null
2024-08-02 Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation Yuanzhi Su et.al. 2408.01356 null
2024-08-02 StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation Bingyu Li et.al. 2408.01343 null
2024-08-02 Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach Yabin Zhu et.al. 2408.00969 null
2024-08-01 Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation Siyu Jiao et.al. 2408.00744 null
2024-08-01 Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function Matias Oscar Volman Stern et.al. 2408.00707 null
2024-08-01 AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation Asbjørn Munk et.al. 2408.00640 null
2024-08-01 SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation Shengbo Tan et.al. 2408.00496 null
2024-07-31 Open-Vocabulary Audio-Visual Semantic Segmentation Ruohao Guo et.al. 2407.21721 null
2024-07-31 MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment Anurag Das et.al. 2407.21654 null
2024-07-31 Small Object Few-shot Segmentation for Vision-based Industrial Inspection Zilong Zhang et.al. 2407.21351 null
2024-07-31 On-the-fly Point Feature Representation for Point Clouds Analysis Jiangyi Wang et.al. 2407.21335 null
2024-07-31 Fine-grained Metrics for Point Cloud Semantic Segmentation Zhuheng Lu et.al. 2407.21289 null
2024-07-30 PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds Kerem Mertoğlu et.al. 2407.21150 null
2024-07-30 Learning Ordinality in Semantic Segmentation Rafael Cristino et.al. 2407.20959 null
2024-07-29 Improving 2D Feature Representations by 3D-Aware Fine-Tuning Yuanwen Yue et.al. 2407.20229 null
2024-07-29 Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset Yimian Dai et.al. 2407.20078 link
2024-07-29 Language-driven Grasp Detection with Mask-guided Attention Tuan Van Vo et.al. 2407.19877 null
2024-07-29 Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets Muhammad Abdullah Jamal et.al. 2407.19714 null
2024-07-29 ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement Ezequiel Perez-Zarate et.al. 2407.19708 link
2024-07-28 ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding Zhen Chen et.al. 2407.19435 link
2024-07-27 Ensembling convolutional neural networks for human skin segmentation Patryk Kuban et.al. 2407.19310 null
2024-07-27 Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network Gang Pan et.al. 2407.19271 null
2024-07-26 Sparse Refinement for Efficient High-Resolution Semantic Segmentation Zhijian Liu et.al. 2407.19014 null
2024-07-29 Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation Jingjun Yi et.al. 2407.18568 null
2024-07-25 Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception Julia Hindel et.al. 2407.18145 null
2024-07-25 TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework Guanfeng Tang et.al. 2407.18038 null
2024-07-25 Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions Jan Nikolas Morshuis et.al. 2407.18026 link
2024-07-24 Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation Hyunwoo Yu et.al. 2407.17261 link
2024-07-24 Trans2Unet: Neural fusion for Nuclei Semantic Segmentation Dinh-Phu Tran et.al. 2407.17181 null
2024-07-24 PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning Mu Chen et.al. 2407.17101 null
2024-07-25 Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste Qinfeng Zhu et.al. 2407.17028 link
2024-07-24 Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images Dooseop Choi et.al. 2407.17003 link
2024-07-23 Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving Anam Manzoor et.al. 2407.16647 null
2024-07-23 Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging Daniela L. Ramos et.al. 2407.16608 null
2024-07-23 Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision Aditya Krishnan et.al. 2407.16102 null
2024-07-22 MILAN: Milli-Annotations for Lidar Semantic Segmentation Nermin Samet et.al. 2407.15797 null
2024-07-22 Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond Silvio Galesso et.al. 2407.15739 link
2024-07-22 MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics Alexander Melekhin et.al. 2407.15663 link
2024-07-22 Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling Bo Yuan et.al. 2407.15429 link
2024-07-22 Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data Junha Song et.al. 2407.15383 null
2024-07-21 Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation Xiaoyang Wu et.al. 2407.15282 null
2024-07-20 Downstream-Pretext Domain Knowledge Traceback for Active Learning Beichen Zhang et.al. 2407.14720 null
2024-07-19 Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model Kun Zhao et.al. 2407.14326 null
2024-07-19 Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation Zhengyuan Xie et.al. 2407.14142 link
2024-07-19 GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation Florian Chabot et.al. 2407.14108 null
2024-07-18 Many Perception Tasks are Highly Redundant Functions of their Input Data Rahul Ramesh et.al. 2407.13841 null
2024-07-18 GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model Abdelrahman Shaker et.al. 2407.13772 link
2024-07-18 SegPoint: Segment Any Point Cloud via Large Language Model Shuting He et.al. 2407.13761 null
2024-07-18 MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis Ziming Zhong et.al. 2407.13675 link
2024-07-18 Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models Xiaoyu Zhu et.al. 2407.13642 null
2024-07-18 FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures Hao Lu et.al. 2407.13500 link
2024-07-18 FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions Sohyun Lee et.al. 2407.13437 null
2024-07-18 Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability Judith Dijk et.al. 2407.13392 null
2024-07-18 Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation Chang Liu et.al. 2407.13363 null
2024-07-18 Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation Shoumeng Qiu et.al. 2407.13254 null
2024-07-18 OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation Jian Sun et.al. 2407.13137 null
2024-07-16 Mitigating Background Shift in Class-Incremental Semantic Segmentation Gilhan Park et.al. 2407.11859 link
2024-07-16 Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation Juncheng Ma et.al. 2407.11820 null
2024-07-16 XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach Truong Thanh Hung Nguyen et.al. 2407.11771 null
2024-07-16 OAM-TCD: A globally diverse dataset of high-resolution tree cover maps Josh Veitch-Michaelis et.al. 2407.11743 null
2024-07-16 SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds Yanbo Wang et.al. 2407.11569 link
2024-07-16 Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations Yunya Gao et.al. 2407.11381 link
2024-07-16 Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities Xu Zheng et.al. 2407.11351 null
2024-07-16 Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation Xu Zheng et.al. 2407.11344 null
2024-07-16 TCFormer: Visual Recognition via Token Clustering Transformer Wang Zeng et.al. 2407.11321 link
2024-07-15 Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding Danish Nazir et.al. 2407.11224 null
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964 link
2024-07-15 APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2407.10649 null
2024-07-15 Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs Rong Ma et.al. 2407.10534 null
2024-07-14 Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data Tuo Feng et.al. 2407.10200 link
2024-07-14 RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation Li Li et.al. 2407.10159 link
2024-07-14 HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation Chengjie Jiang et.al. 2407.10047 null
2024-07-13 Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation Anqi Zhang et.al. 2407.09838 null
2024-07-13 Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach Md Rakibul Islam et.al. 2407.09828 null
2024-07-13 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance Xiaoxu Xu et.al. 2407.09826 null
2024-07-13 TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation Xiaopei Wu et.al. 2407.09751 null
2024-07-12 FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background Muhammad Ali et.al. 2407.09379 link
2024-07-12 Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy Julian Wyatt et.al. 2407.09192 null
2024-07-12 Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off Levente Halmosi et.al. 2407.09150 link
2024-07-12 Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation Wei Cong et.al. 2407.09047 null
2024-07-12 Textual Query-Driven Mask Transformer for Domain Generalized Segmentation Byeonghyun Pak et.al. 2407.09033 null
2024-07-12 Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation Zihao Li et.al. 2407.08994 null
2024-07-11 Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation Tong Shao et.al. 2407.08268 null
2024-07-11 Enrich the content of the image Using Context-Aware Copy Paste Qiushi Guo et.al. 2407.08151 null
2024-07-10 MambaVision: A Hybrid Mamba-Transformer Vision Backbone Ali Hatamizadeh et.al. 2407.08083 link
2024-07-10 Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift Elliot Vincent et.al. 2407.07616 link
2024-07-10 H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper Ryan Banks et.al. 2407.07604 link
2024-07-11 Trainable Highly-expressive Activation Functions Irit Chelly et.al. 2407.07564 null
2024-07-10 Deformable-Heatmap-Segmentation for Automobile Visual Perception Hongyu Jin et.al. 2407.07493 null
2024-07-10 Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining Tianfang Sun et.al. 2407.07465 null
2024-07-11 HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation Guoan Xu et.al. 2407.07441 null
2024-07-09 ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation Yuyuan Liu et.al. 2407.07171 link
2024-07-08 Training-free CryoET Tomogram Segmentation Yizhou Zhao et.al. 2407.06833 link
2024-07-09 CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM Aditya Murali et.al. 2407.06795 null
2024-07-09 LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration Jiayi Liu et.al. 2407.06512 link
2024-07-08 Leveraging image captions for selective whole slide image annotation Jingna Qiu et.al. 2407.06363 null
2024-07-08 Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots Siva Krishna Ravipati et.al. 2407.06077 null
2024-07-08 Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts Puzuo Wang et.al. 2407.06043 null
2024-07-08 RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation Sarah Elmahdy et.al. 2407.06016 link
2024-07-07 Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images Tuan T. Nguyen et.al. 2407.05452 null
2024-07-07 Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness Idris Hamoud et.al. 2407.05448 null
2024-07-06 A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation Monika Wysoczańska et.al. 2407.05061 null
2024-07-06 BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support Vladyslav Polushko et.al. 2407.05007 null
2024-07-05 Explainable Metric Learning for Deflating Data Bias Emma Andrews et.al. 2407.04866 null
2024-07-05 LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes Zexian Huang et.al. 2407.04326 null
2024-07-04 Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier Prantik Howlader et.al. 2407.04036 link
2024-07-04 Relative Difficulty Distillation for Semantic Segmentation Dong Liang et.al. 2407.03719 null
2024-07-04 POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation Arindam Dutta et.al. 2407.03549 null
2024-07-03 A Unified Framework for 3D Scene Understanding Wei Xu et.al. 2407.03263 null
2024-07-03 ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation Chang Li et.al. 2407.03033 null
2024-07-03 ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation Yipin Guo et.al. 2407.02881 null
2024-07-03 Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation Tao Chen et.al. 2407.02768 null
2024-07-02 Open Panoramic Segmentation Junwei Zheng et.al. 2407.02685 null
2024-07-02 Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction Tinghuai Wang et.al. 2407.02639 null
2024-07-02 Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather Junsung Park et.al. 2407.02286 link
2024-07-02 MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders Baijiong Lin et.al. 2407.02228 link
2024-07-02 Occlusion-Aware Seamless Segmentation Yihong Cao et.al. 2407.02182 link
2024-07-02 VRBiom: A New Periocular Dataset for Biometric Applications of HMD Ketan Kotwal et.al. 2407.02150 null
2024-07-02 Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts Pasquale De Marinis et.al. 2407.02075 null
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-07-01 Label-free Neural Semantic Image Synthesis Jiayi Wang et.al. 2407.01790 null
2024-07-01 PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction Xuan Yu et.al. 2407.01349 null
2024-07-01 CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes Danial Qashqai et.al. 2407.01328 link
2024-06-29 SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City Guohao Wang et.al. 2407.00296 link
2024-07-01 Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding Yifan Tang et.al. 2406.19791 null
2024-06-28 Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation Junsung Park et.al. 2406.19638 link
2024-06-28 PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation Deyi Ji et.al. 2406.19632 null
2024-06-27 Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model Haobo Yuan et.al. 2406.19369 null
2024-06-27 ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation Nazanin Moradinasab et.al. 2406.19225 null
2024-06-30 Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO Fuseini Mumuni et.al. 2406.19057 null
2024-06-27 Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation Tao Lian et.al. 2406.18809 null
2024-06-26 CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data Nikolaos Dionelis et.al. 2406.18279 null
2024-06-26 The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval Meinardus Boris et.al. 2406.18113 link
2024-06-26 Few-Shot Medical Image Segmentation with High-Fidelity Prototypes Song Tang et.al. 2406.18074 link
2024-06-25 Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation Xuming Zhang et.al. 2406.17679 null
2024-06-25 DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation Ahmad Mohammadshirazi et.al. 2406.17591 link
2024-06-25 Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation Felix Stillger et.al. 2406.17541 null
2024-06-25 Investigating Self-Supervised Methods for Label-Efficient Learning Srinivasa Rao Nandam et.al. 2406.17460 null
2024-06-25 Pseudo Labelling for Enhanced Masked Autoencoders Srinivasa Rao Nandam et.al. 2406.17450 null
2024-06-25 Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model Zhuoyuan Li et.al. 2406.17442 null
2024-06-25 Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes Qi Ma et.al. 2406.17438 link
2024-06-24 Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation Yizheng Wu et.al. 2406.16776 link
2024-06-24 μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation Pierangela Bruno et.al. 2406.16724 null
2024-06-24 GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection Harnaik Dhami et.al. 2406.16625 null
2024-06-24 LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images Xiaowen Ma et.al. 2406.16502 link
2024-06-24 Cascade Reward Sampling for Efficient Decoding-Time Alignment Bolian Li et.al. 2406.16306 null
2024-06-24 SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments Neng Wang et.al. 2406.16279 link
2024-06-23 UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery Pengfei Zhang et.al. 2406.16129 null
2024-06-22 Fine-grained Background Representation for Weakly Supervised Semantic Segmentation Xu Yin et.al. 2406.15755 null
2024-06-20 Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery Ilham Adi Panuntun et.al. 2406.14220 null
2024-06-20 Trusting Semantic Segmentation Networks Samik Some et.al. 2406.14201 null
2024-06-20 EvSegSNN: Neuromorphic Semantic Segmentation for Event Data Dalia Hareb et.al. 2406.14178 null
2024-06-20 Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images Qinfeng Zhu et.al. 2406.14086 link
2024-06-19 Search-based DNN Testing and Retraining with GAN-enhanced Simulations Mohammed Oualid Attaoui et.al. 2406.13359 null
2024-06-19 Deep Learning-Based 3D Instance and Semantic Segmentation: A Review Siddiqui Muhammad Yasir et.al. 2406.13308 null
2024-06-18 Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation Guoyu Yang et.al. 2406.12496 link
2024-06-18 Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble Wang Liu et.al. 2406.12271 null
2024-06-17 OoDIS: Anomaly Instance Segmentation Benchmark Alexey Nekrasov et.al. 2406.11835 link
2024-06-17 Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT Maximilian E. Tschuchnig et.al. 2406.11650 null
2024-06-17 SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation Zhenchao Lin et.al. 2406.11441 link
2024-06-17 Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding Yunsong Wang et.al. 2406.11283 null
2024-06-17 Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation Bingfeng Zhang et.al. 2406.11189 null
2024-06-16 $α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion Sanbao Su et.al. 2406.11021 null
2024-06-16 PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery Libo Wang et.al. 2406.10828 link
2024-06-15 GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR Bharat Singh et.al. 2406.10722 null
2024-06-15 A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection Chenyao Zhou et.al. 2406.10678 link
2024-06-14 ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers Narges Norouzi et.al. 2406.09936 null
2024-06-14 Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions Aldi Piroli et.al. 2406.09906 null
2024-06-14 Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation Brunó B. Englert et.al. 2406.09896 link
2024-06-14 Open-Vocabulary Semantic Segmentation with Image Embedding Balancing Xiangheng Shan et.al. 2406.09829 link
2024-06-13 Instance-level quantitative saliency in multiple sclerosis lesion segmentation Federico Spagnolo et.al. 2406.09335 null
2024-06-13 APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation Weizhao He et.al. 2406.08372 null
2024-06-12 Dataset Enhancement with Instance-Level Augmentations Orest Kupyn et.al. 2406.08249 link
2024-06-13 A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder Lixian Zhang et.al. 2406.08079 null
2024-06-12 OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding Yinan Deng et.al. 2406.08009 link
2024-06-12 SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation Chanda Grover Kamra et.al. 2406.07986 link
2024-06-12 Small Scale Data-Free Knowledge Distillation He Liu et.al. 2406.07876 link
2024-06-11 Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph Sergey Linok et.al. 2406.07113 null
2024-06-11 PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving Yining Shi et.al. 2406.07037 null
2024-06-12 LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection Jiahua Xu et.al. 2406.07023 null
2024-06-10 Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation Dong Zhao et.al. 2406.06813 link
2024-06-09 Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation Abdul Qayyum et.al. 2406.06643 null
2024-06-10 Merlin: A Vision Language Foundation Model for 3D Computed Tomography Louis Blankemeier et.al. 2406.06512 null
2024-06-10 UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving Daniel Bogdoll et.al. 2406.06370 null
2024-06-09 Scaling Graph Convolutions for Mobile Vision William Avery et.al. 2406.05850 link
2024-06-09 Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation Jun Yu et.al. 2406.05837 null
2024-06-09 Convolution and Attention-Free Mamba-based Cardiac Image Segmentation Abbas Khan et.al. 2406.05786 null
2024-06-09 Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language Mark Hamilton et.al. 2406.05629 link
2024-06-08 A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+ Jianzhao Wang et.al. 2406.05513 null
2024-06-08 Layered Image Vectorization via Semantic Simplification Zhenyu Wang et.al. 2406.05404 null
2024-06-08 1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation Qingfeng Liu et.al. 2406.05352 null
2024-06-07 USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation Xiaoqi Wang et.al. 2406.05271 null
2024-06-07 Semantic Segmentation on VSPW Dataset through Masked Video Consistency Chen Liang et.al. 2406.04979 null
2024-06-07 Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment Venkanna Babu Guthula et.al. 2406.04949 null
2024-06-06 Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis Chengeng Liu et.al. 2406.04149 null
2024-06-06 Frequency-based Matcher for Long-tailed Semantic Segmentation Shan Li et.al. 2406.03917 link
2024-06-07 Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge Nan Zhang et.al. 2406.03799 link
2024-06-06 DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation Zilu Guo et.al. 2406.03702 link
2024-06-05 Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation Maximilian Zenk et.al. 2406.03323 null
2024-06-05 Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy Yunho Kim et.al. 2406.02989 null
2024-06-04 W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics Andre Schreiber et.al. 2406.02822 link
2024-06-04 Window to Wall Ratio Detection using SegFormer Zoe De Simone et.al. 2406.02706 link
2024-06-04 Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning Heather Doig et.al. 2406.01932 null
2024-06-03 EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding Thanh-Dat Truong et.al. 2406.01429 null
2024-06-03 TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation Antonio Santo et.al. 2406.01395 link
2024-06-03 ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds Ka Lung Cheung et.al. 2406.01337 link
2024-06-03 LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism Miao Fu et.al. 2406.01228 null
2024-06-04 GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer Ding Jia et.al. 2406.01210 link
2024-06-03 S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography Yuhan Song et.al. 2406.01191 null
2024-06-02 Diffusion Features to Bridge Domain Gap for Semantic Segmentation Yuxiang Ji et.al. 2406.00777 null
2024-06-02 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation Yunheng Li et.al. 2406.00670 null
2024-06-02 Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024 Biao Wu et.al. 2406.00587 null
2024-05-31 Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks Linlin Yu et.al. 2405.20986 null
2024-05-31 Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation Wooseok Shin et.al. 2405.20610 link
2024-05-30 P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation Qi Zhang et.al. 2405.20443 null
2024-05-30 SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow Chaoyang Wang et.al. 2405.20282 link
2024-05-30 MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion Angel Villar-Corrales et.al. 2405.19921 link
2024-05-30 Open-Set Domain Adaptation for Semantic Segmentation Seun-An Choe et.al. 2405.19899 link
2024-05-30 DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation Ron Keuth et.al. 2405.19746 link
2024-05-30 Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes Yong-Qiang Mao et.al. 2405.19735 null
2024-05-30 CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation Ankush Gajanan Arudkar et.al. 2405.19672 null
2024-05-29 Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation Lianlei Shan et.al. 2405.19568 null
2024-05-29 Enabling Visual Recognition at Radio Frequency Haowen Lai et.al. 2405.19516 null
2024-05-29 Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models Tianrun Chen et.al. 2405.19326 null
2024-05-29 A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation Niclas Vödisch et.al. 2405.19035 link
2024-05-29 Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation Zelin Peng et.al. 2405.18840 null
2024-05-28 Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation JuneHyoung Kwon et.al. 2405.18148 null
2024-05-28 Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images Lianlei Shan et.al. 2405.18078 null
2024-05-28 RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields Mihnea-Bogdan Jurca et.al. 2405.18033 null
2024-05-28 DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture Shentong Mo et.al. 2405.17995 null
2024-05-28 The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention Xingyu Ding et.al. 2405.17776 null
2024-05-27 Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation Steven Landgraf et.al. 2405.17097 null
2024-05-27 DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking Hongtao Wang et.al. 2405.16980 null
2024-05-27 Collective Perception Datasets for Autonomous Driving: A Comprehensive Review Sven Teufel et.al. 2405.16973 null
2024-05-27 Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models Qian Wang et.al. 2405.16947 null
2024-05-27 A re-calibration method for object detection with multi-modal alignment bias in autonomous driving Zhihang Song et.al. 2405.16848 null
2024-05-25 BOLD: Boolean Logic Deep Learning Van Minh Nguyen et.al. 2405.16339 null
2024-05-25 Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation Huizhou Chen et.al. 2405.16099 null
2024-05-25 Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality Hakim Ikebayashi et.al. 2405.16008 null
2024-05-24 Visualize and Paint GAN Activations Rudolf Herdt et.al. 2405.15636 null
2024-05-24 Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets Hoàng-Ân Lê et.al. 2405.15394 null
2024-05-24 U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation Bingyu Li et.al. 2405.15365 link
2024-05-24 Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation Jiayi Chen et.al. 2405.15265 null
2024-05-23 Mamba-R: Vision Mamba ALSO Needs Registers Feng Wang et.al. 2405.14858 null
2024-05-23 Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation Daniel Kienzle et.al. 2405.14467 null
2024-05-23 MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models Jiuming Liu et.al. 2405.14338 null
2024-05-23 Tuning-free Universally-Supervised Semantic Segmentation Xiaobo Yang et.al. 2405.14294 null
2024-05-23 SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation Kai Yao et.al. 2405.14278 null
2024-05-23 Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations Mohammed Baharoon et.al. 2405.14239 null
2024-05-24 Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification Taylor Archibald et.al. 2405.14162 null
2024-05-23 Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips Yaotian Liu et.al. 2405.14154 null
2024-05-22 TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System Diogo Lavado et.al. 2405.13989 null
2024-05-22 Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer Qihang Fan et.al. 2405.13337 null
2024-05-21 Transparency Distortion Robustness for SOTA Image Segmentation Tasks Volker Knauthe et.al. 2405.12864 null
2024-05-20 A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation Sushmita Sarker et.al. 2405.11903 null
2024-05-20 Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments Jooyong Park et.al. 2405.11855 null
2024-05-20 Universal Organizer of SAM for Unsupervised Semantic Segmentation Tingting Li et.al. 2405.11742 null
2024-05-19 Interpreting a Semantic Segmentation Model for Coastline Detection Conor O'Sullivan et.al. 2405.11500 null
2024-05-17 CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation Mushui Liu et.al. 2405.10530 link
2024-05-16 Towards Task-Compatible Compressible Representations Anderson de Andrade et.al. 2405.10244 link
2024-05-16 A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance Andrea Matteazzi et.al. 2405.10046 null
2024-05-16 Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation Jihwan Kwak et.al. 2405.09858 null
2024-05-15 Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation Guo Yachan et.al. 2405.09682 null

(back to top)

About

This is an Arxiv paper collection

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages