Skip to content

Latest commit

 

History

History
842 lines (832 loc) · 152 KB

README.md

File metadata and controls

842 lines (832 loc) · 152 KB

Contributors Forks Stargazers Issues

Updated on 2024.11.24

Usage instructions: here

Table of Contents
  1. Depth Estimation
  2. Semactic Segmentation

Depth Estimation

Publish Date Title Authors PDF Code
2024-11-21 StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart Jian Shi et.al. 2411.14295 null
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291 null
2024-11-20 OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging Rajini Makam et.al. 2411.13230 null
2024-11-15 SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction Yutao Tang et.al. 2411.12592 link
2024-11-18 Towards Degradation-Robust Reconstruction in Generalizable NeRF Chan Ho Park et.al. 2411.11691 null
2024-11-18 MGNiceNet: Unified Monocular Geometric Scene Understanding Markus Schön et.al. 2411.11466 null
2024-11-18 The ADUULM-360 Dataset -- A Multi-Modal Dataset for Depth Estimation in Adverse Weather Markus Schön et.al. 2411.11455 null
2024-11-18 GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views Boyao Zhou et.al. 2411.11363 null
2024-11-18 Scalable Autoregressive Monocular Depth Estimation Jinhong Wang et.al. 2411.11361 null
2024-11-16 MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation Ansh Shah et.al. 2411.10886 link
2024-11-19 EVT: Efficient View Transformation for Multi-Modal 3D Object Detection Yongjin Lee et.al. 2411.10715 null
2024-11-15 Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses Yongfan Liu et.al. 2411.10013 null
2024-11-14 Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting Yian Wang et.al. 2411.09823 null
2024-11-14 Adversarial Attacks Using Differentiable Rendering: A Survey Matthew Hull et.al. 2411.09749 null
2024-11-14 Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching Yuran Wang et.al. 2411.09151 null
2024-11-13 OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances Youqi Liao et.al. 2411.08665 null
2024-11-13 Scaling Properties of Diffusion Models for Perceptual Tasks Rahul Ravishankar et.al. 2411.08034 null
2024-11-11 $SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation Yinshuang Xu et.al. 2411.07326 null
2024-11-08 Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning Quang Truong Nguyen et.al. 2411.05344 null
2024-11-08 SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection Yun Zhao et.al. 2411.05292 null
2024-11-07 D $^3$ epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes Siyu Chen et.al. 2411.04826 null
2024-11-06 Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation Teppei Kurita et.al. 2411.04714 null
2024-11-07 Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation Qingyao Tian et.al. 2411.04404 null
2024-11-04 PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes Kebin Peng et.al. 2411.04227 null
2024-11-06 Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions Zihan Qin et.al. 2411.03638 null
2024-11-05 Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor Anish Bhattacharya et.al. 2411.03303 null
2024-11-05 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation Matthias Bartolo et.al. 2411.02844 link
2024-11-05 FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training Ruihong Yin et.al. 2411.02229 null
2024-11-05 Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training Yuanqi Yao et.al. 2411.02149 null
2024-11-01 MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes Sanghyun Byun et.al. 2411.01048 null
2024-11-01 On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR Li Li et.al. 2411.00600 link
2024-10-31 Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving Ce Zhou et.al. 2411.00192 null
2024-10-31 ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images Timing Yang et.al. 2410.24001 link
2024-10-30 Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe Songyu Xu et.al. 2410.23154 null
2024-10-29 Active Event Alignment for Monocular Distance Estimation Nan Cai et.al. 2410.22280 null
2024-10-29 PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting Sunghwan Hong et.al. 2410.22128 link
2024-10-27 Unlocking Comics: The AI4VA Dataset for Visual Understanding Peter Grönquist et.al. 2410.20459 link
2024-10-27 Depth Attention for Robust RGB Tracking Yu Liu et.al. 2410.20395 link
2024-10-21 YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning Ranjan Sapkota et.al. 2410.19846 null
2024-10-25 MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors Fanqi Pu et.al. 2410.19590 null
2024-10-24 Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction Hongxin Peng et.al. 2410.18433 null
2024-10-24 Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images Dong-Guw Lee et.al. 2410.18340 link
2024-10-25 UnCLe: Unsupervised Continual Learning of Depth Completion Suchisrit Gangopadhyay et.al. 2410.18074 null
2024-10-21 TIPS: Text-Image Pretraining with Spatial Awareness Kevis-Kokitsi Maninis et.al. 2410.16512 null
2024-10-22 DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain Kun Wang et.al. 2410.14980 link
2024-10-17 DepthSplat: Connecting Gaussian Splatting and Depth Haofei Xu et.al. 2410.13862 link
2024-10-16 DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning Jiabao Wei et.al. 2410.12501 null
2024-10-16 Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture Dabbrata Das et.al. 2410.11610 null
2024-10-16 CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction Pranav Gupta et.al. 2410.11211 link
2024-10-14 When Does Perceptual Alignment Benefit Vision Representations? Shobhita Sundaram et.al. 2410.10817 null
2024-10-14 Depth Any Video with Scalable Synthetic Data Honghui Yang et.al. 2410.10815 link
2024-10-15 Improved Depth Estimation of Bayesian Neural Networks Bart van Erp et.al. 2410.10395 link
2024-10-10 Color-Guided Flying Pixel Correction in Depth Images Ekamresh Vasudevan et.al. 2410.08084 null
2024-10-09 Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models Ange Lou et.al. 2410.07434 null
2024-10-09 Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation Runze Chen et.al. 2410.06982 null
2024-10-09 Analysis of different disparity estimation techniques on aerial stereo image datasets Ishan Narayan et.al. 2410.06711 null
2024-10-08 Vision Transformer based Random Walk for Group Re-Identification Guoqing Zhang et.al. 2410.05808 null
2024-10-08 CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality Wenjie Chang et.al. 2410.05735 null
2024-10-07 PhotoReg: Photometrically Registering 3D Gaussian Splatting Models Ziwen Yuan et.al. 2410.05044 null
2024-10-10 Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy Pengcheng Chen et.al. 2410.04041 null
2024-10-04 Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering Laura Fink et.al. 2410.03861 null
2024-10-03 RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions Ziyao Zeng et.al. 2410.02924 null
2024-10-02 Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Aleksei Bochkovskii et.al. 2410.02073 link
2024-10-10 Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation Shuting Zhao et.al. 2410.00979 null
2024-10-01 Radar Meets Vision: Robustifying Monocular Metric Depth Prediction for Mobile Robotics Marco Job et.al. 2410.00736 null
2024-10-06 Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration Yida Lin et.al. 2410.00503 null
2024-10-01 Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance Hongchao Shu et.al. 2410.00386 null
2024-09-30 CCDepth: A Lightweight Self-supervised Depth Estimation Network with Enhanced Interpretability Xi Zhang et.al. 2409.19933 null
2024-09-30 EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth Prediction Ivan Reyes-Amezcua et.al. 2409.19930 link
2024-09-29 fCOP: Focal Length Estimation from Category-level Object Priors Xinyue Zhang et.al. 2409.19641 null
2024-09-29 KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation Soofiyan Atar et.al. 2409.19490 null
2024-09-27 Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping Anthony A. Song et.al. 2409.19153 null
2024-09-26 Self-supervised Monocular Depth Estimation with Large Kernel Attention Xuezhi Xiang et.al. 2409.17895 null
2024-09-26 Self-Distilled Depth Refinement with Noisy Poisson Fusion Jiaqi Li et.al. 2409.17880 null
2024-09-27 A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts Aurel Pjetri et.al. 2409.17851 null
2024-09-26 Event-based Stereo Depth Estimation: A Survey Suman Ghosh et.al. 2409.17680 null
2024-09-26 CAMOT: Camera Angle-aware Multi-Object Tracking Felix Limanta et.al. 2409.17533 null
2024-09-25 Optical Lens Attack on Deep Learning Based Monocular Depth Estimation Ce Zhou et.al. 2409.17376 null
2024-09-25 Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation Richard D. Paul et.al. 2409.17085 null
2024-09-25 EventHDR: from Event to High-Speed HDR Videos and Beyond Yunhao Zou et.al. 2409.17029 null
2024-09-25 3DDX: Bone Surface Reconstruction from a Single Standard-Geometry Radiograph via Dual-Face Depth Estimation Yi Gu et.al. 2409.16702 null
2024-09-24 MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling Yifang Men et.al. 2409.16160 null
2024-09-24 Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data An Wang et.al. 2409.16063 link
2024-09-23 FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera Guoyang Zhao et.al. 2409.15054 link
2024-09-23 DepthART: Monocular Depth Estimation as Autoregressive Refinement Task Bulat Gabdullin et.al. 2409.15010 null
2024-09-23 Generalizing monocular colonoscopy image depth estimation by uncertainty-based global and local fusion network Sijia Du et.al. 2409.15006 null
2024-09-23 GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth Aurélien Cecille et.al. 2409.14850 null
2024-09-23 Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras Ming Li et.al. 2409.14766 null
2024-09-18 Panoptic-Depth Forecasting Juana Valeria Hurtado et.al. 2409.12008 null
2024-09-17 Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Gonzalo Martin Garcia et.al. 2409.11355 link
2024-09-15 GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion Vitor Guizilini et.al. 2409.09896 null
2024-09-15 Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation Xiaolong Qian et.al. 2409.09754 link
2024-09-13 PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage Denis Zavadski et.al. 2409.09144 link
2024-09-25 Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding Rania Hossam et.al. 2409.08695 link
2024-09-12 Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor Andrea Conti et.al. 2409.08277 null
2024-09-12 LED: Light Enhanced Depth Estimation at Night Simon de Moreau et.al. 2409.08031 link
2024-09-12 Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes Ming Li et.al. 2409.07843 null
2024-09-12 Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy Bojian Li et.al. 2409.07723 null
2024-09-12 FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments Devansh Dhrafani et.al. 2409.07715 null
2024-09-10 Deep Neural Networks: Multi-Classification and Universal Approximation Martín Hernández et.al. 2409.06555 null
2024-09-10 EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation Nischal Khanal et.al. 2409.06183 link
2024-09-11 EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels Qingyao Tian et.al. 2409.05442 null
2024-09-09 Spontaneous magnetic field and disorder effects in BaPtAs_1-x_Sb_x_ with honeycomb network T. Adachi et.al. 2409.05266 null
2024-09-08 TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs Horatiu Florea et.al. 2409.05142 null
2024-09-12 Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective Tim Bader et.al. 2409.04086 link
2024-09-08 Estimating Indoor Scene Depth Maps from Ultrasonic Echoes Junpei Honma et.al. 2409.03336 null
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838 null
2024-09-02 GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling Huawei Sun et.al. 2409.02720 null
2024-09-04 Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects Kyungmin Jo et.al. 2409.02653 null
2024-09-04 UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching Soomin Kim et.al. 2409.02545 null
2024-09-04 SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction Sumin Son et.al. 2409.02513 null
2024-09-04 Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation Li Liu et.al. 2409.02494 null
2024-09-04 Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization Cho-Ying Wu et.al. 2409.02486 null
2024-09-04 GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving Huasong Han et.al. 2409.02382 null
2024-09-03 DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos Wenbo Hu et.al. 2409.02095 null
2024-09-02 Large Language Models Can Understanding Depth from Monocular Images Zhongyi Xia et.al. 2409.01133 null
2024-08-30 DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model Mona Sheikh Zeinoddin et.al. 2408.17433 null
2024-08-30 Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method Yuji Lin et.al. 2408.17339 null
2024-08-30 Synthetic Lunar Terrain: A Multimodal Open Dataset for Training and Evaluating Neuromorphic Vision Algorithms Marcus Märtens et.al. 2408.16971 null
2024-08-29 EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More Kanghao Chen et.al. 2408.16254 null
2024-08-30 Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective Zhijie Shen et.al. 2408.16227 link
2024-08-27 Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack Naufal Suryanto et.al. 2408.14879 null
2024-08-26 NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-training Albert Luginov et.al. 2408.14177 null
2024-08-26 Pixel-Aligned Multi-View Generation with Depth Guided Decoder Zhenggang Tang et.al. 2408.14016 null
2024-08-25 TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers Chuanrui Zhang et.al. 2408.13770 null
2024-08-25 InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth Cho-Ying Wu et.al. 2408.13708 null
2024-08-25 SeeBelow: Sub-dermal 3D Reconstruction of Tumors with Surgical Robotic Palpation and Tactile Exploration Raghava Uppuluri et.al. 2408.13699 null
2024-08-27 Sapiens: Foundation for Human Vision Models Rawal Khirodkar et.al. 2408.12569 null
2024-08-21 LiFCal: Online Light Field Camera Calibration via Bundle Adjustment Aymeric Fleith et.al. 2408.11682 null
2024-08-19 Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video Shuxian Wang et.al. 2408.10153 null
2024-08-19 SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition Wiktor Mucha et.al. 2408.10037 link
2024-08-19 P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders Xuechao Chen et.al. 2408.10007 null
2024-08-14 Enhanced Scale-aware Depth Estimation for Monocular Endoscopic Scenes with Geometric Modeling Ruofeng Wei et.al. 2408.07266 null
2024-08-12 Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces Junrui Zhang et.al. 2408.06083 null
2024-08-08 Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation Daniele Rege Cambrin et.al. 2408.04523 link
2024-08-08 Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework Subhasis Dasgupta et.al. 2408.04360 null
2024-08-08 Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform Daniel Vargas et.al. 2408.04195 null
2024-08-07 Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach Benedikt W. Hosp et.al. 2408.03591 null
2024-08-06 BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications G. Manni et.al. 2408.03078 link
2024-08-05 Gaussian Mixture based Evidential Learning for Stereo Matching Weide Liu et.al. 2408.02796 null
2024-08-05 Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining Dongyang Liu et.al. 2408.02657 link
2024-08-03 MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas Feng Qiao et.al. 2408.01653 null
2024-08-02 Self-Supervised Depth Estimation Based on Camera Models Jinchang Zhang et.al. 2408.01565 null
2024-08-01 MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection Youjia Fu et.al. 2408.00438 null
2024-08-01 High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior Wencheng Han et.al. 2408.00361 null
2024-07-31 Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching Pengjie Zhang et.al. 2407.21735 null
2024-07-29 BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation Kieran Saunders et.al. 2407.20437 null
2024-07-29 Analysis and Improvement of Rank-Ordered Mean Algorithm in Single-Photon LiDAR William C. Yau et.al. 2407.20399 null
2024-07-29 Improving 2D Feature Representations by 3D-Aware Fine-Tuning Yuanwen Yue et.al. 2407.20229 null
2024-07-27 Revisit Self-supervised Depth Estimation with Local Structure-from-Motion Shengjie Zhu et.al. 2407.19166 null
2024-07-27 RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry Shengjie Zhu et.al. 2407.19154 null
2024-07-26 HybridDepth: Robust Depth Fusion for Mobile AR by Leveraging Depth from Focus and Single-Image Priors Ashkan Ganj et.al. 2407.18443 link
2024-07-26 Enhanced Depth Estimation and 3D Geometry Reconstruction using Bayesian Helmholtz Stereopsis with Belief Propagation Razieh Azizi et.al. 2407.18195 null
2024-07-25 BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation Xiang Zhang et.al. 2407.17952 null
2024-07-25 UMono: Physical Model Informed Hybrid CNN-Transformer Framework for Underwater Monocular Depth Estimation Jian Wang et.al. 2407.17838 null
2024-07-24 DarSwin-Unet: Distortion Aware Encoder-Decoder Architecture Akshaya Athwale et.al. 2407.17328 null
2024-07-24 Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches Chenxing Zhao et.al. 2407.17312 null
2024-07-23 SINDER: Repairing the Singular Defects of DINOv2 Haoqi Wang et.al. 2407.16826 link
2024-07-23 Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions Fabio Tosi et.al. 2407.16698 link
2024-07-23 ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation Zhenhua Wu et.al. 2407.16508 null
2024-07-19 Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation Jinfeng Liu et.al. 2407.14126 link
2024-07-18 Unveiling the purely young star formation history of the SMC's northeastern shell from colour-magnitude diagram fitting Joanna D. Sakowska et.al. 2407.13876 null
2024-07-18 Many Perception Tasks are Highly Redundant Functions of their Input Data Rahul Ramesh et.al. 2407.13841 null
2024-07-18 Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks Antoni Kowalczuk et.al. 2407.12588 link
2024-07-16 Temporally Consistent Stereo Matching Jiaxi Zeng et.al. 2407.11950 link
2024-07-15 IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation Yuanhao Zhai et.al. 2407.10937 link
2024-07-15 OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection Jinghua Hou et.al. 2407.10753 link
2024-07-15 Towards Scale-Aware Full Surround Monodepth with Transformers Yuchen Yang et.al. 2407.10406 null
2024-07-12 ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion Sungmin Woo et.al. 2407.09303 link
2024-07-11 ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation Ruijie Zhu et.al. 2407.08187 link
2024-07-10 Controlling Space and Time with Diffusion Models Daniel Watson et.al. 2407.07860 null
2024-07-07 SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning Yi Feng et.al. 2407.05283 link
2024-07-05 A Physical Model-Guided Framework for Underwater Image Enhancement and Depth Estimation Dazhao Du et.al. 2407.04230 null
2024-07-04 Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation Laiyan Ding et.al. 2407.04041 null
2024-07-02 Parametric Modeling and Estimation of Photon Registrations for 3D Imaging Weijian Zhang et.al. 2407.02712 null
2024-07-02 Depth-Aware Endoscopic Video Inpainting Francis Xiatian Zhang et.al. 2407.02675 link
2024-07-04 Camera-LiDAR Cross-modality Gait Recognition Wenxuan Guo et.al. 2407.02038 null
2024-07-07 CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation Huawei Sun et.al. 2407.00697 link
2024-06-28 Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey Uchitha Rajapaksha et.al. 2406.19675 null
2024-07-05 360 in the Wild: Dataset for Depth Prediction and View Synthesis Kibaek Park et.al. 2406.18898 null
2024-06-27 Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach Yuxiang Huang et.al. 2406.18837 null
2024-06-26 DoubleTake: Geometry Guided Depth Estimation Mohamed Sayed et.al. 2406.18387 null
2024-06-25 Depth-Guided Semi-Supervised Instance Segmentation Xin Chen et.al. 2406.17413 null
2024-06-20 Uncertainty and Self-Supervision in Single-View Depth Javier Rodriguez-Puigvert et.al. 2406.14226 null
2024-06-19 WaterMono: Teacher-Guided Anomaly Masking and Enhancement Boosting for Robust Underwater Self-Supervised Monocular Depth Estimation Yilin Ding et.al. 2406.13344 link
2024-06-18 Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation Ning-Hsu Wang et.al. 2406.12849 null
2024-06-21 GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models Yongtao Ge et.al. 2406.12671 link
2024-06-17 DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features Letian Wang et.al. 2406.12095 null
2024-06-17 MEDeA: Multi-view Efficient Depth Adjustment Mikhail Artemyev et.al. 2406.12048 null
2024-06-16 3D Gaze Tracking for Studying Collaborative Interactions in Mixed-Reality Environments Eduardo Davalos et.al. 2406.11003 null
2024-06-15 GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR Bharat Singh et.al. 2406.10722 null
2024-06-14 The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences Bria Long et.al. 2406.10447 null
2024-06-14 D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video Moritz Kappel et.al. 2406.10078 null
2024-06-14 DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications Li Li et.al. 2406.10068 link
2024-06-14 Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion Runze Liu et.al. 2406.09782 null
2024-06-13 Depth Anything V2 Lihe Yang et.al. 2406.09414 null
2024-06-14 WonderWorld: Interactive 3D Scene Generation from a Single Image Hong-Xing Yu et.al. 2406.09394 null
2024-06-13 Scale-Invariant Monocular Depth Estimation via SSI Depth S. Mahdi H. Miangoleh et.al. 2406.09374 null
2024-06-13 Multiple Prior Representation Learning for Self-Supervised Monocular Depth Estimation via Hybrid Transformer Guodong Sun et.al. 2406.08928 link
2024-06-13 ToSA: Token Selective Attention for Efficient Vision Transformers Manish Kumar Singh et.al. 2406.08816 null
2024-06-11 Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation Yufan Zhu et.al. 2406.07741 link
2024-06-11 PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow Joshua Tokarsky et.al. 2406.07667 null
2024-06-11 RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks Zhechao Wang et.al. 2406.07032 null
2024-06-10 PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation Zhenyu Li et.al. 2406.06679 null
2024-06-09 Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks Zhiyuan Cheng et.al. 2406.05857 link
2024-06-09 RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering Rui Zhang et.al. 2406.05852 null
2024-06-07 Normal-guided Detail-Preserving Neural Implicit Functions for High-Fidelity 3D Surface Reconstruction Aarya Patel et.al. 2406.04861 null
2024-06-07 UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection Yuchao Wang et.al. 2406.04647 null
2024-06-06 MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation Ionuţ Grigore et.al. 2406.04532 null
2024-06-06 Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image Stanislaw Szymanowicz et.al. 2406.04343 null
2024-06-06 Neural Surface Reconstruction from Sparse Views Using Epipolar Geometry Kaichen Zhou et.al. 2406.04301 null
2024-06-04 VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors Markus Plack et.al. 2406.02552 null
2024-06-03 L-MAGIC: Language Model Assisted Generation of Images with Coherence Zhipeng Cai et.al. 2406.01843 link
2024-06-04 Learning Temporally Consistent Video Depth from Video Diffusion Priors Jiahao Shao et.al. 2406.01493 null
2024-06-03 Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry Takayuki Kanai et.al. 2406.00929 null
2024-06-01 MoDGS: Dynamic Gaussian Splatting from Causually-captured Monocular Videos Qingming Liu et.al. 2406.00434 null
2024-05-30 Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian Wei Sun et.al. 2405.19657 null
2024-05-28 Hybrid Multi-Head Physics-informed Neural Network for Depth Estimation in Terahertz Imaging Mingjun Xiang et.al. 2405.18317 null
2024-05-27 Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation Amir El-Ghoussani et.al. 2405.17704 null
2024-05-27 Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving Shaoyuan Xie et.al. 2405.17426 link
2024-05-27 All-day Depth Completion Vadim Ezhov et.al. 2405.17315 null
2024-05-27 GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping Junyoung Seo et.al. 2405.17251 null
2024-05-27 SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing Yong-Qiang Mao et.al. 2405.17140 null
2024-05-27 DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge Yifan Mao et.al. 2405.17102 null
2024-05-27 Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation Steven Landgraf et.al. 2405.17097 null
2024-05-27 DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation Mengtan Zhang et.al. 2405.16960 null
2024-05-27 ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection Ziying Song et.al. 2405.16873 null
2024-05-27 Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations Jingguo Liu et.al. 2405.16858 null
2024-05-26 Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians Erik Sandström et.al. 2405.16544 null
2024-05-24 Transparent Object Depth Completion Yifan Zhou et.al. 2405.15299 null
2024-05-24 MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method Pan Liao et.al. 2405.15176 null
2024-05-23 EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting Jiaxu Wang et.al. 2405.14959 link
2024-05-23 Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks Xingguang Jiang et.al. 2405.14520 null
2024-05-23 Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning Zhenyu Wei et.al. 2405.14195 null
2024-05-21 Cross-spectral Gated-RGB Stereo Depth Estimation Samuel Brucker et.al. 2405.12759 null
2024-05-20 Depth Reconstruction with Neural Signed Distance Fields in Structured Light Systems Rukun Qiao et.al. 2405.12006 null
2024-05-20 Depth Prompting for Sensor-Agnostic Depth Estimation Jin-Hwi Park et.al. 2405.11867 null
2024-05-19 CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs Zidong Cao et.al. 2405.11564 null
2024-05-18 Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation Models Madhu Vankadari et.al. 2405.11158 link
2024-05-17 FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation Fei Wang et.al. 2405.10885 link
2024-05-17 Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory Jonas Kälble et.al. 2405.10575 link
2024-05-16 Towards Task-Compatible Compressible Representations Anderson de Andrade et.al. 2405.10244 link
2024-05-16 KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment Zhengxu Shi et.al. 2405.09964 null
2024-05-14 CLIP with Quality Captions: A Strong Pretraining for Vision Tasks Pavan Kumar Anasosalu Vasu et.al. 2405.08911 null

(back to top)

Semactic Segmentation

Publish Date Title Authors PDF Code
2024-11-21 Revisiting the Integration of Convolution and Attention for Vision Backbone Lei Zhu et.al. 2411.14429 link
2024-11-21 CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation Lin Sun et.al. 2411.13836 link
2024-11-21 Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals Hussni Mohd Zakir et.al. 2411.13774 null
2024-11-20 FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting Ola Shorinwa et.al. 2411.13753 null
2024-11-20 BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation Umamaheswaran Raman Kumar et.al. 2411.13251 null
2024-11-20 XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation Ziyi Wang et.al. 2411.13243 link
2024-11-20 Automating Sonologists USG Commands with AI and Voice Interface Emad Mohamed et.al. 2411.13006 null
2024-11-19 A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation Jiaqi Yang et.al. 2411.12615 link
2024-11-19 SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation Ron Keuth et.al. 2411.12602 link
2024-11-19 ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator Xiao Jiang et.al. 2411.12250 null
2024-11-18 ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements M. Arda Aydın et.al. 2411.12044 link
2024-11-18 Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation Hanieh Shojaei Miandashti et.al. 2411.11935 null
2024-11-18 MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models Harshita Sharma et.al. 2411.11362 null
2024-11-18 Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications Scarlett Raine et.al. 2411.11287 null
2024-11-16 Attention-based U-Net Method for Autonomous Lane Detection Mohammadhamed Tangestanizadeh et.al. 2411.10902 null
2024-11-16 Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation Jaisidh Singh et.al. 2411.10845 null
2024-11-19 Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients Maria Monzon et.al. 2411.10755 link
2024-11-15 Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images Ammar Qammaz et.al. 2411.10334 null
2024-11-15 CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation Dengke Zhang et.al. 2411.10086 null
2024-11-14 OneNet: A Channel-Wise 1D Convolutional U-Net Sanghyun Byun et.al. 2411.09838 link
2024-11-14 Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks Zengyi Yang et.al. 2411.09387 null
2024-11-14 Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation Yuheng Shi et.al. 2411.09219 link
2024-11-14 Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery Ashim Dahal et.al. 2411.09101 link
2024-11-13 CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation Xuming Zhang et.al. 2411.09023 null
2024-11-14 Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation Yangyang Li et.al. 2411.08756 null
2024-11-13 Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model Jun Xie et.al. 2411.08592 null
2024-11-12 Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry Christopher Hahne et.al. 2411.07918 link
2024-11-12 Semantic segmentation on multi-resolution optical and microwave data using deep learning Jai G Singla et.al. 2411.07581 null
2024-11-11 SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation Jiale Chen et.al. 2411.06991 null
2024-11-14 Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision Yueyang Cang et.al. 2411.06727 null
2024-11-10 Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments Deegan Atha et.al. 2411.06632 null
2024-11-09 Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing Kaixuan Lu et.al. 2411.06091 null
2024-11-08 Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model Shuchang Lyu et.al. 2411.05878 link
2024-11-08 Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation Sien Li et.al. 2411.05307 link
2024-11-07 In the Era of Prompt Learning with Vision-Language Models Ankit Jha et.al. 2411.04892 null
2024-11-11 ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset Olaf Wysocki et.al. 2411.04865 link
2024-11-06 Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts Zhitong Gao et.al. 2411.03829 link
2024-11-06 Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model Yansong Qu et.al. 2411.03672 null
2024-11-05 Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation Zhiling Yue et.al. 2411.03551 null
2024-11-05 SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture Andrew Heschl et.al. 2411.03505 link
2024-11-05 Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need Qishuai Wen et.al. 2411.03033 link
2024-11-05 Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation Xavier Timoneda et.al. 2411.02969 null
2024-11-05 Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery Mohammad Kakooei et.al. 2411.02935 null
2024-11-05 CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation Jinchao Ge et.al. 2411.02715 null
2024-11-04 Deep Learning on 3D Semantic Segmentation: A Detailed Review Thodoris Betsas et.al. 2411.02104 null
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability Bo Gao et.al. 2411.01819 null
2024-11-04 Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations Thanh Nguyen Canh et.al. 2411.01816 null
2024-11-03 PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation Xinyu Xu et.al. 2411.01624 null
2024-11-01 Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions Lixiao Yang et.al. 2411.01039 null
2024-11-01 Event-guided Low-light Video Semantic Segmentation Zhen Yao et.al. 2411.00639 null
2024-11-01 Cross-modal semantic segmentation for indoor environmental perception using single-chip millimeter-wave radar raw data Hairuo Hu et.al. 2411.00499 null
2024-11-01 Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing Naufal Suryanto et.al. 2411.00425 link
2024-10-31 A Recipe for Geometry-Aware 3D Mesh Transformers Mohammad Farazi et.al. 2411.00164 null
2024-10-31 Federated Black-Box Adaptation for Semantic Segmentation Jay N. Paranjape et.al. 2410.24181 null
2024-10-31 COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes Muhammad Ali et.al. 2410.24139 link
2024-10-31 Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model Hao Zhang et.al. 2410.23905 link
2024-10-30 S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Maciej K. Wozniak et.al. 2410.23085 null
2024-10-31 CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation Ziyang Gong et.al. 2410.22629 link
2024-10-29 Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation Zhaochong An et.al. 2410.22489 null
2024-10-29 Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation Jintao Tong et.al. 2410.22135 null
2024-10-29 Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models Imad Ali Shah et.al. 2410.22101 null
2024-10-29 Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation Ruihao Xia et.al. 2410.21708 link
2024-10-28 Domain Adaptation with a Single Vision-Language Embedding Mohammad Fahes et.al. 2410.21361 null
2024-10-28 IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks Manjunath D et.al. 2410.20953 null
2024-10-27 A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models Camilo Espinosa-Curilem et.al. 2410.20595 link
2024-10-27 Unlocking Comics: The AI4VA Dataset for Visual Understanding Peter Grönquist et.al. 2410.20459 link
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 null
2024-10-25 OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery Philipe Dias et.al. 2410.19965 null
2024-10-25 IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation Kaixian Qu et.al. 2410.19697 null
2024-10-25 Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation Yao Wu et.al. 2410.19446 link
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-24 Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks Alexander Jaus et.al. 2410.18684 null
2024-10-24 Unsupervised semantic segmentation of urban high-density multispectral point clouds Oona Oinonen et.al. 2410.18520 null
2024-10-26 CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator Stefanos Pasios et.al. 2410.18238 null
2024-10-23 Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers Achille Chiuchiarelli et.al. 2410.17738 null
2024-10-22 EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding Zhiyi Pan et.al. 2410.17207 null
2024-10-22 SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments Jumman Hossain et.al. 2410.16686 null
2024-10-21 TIPS: Text-Image Pretraining with Spatial Awareness Kevis-Kokitsi Maninis et.al. 2410.16512 null
2024-10-21 GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation Nazanin Moradinasab et.al. 2410.16485 null
2024-10-21 LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training Thomas Kreutz et.al. 2410.15833 link
2024-10-21 TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight Hyun-Kurl Jang et.al. 2410.15674 link
2024-10-21 Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications Jintao Ren et.al. 2410.15584 null
2024-10-22 Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation Fnu Neha et.al. 2410.15472 null
2024-10-18 On the Influence of Shape, Texture and Color for Learning Semantic Segmentation Annika Mütze et.al. 2410.14878 null
2024-10-18 Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+ Arpan Mahara et.al. 2410.14836 null
2024-10-17 ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Guangda Ji et.al. 2410.13924 null
2024-10-17 Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks Clément Playout et.al. 2410.13822 link
2024-10-22 EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything Joonhyeon Song et.al. 2410.13621 link
2024-10-17 Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation Ziyang Chen et.al. 2410.13472 null
2024-10-17 SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing Bin Wang et.al. 2410.13471 link
2024-10-17 Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation Florian Wulff et.al. 2410.13383 null
2024-10-17 Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation Houze Liu et.al. 2410.13099 null
2024-10-16 Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation Wenbo Xu et.al. 2410.13094 null
2024-10-16 Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation Jesús Alejandro Loera-Ponce et.al. 2410.12988 null
2024-10-16 VividMed: Vision Language Model with Versatile Visual Grounding for Medicine Lingxiao Luo et.al. 2410.12694 link
2024-10-16 Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans Luca Marsilio et.al. 2410.12641 null
2024-10-16 SAM-Guided Masked Token Prediction for 3D Scene Understanding Zhimin Chen et.al. 2410.12158 null
2024-10-15 WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation Chenghao Qian et.al. 2410.12075 null
2024-10-15 Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning Rijun Wang et.al. 2410.11913 null
2024-10-15 RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation Anton Antonov et.al. 2410.11722 link
2024-10-15 InvSeg: Test-Time Prompt Inversion for Semantic Segmentation Jiayi Lin et.al. 2410.11473 null
2024-10-15 MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation Xianping Ma et.al. 2410.11160 link
2024-10-14 Locality Alignment Improves Vision-Language Models Ian Covert et.al. 2410.11087 null
2024-10-14 Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes Tim Broedermann et.al. 2410.10791 null
2024-10-14 UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation Lihe Yang et.al. 2410.10777 link
2024-10-14 Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation Daniel Fusaro et.al. 2410.10510 link
2024-10-14 LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections Xuezhi Xiang et.al. 2410.10433 null
2024-10-14 V2M: Visual 2-Dimensional Mamba for Image Representation Learning Chengkun Wang et.al. 2410.10382 link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 link
2024-10-13 AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model Yuchen Li et.al. 2410.09714 null
2024-10-12 An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation Wei Liang et.al. 2410.09443 null
2024-10-11 Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation Varduhi Yeghiazaryan et.al. 2410.08946 null
2024-10-11 Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation Hanieh Shojaei et.al. 2410.08687 null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 link
2024-10-10 Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Samir Abou Haidar et.al. 2410.08365 null
2024-10-10 Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation Zhiyi Pan et.al. 2410.08091 null
2024-10-10 Shift and matching queries for video semantic segmentation Tsubasa Mizuno et.al. 2410.07635 null
2024-10-10 3D Vision-Language Gaussian Splatting Qucheng Peng et.al. 2410.07577 null
2024-10-11 Bridge the Points: Graph-based Few-shot Segment Anything Semantically Anqi Zhang et.al. 2410.06964 null
2024-10-09 Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation Seungho Lee et.al. 2410.06893 null
2024-10-09 Rethinking the Evaluation of Visible and Infrared Image Fusion Dayan Guan et.al. 2410.06811 link
2024-10-10 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 link
2024-10-09 Transesophageal Echocardiography Generation using Anatomical Models Emmanuel Oladokun et.al. 2410.06781 null
2024-10-09 Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy Qinfeng Zhu et.al. 2410.06725 null
2024-10-09 Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments Meng Yu et.al. 2410.06626 null
2024-10-09 Towards Natural Image Matting in the Wild via Real-Scenario Prior Ruihao Xia et.al. 2410.06593 link
2024-10-08 Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions Mateus Karvat et.al. 2410.06380 null
2024-10-08 Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading Fang Gao et.al. 2410.05762 null
2024-10-07 Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation Vince Zhu et.al. 2410.04689 null
2024-10-04 SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2 Hao Yu et.al. 2410.03962 null
2024-10-04 Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features Benyuan Meng et.al. 2410.03558 link
2024-10-04 Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images Abhijeet Patil et.al. 2410.03289 link
2024-10-04 HRVMamba: High-Resolution Visual State Space Model for Dense Prediction Hao Zhang et.al. 2410.03174 null
2024-10-03 HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer Jingjing Ren et.al. 2410.02528 null
2024-10-04 Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation Muzhi Zhu et.al. 2410.02369 null
2024-10-03 RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds Remco Royen et.al. 2410.02323 null
2024-10-03 Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network Yangyang Qiu et.al. 2410.02224 null
2024-10-03 Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images Qingyuan Liu et.al. 2410.02207 null
2024-10-02 SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images Kaiyu Li et.al. 2410.01768 link
2024-10-02 One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations Shaokang Wu et.al. 2410.01630 null
2024-10-02 Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation Zhaofeng Shi et.al. 2410.01341 null
2024-10-02 VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings Andrea Carrara et.al. 2410.01336 null
2024-10-01 RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation Yazhou Zhu et.al. 2410.01110 null
2024-10-01 Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer Vlatko Spasev et.al. 2410.01092 null
2024-10-01 Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time Chiao-An Yang et.al. 2410.01083 link
2024-10-01 DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles Robert Krajewski et.al. 2410.00769 null
2024-10-01 Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection Pengxi Zeng et.al. 2410.00582 null
2024-10-01 Precise Workcell Sketching from Point Clouds Using an AR Toolbox Krzysztof Zieliński et.al. 2410.00479 null
2024-09-30 AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation Boyu Han et.al. 2409.20398 null
2024-09-30 Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation Tillmann Rheude et.al. 2409.20287 link
2024-09-30 Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model Fulong Ma et.al. 2409.20164 null
2024-09-30 Segmenting Wood Rot using Computer Vision Models Roland Kammerbauer et.al. 2409.20137 null
2024-09-30 Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels Heeseong Shin et.al. 2409.19846 null
2024-09-27 Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation Raphael Hagmanns et.al. 2409.18788 null
2024-09-27 Learning from Pattern Completion: Self-supervised Controllable Generation Zhiqiang Chen et.al. 2409.18694 link
2024-09-27 Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast Xiaoke Hao et.al. 2409.18543 link
2024-10-01 Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization Siru Li et.al. 2409.18434 null
2024-09-26 Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning Siyi Lu et.al. 2409.17659 null
2024-09-26 Global-Local Medical SAM Adaptor Based on Full Adaption Meng Wang et.al. 2409.17486 null
2024-09-25 VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection Liangyu Zhong et.al. 2409.17330 null
2024-09-25 2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation Tommie Kerssies et.al. 2409.17208 link
2024-09-25 WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks Alberto Bacchin et.al. 2409.16999 link
2024-09-25 Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis Illia Tsiporenko et.al. 2409.16940 null
2024-09-24 A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation Avisha Kumar et.al. 2409.16441 null
2024-09-24 Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds Asad Ur Rahman et.al. 2409.16381 null
2024-09-24 Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation Hannah Kerner et.al. 2409.16252 link
2024-09-24 Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation Harry Rogers et.al. 2409.16213 link
2024-09-24 Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification Pang-Yuan Pao et.al. 2409.15846 null
2024-09-24 DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation Soojin Jang et.al. 2409.15801 null
2024-09-24 Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis Camndon Reed et.al. 2409.15671 null
2024-09-23 ZeroSCD: Zero-Shot Street Scene Change Detection Shyam Sundar Kannan et.al. 2409.15255 null
2024-09-17 Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks Edgar Heinert et.al. 2409.11373 null
2024-09-17 MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping Amirreza Fateh et.al. 2409.11316 link
2024-09-17 Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark Clifford Broni-Bediako et.al. 2409.11227 link
2024-09-17 HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios Nick Theisen et.al. 2409.11205 link
2024-09-16 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning Amin Karimi Monsefi et.al. 2409.10362 null
2024-09-16 BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images Wentao Wang et.al. 2409.10269 null
2024-09-15 Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation Zhanteng Xie et.al. 2409.09899 null
2024-09-15 Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation Qilong Zhangli et.al. 2409.09893 null
2024-09-15 High Definition Map Mapping and Update: A General Overview and Future Directions Benny Wijaya et.al. 2409.09726 null
2024-09-14 Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation Hugo Porta et.al. 2409.09497 null
2024-09-13 AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation Zechao Sun et.al. 2409.08516 null
2024-09-13 VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation Ezra MacDonald et.al. 2409.08461 link
2024-09-12 Bayesian Self-Training for Semi-Supervised 3D Segmentation Ozan Unal et.al. 2409.08102 null
2024-09-12 Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes Siyu Chen et.al. 2409.07995 null
2024-09-12 SURGIVID: Annotation-Efficient Surgical Video Object Discovery Çağhan Köksal et.al. 2409.07801 null
2024-09-12 Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation Fuchen Zheng et.al. 2409.07793 link
2024-09-12 ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation Fuchen Zheng et.al. 2409.07779 link
2024-09-12 Open-Vocabulary Remote Sensing Image Semantic Segmentation Qinglong Cao et.al. 2409.07683 null
2024-09-11 Token Turing Machines are Efficient Vision Models Purvish Jajal et.al. 2409.07613 null
2024-09-11 AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution Wangduo Xie et.al. 2409.07171 null
2024-09-11 Brain-Inspired Stepwise Patch Merging for Vision Transformers Yonghao Yu et.al. 2409.06963 null
2024-09-10 Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds Mu Cai et.al. 2409.06827 link
2024-09-10 A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO Sabit Ahamed Preanto et.al. 2409.06671 null
2024-09-10 PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation Yin Hu et.al. 2409.06309 null
2024-09-10 EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation Nischal Khanal et.al. 2409.06183 link
2024-09-09 SVS-GAN: Leveraging GANs for Semantic Video Synthesis Khaled M. Seyam et.al. 2409.06074 null
2024-09-09 Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance Quang-Huy Che et.al. 2409.06002 null
2024-09-09 Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features Jacob Gildenblat et.al. 2409.05697 null
2024-09-09 ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions Furqan Ahmed Shaik et.al. 2409.05327 null
2024-09-08 RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network Zhiwei Lin et.al. 2409.04979 null
2024-09-06 Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation Björn Michele et.al. 2409.04409 link
2024-09-05 Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution Marga Don et.al. 2409.03754 link
2024-09-05 LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones Moritz Nottebaum et.al. 2409.03460 link
2024-09-05 Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications Tong Bu et.al. 2409.03368 null
2024-09-05 UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking Md. Mahfuzur Rahman et.al. 2409.03245 null
2024-09-05 Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation Xixi Jiang et.al. 2409.03228 link
2024-09-06 iSeg: An Iterative Refinement-based Framework for Training-free Segmentation Lin Sun et.al. 2409.03209 link
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838 null
2024-09-04 CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation Minhee Cho et.al. 2409.02699 null
2024-09-04 SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction Sumin Son et.al. 2409.02513 null
2024-09-03 K-Origins: Better Colour Quantification for Neural Networks Lewis Mason et.al. 2409.02281 link
2024-09-03 AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions Chenghao Qian et.al. 2409.02045 null
2024-09-03 Segmenting Object Affordances: Reproducibility and Sensitivity to Scale Tommaso Apicella et.al. 2409.01814 link
2024-09-03 Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation Haodong Wang et.al. 2409.01662 null
2024-09-02 Semantic Segmentation from Image Labels by Reconstruction from Structured Decomposition Xuanrui Zeng et.al. 2409.01472 link
2024-09-02 SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation Alberto Bacchin et.al. 2409.01109 link
2024-09-02 Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions Taorong Liu et.al. 2409.01072 null
2024-08-30 Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes Li Zhang et.al. 2408.17421 link
2024-08-30 Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations Ahmed Hammam et.al. 2408.17311 null
2024-08-30 Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training Zizheng Huang et.al. 2408.17081 link
2024-08-30 Transient Fault Tolerant Semantic Segmentation for Autonomous Driving Leonardo Iurada et.al. 2408.16952 link
2024-08-29 SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection Rohit Venkata Sai Dulam et.al. 2408.16645 null
2024-08-29 MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation Linyan Yang et.al. 2408.16478 null
2024-08-29 Multi-source Domain Adaptation for Panoramic Semantic Segmentation Jing Jiang et.al. 2408.16469 null
2024-08-29 EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More Kanghao Chen et.al. 2408.16254 null
2024-08-28 SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors Zhiqing Zhang et.al. 2408.15887 null
2024-08-28 DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries Yu Yang et.al. 2408.15813 null
2024-08-28 TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation Junbao Zhou et.al. 2408.15657 link
2024-08-27 Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images Silvia Seidlitz et.al. 2408.15373 link
2024-08-27 An Investigation on The Position Encoding in Vision-Based Dynamics Prediction Jiageng Zhu et.al. 2408.15201 null
2024-08-27 Applying ViT in Generalized Few-shot Semantic Segmentation Liyuan Geng et.al. 2408.14957 link
2024-08-27 Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack Naufal Suryanto et.al. 2408.14879 null
2024-08-27 MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation Yuanbing Zhu et.al. 2408.14776 null
2024-08-26 Physically Feasible Semantic Segmentation Shamik Basu et.al. 2408.14672 link
2024-08-25 OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation Muhammad Rameez ur Rahman et.al. 2408.13936 link
2024-08-25 Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation Yuwen Pan et.al. 2408.13838 null
2024-08-25 TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather Xiongwei Zhao et.al. 2408.13802 link
2024-08-25 ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation Xin Zhang et.al. 2408.13771 null
2024-08-25 Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation Zhaoyang Li et.al. 2408.13752 null
2024-08-24 ESA: Annotation-Efficient Active Learning for Semantic Segmentation Jinchao Ge et.al. 2408.13491 link
2024-08-23 Accuracy Improvement of Cell Image Segmentation Using Feedback Former Hinako Mitsuoka et.al. 2408.12974 null
2024-08-23 Image Segmentation in Foundation Model Era: A Survey Tianfei Zhou et.al. 2408.12957 null
2024-08-23 Symmetric masking strategy enhances the performance of Masked Image Modeling Khanh-Binh Nguyen et.al. 2408.12772 null
2024-08-22 Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets Wolfgang Boettcher et.al. 2408.12489 null
2024-08-22 The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation Tuyen Tran et.al. 2408.12447 null
2024-08-21 UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images Enze Zhu et.al. 2408.11545 null
2024-08-21 Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation Chuandong Liu et.al. 2408.11280 null
2024-08-20 NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency Valentinos Pariza et.al. 2408.11054 null
2024-08-20 CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients Karen Sanchez et.al. 2408.10827 null
2024-08-20 Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? Chen Liang et.al. 2408.10627 null
2024-08-20 Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation Jiawei Han et.al. 2408.10537 link
2024-08-19 Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network Rasha Alshawi et.al. 2408.10181 null
2024-08-19 Dynamic Label Injection for Imbalanced Industrial Defect Segmentation Emanuele Caruso et.al. 2408.10031 link
2024-08-19 Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis Kira Maag et.al. 2408.10021 null
2024-08-19 Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving Jun Yan et.al. 2408.09839 link
2024-08-18 OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras Muhammad Rameez Ur Rahman et.al. 2408.09424 link
2024-08-18 Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration Hao Ai et.al. 2408.09336 null
2024-08-17 Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology Junchao Zhu et.al. 2408.09278 link
2024-08-17 GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation Weiming Zhang et.al. 2408.09115 null
2024-08-17 Depth-guided Texture Diffusion for Image Semantic Segmentation Wei Sun et.al. 2408.09097 null
2024-08-15 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks Dongshuo Yin et.al. 2408.08345 link
2024-08-14 MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis Nimeesha Chan et.al. 2408.07773 link
2024-08-15 MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation Beoungwoo Kang et.al. 2408.07576 link
2024-08-15 MagicFace: Training-free Universal-Style Human Image Customized Synthesis Yibin Wang et.al. 2408.07433 null
2024-08-14 Segment Using Just One Example Pratik Vora et.al. 2408.07393 null
2024-08-14 Ensemble architecture in polyp segmentation Hao-Yun Hsu et.al. 2408.07262 link
2024-08-14 Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks Raghavendra Singh et.al. 2408.07243 null
2024-08-14 Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training Ethan Kou et.al. 2408.07239 null
2024-08-13 ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation Jingyun Wang et.al. 2408.06747 link
2024-08-10 Dilated Convolution with Learnable Spacings Ismail Khalfaoui-Hassani et.al. 2408.06383 null
2024-08-12 Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images Siladittya Manna et.al. 2408.06235 null
2024-08-12 A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting Felix Assion et.al. 2408.06071 null
2024-08-12 Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning Xinrong Hu et.al. 2408.05889 null
2024-08-11 Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task Hannuo Zhang et.al. 2408.05777 null
2024-08-11 MacFormer: Semantic Segmentation with Fine Object Boundaries Guoan Xu et.al. 2408.05699 null
2024-08-10 Multimodal generative semantic communication based on latent diffusion model Weiqi Fu et.al. 2408.05455 null
2024-08-09 In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation Dahyun Kang et.al. 2408.04961 link
2024-08-09 ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation Mengcheng Lan et.al. 2408.04883 link
2024-08-09 Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning Fumihiro Kaneko et.al. 2408.04795 null
2024-08-08 SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation Jieming Yu et.al. 2408.04593 null
2024-08-08 SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios Sriram Mandalika et.al. 2408.04482 null
2024-08-08 What could go wrong? Discovering and describing failure modes in computer vision Gabriela Csurka et.al. 2408.04471 null
2024-08-07 CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Tianfang Zhang et.al. 2408.03703 link
2024-08-07 SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology Mingya Zhang et.al. 2408.03651 link
2024-08-06 Post-Mortem Human Iris Segmentation Analysis with Deep Learning Afzal Hossain et.al. 2408.03448 null
2024-08-06 Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Jonas Schmitt et.al. 2408.03046 link
2024-08-05 Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation Sai Prasanna et.al. 2408.02297 null
2024-08-05 Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs Jeongkee Lim et.al. 2408.02261 null
2024-08-05 Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders Muhammad Abdullah Jamal et.al. 2408.02245 null
2024-08-04 Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation Ye Du et.al. 2408.02039 null
2024-08-03 Bayesian Active Learning for Semantic Segmentation Sima Didari et.al. 2408.01694 null
2024-08-03 A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection Omkar Oak et.al. 2408.01692 null
2024-08-03 Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation Balázs Opra et.al. 2408.01640 null
2024-08-02 Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans Lukas Kratochvila et.al. 2408.01526 null
2024-08-02 Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation Yuanzhi Su et.al. 2408.01356 null
2024-08-02 StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation Bingyu Li et.al. 2408.01343 null
2024-08-02 Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach Yabin Zhu et.al. 2408.00969 null
2024-08-01 Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation Siyu Jiao et.al. 2408.00744 null
2024-08-01 Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function Matias Oscar Volman Stern et.al. 2408.00707 null
2024-08-01 AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation Asbjørn Munk et.al. 2408.00640 null
2024-08-01 SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation Shengbo Tan et.al. 2408.00496 null
2024-07-31 Open-Vocabulary Audio-Visual Semantic Segmentation Ruohao Guo et.al. 2407.21721 null
2024-07-31 MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment Anurag Das et.al. 2407.21654 null
2024-07-31 Small Object Few-shot Segmentation for Vision-based Industrial Inspection Zilong Zhang et.al. 2407.21351 null
2024-07-31 On-the-fly Point Feature Representation for Point Clouds Analysis Jiangyi Wang et.al. 2407.21335 null
2024-07-31 Fine-grained Metrics for Point Cloud Semantic Segmentation Zhuheng Lu et.al. 2407.21289 null
2024-07-30 PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds Kerem Mertoğlu et.al. 2407.21150 null
2024-07-30 Learning Ordinality in Semantic Segmentation Rafael Cristino et.al. 2407.20959 null
2024-07-29 Improving 2D Feature Representations by 3D-Aware Fine-Tuning Yuanwen Yue et.al. 2407.20229 null
2024-07-29 Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset Yimian Dai et.al. 2407.20078 link
2024-07-29 Language-driven Grasp Detection with Mask-guided Attention Tuan Van Vo et.al. 2407.19877 null
2024-07-29 Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets Muhammad Abdullah Jamal et.al. 2407.19714 null
2024-07-29 ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement Ezequiel Perez-Zarate et.al. 2407.19708 link
2024-07-28 ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding Zhen Chen et.al. 2407.19435 link
2024-07-27 Ensembling convolutional neural networks for human skin segmentation Patryk Kuban et.al. 2407.19310 null
2024-07-27 Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network Gang Pan et.al. 2407.19271 null
2024-07-26 Sparse Refinement for Efficient High-Resolution Semantic Segmentation Zhijian Liu et.al. 2407.19014 null
2024-07-29 Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation Jingjun Yi et.al. 2407.18568 null
2024-07-25 Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception Julia Hindel et.al. 2407.18145 null
2024-07-25 TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework Guanfeng Tang et.al. 2407.18038 null
2024-07-25 Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions Jan Nikolas Morshuis et.al. 2407.18026 link
2024-07-24 Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation Hyunwoo Yu et.al. 2407.17261 link
2024-07-24 Trans2Unet: Neural fusion for Nuclei Semantic Segmentation Dinh-Phu Tran et.al. 2407.17181 null
2024-07-24 PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning Mu Chen et.al. 2407.17101 null
2024-07-25 Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste Qinfeng Zhu et.al. 2407.17028 link
2024-07-24 Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images Dooseop Choi et.al. 2407.17003 link
2024-07-23 Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving Anam Manzoor et.al. 2407.16647 null
2024-07-23 Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging Daniela L. Ramos et.al. 2407.16608 null
2024-07-23 Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision Aditya Krishnan et.al. 2407.16102 null
2024-07-22 MILAN: Milli-Annotations for Lidar Semantic Segmentation Nermin Samet et.al. 2407.15797 null
2024-07-22 Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond Silvio Galesso et.al. 2407.15739 link
2024-07-22 MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics Alexander Melekhin et.al. 2407.15663 link
2024-07-22 Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling Bo Yuan et.al. 2407.15429 link
2024-07-22 Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data Junha Song et.al. 2407.15383 null
2024-07-21 Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation Xiaoyang Wu et.al. 2407.15282 null
2024-07-20 Downstream-Pretext Domain Knowledge Traceback for Active Learning Beichen Zhang et.al. 2407.14720 null
2024-07-19 Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model Kun Zhao et.al. 2407.14326 null
2024-07-19 Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation Zhengyuan Xie et.al. 2407.14142 link
2024-07-19 GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation Florian Chabot et.al. 2407.14108 null
2024-07-18 Many Perception Tasks are Highly Redundant Functions of their Input Data Rahul Ramesh et.al. 2407.13841 null
2024-07-18 GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model Abdelrahman Shaker et.al. 2407.13772 link
2024-07-18 SegPoint: Segment Any Point Cloud via Large Language Model Shuting He et.al. 2407.13761 null
2024-07-18 MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis Ziming Zhong et.al. 2407.13675 link
2024-07-18 Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models Xiaoyu Zhu et.al. 2407.13642 null
2024-07-18 FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures Hao Lu et.al. 2407.13500 link
2024-07-18 FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions Sohyun Lee et.al. 2407.13437 null
2024-07-18 Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability Judith Dijk et.al. 2407.13392 null
2024-07-18 Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation Chang Liu et.al. 2407.13363 null
2024-07-18 Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation Shoumeng Qiu et.al. 2407.13254 null
2024-07-18 OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation Jian Sun et.al. 2407.13137 null
2024-07-16 Mitigating Background Shift in Class-Incremental Semantic Segmentation Gilhan Park et.al. 2407.11859 link
2024-07-16 Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation Juncheng Ma et.al. 2407.11820 null
2024-07-16 XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach Truong Thanh Hung Nguyen et.al. 2407.11771 null
2024-07-16 OAM-TCD: A globally diverse dataset of high-resolution tree cover maps Josh Veitch-Michaelis et.al. 2407.11743 null
2024-07-16 SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds Yanbo Wang et.al. 2407.11569 link
2024-07-16 Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations Yunya Gao et.al. 2407.11381 link
2024-07-16 Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities Xu Zheng et.al. 2407.11351 null
2024-07-16 Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation Xu Zheng et.al. 2407.11344 null
2024-07-16 TCFormer: Visual Recognition via Token Clustering Transformer Wang Zeng et.al. 2407.11321 link
2024-07-15 Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding Danish Nazir et.al. 2407.11224 null
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964 link
2024-07-15 APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2407.10649 null
2024-07-15 Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs Rong Ma et.al. 2407.10534 null
2024-07-14 Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data Tuo Feng et.al. 2407.10200 link
2024-07-14 RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation Li Li et.al. 2407.10159 link
2024-07-14 HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation Chengjie Jiang et.al. 2407.10047 null
2024-07-13 Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation Anqi Zhang et.al. 2407.09838 null
2024-07-13 Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach Md Rakibul Islam et.al. 2407.09828 null
2024-07-13 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance Xiaoxu Xu et.al. 2407.09826 null
2024-07-13 TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation Xiaopei Wu et.al. 2407.09751 null
2024-07-12 FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background Muhammad Ali et.al. 2407.09379 link
2024-07-12 Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy Julian Wyatt et.al. 2407.09192 null
2024-07-12 Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off Levente Halmosi et.al. 2407.09150 link
2024-07-12 Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation Wei Cong et.al. 2407.09047 null
2024-07-12 Textual Query-Driven Mask Transformer for Domain Generalized Segmentation Byeonghyun Pak et.al. 2407.09033 null
2024-07-12 Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation Zihao Li et.al. 2407.08994 null
2024-07-11 Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation Tong Shao et.al. 2407.08268 null
2024-07-11 Enrich the content of the image Using Context-Aware Copy Paste Qiushi Guo et.al. 2407.08151 null
2024-07-10 MambaVision: A Hybrid Mamba-Transformer Vision Backbone Ali Hatamizadeh et.al. 2407.08083 link
2024-07-10 Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift Elliot Vincent et.al. 2407.07616 link
2024-07-10 H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper Ryan Banks et.al. 2407.07604 link
2024-07-11 Trainable Highly-expressive Activation Functions Irit Chelly et.al. 2407.07564 null
2024-07-10 Deformable-Heatmap-Segmentation for Automobile Visual Perception Hongyu Jin et.al. 2407.07493 null
2024-07-10 Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining Tianfang Sun et.al. 2407.07465 null
2024-07-11 HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation Guoan Xu et.al. 2407.07441 null
2024-07-09 ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation Yuyuan Liu et.al. 2407.07171 link
2024-07-08 Training-free CryoET Tomogram Segmentation Yizhou Zhao et.al. 2407.06833 link
2024-07-09 CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM Aditya Murali et.al. 2407.06795 null
2024-07-09 LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration Jiayi Liu et.al. 2407.06512 link
2024-07-08 Leveraging image captions for selective whole slide image annotation Jingna Qiu et.al. 2407.06363 null
2024-07-08 Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots Siva Krishna Ravipati et.al. 2407.06077 null
2024-07-08 Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts Puzuo Wang et.al. 2407.06043 null
2024-07-08 RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation Sarah Elmahdy et.al. 2407.06016 link
2024-07-07 Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images Tuan T. Nguyen et.al. 2407.05452 null
2024-07-07 Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness Idris Hamoud et.al. 2407.05448 null
2024-07-06 A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation Monika Wysoczańska et.al. 2407.05061 null
2024-07-06 BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support Vladyslav Polushko et.al. 2407.05007 null
2024-07-05 Explainable Metric Learning for Deflating Data Bias Emma Andrews et.al. 2407.04866 null
2024-07-05 LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes Zexian Huang et.al. 2407.04326 null
2024-07-04 Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier Prantik Howlader et.al. 2407.04036 link
2024-07-04 Relative Difficulty Distillation for Semantic Segmentation Dong Liang et.al. 2407.03719 null
2024-07-04 POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation Arindam Dutta et.al. 2407.03549 null
2024-07-03 A Unified Framework for 3D Scene Understanding Wei Xu et.al. 2407.03263 null
2024-07-03 ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation Chang Li et.al. 2407.03033 null
2024-07-03 ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation Yipin Guo et.al. 2407.02881 null
2024-07-03 Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation Tao Chen et.al. 2407.02768 null
2024-07-02 Open Panoramic Segmentation Junwei Zheng et.al. 2407.02685 null
2024-07-02 Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction Tinghuai Wang et.al. 2407.02639 null
2024-07-02 Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather Junsung Park et.al. 2407.02286 link
2024-07-02 MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders Baijiong Lin et.al. 2407.02228 link
2024-07-02 Occlusion-Aware Seamless Segmentation Yihong Cao et.al. 2407.02182 link
2024-07-02 VRBiom: A New Periocular Dataset for Biometric Applications of HMD Ketan Kotwal et.al. 2407.02150 null
2024-07-02 Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts Pasquale De Marinis et.al. 2407.02075 null
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-07-01 Label-free Neural Semantic Image Synthesis Jiayi Wang et.al. 2407.01790 null
2024-07-01 PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction Xuan Yu et.al. 2407.01349 null
2024-07-01 CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes Danial Qashqai et.al. 2407.01328 link
2024-06-29 SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City Guohao Wang et.al. 2407.00296 link
2024-07-01 Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding Yifan Tang et.al. 2406.19791 null
2024-06-28 Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation Junsung Park et.al. 2406.19638 link
2024-06-28 PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation Deyi Ji et.al. 2406.19632 null
2024-06-27 Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model Haobo Yuan et.al. 2406.19369 null
2024-06-27 ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation Nazanin Moradinasab et.al. 2406.19225 null
2024-06-30 Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO Fuseini Mumuni et.al. 2406.19057 null
2024-06-27 Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation Tao Lian et.al. 2406.18809 null
2024-06-26 CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data Nikolaos Dionelis et.al. 2406.18279 null
2024-06-26 The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval Meinardus Boris et.al. 2406.18113 link
2024-06-26 Few-Shot Medical Image Segmentation with High-Fidelity Prototypes Song Tang et.al. 2406.18074 link
2024-06-25 Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation Xuming Zhang et.al. 2406.17679 null
2024-06-25 DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation Ahmad Mohammadshirazi et.al. 2406.17591 link
2024-06-25 Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation Felix Stillger et.al. 2406.17541 null
2024-06-25 Investigating Self-Supervised Methods for Label-Efficient Learning Srinivasa Rao Nandam et.al. 2406.17460 null
2024-06-25 Pseudo Labelling for Enhanced Masked Autoencoders Srinivasa Rao Nandam et.al. 2406.17450 null
2024-06-25 Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model Zhuoyuan Li et.al. 2406.17442 null
2024-06-25 Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes Qi Ma et.al. 2406.17438 link
2024-06-24 Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation Yizheng Wu et.al. 2406.16776 link
2024-06-24 μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation Pierangela Bruno et.al. 2406.16724 null
2024-06-24 GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection Harnaik Dhami et.al. 2406.16625 null
2024-06-24 LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images Xiaowen Ma et.al. 2406.16502 link
2024-06-24 Cascade Reward Sampling for Efficient Decoding-Time Alignment Bolian Li et.al. 2406.16306 null
2024-06-24 SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments Neng Wang et.al. 2406.16279 link
2024-06-23 UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery Pengfei Zhang et.al. 2406.16129 null
2024-06-22 Fine-grained Background Representation for Weakly Supervised Semantic Segmentation Xu Yin et.al. 2406.15755 null
2024-06-20 Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery Ilham Adi Panuntun et.al. 2406.14220 null
2024-06-20 Trusting Semantic Segmentation Networks Samik Some et.al. 2406.14201 null
2024-06-20 EvSegSNN: Neuromorphic Semantic Segmentation for Event Data Dalia Hareb et.al. 2406.14178 null
2024-06-20 Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images Qinfeng Zhu et.al. 2406.14086 link
2024-06-19 Search-based DNN Testing and Retraining with GAN-enhanced Simulations Mohammed Oualid Attaoui et.al. 2406.13359 null
2024-06-19 Deep Learning-Based 3D Instance and Semantic Segmentation: A Review Siddiqui Muhammad Yasir et.al. 2406.13308 null
2024-06-18 Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation Guoyu Yang et.al. 2406.12496 link
2024-06-18 Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble Wang Liu et.al. 2406.12271 null
2024-06-17 OoDIS: Anomaly Instance Segmentation Benchmark Alexey Nekrasov et.al. 2406.11835 link
2024-06-17 Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT Maximilian E. Tschuchnig et.al. 2406.11650 null
2024-06-17 SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation Zhenchao Lin et.al. 2406.11441 link
2024-06-17 Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding Yunsong Wang et.al. 2406.11283 null
2024-06-17 Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation Bingfeng Zhang et.al. 2406.11189 null
2024-06-16 $α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion Sanbao Su et.al. 2406.11021 null
2024-06-16 PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery Libo Wang et.al. 2406.10828 link
2024-06-15 GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR Bharat Singh et.al. 2406.10722 null
2024-06-15 A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection Chenyao Zhou et.al. 2406.10678 link
2024-06-14 ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers Narges Norouzi et.al. 2406.09936 null
2024-06-14 Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions Aldi Piroli et.al. 2406.09906 null
2024-06-14 Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation Brunó B. Englert et.al. 2406.09896 link
2024-06-14 Open-Vocabulary Semantic Segmentation with Image Embedding Balancing Xiangheng Shan et.al. 2406.09829 link
2024-06-13 Instance-level quantitative saliency in multiple sclerosis lesion segmentation Federico Spagnolo et.al. 2406.09335 null
2024-06-13 APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation Weizhao He et.al. 2406.08372 null
2024-06-12 Dataset Enhancement with Instance-Level Augmentations Orest Kupyn et.al. 2406.08249 link
2024-06-13 A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder Lixian Zhang et.al. 2406.08079 null
2024-06-12 OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding Yinan Deng et.al. 2406.08009 link
2024-06-12 SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation Chanda Grover Kamra et.al. 2406.07986 link
2024-06-12 Small Scale Data-Free Knowledge Distillation He Liu et.al. 2406.07876 link
2024-06-11 Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph Sergey Linok et.al. 2406.07113 null
2024-06-11 PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving Yining Shi et.al. 2406.07037 null
2024-06-12 LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection Jiahua Xu et.al. 2406.07023 null
2024-06-10 Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation Dong Zhao et.al. 2406.06813 link
2024-06-09 Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation Abdul Qayyum et.al. 2406.06643 null
2024-06-10 Merlin: A Vision Language Foundation Model for 3D Computed Tomography Louis Blankemeier et.al. 2406.06512 null
2024-06-10 UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving Daniel Bogdoll et.al. 2406.06370 null
2024-06-09 Scaling Graph Convolutions for Mobile Vision William Avery et.al. 2406.05850 link
2024-06-09 Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation Jun Yu et.al. 2406.05837 null
2024-06-09 Convolution and Attention-Free Mamba-based Cardiac Image Segmentation Abbas Khan et.al. 2406.05786 null
2024-06-09 Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language Mark Hamilton et.al. 2406.05629 link
2024-06-08 A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+ Jianzhao Wang et.al. 2406.05513 null
2024-06-08 Layered Image Vectorization via Semantic Simplification Zhenyu Wang et.al. 2406.05404 null
2024-06-08 1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation Qingfeng Liu et.al. 2406.05352 null
2024-06-07 USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation Xiaoqi Wang et.al. 2406.05271 null
2024-06-07 Semantic Segmentation on VSPW Dataset through Masked Video Consistency Chen Liang et.al. 2406.04979 null
2024-06-07 Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment Venkanna Babu Guthula et.al. 2406.04949 null
2024-06-06 Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis Chengeng Liu et.al. 2406.04149 null
2024-06-06 Frequency-based Matcher for Long-tailed Semantic Segmentation Shan Li et.al. 2406.03917 link
2024-06-07 Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge Nan Zhang et.al. 2406.03799 link
2024-06-06 DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation Zilu Guo et.al. 2406.03702 link
2024-06-05 Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation Maximilian Zenk et.al. 2406.03323 null
2024-06-05 Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy Yunho Kim et.al. 2406.02989 null
2024-06-04 W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics Andre Schreiber et.al. 2406.02822 link
2024-06-04 Window to Wall Ratio Detection using SegFormer Zoe De Simone et.al. 2406.02706 link
2024-06-04 Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning Heather Doig et.al. 2406.01932 null
2024-06-03 EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding Thanh-Dat Truong et.al. 2406.01429 null
2024-06-03 TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation Antonio Santo et.al. 2406.01395 link
2024-06-03 ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds Ka Lung Cheung et.al. 2406.01337 link
2024-06-03 LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism Miao Fu et.al. 2406.01228 null
2024-06-04 GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer Ding Jia et.al. 2406.01210 link
2024-06-03 S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography Yuhan Song et.al. 2406.01191 null
2024-06-02 Diffusion Features to Bridge Domain Gap for Semantic Segmentation Yuxiang Ji et.al. 2406.00777 null
2024-06-02 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation Yunheng Li et.al. 2406.00670 null
2024-06-02 Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024 Biao Wu et.al. 2406.00587 null
2024-05-31 Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks Linlin Yu et.al. 2405.20986 null
2024-05-31 Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation Wooseok Shin et.al. 2405.20610 link
2024-05-30 P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation Qi Zhang et.al. 2405.20443 null
2024-05-30 SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow Chaoyang Wang et.al. 2405.20282 link
2024-05-30 MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion Angel Villar-Corrales et.al. 2405.19921 link
2024-05-30 Open-Set Domain Adaptation for Semantic Segmentation Seun-An Choe et.al. 2405.19899 link
2024-05-30 DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation Ron Keuth et.al. 2405.19746 link
2024-05-30 Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes Yong-Qiang Mao et.al. 2405.19735 null
2024-05-30 CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation Ankush Gajanan Arudkar et.al. 2405.19672 null
2024-05-29 Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation Lianlei Shan et.al. 2405.19568 null
2024-05-29 Enabling Visual Recognition at Radio Frequency Haowen Lai et.al. 2405.19516 null
2024-05-29 Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models Tianrun Chen et.al. 2405.19326 null
2024-05-29 A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation Niclas Vödisch et.al. 2405.19035 link
2024-05-29 Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation Zelin Peng et.al. 2405.18840 null
2024-05-28 Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation JuneHyoung Kwon et.al. 2405.18148 null
2024-05-28 Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images Lianlei Shan et.al. 2405.18078 null
2024-05-28 RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields Mihnea-Bogdan Jurca et.al. 2405.18033 null
2024-05-28 DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture Shentong Mo et.al. 2405.17995 null
2024-05-28 The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention Xingyu Ding et.al. 2405.17776 null
2024-05-27 Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation Steven Landgraf et.al. 2405.17097 null
2024-05-27 DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking Hongtao Wang et.al. 2405.16980 null
2024-05-27 Collective Perception Datasets for Autonomous Driving: A Comprehensive Review Sven Teufel et.al. 2405.16973 null
2024-05-27 Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models Qian Wang et.al. 2405.16947 null
2024-05-27 A re-calibration method for object detection with multi-modal alignment bias in autonomous driving Zhihang Song et.al. 2405.16848 null
2024-05-25 BOLD: Boolean Logic Deep Learning Van Minh Nguyen et.al. 2405.16339 null
2024-05-25 Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation Huizhou Chen et.al. 2405.16099 null
2024-05-25 Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality Hakim Ikebayashi et.al. 2405.16008 null
2024-05-24 Visualize and Paint GAN Activations Rudolf Herdt et.al. 2405.15636 null
2024-05-24 Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets Hoàng-Ân Lê et.al. 2405.15394 null
2024-05-24 U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation Bingyu Li et.al. 2405.15365 link
2024-05-24 Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation Jiayi Chen et.al. 2405.15265 null
2024-05-23 Mamba-R: Vision Mamba ALSO Needs Registers Feng Wang et.al. 2405.14858 null
2024-05-23 Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation Daniel Kienzle et.al. 2405.14467 null
2024-05-23 MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models Jiuming Liu et.al. 2405.14338 null
2024-05-23 Tuning-free Universally-Supervised Semantic Segmentation Xiaobo Yang et.al. 2405.14294 null
2024-05-23 SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation Kai Yao et.al. 2405.14278 null
2024-05-23 Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations Mohammed Baharoon et.al. 2405.14239 null
2024-05-24 Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification Taylor Archibald et.al. 2405.14162 null
2024-05-23 Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips Yaotian Liu et.al. 2405.14154 null
2024-05-22 TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System Diogo Lavado et.al. 2405.13989 null
2024-05-22 Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer Qihang Fan et.al. 2405.13337 null
2024-05-21 Transparency Distortion Robustness for SOTA Image Segmentation Tasks Volker Knauthe et.al. 2405.12864 null
2024-05-20 A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation Sushmita Sarker et.al. 2405.11903 null
2024-05-20 Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments Jooyong Park et.al. 2405.11855 null
2024-05-20 Universal Organizer of SAM for Unsupervised Semantic Segmentation Tingting Li et.al. 2405.11742 null
2024-05-19 Interpreting a Semantic Segmentation Model for Coastline Detection Conor O'Sullivan et.al. 2405.11500 null
2024-05-17 CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation Mushui Liu et.al. 2405.10530 link
2024-05-16 Towards Task-Compatible Compressible Representations Anderson de Andrade et.al. 2405.10244 link
2024-05-16 A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance Andrea Matteazzi et.al. 2405.10046 null
2024-05-16 Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation Jihwan Kwak et.al. 2405.09858 null
2024-05-15 Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation Guo Yachan et.al. 2405.09682 null

(back to top)