2024-11-21 |
Revisiting the Integration of Convolution and Attention for Vision Backbone |
Lei Zhu et.al. |
2411.14429 |
link |
2024-11-21 |
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation |
Lin Sun et.al. |
2411.13836 |
link |
2024-11-21 |
Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals |
Hussni Mohd Zakir et.al. |
2411.13774 |
null |
2024-11-20 |
FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting |
Ola Shorinwa et.al. |
2411.13753 |
null |
2024-11-20 |
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation |
Umamaheswaran Raman Kumar et.al. |
2411.13251 |
null |
2024-11-20 |
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation |
Ziyi Wang et.al. |
2411.13243 |
link |
2024-11-20 |
Automating Sonologists USG Commands with AI and Voice Interface |
Emad Mohamed et.al. |
2411.13006 |
null |
2024-11-19 |
A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation |
Jiaqi Yang et.al. |
2411.12615 |
link |
2024-11-19 |
SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation |
Ron Keuth et.al. |
2411.12602 |
link |
2024-11-19 |
ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator |
Xiao Jiang et.al. |
2411.12250 |
null |
2024-11-18 |
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements |
M. Arda Aydın et.al. |
2411.12044 |
link |
2024-11-18 |
Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation |
Hanieh Shojaei Miandashti et.al. |
2411.11935 |
null |
2024-11-18 |
MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models |
Harshita Sharma et.al. |
2411.11362 |
null |
2024-11-18 |
Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications |
Scarlett Raine et.al. |
2411.11287 |
null |
2024-11-16 |
Attention-based U-Net Method for Autonomous Lane Detection |
Mohammadhamed Tangestanizadeh et.al. |
2411.10902 |
null |
2024-11-16 |
Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation |
Jaisidh Singh et.al. |
2411.10845 |
null |
2024-11-19 |
Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients |
Maria Monzon et.al. |
2411.10755 |
link |
2024-11-15 |
Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images |
Ammar Qammaz et.al. |
2411.10334 |
null |
2024-11-15 |
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation |
Dengke Zhang et.al. |
2411.10086 |
null |
2024-11-14 |
OneNet: A Channel-Wise 1D Convolutional U-Net |
Sanghyun Byun et.al. |
2411.09838 |
link |
2024-11-14 |
Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks |
Zengyi Yang et.al. |
2411.09387 |
null |
2024-11-14 |
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation |
Yuheng Shi et.al. |
2411.09219 |
link |
2024-11-14 |
Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery |
Ashim Dahal et.al. |
2411.09101 |
link |
2024-11-13 |
CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation |
Xuming Zhang et.al. |
2411.09023 |
null |
2024-11-14 |
Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation |
Yangyang Li et.al. |
2411.08756 |
null |
2024-11-13 |
Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model |
Jun Xie et.al. |
2411.08592 |
null |
2024-11-12 |
Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry |
Christopher Hahne et.al. |
2411.07918 |
link |
2024-11-12 |
Semantic segmentation on multi-resolution optical and microwave data using deep learning |
Jai G Singla et.al. |
2411.07581 |
null |
2024-11-11 |
SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation |
Jiale Chen et.al. |
2411.06991 |
null |
2024-11-14 |
Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision |
Yueyang Cang et.al. |
2411.06727 |
null |
2024-11-10 |
Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments |
Deegan Atha et.al. |
2411.06632 |
null |
2024-11-09 |
Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing |
Kaixuan Lu et.al. |
2411.06091 |
null |
2024-11-08 |
Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model |
Shuchang Lyu et.al. |
2411.05878 |
link |
2024-11-08 |
Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation |
Sien Li et.al. |
2411.05307 |
link |
2024-11-07 |
In the Era of Prompt Learning with Vision-Language Models |
Ankit Jha et.al. |
2411.04892 |
null |
2024-11-11 |
ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset |
Olaf Wysocki et.al. |
2411.04865 |
link |
2024-11-06 |
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts |
Zhitong Gao et.al. |
2411.03829 |
link |
2024-11-06 |
Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model |
Yansong Qu et.al. |
2411.03672 |
null |
2024-11-05 |
Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation |
Zhiling Yue et.al. |
2411.03551 |
null |
2024-11-05 |
SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture |
Andrew Heschl et.al. |
2411.03505 |
link |
2024-11-05 |
Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need |
Qishuai Wen et.al. |
2411.03033 |
link |
2024-11-05 |
Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation |
Xavier Timoneda et.al. |
2411.02969 |
null |
2024-11-05 |
Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery |
Mohammad Kakooei et.al. |
2411.02935 |
null |
2024-11-05 |
CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation |
Jinchao Ge et.al. |
2411.02715 |
null |
2024-11-04 |
Deep Learning on 3D Semantic Segmentation: A Detailed Review |
Thodoris Betsas et.al. |
2411.02104 |
null |
2024-11-04 |
Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models |
Sharat Agarwal et.al. |
2411.01925 |
null |
2024-11-04 |
DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability |
Bo Gao et.al. |
2411.01819 |
null |
2024-11-04 |
Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations |
Thanh Nguyen Canh et.al. |
2411.01816 |
null |
2024-11-03 |
PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation |
Xinyu Xu et.al. |
2411.01624 |
null |
2024-11-01 |
Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions |
Lixiao Yang et.al. |
2411.01039 |
null |
2024-11-01 |
Event-guided Low-light Video Semantic Segmentation |
Zhen Yao et.al. |
2411.00639 |
null |
2024-11-01 |
Cross-modal semantic segmentation for indoor environmental perception using single-chip millimeter-wave radar raw data |
Hairuo Hu et.al. |
2411.00499 |
null |
2024-11-01 |
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing |
Naufal Suryanto et.al. |
2411.00425 |
link |
2024-10-31 |
A Recipe for Geometry-Aware 3D Mesh Transformers |
Mohammad Farazi et.al. |
2411.00164 |
null |
2024-10-31 |
Federated Black-Box Adaptation for Semantic Segmentation |
Jay N. Paranjape et.al. |
2410.24181 |
null |
2024-10-31 |
COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes |
Muhammad Ali et.al. |
2410.24139 |
link |
2024-10-31 |
Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model |
Hao Zhang et.al. |
2410.23905 |
link |
2024-10-30 |
S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving |
Maciej K. Wozniak et.al. |
2410.23085 |
null |
2024-10-31 |
CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation |
Ziyang Gong et.al. |
2410.22629 |
link |
2024-10-29 |
Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation |
Zhaochong An et.al. |
2410.22489 |
null |
2024-10-29 |
Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation |
Jintao Tong et.al. |
2410.22135 |
null |
2024-10-29 |
Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models |
Imad Ali Shah et.al. |
2410.22101 |
null |
2024-10-29 |
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation |
Ruihao Xia et.al. |
2410.21708 |
link |
2024-10-28 |
Domain Adaptation with a Single Vision-Language Embedding |
Mohammad Fahes et.al. |
2410.21361 |
null |
2024-10-28 |
IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks |
Manjunath D et.al. |
2410.20953 |
null |
2024-10-27 |
A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models |
Camilo Espinosa-Curilem et.al. |
2410.20595 |
link |
2024-10-27 |
Unlocking Comics: The AI4VA Dataset for Visual Understanding |
Peter Grönquist et.al. |
2410.20459 |
link |
2024-10-27 |
Historical Test-time Prompt Tuning for Vision Foundation Models |
Jingyi Zhang et.al. |
2410.20346 |
null |
2024-10-25 |
OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery |
Philipe Dias et.al. |
2410.19965 |
null |
2024-10-25 |
IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation |
Kaixian Qu et.al. |
2410.19697 |
null |
2024-10-25 |
Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation |
Yao Wu et.al. |
2410.19446 |
link |
2024-10-25 |
Context-Based Visual-Language Place Recognition |
Soojin Woo et.al. |
2410.19341 |
link |
2024-10-24 |
Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks |
Alexander Jaus et.al. |
2410.18684 |
null |
2024-10-24 |
Unsupervised semantic segmentation of urban high-density multispectral point clouds |
Oona Oinonen et.al. |
2410.18520 |
null |
2024-10-26 |
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator |
Stefanos Pasios et.al. |
2410.18238 |
null |
2024-10-23 |
Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers |
Achille Chiuchiarelli et.al. |
2410.17738 |
null |
2024-10-22 |
EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding |
Zhiyi Pan et.al. |
2410.17207 |
null |
2024-10-22 |
SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments |
Jumman Hossain et.al. |
2410.16686 |
null |
2024-10-21 |
TIPS: Text-Image Pretraining with Spatial Awareness |
Kevis-Kokitsi Maninis et.al. |
2410.16512 |
null |
2024-10-21 |
GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation |
Nazanin Moradinasab et.al. |
2410.16485 |
null |
2024-10-21 |
LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training |
Thomas Kreutz et.al. |
2410.15833 |
link |
2024-10-21 |
TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight |
Hyun-Kurl Jang et.al. |
2410.15674 |
link |
2024-10-21 |
Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications |
Jintao Ren et.al. |
2410.15584 |
null |
2024-10-22 |
Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation |
Fnu Neha et.al. |
2410.15472 |
null |
2024-10-18 |
On the Influence of Shape, Texture and Color for Learning Semantic Segmentation |
Annika Mütze et.al. |
2410.14878 |
null |
2024-10-18 |
Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+ |
Arpan Mahara et.al. |
2410.14836 |
null |
2024-10-17 |
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding |
Guangda Ji et.al. |
2410.13924 |
null |
2024-10-17 |
Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks |
Clément Playout et.al. |
2410.13822 |
link |
2024-10-22 |
EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything |
Joonhyeon Song et.al. |
2410.13621 |
link |
2024-10-17 |
Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation |
Ziyang Chen et.al. |
2410.13472 |
null |
2024-10-17 |
SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing |
Bin Wang et.al. |
2410.13471 |
link |
2024-10-17 |
Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation |
Florian Wulff et.al. |
2410.13383 |
null |
2024-10-17 |
Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation |
Houze Liu et.al. |
2410.13099 |
null |
2024-10-16 |
Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation |
Wenbo Xu et.al. |
2410.13094 |
null |
2024-10-16 |
Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation |
Jesús Alejandro Loera-Ponce et.al. |
2410.12988 |
null |
2024-10-16 |
VividMed: Vision Language Model with Versatile Visual Grounding for Medicine |
Lingxiao Luo et.al. |
2410.12694 |
link |
2024-10-16 |
Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans |
Luca Marsilio et.al. |
2410.12641 |
null |
2024-10-16 |
SAM-Guided Masked Token Prediction for 3D Scene Understanding |
Zhimin Chen et.al. |
2410.12158 |
null |
2024-10-15 |
WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation |
Chenghao Qian et.al. |
2410.12075 |
null |
2024-10-15 |
Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning |
Rijun Wang et.al. |
2410.11913 |
null |
2024-10-15 |
RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation |
Anton Antonov et.al. |
2410.11722 |
link |
2024-10-15 |
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation |
Jiayi Lin et.al. |
2410.11473 |
null |
2024-10-15 |
MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation |
Xianping Ma et.al. |
2410.11160 |
link |
2024-10-14 |
Locality Alignment Improves Vision-Language Models |
Ian Covert et.al. |
2410.11087 |
null |
2024-10-14 |
Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes |
Tim Broedermann et.al. |
2410.10791 |
null |
2024-10-14 |
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation |
Lihe Yang et.al. |
2410.10777 |
link |
2024-10-14 |
Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation |
Daniel Fusaro et.al. |
2410.10510 |
link |
2024-10-14 |
LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections |
Xuezhi Xiang et.al. |
2410.10433 |
null |
2024-10-14 |
V2M: Visual 2-Dimensional Mamba for Image Representation Learning |
Chengkun Wang et.al. |
2410.10382 |
link |
2024-10-14 |
GlobalMamba: Global Image Serialization for Vision Mamba |
Chengkun Wang et.al. |
2410.10316 |
link |
2024-10-13 |
AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model |
Yuchen Li et.al. |
2410.09714 |
null |
2024-10-12 |
An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation |
Wei Liang et.al. |
2410.09443 |
null |
2024-10-11 |
Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation |
Varduhi Yeghiazaryan et.al. |
2410.08946 |
null |
2024-10-11 |
Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation |
Hanieh Shojaei et.al. |
2410.08687 |
null |
2024-10-11 |
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention |
Nguyen Huu Bao Long et.al. |
2410.08582 |
link |
2024-10-10 |
Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? |
Samir Abou Haidar et.al. |
2410.08365 |
null |
2024-10-10 |
Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation |
Zhiyi Pan et.al. |
2410.08091 |
null |
2024-10-10 |
Shift and matching queries for video semantic segmentation |
Tsubasa Mizuno et.al. |
2410.07635 |
null |
2024-10-10 |
3D Vision-Language Gaussian Splatting |
Qucheng Peng et.al. |
2410.07577 |
null |
2024-10-11 |
Bridge the Points: Graph-based Few-shot Segment Anything Semantically |
Anqi Zhang et.al. |
2410.06964 |
null |
2024-10-09 |
Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation |
Seungho Lee et.al. |
2410.06893 |
null |
2024-10-09 |
Rethinking the Evaluation of Visible and Infrared Image Fusion |
Dayan Guan et.al. |
2410.06811 |
link |
2024-10-10 |
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model |
Fei Xie et.al. |
2410.06806 |
link |
2024-10-09 |
Transesophageal Echocardiography Generation using Anatomical Models |
Emmanuel Oladokun et.al. |
2410.06781 |
null |
2024-10-09 |
Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy |
Qinfeng Zhu et.al. |
2410.06725 |
null |
2024-10-09 |
Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments |
Meng Yu et.al. |
2410.06626 |
null |
2024-10-09 |
Towards Natural Image Matting in the Wild via Real-Scenario Prior |
Ruihao Xia et.al. |
2410.06593 |
link |
2024-10-08 |
Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions |
Mateus Karvat et.al. |
2410.06380 |
null |
2024-10-08 |
Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading |
Fang Gao et.al. |
2410.05762 |
null |
2024-10-07 |
Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation |
Vince Zhu et.al. |
2410.04689 |
null |
2024-10-04 |
SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2 |
Hao Yu et.al. |
2410.03962 |
null |
2024-10-04 |
Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features |
Benyuan Meng et.al. |
2410.03558 |
link |
2024-10-04 |
Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images |
Abhijeet Patil et.al. |
2410.03289 |
link |
2024-10-04 |
HRVMamba: High-Resolution Visual State Space Model for Dense Prediction |
Hao Zhang et.al. |
2410.03174 |
null |
2024-10-03 |
HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer |
Jingjing Ren et.al. |
2410.02528 |
null |
2024-10-04 |
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation |
Muzhi Zhu et.al. |
2410.02369 |
null |
2024-10-03 |
RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds |
Remco Royen et.al. |
2410.02323 |
null |
2024-10-03 |
Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network |
Yangyang Qiu et.al. |
2410.02224 |
null |
2024-10-03 |
Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images |
Qingyuan Liu et.al. |
2410.02207 |
null |
2024-10-02 |
SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images |
Kaiyu Li et.al. |
2410.01768 |
link |
2024-10-02 |
One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations |
Shaokang Wu et.al. |
2410.01630 |
null |
2024-10-02 |
Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation |
Zhaofeng Shi et.al. |
2410.01341 |
null |
2024-10-02 |
VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings |
Andrea Carrara et.al. |
2410.01336 |
null |
2024-10-01 |
RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation |
Yazhou Zhu et.al. |
2410.01110 |
null |
2024-10-01 |
Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer |
Vlatko Spasev et.al. |
2410.01092 |
null |
2024-10-01 |
Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time |
Chiao-An Yang et.al. |
2410.01083 |
link |
2024-10-01 |
DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles |
Robert Krajewski et.al. |
2410.00769 |
null |
2024-10-01 |
Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection |
Pengxi Zeng et.al. |
2410.00582 |
null |
2024-10-01 |
Precise Workcell Sketching from Point Clouds Using an AR Toolbox |
Krzysztof Zieliński et.al. |
2410.00479 |
null |
2024-09-30 |
AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation |
Boyu Han et.al. |
2409.20398 |
null |
2024-09-30 |
Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation |
Tillmann Rheude et.al. |
2409.20287 |
link |
2024-09-30 |
Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model |
Fulong Ma et.al. |
2409.20164 |
null |
2024-09-30 |
Segmenting Wood Rot using Computer Vision Models |
Roland Kammerbauer et.al. |
2409.20137 |
null |
2024-09-30 |
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels |
Heeseong Shin et.al. |
2409.19846 |
null |
2024-09-27 |
Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation |
Raphael Hagmanns et.al. |
2409.18788 |
null |
2024-09-27 |
Learning from Pattern Completion: Self-supervised Controllable Generation |
Zhiqiang Chen et.al. |
2409.18694 |
link |
2024-09-27 |
Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast |
Xiaoke Hao et.al. |
2409.18543 |
link |
2024-10-01 |
Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization |
Siru Li et.al. |
2409.18434 |
null |
2024-09-26 |
Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning |
Siyi Lu et.al. |
2409.17659 |
null |
2024-09-26 |
Global-Local Medical SAM Adaptor Based on Full Adaption |
Meng Wang et.al. |
2409.17486 |
null |
2024-09-25 |
VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection |
Liangyu Zhong et.al. |
2409.17330 |
null |
2024-09-25 |
2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation |
Tommie Kerssies et.al. |
2409.17208 |
link |
2024-09-25 |
WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks |
Alberto Bacchin et.al. |
2409.16999 |
link |
2024-09-25 |
Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis |
Illia Tsiporenko et.al. |
2409.16940 |
null |
2024-09-24 |
A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation |
Avisha Kumar et.al. |
2409.16441 |
null |
2024-09-24 |
Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds |
Asad Ur Rahman et.al. |
2409.16381 |
null |
2024-09-24 |
Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation |
Hannah Kerner et.al. |
2409.16252 |
link |
2024-09-24 |
Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation |
Harry Rogers et.al. |
2409.16213 |
link |
2024-09-24 |
Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification |
Pang-Yuan Pao et.al. |
2409.15846 |
null |
2024-09-24 |
DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation |
Soojin Jang et.al. |
2409.15801 |
null |
2024-09-24 |
Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis |
Camndon Reed et.al. |
2409.15671 |
null |
2024-09-23 |
ZeroSCD: Zero-Shot Street Scene Change Detection |
Shyam Sundar Kannan et.al. |
2409.15255 |
null |
2024-09-17 |
Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks |
Edgar Heinert et.al. |
2409.11373 |
null |
2024-09-17 |
MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping |
Amirreza Fateh et.al. |
2409.11316 |
link |
2024-09-17 |
Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark |
Clifford Broni-Bediako et.al. |
2409.11227 |
link |
2024-09-17 |
HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios |
Nick Theisen et.al. |
2409.11205 |
link |
2024-09-16 |
Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning |
Amin Karimi Monsefi et.al. |
2409.10362 |
null |
2024-09-16 |
BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images |
Wentao Wang et.al. |
2409.10269 |
null |
2024-09-15 |
Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation |
Zhanteng Xie et.al. |
2409.09899 |
null |
2024-09-15 |
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation |
Qilong Zhangli et.al. |
2409.09893 |
null |
2024-09-15 |
High Definition Map Mapping and Update: A General Overview and Future Directions |
Benny Wijaya et.al. |
2409.09726 |
null |
2024-09-14 |
Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation |
Hugo Porta et.al. |
2409.09497 |
null |
2024-09-13 |
AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation |
Zechao Sun et.al. |
2409.08516 |
null |
2024-09-13 |
VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation |
Ezra MacDonald et.al. |
2409.08461 |
link |
2024-09-12 |
Bayesian Self-Training for Semi-Supervised 3D Segmentation |
Ozan Unal et.al. |
2409.08102 |
null |
2024-09-12 |
Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes |
Siyu Chen et.al. |
2409.07995 |
null |
2024-09-12 |
SURGIVID: Annotation-Efficient Surgical Video Object Discovery |
Çağhan Köksal et.al. |
2409.07801 |
null |
2024-09-12 |
Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation |
Fuchen Zheng et.al. |
2409.07793 |
link |
2024-09-12 |
ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation |
Fuchen Zheng et.al. |
2409.07779 |
link |
2024-09-12 |
Open-Vocabulary Remote Sensing Image Semantic Segmentation |
Qinglong Cao et.al. |
2409.07683 |
null |
2024-09-11 |
Token Turing Machines are Efficient Vision Models |
Purvish Jajal et.al. |
2409.07613 |
null |
2024-09-11 |
AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution |
Wangduo Xie et.al. |
2409.07171 |
null |
2024-09-11 |
Brain-Inspired Stepwise Patch Merging for Vision Transformers |
Yonghao Yu et.al. |
2409.06963 |
null |
2024-09-10 |
Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds |
Mu Cai et.al. |
2409.06827 |
link |
2024-09-10 |
A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO |
Sabit Ahamed Preanto et.al. |
2409.06671 |
null |
2024-09-10 |
PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation |
Yin Hu et.al. |
2409.06309 |
null |
2024-09-10 |
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation |
Nischal Khanal et.al. |
2409.06183 |
link |
2024-09-09 |
SVS-GAN: Leveraging GANs for Semantic Video Synthesis |
Khaled M. Seyam et.al. |
2409.06074 |
null |
2024-09-09 |
Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance |
Quang-Huy Che et.al. |
2409.06002 |
null |
2024-09-09 |
Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features |
Jacob Gildenblat et.al. |
2409.05697 |
null |
2024-09-09 |
ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions |
Furqan Ahmed Shaik et.al. |
2409.05327 |
null |
2024-09-08 |
RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network |
Zhiwei Lin et.al. |
2409.04979 |
null |
2024-09-06 |
Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation |
Björn Michele et.al. |
2409.04409 |
link |
2024-09-05 |
Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution |
Marga Don et.al. |
2409.03754 |
link |
2024-09-05 |
LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones |
Moritz Nottebaum et.al. |
2409.03460 |
link |
2024-09-05 |
Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications |
Tong Bu et.al. |
2409.03368 |
null |
2024-09-05 |
UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking |
Md. Mahfuzur Rahman et.al. |
2409.03245 |
null |
2024-09-05 |
Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation |
Xixi Jiang et.al. |
2409.03228 |
link |
2024-09-06 |
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation |
Lin Sun et.al. |
2409.03209 |
link |
2024-09-04 |
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation |
Hayeon Jo et.al. |
2409.02838 |
null |
2024-09-04 |
CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation |
Minhee Cho et.al. |
2409.02699 |
null |
2024-09-04 |
SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction |
Sumin Son et.al. |
2409.02513 |
null |
2024-09-03 |
K-Origins: Better Colour Quantification for Neural Networks |
Lewis Mason et.al. |
2409.02281 |
link |
2024-09-03 |
AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions |
Chenghao Qian et.al. |
2409.02045 |
null |
2024-09-03 |
Segmenting Object Affordances: Reproducibility and Sensitivity to Scale |
Tommaso Apicella et.al. |
2409.01814 |
link |
2024-09-03 |
Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation |
Haodong Wang et.al. |
2409.01662 |
null |
2024-09-02 |
Semantic Segmentation from Image Labels by Reconstruction from Structured Decomposition |
Xuanrui Zeng et.al. |
2409.01472 |
link |
2024-09-02 |
SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation |
Alberto Bacchin et.al. |
2409.01109 |
link |
2024-09-02 |
Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions |
Taorong Liu et.al. |
2409.01072 |
null |
2024-08-30 |
Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes |
Li Zhang et.al. |
2408.17421 |
link |
2024-08-30 |
Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations |
Ahmed Hammam et.al. |
2408.17311 |
null |
2024-08-30 |
Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training |
Zizheng Huang et.al. |
2408.17081 |
link |
2024-08-30 |
Transient Fault Tolerant Semantic Segmentation for Autonomous Driving |
Leonardo Iurada et.al. |
2408.16952 |
link |
2024-08-29 |
SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection |
Rohit Venkata Sai Dulam et.al. |
2408.16645 |
null |
2024-08-29 |
MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation |
Linyan Yang et.al. |
2408.16478 |
null |
2024-08-29 |
Multi-source Domain Adaptation for Panoramic Semantic Segmentation |
Jing Jiang et.al. |
2408.16469 |
null |
2024-08-29 |
EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More |
Kanghao Chen et.al. |
2408.16254 |
null |
2024-08-28 |
SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors |
Zhiqing Zhang et.al. |
2408.15887 |
null |
2024-08-28 |
DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries |
Yu Yang et.al. |
2408.15813 |
null |
2024-08-28 |
TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation |
Junbao Zhou et.al. |
2408.15657 |
link |
2024-08-27 |
Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images |
Silvia Seidlitz et.al. |
2408.15373 |
link |
2024-08-27 |
An Investigation on The Position Encoding in Vision-Based Dynamics Prediction |
Jiageng Zhu et.al. |
2408.15201 |
null |
2024-08-27 |
Applying ViT in Generalized Few-shot Semantic Segmentation |
Liyuan Geng et.al. |
2408.14957 |
link |
2024-08-27 |
Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack |
Naufal Suryanto et.al. |
2408.14879 |
null |
2024-08-27 |
MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation |
Yuanbing Zhu et.al. |
2408.14776 |
null |
2024-08-26 |
Physically Feasible Semantic Segmentation |
Shamik Basu et.al. |
2408.14672 |
link |
2024-08-25 |
OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation |
Muhammad Rameez ur Rahman et.al. |
2408.13936 |
link |
2024-08-25 |
Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation |
Yuwen Pan et.al. |
2408.13838 |
null |
2024-08-25 |
TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather |
Xiongwei Zhao et.al. |
2408.13802 |
link |
2024-08-25 |
ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation |
Xin Zhang et.al. |
2408.13771 |
null |
2024-08-25 |
Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation |
Zhaoyang Li et.al. |
2408.13752 |
null |
2024-08-24 |
ESA: Annotation-Efficient Active Learning for Semantic Segmentation |
Jinchao Ge et.al. |
2408.13491 |
link |
2024-08-23 |
Accuracy Improvement of Cell Image Segmentation Using Feedback Former |
Hinako Mitsuoka et.al. |
2408.12974 |
null |
2024-08-23 |
Image Segmentation in Foundation Model Era: A Survey |
Tianfei Zhou et.al. |
2408.12957 |
null |
2024-08-23 |
Symmetric masking strategy enhances the performance of Masked Image Modeling |
Khanh-Binh Nguyen et.al. |
2408.12772 |
null |
2024-08-22 |
Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets |
Wolfgang Boettcher et.al. |
2408.12489 |
null |
2024-08-22 |
The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation |
Tuyen Tran et.al. |
2408.12447 |
null |
2024-08-21 |
UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images |
Enze Zhu et.al. |
2408.11545 |
null |
2024-08-21 |
Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation |
Chuandong Liu et.al. |
2408.11280 |
null |
2024-08-20 |
NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency |
Valentinos Pariza et.al. |
2408.11054 |
null |
2024-08-20 |
CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients |
Karen Sanchez et.al. |
2408.10827 |
null |
2024-08-20 |
Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? |
Chen Liang et.al. |
2408.10627 |
null |
2024-08-20 |
Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation |
Jiawei Han et.al. |
2408.10537 |
link |
2024-08-19 |
Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network |
Rasha Alshawi et.al. |
2408.10181 |
null |
2024-08-19 |
Dynamic Label Injection for Imbalanced Industrial Defect Segmentation |
Emanuele Caruso et.al. |
2408.10031 |
link |
2024-08-19 |
Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis |
Kira Maag et.al. |
2408.10021 |
null |
2024-08-19 |
Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving |
Jun Yan et.al. |
2408.09839 |
link |
2024-08-18 |
OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras |
Muhammad Rameez Ur Rahman et.al. |
2408.09424 |
link |
2024-08-18 |
Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration |
Hao Ai et.al. |
2408.09336 |
null |
2024-08-17 |
Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology |
Junchao Zhu et.al. |
2408.09278 |
link |
2024-08-17 |
GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation |
Weiming Zhang et.al. |
2408.09115 |
null |
2024-08-17 |
Depth-guided Texture Diffusion for Image Semantic Segmentation |
Wei Sun et.al. |
2408.09097 |
null |
2024-08-15 |
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks |
Dongshuo Yin et.al. |
2408.08345 |
link |
2024-08-14 |
MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis |
Nimeesha Chan et.al. |
2408.07773 |
link |
2024-08-15 |
MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation |
Beoungwoo Kang et.al. |
2408.07576 |
link |
2024-08-15 |
MagicFace: Training-free Universal-Style Human Image Customized Synthesis |
Yibin Wang et.al. |
2408.07433 |
null |
2024-08-14 |
Segment Using Just One Example |
Pratik Vora et.al. |
2408.07393 |
null |
2024-08-14 |
Ensemble architecture in polyp segmentation |
Hao-Yun Hsu et.al. |
2408.07262 |
link |
2024-08-14 |
Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks |
Raghavendra Singh et.al. |
2408.07243 |
null |
2024-08-14 |
Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training |
Ethan Kou et.al. |
2408.07239 |
null |
2024-08-13 |
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation |
Jingyun Wang et.al. |
2408.06747 |
link |
2024-08-10 |
Dilated Convolution with Learnable Spacings |
Ismail Khalfaoui-Hassani et.al. |
2408.06383 |
null |
2024-08-12 |
Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images |
Siladittya Manna et.al. |
2408.06235 |
null |
2024-08-12 |
A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting |
Felix Assion et.al. |
2408.06071 |
null |
2024-08-12 |
Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning |
Xinrong Hu et.al. |
2408.05889 |
null |
2024-08-11 |
Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task |
Hannuo Zhang et.al. |
2408.05777 |
null |
2024-08-11 |
MacFormer: Semantic Segmentation with Fine Object Boundaries |
Guoan Xu et.al. |
2408.05699 |
null |
2024-08-10 |
Multimodal generative semantic communication based on latent diffusion model |
Weiqi Fu et.al. |
2408.05455 |
null |
2024-08-09 |
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation |
Dahyun Kang et.al. |
2408.04961 |
link |
2024-08-09 |
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation |
Mengcheng Lan et.al. |
2408.04883 |
link |
2024-08-09 |
Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning |
Fumihiro Kaneko et.al. |
2408.04795 |
null |
2024-08-08 |
SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation |
Jieming Yu et.al. |
2408.04593 |
null |
2024-08-08 |
SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios |
Sriram Mandalika et.al. |
2408.04482 |
null |
2024-08-08 |
What could go wrong? Discovering and describing failure modes in computer vision |
Gabriela Csurka et.al. |
2408.04471 |
null |
2024-08-07 |
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications |
Tianfang Zhang et.al. |
2408.03703 |
link |
2024-08-07 |
SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology |
Mingya Zhang et.al. |
2408.03651 |
link |
2024-08-06 |
Post-Mortem Human Iris Segmentation Analysis with Deep Learning |
Afzal Hossain et.al. |
2408.03448 |
null |
2024-08-06 |
Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression |
Jonas Schmitt et.al. |
2408.03046 |
link |
2024-08-05 |
Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation |
Sai Prasanna et.al. |
2408.02297 |
null |
2024-08-05 |
Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs |
Jeongkee Lim et.al. |
2408.02261 |
null |
2024-08-05 |
Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders |
Muhammad Abdullah Jamal et.al. |
2408.02245 |
null |
2024-08-04 |
Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation |
Ye Du et.al. |
2408.02039 |
null |
2024-08-03 |
Bayesian Active Learning for Semantic Segmentation |
Sima Didari et.al. |
2408.01694 |
null |
2024-08-03 |
A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection |
Omkar Oak et.al. |
2408.01692 |
null |
2024-08-03 |
Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation |
Balázs Opra et.al. |
2408.01640 |
null |
2024-08-02 |
Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans |
Lukas Kratochvila et.al. |
2408.01526 |
null |
2024-08-02 |
Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation |
Yuanzhi Su et.al. |
2408.01356 |
null |
2024-08-02 |
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation |
Bingyu Li et.al. |
2408.01343 |
null |
2024-08-02 |
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach |
Yabin Zhu et.al. |
2408.00969 |
null |
2024-08-01 |
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation |
Siyu Jiao et.al. |
2408.00744 |
null |
2024-08-01 |
Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function |
Matias Oscar Volman Stern et.al. |
2408.00707 |
null |
2024-08-01 |
AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation |
Asbjørn Munk et.al. |
2408.00640 |
null |
2024-08-01 |
SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation |
Shengbo Tan et.al. |
2408.00496 |
null |
2024-07-31 |
Open-Vocabulary Audio-Visual Semantic Segmentation |
Ruohao Guo et.al. |
2407.21721 |
null |
2024-07-31 |
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment |
Anurag Das et.al. |
2407.21654 |
null |
2024-07-31 |
Small Object Few-shot Segmentation for Vision-based Industrial Inspection |
Zilong Zhang et.al. |
2407.21351 |
null |
2024-07-31 |
On-the-fly Point Feature Representation for Point Clouds Analysis |
Jiangyi Wang et.al. |
2407.21335 |
null |
2024-07-31 |
Fine-grained Metrics for Point Cloud Semantic Segmentation |
Zhuheng Lu et.al. |
2407.21289 |
null |
2024-07-30 |
PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds |
Kerem Mertoğlu et.al. |
2407.21150 |
null |
2024-07-30 |
Learning Ordinality in Semantic Segmentation |
Rafael Cristino et.al. |
2407.20959 |
null |
2024-07-29 |
Improving 2D Feature Representations by 3D-Aware Fine-Tuning |
Yuanwen Yue et.al. |
2407.20229 |
null |
2024-07-29 |
Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset |
Yimian Dai et.al. |
2407.20078 |
link |
2024-07-29 |
Language-driven Grasp Detection with Mask-guided Attention |
Tuan Van Vo et.al. |
2407.19877 |
null |
2024-07-29 |
Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets |
Muhammad Abdullah Jamal et.al. |
2407.19714 |
null |
2024-07-29 |
ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement |
Ezequiel Perez-Zarate et.al. |
2407.19708 |
link |
2024-07-28 |
ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding |
Zhen Chen et.al. |
2407.19435 |
link |
2024-07-27 |
Ensembling convolutional neural networks for human skin segmentation |
Patryk Kuban et.al. |
2407.19310 |
null |
2024-07-27 |
Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network |
Gang Pan et.al. |
2407.19271 |
null |
2024-07-26 |
Sparse Refinement for Efficient High-Resolution Semantic Segmentation |
Zhijian Liu et.al. |
2407.19014 |
null |
2024-07-29 |
Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation |
Jingjun Yi et.al. |
2407.18568 |
null |
2024-07-25 |
Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception |
Julia Hindel et.al. |
2407.18145 |
null |
2024-07-25 |
TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework |
Guanfeng Tang et.al. |
2407.18038 |
null |
2024-07-25 |
Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions |
Jan Nikolas Morshuis et.al. |
2407.18026 |
link |
2024-07-24 |
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation |
Hyunwoo Yu et.al. |
2407.17261 |
link |
2024-07-24 |
Trans2Unet: Neural fusion for Nuclei Semantic Segmentation |
Dinh-Phu Tran et.al. |
2407.17181 |
null |
2024-07-24 |
PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning |
Mu Chen et.al. |
2407.17101 |
null |
2024-07-25 |
Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste |
Qinfeng Zhu et.al. |
2407.17028 |
link |
2024-07-24 |
Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images |
Dooseop Choi et.al. |
2407.17003 |
link |
2024-07-23 |
Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving |
Anam Manzoor et.al. |
2407.16647 |
null |
2024-07-23 |
Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging |
Daniela L. Ramos et.al. |
2407.16608 |
null |
2024-07-23 |
Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision |
Aditya Krishnan et.al. |
2407.16102 |
null |
2024-07-22 |
MILAN: Milli-Annotations for Lidar Semantic Segmentation |
Nermin Samet et.al. |
2407.15797 |
null |
2024-07-22 |
Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond |
Silvio Galesso et.al. |
2407.15739 |
link |
2024-07-22 |
MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics |
Alexander Melekhin et.al. |
2407.15663 |
link |
2024-07-22 |
Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling |
Bo Yuan et.al. |
2407.15429 |
link |
2024-07-22 |
Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data |
Junha Song et.al. |
2407.15383 |
null |
2024-07-21 |
Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation |
Xiaoyang Wu et.al. |
2407.15282 |
null |
2024-07-20 |
Downstream-Pretext Domain Knowledge Traceback for Active Learning |
Beichen Zhang et.al. |
2407.14720 |
null |
2024-07-19 |
Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model |
Kun Zhao et.al. |
2407.14326 |
null |
2024-07-19 |
Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation |
Zhengyuan Xie et.al. |
2407.14142 |
link |
2024-07-19 |
GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation |
Florian Chabot et.al. |
2407.14108 |
null |
2024-07-18 |
Many Perception Tasks are Highly Redundant Functions of their Input Data |
Rahul Ramesh et.al. |
2407.13841 |
null |
2024-07-18 |
GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model |
Abdelrahman Shaker et.al. |
2407.13772 |
link |
2024-07-18 |
SegPoint: Segment Any Point Cloud via Large Language Model |
Shuting He et.al. |
2407.13761 |
null |
2024-07-18 |
MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis |
Ziming Zhong et.al. |
2407.13675 |
link |
2024-07-18 |
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models |
Xiaoyu Zhu et.al. |
2407.13642 |
null |
2024-07-18 |
FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures |
Hao Lu et.al. |
2407.13500 |
link |
2024-07-18 |
FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions |
Sohyun Lee et.al. |
2407.13437 |
null |
2024-07-18 |
Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability |
Judith Dijk et.al. |
2407.13392 |
null |
2024-07-18 |
Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation |
Chang Liu et.al. |
2407.13363 |
null |
2024-07-18 |
Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation |
Shoumeng Qiu et.al. |
2407.13254 |
null |
2024-07-18 |
OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation |
Jian Sun et.al. |
2407.13137 |
null |
2024-07-16 |
Mitigating Background Shift in Class-Incremental Semantic Segmentation |
Gilhan Park et.al. |
2407.11859 |
link |
2024-07-16 |
Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation |
Juncheng Ma et.al. |
2407.11820 |
null |
2024-07-16 |
XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach |
Truong Thanh Hung Nguyen et.al. |
2407.11771 |
null |
2024-07-16 |
OAM-TCD: A globally diverse dataset of high-resolution tree cover maps |
Josh Veitch-Michaelis et.al. |
2407.11743 |
null |
2024-07-16 |
SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds |
Yanbo Wang et.al. |
2407.11569 |
link |
2024-07-16 |
Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations |
Yunya Gao et.al. |
2407.11381 |
link |
2024-07-16 |
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities |
Xu Zheng et.al. |
2407.11351 |
null |
2024-07-16 |
Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation |
Xu Zheng et.al. |
2407.11344 |
null |
2024-07-16 |
TCFormer: Visual Recognition via Token Clustering Transformer |
Wang Zeng et.al. |
2407.11321 |
link |
2024-07-15 |
Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding |
Danish Nazir et.al. |
2407.11224 |
null |
2024-07-15 |
No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations |
Walter Simoncini et.al. |
2407.10964 |
link |
2024-07-15 |
APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation |
Wangyu Wu et.al. |
2407.10649 |
null |
2024-07-15 |
Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs |
Rong Ma et.al. |
2407.10534 |
null |
2024-07-14 |
Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data |
Tuo Feng et.al. |
2407.10200 |
link |
2024-07-14 |
RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation |
Li Li et.al. |
2407.10159 |
link |
2024-07-14 |
HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation |
Chengjie Jiang et.al. |
2407.10047 |
null |
2024-07-13 |
Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation |
Anqi Zhang et.al. |
2407.09838 |
null |
2024-07-13 |
Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach |
Md Rakibul Islam et.al. |
2407.09828 |
null |
2024-07-13 |
3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance |
Xiaoxu Xu et.al. |
2407.09826 |
null |
2024-07-13 |
TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation |
Xiaopei Wu et.al. |
2407.09751 |
null |
2024-07-12 |
FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background |
Muhammad Ali et.al. |
2407.09379 |
link |
2024-07-12 |
Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy |
Julian Wyatt et.al. |
2407.09192 |
null |
2024-07-12 |
Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off |
Levente Halmosi et.al. |
2407.09150 |
link |
2024-07-12 |
Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation |
Wei Cong et.al. |
2407.09047 |
null |
2024-07-12 |
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation |
Byeonghyun Pak et.al. |
2407.09033 |
null |
2024-07-12 |
Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation |
Zihao Li et.al. |
2407.08994 |
null |
2024-07-11 |
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation |
Tong Shao et.al. |
2407.08268 |
null |
2024-07-11 |
Enrich the content of the image Using Context-Aware Copy Paste |
Qiushi Guo et.al. |
2407.08151 |
null |
2024-07-10 |
MambaVision: A Hybrid Mamba-Transformer Vision Backbone |
Ali Hatamizadeh et.al. |
2407.08083 |
link |
2024-07-10 |
Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift |
Elliot Vincent et.al. |
2407.07616 |
link |
2024-07-10 |
H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper |
Ryan Banks et.al. |
2407.07604 |
link |
2024-07-11 |
Trainable Highly-expressive Activation Functions |
Irit Chelly et.al. |
2407.07564 |
null |
2024-07-10 |
Deformable-Heatmap-Segmentation for Automobile Visual Perception |
Hongyu Jin et.al. |
2407.07493 |
null |
2024-07-10 |
Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining |
Tianfang Sun et.al. |
2407.07465 |
null |
2024-07-11 |
HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation |
Guoan Xu et.al. |
2407.07441 |
null |
2024-07-09 |
ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation |
Yuyuan Liu et.al. |
2407.07171 |
link |
2024-07-08 |
Training-free CryoET Tomogram Segmentation |
Yizhou Zhao et.al. |
2407.06833 |
link |
2024-07-09 |
CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM |
Aditya Murali et.al. |
2407.06795 |
null |
2024-07-09 |
LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration |
Jiayi Liu et.al. |
2407.06512 |
link |
2024-07-08 |
Leveraging image captions for selective whole slide image annotation |
Jingna Qiu et.al. |
2407.06363 |
null |
2024-07-08 |
Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots |
Siva Krishna Ravipati et.al. |
2407.06077 |
null |
2024-07-08 |
Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts |
Puzuo Wang et.al. |
2407.06043 |
null |
2024-07-08 |
RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation |
Sarah Elmahdy et.al. |
2407.06016 |
link |
2024-07-07 |
Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images |
Tuan T. Nguyen et.al. |
2407.05452 |
null |
2024-07-07 |
Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness |
Idris Hamoud et.al. |
2407.05448 |
null |
2024-07-06 |
A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation |
Monika Wysoczańska et.al. |
2407.05061 |
null |
2024-07-06 |
BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support |
Vladyslav Polushko et.al. |
2407.05007 |
null |
2024-07-05 |
Explainable Metric Learning for Deflating Data Bias |
Emma Andrews et.al. |
2407.04866 |
null |
2024-07-05 |
LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes |
Zexian Huang et.al. |
2407.04326 |
null |
2024-07-04 |
Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier |
Prantik Howlader et.al. |
2407.04036 |
link |
2024-07-04 |
Relative Difficulty Distillation for Semantic Segmentation |
Dong Liang et.al. |
2407.03719 |
null |
2024-07-04 |
POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation |
Arindam Dutta et.al. |
2407.03549 |
null |
2024-07-03 |
A Unified Framework for 3D Scene Understanding |
Wei Xu et.al. |
2407.03263 |
null |
2024-07-03 |
ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation |
Chang Li et.al. |
2407.03033 |
null |
2024-07-03 |
ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation |
Yipin Guo et.al. |
2407.02881 |
null |
2024-07-03 |
Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation |
Tao Chen et.al. |
2407.02768 |
null |
2024-07-02 |
Open Panoramic Segmentation |
Junwei Zheng et.al. |
2407.02685 |
null |
2024-07-02 |
Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction |
Tinghuai Wang et.al. |
2407.02639 |
null |
2024-07-02 |
Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather |
Junsung Park et.al. |
2407.02286 |
link |
2024-07-02 |
MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders |
Baijiong Lin et.al. |
2407.02228 |
link |
2024-07-02 |
Occlusion-Aware Seamless Segmentation |
Yihong Cao et.al. |
2407.02182 |
link |
2024-07-02 |
VRBiom: A New Periocular Dataset for Biometric Applications of HMD |
Ketan Kotwal et.al. |
2407.02150 |
null |
2024-07-02 |
Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts |
Pasquale De Marinis et.al. |
2407.02075 |
null |
2024-07-02 |
Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning |
Chengchao Shen et.al. |
2407.02014 |
link |
2024-07-01 |
Label-free Neural Semantic Image Synthesis |
Jiayi Wang et.al. |
2407.01790 |
null |
2024-07-01 |
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction |
Xuan Yu et.al. |
2407.01349 |
null |
2024-07-01 |
CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes |
Danial Qashqai et.al. |
2407.01328 |
link |
2024-06-29 |
SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City |
Guohao Wang et.al. |
2407.00296 |
link |
2024-07-01 |
Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding |
Yifan Tang et.al. |
2406.19791 |
null |
2024-06-28 |
Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation |
Junsung Park et.al. |
2406.19638 |
link |
2024-06-28 |
PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation |
Deyi Ji et.al. |
2406.19632 |
null |
2024-06-27 |
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model |
Haobo Yuan et.al. |
2406.19369 |
null |
2024-06-27 |
ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation |
Nazanin Moradinasab et.al. |
2406.19225 |
null |
2024-06-30 |
Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO |
Fuseini Mumuni et.al. |
2406.19057 |
null |
2024-06-27 |
Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation |
Tao Lian et.al. |
2406.18809 |
null |
2024-06-26 |
CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data |
Nikolaos Dionelis et.al. |
2406.18279 |
null |
2024-06-26 |
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval |
Meinardus Boris et.al. |
2406.18113 |
link |
2024-06-26 |
Few-Shot Medical Image Segmentation with High-Fidelity Prototypes |
Song Tang et.al. |
2406.18074 |
link |
2024-06-25 |
Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation |
Xuming Zhang et.al. |
2406.17679 |
null |
2024-06-25 |
DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation |
Ahmad Mohammadshirazi et.al. |
2406.17591 |
link |
2024-06-25 |
Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation |
Felix Stillger et.al. |
2406.17541 |
null |
2024-06-25 |
Investigating Self-Supervised Methods for Label-Efficient Learning |
Srinivasa Rao Nandam et.al. |
2406.17460 |
null |
2024-06-25 |
Pseudo Labelling for Enhanced Masked Autoencoders |
Srinivasa Rao Nandam et.al. |
2406.17450 |
null |
2024-06-25 |
Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model |
Zhuoyuan Li et.al. |
2406.17442 |
null |
2024-06-25 |
Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes |
Qi Ma et.al. |
2406.17438 |
link |
2024-06-24 |
Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation |
Yizheng Wu et.al. |
2406.16776 |
link |
2024-06-24 |
μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation |
Pierangela Bruno et.al. |
2406.16724 |
null |
2024-06-24 |
GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection |
Harnaik Dhami et.al. |
2406.16625 |
null |
2024-06-24 |
LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images |
Xiaowen Ma et.al. |
2406.16502 |
link |
2024-06-24 |
Cascade Reward Sampling for Efficient Decoding-Time Alignment |
Bolian Li et.al. |
2406.16306 |
null |
2024-06-24 |
SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments |
Neng Wang et.al. |
2406.16279 |
link |
2024-06-23 |
UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery |
Pengfei Zhang et.al. |
2406.16129 |
null |
2024-06-22 |
Fine-grained Background Representation for Weakly Supervised Semantic Segmentation |
Xu Yin et.al. |
2406.15755 |
null |
2024-06-20 |
Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery |
Ilham Adi Panuntun et.al. |
2406.14220 |
null |
2024-06-20 |
Trusting Semantic Segmentation Networks |
Samik Some et.al. |
2406.14201 |
null |
2024-06-20 |
EvSegSNN: Neuromorphic Semantic Segmentation for Event Data |
Dalia Hareb et.al. |
2406.14178 |
null |
2024-06-20 |
Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images |
Qinfeng Zhu et.al. |
2406.14086 |
link |
2024-06-19 |
Search-based DNN Testing and Retraining with GAN-enhanced Simulations |
Mohammed Oualid Attaoui et.al. |
2406.13359 |
null |
2024-06-19 |
Deep Learning-Based 3D Instance and Semantic Segmentation: A Review |
Siddiqui Muhammad Yasir et.al. |
2406.13308 |
null |
2024-06-18 |
Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation |
Guoyu Yang et.al. |
2406.12496 |
link |
2024-06-18 |
Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble |
Wang Liu et.al. |
2406.12271 |
null |
2024-06-17 |
OoDIS: Anomaly Instance Segmentation Benchmark |
Alexey Nekrasov et.al. |
2406.11835 |
link |
2024-06-17 |
Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT |
Maximilian E. Tschuchnig et.al. |
2406.11650 |
null |
2024-06-17 |
SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation |
Zhenchao Lin et.al. |
2406.11441 |
link |
2024-06-17 |
Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding |
Yunsong Wang et.al. |
2406.11283 |
null |
2024-06-17 |
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation |
Bingfeng Zhang et.al. |
2406.11189 |
null |
2024-06-16 |
$α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion |
Sanbao Su et.al. |
2406.11021 |
null |
2024-06-16 |
PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery |
Libo Wang et.al. |
2406.10828 |
link |
2024-06-15 |
GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR |
Bharat Singh et.al. |
2406.10722 |
null |
2024-06-15 |
A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection |
Chenyao Zhou et.al. |
2406.10678 |
link |
2024-06-14 |
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers |
Narges Norouzi et.al. |
2406.09936 |
null |
2024-06-14 |
Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions |
Aldi Piroli et.al. |
2406.09906 |
null |
2024-06-14 |
Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation |
Brunó B. Englert et.al. |
2406.09896 |
link |
2024-06-14 |
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing |
Xiangheng Shan et.al. |
2406.09829 |
link |
2024-06-13 |
Instance-level quantitative saliency in multiple sclerosis lesion segmentation |
Federico Spagnolo et.al. |
2406.09335 |
null |
2024-06-13 |
APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation |
Weizhao He et.al. |
2406.08372 |
null |
2024-06-12 |
Dataset Enhancement with Instance-Level Augmentations |
Orest Kupyn et.al. |
2406.08249 |
link |
2024-06-13 |
A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder |
Lixian Zhang et.al. |
2406.08079 |
null |
2024-06-12 |
OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding |
Yinan Deng et.al. |
2406.08009 |
link |
2024-06-12 |
SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation |
Chanda Grover Kamra et.al. |
2406.07986 |
link |
2024-06-12 |
Small Scale Data-Free Knowledge Distillation |
He Liu et.al. |
2406.07876 |
link |
2024-06-11 |
Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph |
Sergey Linok et.al. |
2406.07113 |
null |
2024-06-11 |
PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving |
Yining Shi et.al. |
2406.07037 |
null |
2024-06-12 |
LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection |
Jiahua Xu et.al. |
2406.07023 |
null |
2024-06-10 |
Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation |
Dong Zhao et.al. |
2406.06813 |
link |
2024-06-09 |
Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation |
Abdul Qayyum et.al. |
2406.06643 |
null |
2024-06-10 |
Merlin: A Vision Language Foundation Model for 3D Computed Tomography |
Louis Blankemeier et.al. |
2406.06512 |
null |
2024-06-10 |
UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving |
Daniel Bogdoll et.al. |
2406.06370 |
null |
2024-06-09 |
Scaling Graph Convolutions for Mobile Vision |
William Avery et.al. |
2406.05850 |
link |
2024-06-09 |
Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation |
Jun Yu et.al. |
2406.05837 |
null |
2024-06-09 |
Convolution and Attention-Free Mamba-based Cardiac Image Segmentation |
Abbas Khan et.al. |
2406.05786 |
null |
2024-06-09 |
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language |
Mark Hamilton et.al. |
2406.05629 |
link |
2024-06-08 |
A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+ |
Jianzhao Wang et.al. |
2406.05513 |
null |
2024-06-08 |
Layered Image Vectorization via Semantic Simplification |
Zhenyu Wang et.al. |
2406.05404 |
null |
2024-06-08 |
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation |
Qingfeng Liu et.al. |
2406.05352 |
null |
2024-06-07 |
USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation |
Xiaoqi Wang et.al. |
2406.05271 |
null |
2024-06-07 |
Semantic Segmentation on VSPW Dataset through Masked Video Consistency |
Chen Liang et.al. |
2406.04979 |
null |
2024-06-07 |
Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment |
Venkanna Babu Guthula et.al. |
2406.04949 |
null |
2024-06-06 |
Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis |
Chengeng Liu et.al. |
2406.04149 |
null |
2024-06-06 |
Frequency-based Matcher for Long-tailed Semantic Segmentation |
Shan Li et.al. |
2406.03917 |
link |
2024-06-07 |
Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge |
Nan Zhang et.al. |
2406.03799 |
link |
2024-06-06 |
DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation |
Zilu Guo et.al. |
2406.03702 |
link |
2024-06-05 |
Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation |
Maximilian Zenk et.al. |
2406.03323 |
null |
2024-06-05 |
Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy |
Yunho Kim et.al. |
2406.02989 |
null |
2024-06-04 |
W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics |
Andre Schreiber et.al. |
2406.02822 |
link |
2024-06-04 |
Window to Wall Ratio Detection using SegFormer |
Zoe De Simone et.al. |
2406.02706 |
link |
2024-06-04 |
Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning |
Heather Doig et.al. |
2406.01932 |
null |
2024-06-03 |
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding |
Thanh-Dat Truong et.al. |
2406.01429 |
null |
2024-06-03 |
TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation |
Antonio Santo et.al. |
2406.01395 |
link |
2024-06-03 |
ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds |
Ka Lung Cheung et.al. |
2406.01337 |
link |
2024-06-03 |
LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism |
Miao Fu et.al. |
2406.01228 |
null |
2024-06-04 |
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer |
Ding Jia et.al. |
2406.01210 |
link |
2024-06-03 |
S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography |
Yuhan Song et.al. |
2406.01191 |
null |
2024-06-02 |
Diffusion Features to Bridge Domain Gap for Semantic Segmentation |
Yuxiang Ji et.al. |
2406.00777 |
null |
2024-06-02 |
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation |
Yunheng Li et.al. |
2406.00670 |
null |
2024-06-02 |
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024 |
Biao Wu et.al. |
2406.00587 |
null |
2024-05-31 |
Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks |
Linlin Yu et.al. |
2405.20986 |
null |
2024-05-31 |
Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation |
Wooseok Shin et.al. |
2405.20610 |
link |
2024-05-30 |
P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation |
Qi Zhang et.al. |
2405.20443 |
null |
2024-05-30 |
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow |
Chaoyang Wang et.al. |
2405.20282 |
link |
2024-05-30 |
MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion |
Angel Villar-Corrales et.al. |
2405.19921 |
link |
2024-05-30 |
Open-Set Domain Adaptation for Semantic Segmentation |
Seun-An Choe et.al. |
2405.19899 |
link |
2024-05-30 |
DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation |
Ron Keuth et.al. |
2405.19746 |
link |
2024-05-30 |
Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes |
Yong-Qiang Mao et.al. |
2405.19735 |
null |
2024-05-30 |
CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation |
Ankush Gajanan Arudkar et.al. |
2405.19672 |
null |
2024-05-29 |
Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation |
Lianlei Shan et.al. |
2405.19568 |
null |
2024-05-29 |
Enabling Visual Recognition at Radio Frequency |
Haowen Lai et.al. |
2405.19516 |
null |
2024-05-29 |
Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models |
Tianrun Chen et.al. |
2405.19326 |
null |
2024-05-29 |
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation |
Niclas Vödisch et.al. |
2405.19035 |
link |
2024-05-29 |
Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation |
Zelin Peng et.al. |
2405.18840 |
null |
2024-05-28 |
Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation |
JuneHyoung Kwon et.al. |
2405.18148 |
null |
2024-05-28 |
Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images |
Lianlei Shan et.al. |
2405.18078 |
null |
2024-05-28 |
RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields |
Mihnea-Bogdan Jurca et.al. |
2405.18033 |
null |
2024-05-28 |
DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture |
Shentong Mo et.al. |
2405.17995 |
null |
2024-05-28 |
The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention |
Xingyu Ding et.al. |
2405.17776 |
null |
2024-05-27 |
Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation |
Steven Landgraf et.al. |
2405.17097 |
null |
2024-05-27 |
DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking |
Hongtao Wang et.al. |
2405.16980 |
null |
2024-05-27 |
Collective Perception Datasets for Autonomous Driving: A Comprehensive Review |
Sven Teufel et.al. |
2405.16973 |
null |
2024-05-27 |
Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models |
Qian Wang et.al. |
2405.16947 |
null |
2024-05-27 |
A re-calibration method for object detection with multi-modal alignment bias in autonomous driving |
Zhihang Song et.al. |
2405.16848 |
null |
2024-05-25 |
BOLD: Boolean Logic Deep Learning |
Van Minh Nguyen et.al. |
2405.16339 |
null |
2024-05-25 |
Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation |
Huizhou Chen et.al. |
2405.16099 |
null |
2024-05-25 |
Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality |
Hakim Ikebayashi et.al. |
2405.16008 |
null |
2024-05-24 |
Visualize and Paint GAN Activations |
Rudolf Herdt et.al. |
2405.15636 |
null |
2024-05-24 |
Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets |
Hoàng-Ân Lê et.al. |
2405.15394 |
null |
2024-05-24 |
U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation |
Bingyu Li et.al. |
2405.15365 |
link |
2024-05-24 |
Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation |
Jiayi Chen et.al. |
2405.15265 |
null |
2024-05-23 |
Mamba-R: Vision Mamba ALSO Needs Registers |
Feng Wang et.al. |
2405.14858 |
null |
2024-05-23 |
Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation |
Daniel Kienzle et.al. |
2405.14467 |
null |
2024-05-23 |
MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models |
Jiuming Liu et.al. |
2405.14338 |
null |
2024-05-23 |
Tuning-free Universally-Supervised Semantic Segmentation |
Xiaobo Yang et.al. |
2405.14294 |
null |
2024-05-23 |
SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation |
Kai Yao et.al. |
2405.14278 |
null |
2024-05-23 |
Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations |
Mohammed Baharoon et.al. |
2405.14239 |
null |
2024-05-24 |
Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification |
Taylor Archibald et.al. |
2405.14162 |
null |
2024-05-23 |
Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips |
Yaotian Liu et.al. |
2405.14154 |
null |
2024-05-22 |
TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System |
Diogo Lavado et.al. |
2405.13989 |
null |
2024-05-22 |
Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer |
Qihang Fan et.al. |
2405.13337 |
null |
2024-05-21 |
Transparency Distortion Robustness for SOTA Image Segmentation Tasks |
Volker Knauthe et.al. |
2405.12864 |
null |
2024-05-20 |
A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation |
Sushmita Sarker et.al. |
2405.11903 |
null |
2024-05-20 |
Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments |
Jooyong Park et.al. |
2405.11855 |
null |
2024-05-20 |
Universal Organizer of SAM for Unsupervised Semantic Segmentation |
Tingting Li et.al. |
2405.11742 |
null |
2024-05-19 |
Interpreting a Semantic Segmentation Model for Coastline Detection |
Conor O'Sullivan et.al. |
2405.11500 |
null |
2024-05-17 |
CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation |
Mushui Liu et.al. |
2405.10530 |
link |
2024-05-16 |
Towards Task-Compatible Compressible Representations |
Anderson de Andrade et.al. |
2405.10244 |
link |
2024-05-16 |
A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance |
Andrea Matteazzi et.al. |
2405.10046 |
null |
2024-05-16 |
Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation |
Jihwan Kwak et.al. |
2405.09858 |
null |
2024-05-15 |
Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation |
Guo Yachan et.al. |
2405.09682 |
null |