2505
* 0-Shot Self-Attention Mechanism for Accelerated Diagonal Attention, A
* 1D Cascaded Denoising and Classification Framework for Micro-Doppler-Based Radar Target Recognition, A
* 360PanT: Training-Free Text-Driven 360-Degree Panorama-to-Panorama Translation
* 3D Edge Sketch from Multiview Images
* 3D microvascular reconstruction in retinal OCT angiography images via domain-adaptive learning
* 3D Part Segmentation via Geometric Aggregation of 2D Visual Features
* 3D Reconstruction of Gas Cloud Leakage Based on Multi-Spectral Imaging Systems, A
* 3D Shape Completion using Multi-resolution Spectral Encoding
* 3D Synthesis for Architectural Design
* 3D Understanding of Deformable Linear Objects: Datasets and Transferability Benchmark
* Few-Shot Incremental Learning (H4)
* Gas Plume, Air Flow (H3)
* Large-Scale 3-D Semantic Object Detection (H4)
* Layout to Image, Image Based Rendering (H3)
* Water Plume, River Flow, Streamflow (H3)
* @BENCH: Benchmarking Vision-Language Models for Human-centered Assistive Technology
* A2VIS: Amodal-Aware Approach to Video Instance Segmentation
* AC-IND: Sparse CT Reconstruction Based on Attenuation Coefficient Estimation and Implicit Neural Distribution
* Accelerated and Interpretable Flood Susceptibility Mapping Through Explainable Deep Learning with Hydrological Prior Knowledge
* Accelerated Testing and Evaluation for Black-Box Autonomous Driving Systems via Adaptive Markov Chain Monte Carlo
* Accurate and Robust Three-Intersection-Chord-Invariant Ellipse Detection
* Accurate Mapping of Downed Deadwood in a Dense Deciduous Forest Using UAV-SfM Data and Deep Learning
* ACE: Action Concept Enhancement of Video-Language Models in Procedural Videos
* ACE: Anatomically Consistent Embeddings in Composition and Decomposition
* Achieving Byzantine-Resilient Federated Learning via Layer-Adaptive Sparsified Model Aggregation
* Achieving high performance on sketch-based image retrieval without real sketches for training
* ActionDiffusion: An Action-Aware Diffusion Model for Procedure Planning in Instructional Videos
* Active and Passive Integrated Lightning Localization and Imaging Technology Based on Very-High-Frequency Radar
* Active Event Alignment for Monocular Distance Estimation
* Active Learning for Image Segmentation with Binary User Feedback
* Active Learning for Vision-Language Models
* Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation
* Active Supervised Cross-Modal Retrieval
* AD-Det: Boosting Object Detection in UAV Images with Focused Small Objects and Balanced Tail Classes
* Ad2Mix: Adversarial and Adaptive Mixup for Unsupervised Domain Adaptation
* Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior
* AdamGraph: Adaptive Attention-Modulated Graph Network for EEG Emotion Recognition
* AdaPrefix++: Integrating Adapters, Prefixes and Hypernetwork for Continual Learning
* Adapting the High-Resolution PlanetScope Biomass Model to Low-Resolution VIIRS Imagery Using Spectral Harmonization: A Case of Grassland Monitoring in Mongolia
* Adaptive and Temporally Consistent Gaussian Surfels for Multi-View Dynamic Reconstruction
* Adaptive Attention Based on Mixture Distribution for Zero-Shot Non-Line-of-Sight Imaging
* Adaptive Clustering With Similarity Learning for Enhanced Multi-Scenario Radar Signal Processing
* Adaptive Colour-Depth Aware Attention for RGB-D Object Tracking
* Adaptive Conditional Reasoning for Remote Sensing Visual Question Answering
* Adaptive Deviation Learning for Visual Anomaly Detection with Data Contamination
* Adaptive Finite-Time Prescribed Performance Control of Vehicular Platoons With Multilevel Threshold and Asymptotic Convergence
* Adaptive Occlusion-Aware Network for Occluded Person Re-Identification
* Adaptive Online Graph Learning
* Adaptive Parameter Evolutionary Marine Predators Algorithm for Joint Resource Scheduling of Cooperative Jamming Networked Radar Systems, An
* Adaptive spatially regularized target attribute-aware background suppressed deep correlation filter for object tracking
* Adaptive Target Detection Architecture for Mismatched Signals, An
* Adaptive Total-Variation and Nonconvex Low-Rank Model for Image Denoising
* Adaptively robust high-order tensor factorization for low-rank tensor reconstruction
* Adjacent Channel Interference and Congestion Control for Multi-Channel Operation in Vehicular Networks
* AdQuestA: Knowledge-Guided Visual Question Answer Framework for Advertisements
* Advanced Real-Time Internal Calibration Scheme for the DBF-SCORE Spaceborne SAR Systems, An
* Advanced transformer for high-noise image denoising: Enhanced attention and detail preservation
* Advancements in satellite-based methane point source monitoring: A systematic review
* Advances in Predictive RAHT for Geometric Point Cloud Compression
* Advances in Research and Application of Techniques for Measuring Photosynthetically Active Radiation
* Advancing Chart Question Answering with Robust Chart Component Recognition
* Advancing Corn Yield Mapping in Kenya Through Transfer Learning
* Advancing evolution characterization in dynamic networks: A quantum walk and thermodynamics perspective
* Advancing LiDAR Intensity Simulation Through Learning With Novel Physics-Based Modalities
* Advancing Sparse Vegetation Monitoring in the Arctic and Antarctic: A Review of Satellite and UAV Remote Sensing, Machine Learning, and Sensor Fusion
* Advancing video self-supervised learning via image foundation models
* Advancing Weight and Channel Sparsification with Enhanced Saliency
* Adversarial Attention Deficit: Fooling Deformable Vision Transformers with Collaborative Adversarial Patches
* Adversarial Learning Based Knowledge Distillation on 3D Point Clouds
* Adversarial temporal sentence grounding by learning from external data
* Aerial Mirage: Unmasking Hallucinations in Large Vision Language Models
* Aerosol Distribution Due to Wildfire in Sumatra, Indonesia Considered from Model Simulation
* Aerosol Forcing from Ground-Based Synergies over a Decade in Barcelona, Spain
* Aerosol Retrieval Method Using Multi-Angle Data from GF-5 02 DPC over the Jing-Jin-Ji Region
* AES-AUDIO: An Encryption Scheme for Audio Supporting Differentiated Decryption
* AGCD: An Attention-Guided Graph Convolution Network for Change Detection of Remote Sensing Images
* AGFNet: Adaptive Gated Fusion Network for RGB-T Semantic Segmentation
* Aggregated Attributions for Explanatory Analysis of 3D Segmentation Models
* AgroGPT: Efficient Agricultural Vision-Language Model with Expert Tuning
* Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models
* AH-OCDA: Amplitude-Based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation
* AI Goes Fishing: An Alphabet Spin-Off is Making Aquaculture More Sustainable
* AIC3DOD: Advancing Indoor Class-Incremental 3D Object Detection with Point Transformer Architecture and Room Layout Constraints
* AiDe: Improving 3D Open-Vocabulary Semantic Segmentation by Aligned Vision-Language Learning
* AJANet: SAR Ship Detection Network Based on Adaptive Channel Attention and Large Separable Kernel Adaptation
* Align and Blend: A Unified Multi-Modal LiDAR Segmentation Network
* AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
* All-in-One Image Compression and Restoration
* All-in-one weather removal via Multi-Depth Gated Transformer with gradient modulation
* ALPI: Auto-Labeller with Proxy Injection for 3D Object Detection using 2D Labels Only
* ALPS: An Auto-Labeling and Pre-Training Scheme for Remote Sensing Segmentation With Segment Anything Model
* ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction
* Ambiguity Resolution Strategy for GPS/LEO Integrated Orbit Determination Based on Regional Ground Stations
* AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation
* AMP-ViT: Optimizing Vision Transformer Efficiency with Adaptive Mixed-Precision Post-Training Quantization
* Analysing the performance of Viola-Jones and multi-task convolution neural networks face detection algorithms using real-time video sequences
* Analysis and Correction of Antenna Pattern Errors for In-Orbit Fully Polarimetric Aperture Synthesis Radiometer
* Analysis and Experiments of an Electromagnetic Docking Mechanism for Repeated Docking and Separation of the CubeSats
* Analysis of a Summer Convective Precipitation Event in the Shanghai Region Using Data from a Novel Single-Polarization X-Band Phased-Array Radar and Other Meteorological Observations
* Analysis of Factors Affecting Random Measurement Error in LiDAR Point Cloud Feature Matching Positioning
* Analysis of Grassland Vegetation Coverage Changes and Driving Factors in China-Mongolia-Russia Economic Corridor from 2000 to 2023 Based on RF and BFAST Algorithm
* Analysis of Ionospheric Disturbances in China During the December 2023 Geomagnetic Storm Using Multi-Instrument Data
* Analysis of Powder, Hard-Packed, and Wet Snow in High Mountain Areas Based on SAR, Optical Data, and In Situ Data, An
* Analysis of Regional Spatial Characteristics and Optimization of Tourism Routes Based on Point Cloud Data from Unmanned Aerial Vehicles
* Analysis of Spatial and Driving Factors of National Sanitary Resources in China Using GIS
* Analysis of Tropospheric NO2 Observation Using Pandora and MAX-DOAS Instrument in Xianghe, North China
* Analysis on GNSS Common View and Precise Point Positioning Time Transfer: BDS-3/Galileo/GPS
* Analysis, Simulation, and Scanning Geometry Calibration of Palmer Scanning Units for Airborne Hyperspectral Light Detection and Ranging
* Analyzing and Improving the Skin Tone Consistency and Bias in Implicit 3D Relightable Face Generators
* Analyzing the Sources and Variations of Nighttime Lights in Hong Kong from VIIRS Monthly Data
* Anchored Diffusion for Video Face Reenactment
* Angle-Independent Blood Flow Velocity Measurement With Ultrasound Speckle Decorrelation Analysis
* AniClipart: Clipart Animation with Text-to-Video Priors
* Annual winter wheat mapping for unveiling spatiotemporal patterns in China with a knowledge-guided approach and multi-source datasets
* Anomaly Detection for People with Visual Impairments Using an Egocentric 360-Degree Camera
* Anomaly-aware self-supervised feature learning for weakly supervised video anomaly detection
* AnomalyDINO: Boosting Patch-based Few-Shot Anomaly Detection with DINOv2
* Anthropogenic Forcing on the Coevolution of Tidal Creeks and Vegetation in the Dongtan Wetland, Changjiang Estuary
* ANTHROPOS-V: Benchmarking the Novel Task of Crowd Volume Estimation
* AP-PointRend: An Improved Network for Building Extraction via High-Resolution Remote Sensing Images
* Application of Optical Remote Sensing in Harmful Algal Blooms in Lakes: A Review
* Application of Remote Sensing Floodplain Vegetation Data in a Dynamic Roughness Distributed Runoff Model
* Approximately Invertible Neural Network for Learned Image Compression
* ARD-VAE: A Statistical Formulation to Find the Relevant Latent Dimensions of Variational Autoencoders
* Are Exemplar-Based Class Incremental Learning Models Victim of Black-Box Poison Attacks?
* ARF-Plus: Controlling Perceptual Factors in Artistic Radiance Fields for 3D Scene Stylization
* Art Comes From Life: Artistic Image Aesthetics Assessment via Attribute Knowledge Amalgamation
* ARTeFACT: Benchmarking Segmentation Models on Diverse Analogue Media Damage
* Arterial Ecosignal Coordination Based on Vehicle Trajectory Estimation Using Kinematic Analytical Formula
* ARTIST: Improving the Generation of Text-Rich Images with Disentangled Diffusion Models and Large Language Models
* Assembled Feature Attentive Algorithm for Automatic Detection of Waste Water Treatment Plants Based on Multiple Neural Networks, An
* Assess Spatial Equity Considering the Similarity Between GIS-Based Supply and Demand Maps: A New Framework with Case Study in Beijing
* Assessing Coincidence of Satellite Acquisitions and Flood Events to Predict Suitability for Flood Map Synthesis
* Assessing Habitat Quality on Synergetic Land-Cover Dataset Across the Greater Mekong Subregion over the Last Four Decades
* Assessing Stone Material Recession of Cultural Heritage: New Approach Based on Satellite-Based Rainfall Data and Dose-Response Functions: Case of UNESCO Site of Matera
* Assessing the Combined Impact of Land Surface Temperature and Droughts to Heatwaves over Europe Between 2003 and 2023
* Assessing the Quality of 3D Reconstruction in the Absence of Ground Truth: Application to a Multimodal Archaeological Dataset
* Assessing the Robustness of Multispectral Satellite Imagery with LiDAR Topographic Attributes and Ancillary Data to Predict Vertical Structure in a Wet Eucalypt Forest
* Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance
* Assessment of Integrated Multi-Satellite Retrievals for Global Precipitation Measurement (IMERG) Precipitation Products in Northwest China
* Assimilation of Moderate-Resolution Imaging Spectroradiometer Level Two Cloud Products for Typhoon Analysis and Prediction
* ASSMark: Dual Defense Against Speech Synthesis Attack via Adversarial Robust Watermarking
* Asynchronous Voice Anonymization by Learning From Speaker-Adversarial Speech
* ATHENA - Autonomous Vehicle Trajectory Planning Considered Human Action Awareness
* Attack as Defense: Proactive Adversarial Multi-Modal Learning to Evade Retrieval
* Attention plays a greater role in judging large color differences than small ones
* Attention-Based Class-Conditioned Alignment for Multi-Source Domain Adaptation of Object Detectors
* Attention-Guided Masked Autoencoders for Learning Image Representations
* Attribute Diffusion: Diffusion Driven Diverse Attribute Editing
* AUCPro: AUC-Oriented Provable Robustness Learning
* Augmenting Satellite Remote Sensing with AERONET-OC for Plume Monitoring in the Chesapeake Bay
* Auto-adjustable dual-information graph regularized NMF for multiview data clustering
* Automated design of neural networks with multi-scale convolutions via multi-path weight sampling
* Automated Detection of Pedestrian and Bicycle Lanes from High-Resolution Aerial Images by Integrating Image Processing and Artificial Intelligence (AI) Techniques
* Automated dual CNN-based feature extraction with SMOTE for imbalanced diabetic retinopathy classification
* Automated Eddy Identification and Tracking in the Northwest Pacific Based on Conventional Altimeter and SWOT Data
* Automated Evaluation of Large Vision-Language Models on Self-Driving Corner Cases
* Automated Mapping of the Freshwater Ecosystem Functional Groups of the International Union for Conservation of Nature Global Ecosystem Typology in a Large Region of Arid Australia
* Automated Patient Positioning with Learned 3D Hand Gestures
* Automated Plasma Region Classification and Boundary Layer Identification Using Machine Learning
* Automatic Detection and Identification of Underdense Meteors Based on YOLOv8n-BP Model
* Automatic Elevation Contour Vectorization: A Case Study in a Deep Learning Approach
* Automatic Registration of Multi-Temporal 3D Models Based on Phase Congruency Method
* Autonomous navigation and visual navigation in robot mission execution
* AutoProSAM: Automated Prompting SAM for 3D Multi-Organ Segmentation
* Autoregressive Adaptive Hypergraph Transformer for Skeleton-Based Activity Recognition
* AutoStory: Generating Diverse Storytelling Images with Minimal Human Efforts
* Auxiliary Particle Flow Track-Before-Detect Algorithm for Marine Neighboring Weak Targets
* Averaging illumination colors of multi-illumination ensembles
* AWDA: Adversarial and Weighted Domain Adaptation for cross-dataset change detection
* BA-LINS: A Frame-to-Frame Bundle Adjustment for LiDAR-Inertial Navigation
* Background-Aware Moment Detection for Video Moment Retrieval
* Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation
* Bandit-based Attention Mechanism in Vision Transformers
* Bandwidth-Efficient Communication Modelling for Autonomous Vehicle Collaborative Perception
* BAPS-DITS: Blockchain-Enabled Accountable Privacy-Preserving Scheme for Decentralized Intelligent Transportation Systems
* BASED: Bundle-Adjusting Surgical Endoscopic Dynamic Video Reconstruction Using Neural Radiance Fields
* Bayesian Compressive Sensing for NLOS mmWave Imaging Under Imprecisely Multiangle Surfaces
* Bayesian Language Model Adaptation for Personalized Speech Recognition
* Bayesian Optimal Latent Projection for Noisy Image Restoration
* Bayesian Time-Domain Ringing Suppression Approach in Impulse Ultrawideband Synthetic Aperture Radar
* Bayesian Variance Change Point Detection With Credible Sets
* BCN: Bidirectional Contrastive Learning Net for Multi-View Clustering
* Beamforming Designs for Hybrid Relaying in mmWave Systems Based on Deep Unfolding
* BeautyBank: Encoding Facial Makeup in Latent Space
* Benchmarking VLMs' Reasoning About Persuasive Atypical Images
* Best Practices for Applying and Interpreting the Total Operating Characteristic
* Beta Sampling is All You Need: Efficient Image Generation Strategy for Diffusion Models Using Stepwise Spectral Analysis
* Better Fit: Accommodate Variations in Clothing Types for Virtual Try-On
* BEVHeight++: Toward Robust Visual Centric 3D Object Detection
* BEVUDA++: Geometric-Aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection
* Beyond Batch Learning: Global Awareness Enhanced Domain Adaptation
* Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection
* Beyond Grids: Exploring Elastic Input Sampling for Vision Transformers
* Beyond mask: Rethinking guidance types in few-shot segmentation
* Beyond R-Barycenters: An Effective Averaging Method on Stiefel and Grassmann Manifolds
* Beyond Spatial Explanations: Explainable Face Recognition in the Frequency Domain
* Beyond sRGB: Optimizing Object Detection with Diverse Color Spaces for Precise Wildfire Risk Assessment
* BGSNet: A boundary-guided Siamese multitask network for semantic change detection from high-resolution remote sensing images
* Bi-Direction Label-Guided Semantic Enhancement for Cross-Modal Hashing
* Bi-LSTM-Based Resilient Data-Driven Integral Sliding Mode Control for UMVs Under Hybrid Attacks
* Bidirectional Multi-Step Domain Generalization for Visible-Infrared Person Re-Identification
* Bidirectional trained tree-structured decoder for Handwritten Mathematical Expression Recognition
* Bilateral Control Model for Autonomous Vehicles Based on Deep Reinforcement Learning
* Bilateral Two-Dimensional Multiview Discriminant Analysis for Image Recognition
* BioNet and NeFF: Crop Biomass Prediction from Point Clouds to Drone Imagery
* Bionic Binaural Perception-Based Performance Enhancement for Orthopedic Surgical Systems
* BioPose: Biomechanically-Accurate 3D Pose Estimation from Monocular Videos
* Bit-Flip Induced Latency Attacks in Object Detection
* BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments
* Blind Image Deblurring with FFT-ReLU Sparsity Prior
* Blind Image Quality Assessment: Exploring Content Fidelity Perceptibility via Quality Adversarial Learning
* Blind Video Quality Assessment at the Edge
* Boosting 3D Object Detection via Self-Distilling Introspective Data
* Boosting Context-Aware Speech Translation With Large Language Models
* Boosting Convolution With Efficient MLP-Permutation for Volumetric Medical Image Segmentation
* Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution
* Boosting of Mutual-Structure Denoising: A Plug-and-Play Solution for Compressive Sampling MRI Reconstruction With Theoretical Guarantees
* Boosting Semi-Supervised Video Action Detection with Temporal Context
* Boundary-aware and cross-modal fusion network for enhanced multi-modal brain tumor segmentation
* Boundary-Supplementary Network for Carotid Plaque Segmentation in Ultrasound Images
* Brain anatomy prior modeling to forecast clinical progression of cognitive impairment with structural MRI
* Breaking the Frame: Visual Place Recognition by Overlap Prediction
* Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation
* Bridging efficiency and interpretability: Explainable AI for multi-classification of pulmonary diseases utilizing modified lightweight CNNs
* Brightness contrast induced red/green balance shifts in real-world objects
* BroadTrack: Broadcast Camera Tracking for Soccer
* Building Group Recognition Method Integrating Spatial and Semantic Similarity, A
* CabNIR: A Benchmark for In-Vehicle Infrared Monocular Depth Estimation
* CACE: Sim-to-Real Indoor 3D Semantic Segmentation via Context-Aware Augmentation and Consistency Enforcement
* CAGFNet: A Cross-Attention Image-Guided Fusion Network for Disparity Estimation of High-Resolution Satellite Stereo Images
* Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding
* Calibration of Two X-Band Ground Radars Against GPM DPR Ku-Band
* CAMEL: Confidence-Aware Multi-Task Ensemble Learning with Spatial Information for Retina OCT Image Classification and Segmentation
* CamoFA: A Learnable Fourier-Based Augmentation for Camouflage Segmentation
* CAMS: Convolution and Attention-Free Mamba-Based Cardiac Image Segmentation
* Can Adversarial Examples be Parsed to Reveal Victim Model Information?
* Can Location Embeddings Enhance Super-Resolution of Satellite Imagery?
* Can Multimodal Large Language Models Truly Perform Multimodal In-Context Learning?
* Can Out-of-Domain Data Help to Learn Domain-Specific Prompts for Multimodal Misinformation Detection?
* Can satellite observations detect global ocean heat content change with high resolution by deep learning?
* Can Separation Enhance Fusion? An Efficient Framework for Target Detection in Multimodal Remote Sensing Imagery
* Cap2Aug: Caption Guided Image data Augmentation
* Capability Analysis of Earth Observation Data for Integrated Emergency Management
* CardioSyntax: End-to-End SYNTAX Score Prediction - Dataset, Benchmark and Method
* Cascade Learning Early Classification: A Novel Cascade Learning Classification Framework for Early-Season Crop Classification
* Cascade residual learning based adaptive feature aggregation for light field super-resolution
* Cascaded Dual Vision Transformer for Accurate Facial Landmark Detection
* Cascaded Physical-constraint Conditional Variational Auto Encoder with socially-aware diffusion for pedestrian trajectory prediction
* Case Study on the Use of an Unmanned Aerial System and Terrestrial Laser Scanner Combination Analysis Based on Slope Anchor Damage Factors
* CAST: Contrastive Analysis of Spatial and Temporal Features for QIM-Based VoIP Steganalysis
* CATALOG: A Camera Trap Language-Guided Contrastive Learning Model
* CCASeg: Decoding Multi-Scale Context with Convolutional Cross-Attention for Semantic Segmentation
* CE-VAE: Capsule Enhanced Variational AutoEncoder for Underwater Image Enhancement
* CEMIL: Contextual Attention Based Efficient Weakly Supervised Approach for Histopathology Image Classification
* Changes in Tourists' Perceptions of Community-Based Ecotourism (CBET) After COVID-19 Pandemic: A Study on the Country of Origin and Economic Development Level
* Channel Propagation Networks for Refreshable Vision Transformer
* Channel-Adaptive Range-Doppler Domain Filtering Serial BAQ Algorithm and Comparative Analysis, A
* CharacterFactory: Sampling Consistent Characters With GANs for Diffusion Models
* Characteristics of Eddy Dissipation Rates in Atmosphere Boundary Layer Using Doppler Lidar
* Characterization, Analysis, and Modeling of the Non-Stationary V2I Channel at 28-GHz on Highways
* Characterizing Seasonal Variation of the Atmospheric Mixing Layer Height Using Machine Learning Approaches
* CharDiff: Improving Sampling Convergence via Characteristic Function Consistency in Diffusion Models
* ChromaDistill: Colorizing Monochrome Radiance Fields with Knowledge Distillation
* CIRCOD: Co-Saliency Inspired Referring Camouflaged Object Discovery
* CISOL: An Open and Extensible Dataset for Table Structure Recognition in the Construction Industry
* CL-Cross VQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering
* Clarity Amidst Blur: A Deterministic Method for Synthetic Generation of Water Droplets on Camera Lenses
* Class activation map guided level sets for weakly supervised semantic segmentation
* Class-Agnostic Repetitive Action Counting Using Wearable Devices
* Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation
* Class-Conditioned Transformation for Enhanced Robust Image Classification
* CLASS: Conditional Latent Architecture for Search and Synthesis of Design Layouts
* Classification of Forest Stratification and Evaluation of Forest Stratification Changes over Two Periods Using UAV-LiDAR
* Classroom teacher behavior analysis: The TBU dataset and performance evaluation
* CLFace: A Scalable and Resource-Efficient Continual Learning Framework for Lifelong Face Recognition
* Click&Describe: Multimodal Grounding and Tracking for Aerial Objects
* Climatic Vulnerability of El Mirador de Lambayeque Archaeological Complex (8th-11th Century AD): Morphometric Analyses of Digital Surface Models
* CLIP-Based Camera-Agnostic Feature Learning for Intra-Camera Supervised Person Re-Identification
* CLIP-Driven Transformer for Weakly Supervised Object Localization
* CLIP-Fusion: A Spatio-Temporal Quality Metric for Frame Interpolation
* CLIP-TNseg: A Multi-Modal Hybrid Framework for Thyroid Nodule Segmentation in Ultrasound Images
* CLIPArTT: Adaptation of CLIP to New Domains at Test Time
* CLIPping Imbalances: A Novel Evaluation Baseline and PEARL Dataset for Pedestrian Attribute Recognition
* CLIPScope: Enhancing Zero-Shot OOD Detection with Bayesian Scoring
* Closing the Domain Gap in Manga Colorization via Aligned Paired Dataset
* Cloud and Aerosol Impacts on the Radiation Budget over China from 2000 to 2023
* CLRNetV2: A Faster and Stronger Lane Detector
* Clustering-Based Adaptive Query Generation for Semantic Segmentation
* CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets
* CMAE-3D: Contrastive Masked AutoEncoders for Self-Supervised 3D Object Detection
* CmdVIT: A Voluntary Facial Expression Recognition Model for Complex Mental Disorders
* CNN-Transformer Rectified Collaborative Learning for Medical Image Segmentation
* Co-Design of Enhanced Fuzzy Observer-Based Estimation and Gain-Scheduling Control for Active Suspension Systems Under Malicious Attacks
* Co-evidential fusion with information volume for semi-supervised medical image segmentation
* Codesign of Transmit Waveform and Receive Filter with Similarity Constraints for FDA-MIMO Radar
* Cognition Transferring and Decoupling for Text-Supervised Egocentric Semantic Segmentation
* ColFigPhotoAttnNet: Reliable Finger Photo Presentation Attack Detection Leveraging Window-Attention on Color Spaces
* Collaborative Static-Dynamic Teaching: A Semi-Supervised Framework for Stripe-like Space Target Detection
* Color correction for multi-camera systems: aligning white points
* Color Decoupling for Multi-Illumination Color Constancy
* Color number and texture perception
* Color statistics of images created by generative AI
* Color Vision 2025: Introduction by the feature editors
* ColorizeDiffusion: Improving Reference-Based Sketch Colorization with Latent Diffusion Model
* Combating Label Noise with a General Surrogate Model for Sample Selection
* Combined L-Band Polarimetric SAR and GPR Data to Develop Models for Leak Detection in the Water Pipeline Networks
* Combining Inherent Knowledge of Vision-Language Models with Unsupervised Domain Adaptation Through Strong-Weak Guidance
* ComFace: Facial Representation Learning with Synthetic Data for Comparing Faces
* Communication Efficient Federated Learning for Multi-Organ Segmentation via Knowledge Distillation with Image Synthesis
* Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and Extraction of Individual Tree Parameters
* Comparative Analysis of Satellite-Based Precipitation Products During Extreme Rainfall from Super Typhoon Yagi in Hanoi, Vietnam (September 2024)
* Comparative Evaluation of 3D Reconstruction Methods for Object Pose Estimation
* Comparative Evaluation of Two Bias Correction Approaches for SST Forecasting: Data Assimilation Versus Deep Learning Strategies, A
* Comparative Knowledge Distillation
* Comparing Car-Following Behavior Patterns of Human-Driven Vehicles and Autonomous Vehicles in a Mixed Traffic Environment
* Comparing Satellite-Derived and Model-Based Surface Soil Moisture for Spring Barley Yield Prediction in Central Europe
* Comparison of a Continuous Forest Inventory to an ALS-Derived Digital Inventory in Washington State
* Comparison of gap-filling methods for generating landsat-like land surface temperatures under all-weather conditions
* Comparison of Recent Global Time-Series Land Cover Products, A
* Complementary label learning with multi-view data and a semi-supervised labeling mechanism
* Complex Singular Spectrum Analysis Leveraging Adaptive Taper Windows for Enhancing Mode Reconstruction From Multivariate Signals
* Composed Image Retrieval for Training-FREE DOMain Conversion
* Compositional Segmentation of Cardiac Images Leveraging Metadata
* Comprehensive Analysis Based on Observation, Remote Sensing, and Numerical Models to Understand the Meteorological Environment in Arid Areas and Their Surrounding Areas
* Comprehensive Discussion on Remote Sensing Modeling and Dynamic Electromagnetic Scattering for Aircraft with Speed Brake Deflection
* Comprehensive Evaluation of Land Reclamation Effectiveness in Mining Areas: An Integrated Assessment of Soil, Vegetation, and Ecological Conditions, A
* Comprehensive Evaluation of the Lunar South Pole Landing Sites Using Self-Organizing Maps for Scientific and Engineering Purposes
* Comprehensive Validation of MODIS-Derived Instantaneous Air Temperature and Daily Minimum Temperature at Nighttime
* Computerized Proof of Fundamental Properties of the p-Median Problem Using Integer Linear Programming and a Theorem Prover
* Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency
* Conceptual Neighborhood Graphs of Topological Relations in Z2
* ConDiSR: Contrastive Disentanglement and Style Regularization for Single Domain Generalization
* Conditional Diffusion Model for Skeleton-Based Gesture Recognition With Severe Occlusions
* Conditional GAN for Enhancing Diffusion Models in Efficient and Authentic Global Gesture Generation from Audios
* Confidence guided semi-supervised cross-modality person re-identification
* Confident Multi-View Stereo
* Conflict-Guided Evidential Multimodal Fusion for Semantic Segmentation, A
* Conformal e-prediction
* Conic Transformation Approach for Solving the Perspective-Three-Point Problem, A
* Consistency-Queried Transformer for Audio-Visual Segmentation
* Content-Based Image Retrieval (CBIR): Using Combined Color and Texture Features (TriCLR and HistLBP)
* Context Perception Parallel Decoder for Scene Text Recognition
* Context-Aware Feature Fusion Method for Multi-UAV Cooperative Air Combat, A
* Context-Aware Multi-view Stereo Network for Efficient Edge-Preserving Depth Estimation
* Context-Aware Optimal Transport Learning for Retinal Fundus Image Enhancement
* Context-Aware Outlier Rejection for Robust Multi-View 3D Tracking of Similar Small Birds in An Outdoor Aviary
* Context-Aware Real-Time Semantic View Expansion of Intraoperative 4D OCT
* ContextIQ: A Multimodal Expert-Based Video Retrieval System for Contextual Advertising
* Contextual and uncertainty-aware approach for multi-person pose estimation
* Continual Learning in 3D Point Clouds: Employing Spectral Techniques for Exemplar Selection
* Continual Learning of Personalized Generative Face Models with Experience Replay
* Continuous flood monitoring using on-demand SAR data acquired with different geometries: Methodology and test on COSMO-SkyMed images
* Continuous Spatio-Temporal Memory Networks for 4D Cardiac Cine MRI Segmentation
* Contour Knowledge-Aware Perception Learning for Semantic Segmentation
* Contrastive Learning of Image Representations Guided by Spatial Relations
* Contrastive Learning with Image Deformation and Refined NT-Xent Loss for Urban Morphology Discovery
* Contrastive Semantic-Aware Masked Autoencoder for Point Cloud Self-Supervised Learning
* Contrastive Sequential-Diffusion Learning: Non-Linear and Multi-Scene Instructional Video Synthesis
* Contrastive-Learning Framework for Unsupervised Salient Object Detection, A
* Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation
* Controlling vision-language model for enhancing image restoration
* Converting Interference to Gain: Enhancing Sensing Capabilities of ISAC Systems via Noncooperative Base Station Signals
* Convex Combination-Based Distributed Momentum Methods Over Directed Graphs, A
* ConvMixFormer- A Resource-Efficient Convolution Mixer for Transformer-Based Dynamic Hand Gesture Recognition
* Convolutional neural network framework for deepfake detection: A diffusion-based approach
* Cooperative Multi-Agent Charging Scheduling by Regrouping EVs With Differentiated Deadlines, A
* Coordinate Registration Method for Over-the-Horizon Radar Based on Graph Matching, A
* Copy or Not? Reference-Based Face Image Restoration with Fine Details
* Corgi: Cached Memory Guided Video Generation
* Corner selection and dual network blender for efficient view synthesis in outdoor scenes
* Correlated Topic Modeling for Short Texts in Spherical Embedding Spaces
* CorrFill: Enhancing Faithfulness in Reference-Based Inpainting with Correspondence Guidance in Diffusion Models
* Coseismic Rupture and Postseismic Afterslip of the 2020 Nima Mw 6.4 Earthquake
* COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes
* Counterfactual learning and saliency augmentation for weakly supervised semantic segmentation
* Counting Guidance for High Fidelity Text-to-Image Synthesis
* Covariance-Based Space Regularization for Few-Shot Class Incremental Learning
* Cover Crop Types Influence Biomass Estimation Using Unmanned Aerial Vehicle-Mounted Multispectral Sensors
* CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
* CRAAC: Consistency Regularised Active Learning with Automatic Corrections for Real-Life Road Image Annotations
* Crackstructures and Crackensembles: The Power of Multi-View for 2.5D Crack Detection
* CRAFT: Class Ranking Aware Fine-Tuning for Enhanced Out-of-Distribution Detection
* CRAFT: Designing Creative and Functional 3D Objects
* Crafting Distribution Shifts for Validation and Training in Single Source Domain Generalization
* CRCL: Causal Representation Consistency Learning for Anomaly Detection in Surveillance Videos
* Critical Success Factors of Participatory Community Planning with Geospatial Digital Participatory Platforms
* Cross Image Feature Perturbation with Pseudo Label Fusion for Semi-Supervised Medical Image Segmentation
* Cross-Aligned Fusion For Multimodal Understanding
* Cross-cultural preferences for optimal daylight illumination in viewing human faces
* Cross-Domain and Cross-Dimension Learning for Image-to-Graph Transformers
* Cross-domain distribution adversarial diffusion model for synthesizing contrast-enhanced abdomen CT imaging
* Cross-Domain Invariant Feature Absorption and Domain-Specific Feature Retention for Domain Incremental Chest X-Ray Classification
* Cross-Domain Landslide Extraction Method Utilizing Image Masking and Morphological Information Enhancement, A
* Cross-Domain Multi-Modal Few-Shot Object Detection via Rich Text
* Cross-domain person re-identification via learning Heterogeneous Pseudo Labels
* Cross-erasure enhanced network for occluded person re-identification
* Cross-Estimation Method for Spaceborne Synthetic Aperture Radar Range Antenna Pattern Using Pseudo-Invariant Natural Scenes, A
* Cross-Granularity Network for Vehicle Make and Model Recognition
* Cross-Level Adaptive Feature Aggregation Network for Arbitrary-Oriented SAR Ship Detection
* Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual Categorization
* Cross-Modal Causal Representation Learning for Radiology Report Generation
* Cross-modal contrastive learning with multi-hierarchical tracklet clustering for multi object tracking
* Cross-Modal Feature Alignment and MMD Improve Robustness of Prompt Tuning
* Cross-Modal Hashing via Diverse Instances Matching
* Cross-Modal Hierarchical Knowledge Distillation for Image Aesthetics Assessment
* Cross-Modal Knowledge Diffusion-Based Generation for Difference-Aware Medical VQA
* Cross-scene visual context parsing with large vision-language model
* Cross-Task Affinity Learning for Multitask Dense Scene Predictions
* Cross-task and time-aware adversarial attack framework for perception of autonomous driving
* Cross-Task Crash Severity Analysis With Cost-Sensitive Transfer Graph Convolutional Network
* Cross-testing methodology for pattern learning and model transfer in rare fog events detection
* Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance
* Crossroads of Continents: Automated Artifact Extraction for Cultural Adaptation with Large Multimodal Models
* CrowdMAC: Masked Crowd Density Completion for Robust Crowd Density Forecasting
* CryoMAE: Few-Shot Cryo-EM Particle Picking with Masked Autoencoders
* CSA: Cross-scale alignment with adaptive semantic aggregation and filter for image-text retrieval
* CSFRNet: Integrating Clothing Status Awareness for Long-Term Person Re-identification
* CT to PET Translation: A Large-Scale Dataset and Domain-Knowledge-Guided Diffusion Approach
* CTIP: Towards Accurate Tabular-to-Image Generation for Tire Footprint Generation
* CUNSB-RFIE: Context-Aware Unpaired Neural Schrödinger Bridge in Retinal Fundus Image Enhancement
* CusConcept: Customized Visual Concept Decomposition with Diffusion Models
* CV-YOLO: A Complex-Valued Convolutional Neural Network for Oriented Ship Detection in Single-Polarization Single-Look Complex SAR Images
* Cycle-VQA: A Cycle-Consistent Framework for Robust Medical Visual Question Answering
* CycleCrash: A Dataset of Bicycle Collision Videos for Collision Prediction and Analysis
* CycleMatch: Cyclic pseudo-labeling distillation in semi-supervised medical image segmentation
* D-LUT: Photorealistic Style Transfer via Diffusion Process
* D2FP: Learning Implicit Prior for Human Parsing
* D3QN-Based IAB Resource Allocation and Tethered UAV Positioning for IoT Networks
* DAG Blockchain-Assisted Asynchronous Federated Mutual Learning for Autonomous Driving
* Dam: Dynamic Adapter Merging for Continual Video QA Learning
* Dance any Beat: Blending Beats with Visuals in Dance Video Generation
* DARCS: Memory-Efficient Deep Compressed Sensing Reconstruction for Acceleration of 3D Whole-Heart Coronary MR Angiography
* DARDA: Domain-Aware Real-Time Dynamic Neural Network Adaptation*
* Dark channel map and union training strategy for object detection in foggy scenes
* DarSwin-Unet: Distortion Aware Architecture
* DASC-SPT: Towards Self-Supervised Panoramic Semantic Segmentation
* DashCop: Automated E-Ticket Generation for Two-Wheeler Traffic Violations Using Dashcam Videos
* Data Augmentation for Image Classification Using Generative AI
* Data Augmentation for Surgical Scene Segmentation with Anatomy-Aware Diffusion Models
* Data Generation for Hardware-Friendly Post-Training Quantization
* Data Integration Based on UAV Multispectra and Proximal Hyperspectra Sensing for Maize Canopy Nitrogen Estimation
* Data Perspective on Enhanced Identity Preservation for Diffusion Personalization, A
* Data-Driven Inverse Reinforcement Learning for Heterogeneous Optimal Robust Formation Control
* Data-driven recognition of uncertainty by integrating matrix factorization and kernel smoothing methods
* Data-Efficient 3D Visual Grounding via Order-Aware Referring
* Data-Efficient Alignment in Medical Imaging via Reconfigurable Generative Networks
* Dataset Augmentation by Mixing Visual Concepts
* DAU-YOLO: A Lightweight and Effective Method for Small Object Detection in UAV Images
* DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross-Task Interactions
* Daytime Surface Urban Heat Island Variation in Response to Future Urban Expansion: An Assessment of Different Climate Regimes
* DB-MFENet: A Dual-Branch Multi-Frequency Feature Enhancement Network for Hyperspectral Image Classification
* DBFAM: A dual-branch network with efficient feature fusion and attention-enhanced gating for medical image segmentation
* DCCLA: Dense Cross Connections With Linear Attention for LiDAR-Based 3D Pedestrian Detection
* DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Remote Sensing Change Detection
* DDPM-EMF: a denoising diffusion probabilistic model-based feature-enhancement fusion network for medical image fusion
* DDS: Decoupled Dynamic Scene-Graph Generation Network
* Dead Sea Stromatolite Reefs: Testing Ground for Remote Sensing Automated Detection of Life Forms and Their Traces in Harsh Environments
* Debiased Mapping for Full-Reference Image Quality Assessment
* Debiasify: Self-Distillation for Unsupervised Bias Mitigation
* Decentralized and Communication-Based Multi-Agent Traffic Signal Control Model Employing a Graph Representation for the State
* Decentralized Smoothing ADMM for Quantile Regression With Non-Convex Sparse Penalties
* Decentralized, Secure, and Reliable Vehicle Platoon Formation With Privacy Protection for Autonomous Vehicles, A
* Deciding Truck Platooning Operational Rules With Stochastic Approximation for Vehicle Safe Overtaking on Two-Lane Undivided Highways
* Deciphering the Complaint Aspects: Towards an Aspect-Based Complaint Identification Model with Video Complaint Dataset in Finance
* Declining Snow Resources Since 2000 in Arid Northwest China Based on Integrated Remote Sensing Indicators
* DeCLIP: Decoding CLIP Representations for Deepfake Localization
* DecloudFormer: Quest the key to consistent thin cloud removal of wide-swath multi-spectral images
* Decoding Agricultural Drought Resilience: A Triple-Validated Random Forest Framework Integrating Multi-Source Remote Sensing for High-Resolution Monitoring in the North China Plain
* Decomposed Distribution Matching in Dataset Condensation
* Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection
* Deduce and Select Evidences with Language Models for Training-Free Video Goal Inference
* Deep Attention Learning for Pre-operative Lymph Node Metastasis Prediction in Pancreatic Cancer via Multi-object Relationship Modeling
* Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
* Deep Joint Unrolling for Deblurring and Low-Light Image Enhancement (JUDE)
* Deep learning coupled with split window and temperature-emissivity separation (DL-SW-TES) method improves clear-sky high-resolution land surface temperature estimation
* Deep Learning for Enhanced-Resolution Reconstruction of Sentinel-1 Backscatter NRCS in China's Offshore Seas
* Deep learning in remote sensing image matching: A survey
* Deep Learning Method for Land Use Classification Based on Feature Augmentation, A
* Deep Learning-Based Bathymetry Retrieval without in-situ Depths Using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps
* Deep Learning-Based Cloud Detection for Optical Remote Sensing Images: A Survey
* Deep Learning-Based Magnetic Resonance Image Segmentation and Classification for Alzheimer's Disease Diagnosis
* Deep Metric Learning for Unsupervised Remote Sensing Change Detection
* Deep Multi-Modal Ship Detection and Classification Network
* deep reinforcement active learning method for multi-label image classification, A
* Deep Reinforcement Learning-Based Computation Computational Offloading for Space-Air-Ground Integrated Vehicle Networks
* Deep Unfolding Network for Image Desnowing With Snow Shape Prior
* Deep Wavelet Temporal-Frequency Attention for nonlinear fMRI factorization in ASD
* DeepCA: Deep Learning-Based 3D Coronary Artery Tree Reconstruction from Two 2D Non-Simultaneous X-Ray Angiography Projections
* DeepCropClustering: A deep unsupervised clustering approach by adopting nearest and farthest neighbors for crop mapping
* Deepfake detection with domain generalization and mask-guided supervision
* DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection
* DeepMIM: Deep Supervision for Masked Image Modeling
* DeepNet: Protection of deepfake images with aid of deep learning networks
* Defending Against Adversarial Attack Through Generative Adversarial Networks
* Defending Against Repetitive Backdoor Attacks on Semi-Supervised Learning Through Lens of Rate-Distortion-Perception Trade-Off
* Dehazing Method for UAV Remote Sensing Based on Global and Local Feature Collaboration, A
* Delta-NAS: Difference of Architecture Encoding for Predictor-Based Evolutionary Neural Architecture Search
* Delving Deep into Simplicity Bias for Long-Tailed Image Recognition
* Demystifying Variational Diffusion Models
* Denoising and Feature Enhancement Network for Target Detection Based on SAR Images
* Denoising Diffusion Models for High-Resolution Microscopy Image Restoration
* Dense 3D Reconstruction Based on Multi-Aspect SAR Using a Novel SAR-DAISY Feature Descriptor
* Dense Depth from Event Focal Stack
* Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter
* Denser Teacher: Rethinking Dense Pseudo-Label for Semi-Supervised Oriented Object Detection
* DepthSSC: Monocular 3D Semantic Scene Completion via Depth-Spatial Alignment and Voxel Adaptation
* Dequantization and Color Transfer with Diffusion Models
* Design and Control of Personalized Steering Feel for Steer-by-Wire Systems
* Design and Development of a Local-First Collaborative 3D WebGIS Application for Mapping
* Design Principles of Multi-Scale J-Invariant Networks for Self-Supervised Image Denoising
* Design-O-Meter: Towards Evaluating and Refining Graphic Designs
* Detecting Flooded Areas Using Sentinel-1 SAR Imagery
* Detecting Hearing Impairment Through Localizing Abnormal Speech Patterns
* Detecting Long-Term Spatiotemporal Dynamics of Urban Green Spaces with Training Sample Migration Method
* Detecting Origin Attribution for Text-to-Image Diffusion Models
* Detecting Post-Midnight Plasma Depletions Through Plasma Density and Electric Field Measurements in the Low-Latitude Ionosphere
* Detecting the Distribution of Callery Pear (Pyrus calleryana) in an Urban U.S. Landscape Using High Spatial Resolution Satellite Imagery and Machine Learning
* Detecting Wildfires on UAVs with Real-Time Segmentation Trained by Larger Teacher Models
* Detection and Cover Integrated Waveform Design Method with Good Correlation Characteristics and Doppler Tolerance, A
* Detection and Spatiotemporal Distribution Analysis of Vertically Developing Convective Clouds over the Tibetan Plateau and East Asia Using GEO-KOMPSAT-2A Observations
* Detection of Small-Scale Subsurface Echoes Using Lunar Radar Sounder and Surface Scattering Simulations with a DEM Generated Using a Generative Adversarial Network
* Detection of Violent Content in Videos Using Attention-Augmented 3-D Convolutional Networks
* Detective Networks: Enhancing Disaster Recognition in Images Through Attention Shifting Using Optimal Masking
* Detector With Classifier2: An End-to-End Multi-Stream Feature Aggregation Network for Fine-Grained Object Detection in Remote Sensing Images
* Developing an Objective Scheme to Construct Hurricane Bogus Vortices Based on Scatterometer Sea Surface Wind Data
* Development of a Distance-Adaptive Gaussian Fitting Method for Scheimpflug LiDAR-Based Plant Phenotyping
* Development of Trio Optimal Feature Extraction Model for Attention-Based Adaptive Weighted RNN-Based Lung and Colon Cancer Detection Framework Using Histopathological Images
* DFDW: Distribution-aware Filter and Dynamic Weight for open-mixed-domain Test-time adaptation
* DFedADMM: Dual Constraint Controlled Model Inconsistency for Decentralize Federated Learning
* diabetic retinopathy classification method based on image-text contrastive learning, A
* Diagnostic Analysis of the 2024 Beijing May 30 Gale Simulation Based on Satellite Observation Products, A
* DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET
* Differential Privacy Mechanisms in Neural Tangent Kernel Regression
* Differentially Private Integrated Decision Gradients (IDG-DP) for Radar-Based Human Activity Recognition
* Difficulty, Diversity, and Plausibility: Dynamic Data-Free Quantization
* DiffMesh: A Motion-Aware Diffusion Framework for Human Mesh Recovery from Videos
* DiffMIC-v2: Medical Image Classification via Improved Diffusion Network
* DiffPAD: Denoising Diffusion-Based Adversarial Patch Decontamination
* DiffQRCoder: Diffusion-Based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement
* DiffuCE: Expert-Level CBCT Image Enhancement Using a Novel Conditional Denoising Diffusion Model with Latent Alignment
* DiffUIE: Learning Latent Global Priors in Diffusion Models for Underwater Image Enhancement
* DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining
* DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models
* Diffusion Augmented Complex Maximum Total Correntropy Algorithm for Power System Frequency Estimation
* Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation
* Diffusion Model-Based Image Editing: A Survey
* Diffusion Models in Low-Level Vision: A Survey
* Diffusion-Based Conditional Image Editing Through Optimized Inference with Guidance
* Diffusion-Based Generative Regularization for Supervised Discriminative Learning
* Diffusion-Based Particle-DETR for BEV Perception
* Diffusion-based Visual Anagram as Multi-task Learning
* DiHuR: Diffusion-Guided Generalizable Human Reconstruction
* DiL: An Explainable and Practical Metric for Abnormal Uncertainty in Object Detection
* Direct and Indirect Effects of Large-Scale Forest Restoration on Water Yield in China's Large River Basins
* Direct Comparison of Infrared Channel Measurements by Two ABIs to Monitor Their Calibration Stability
* DisCo: Discovering Common Affordance from Large Models for Actionable Part Perception
* Discriminative Score Suppression for Weakly Supervised Video Anomaly Detection
* Disentangle Source and Target Knowledge for Continual Test-Time Adaptation
* Disentangled Noisy Correspondence Learning
* Disentanglement and codebook learning-induced feature match network to diagnose neurodegenerative diseases on incomplete multimodal data
* Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models
* Disentangling Spatio-Temporal Knowledge for Weakly Supervised Object Detection and Segmentation in Surgical Video
* Disentangling Subject-Irrelevant Elements in Personalized Text-to-Image Diffusion via Filtered Self-Distillation
* DisFlowEm: One-Shot Emotional Talking Head Generation Using Disentangled Pose and Expression Flow-Guidance
* Distillation of Diffusion Features for Semantic Correspondence
* Distillation-Based Cross-Model Transferable Adversarial Attack for Remote Sensing Image Classification
* Distilling Aggregated Knowledge for Weakly-Supervised Video Anomaly Detection
* Distilling Multi-Level Semantic Cues Across Multi-Modalities for Face Forgery Detection
* Distributed Cooperative Control and Robust Optimization for Nonlinear Connected Automated Vehicles With Unknown Reaction Time Delays and Jerk Dynamics
* Distributed Fault-Tolerant Control Strategy for Virtual Coupling Train System Against Measurement Errors and Loss of Actuator Effectiveness
* Distributed Low-Degree-of-Freedom Aerial Target Localization Method Based on Hybrid Measurements, A
* Distributed Modeling and Scenario-Driven Extension Hybrid-DMPC Coordinated Control of Autonomous Vehicle Chassis
* Distributed Secure State Estimation Against Stealthy Attacks
* Distribution Optimization Under Gaussian Hypothesis for Domain Adaptive Semantic Segmentation
* DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing
* DivAvatar: Diverse 3D Avatar Generation with a Single Prompt
* Divergent Domains, Convergent Grading: Enhancing Generalization in Diabetic Retinopathy Grading
* DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-Id
* DMCTDet: A density map-guided composite transformer network for object detection of UAV images
* DMPT: Decoupled Modality-Aware Prompt Tuning for Multi-Modal Object Re-Identification
* DMRN: A Dynamical Multi-Order Response Network for the Robust Lung Airway Segmentation
* DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing
* DOA Estimation for Coherent and Non-Coherent Mixed Signals Using Toeplitz Diagonal Diffusion
* DocMatcher: Document Image Dewarping via Structural and Textual Line Matching
* DocTTT: Test-Time Training for Handwritten Document Recognition Using Meta-Auxiliary Learning
* Documentation for Architectural Heritage: A Historical Building Information Modeling Data Modeling Approach for the Valentino Castle North Wing
* Does Adding a Modality Really Make Positive Impacts in Incomplete Multi-Modal Brain Tumor Segmentation?
* Does background color influence the perception of facial expression? Adjustment to neutral expression by Caucasian and Japanese participants
* Domain adaptive depth completion via spatial-error consistency
* Domain consistency learning for continual test-time adaptation in image semantic segmentation
* Domain generalization for image classification with dynamic decision boundary
* Domain Generalization using Large Pretrained Models with Mixture-of-Adapters
* Domain-Generalized Object Anti-Spoofing: Bridging Gaps and Patch Selection for Robust Detection Across Domains
* Domain-guided multi-frequency underwater image enhancement network
* Domain-Guided Weight Modulation for Semi-Supervised Domain Generalization
* Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models
* Downscaling and Gap-Filling GRACE-Based Terrestrial Water Storage Anomalies in the Qinghai-Tibet Plateau Using Deep Learning and Multi-Source Data
* DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models
* DragonTrack: Transformer-Enhanced Graphical Multi-Person Tracking in Complex Scenarios
* Dragtext: Rethinking Text Embedding in Point-Based Image Editing
* DreamBlend: Advancing Personalized Fine-Tuning of Text-to-Image Diffusion Models
* DreaMo: Articulated 3D Reconstruction from a Single Casual Video
* DRIFT: A Dynamic Crowd Inflow Control System Using LSTM-Based Deep Reinforcement Learning
* DrIFT: Autonomous Drone Dataset with Integrated Real and Synthetic Data, Flexible Views, and Transformed Domains
* Driving Safety Risk Analysis and Assessment in a Mixed Driving Environment of Connected and Non-Connected Vehicles: A Systematic Survey
* Drone Height from Ground Determination Using GNSS-R Based on Dual-Frequency GPS/BDS Signals
* Dropout Connects Transformers and CNNs: Transfer General Knowledge for Knowledge Distillation
* Dropout the High-Rate Downsampling: A Novel Design Paradigm for UHD Image Restoration
* DSDC-NET: Semi-supervised superficial OCTA vessel segmentation for false positive reduction
* DSDNet: Target Detection Algorithm for SDSS Photometric Images Based on Convolutional Neural Networks
* DSMT: Dual-Stage Multiscale Transformer for Hyperspectral Snapshot Compressive Imaging
* DST-Det: Open-Vocabulary Object Detection via Dynamic Self-Training
* DSTIGCN: Deformable Spatial-Temporal Interaction Graph Convolution Network for Pedestrian Trajectory Prediction
* DSTR: Dual Scenes Transformer for Cross-Modal Fusion in 3D Object Detection
* DT Assisted Task Offloading for C-V2X Networks With Imperfect DT Prediction Conditions
* DT-LSD: Deformable Transformer-Based Line Segment Detection
* DTA: Dual Temporal-channel-wise Attention for Spiking Neural Networks
* Dual region mutual enhancement network for camouflaged object detection
* Dual Space Representation Learning for Skeleton-Based Action Recognition
* Dual structure-aware consensus graph learning for incomplete multi-view clustering
* Dual-Domain Multi-Task Learning-Based Domain Adaptation for Hyperspectral Image Classification
* Dual-function discriminator for semantic image synthesis in variational GANs
* DUAL-GDFQ: A Dual-Generator, Dual-Phase Learning Approach for Data-Free Quantization
* Dual-Level Modality De-Biasing for RGB-T Tracking
* Dual-Representation Interaction Driven Image Quality Assessment with Restoration Assistance
* Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing
* Dual-Space Video Person Re-identification
* DualCIR: Enhancing Training-Free Composed Image Retrieval via Dual-Directional Descriptions
* Dynamic accumulated attention map for interpreting evolution of decision-making in vision transformer
* Dynamic Adapter Tuning for Long-Tailed Class-Incremental Learning
* Dynamic Attention-Guided Diffusion for Image Super-Resolution
* Dynamic Coplanar Array Capacitance Imaging Method for Asphalt Materials With Concealed Damages
* Dynamic Diagnosis of an Extreme Precipitation Event over the Southern Slope of Tianshan Mountains Using Multi-Source Observations
* Dynamic Equilibrium Strategy for Road Sensing Systems Considering Open Circuit Faults in DTP-PMSM
* Dynamic Estimation Method for the Headway of Virtual Coupling Trains Utilizing the High-Order Extended Kalman Filter-Based Smoother, A
* Dynamic feature extraction and histopathology domain shift alignment for mitosis detection
* Dynamic Hierarchical Convolutional Attention Network for Recognizing Motor Imagery Intention
* Dynamic Light Path and Bidirectional Reflectance Effects on Solar Noise in UAV-Borne Photon-Counting LiDAR
* Dynamic Locomotion Synchronization and Fuzzy Control of a Lower Limb Exoskeleton With Body Weight Support for Active Following Human Operator
* dynamic predictive transformer with temporal relevance regression for action detection, A
* Dynamic semantic prototype perception for text-video retrieval
* Dynamic Slice Resource Management and Information Synchronization Strategy in IoV Based on Digital Twin
* Dynamic Systems Approach to Modeling Human-Machine Rhythm Interaction, A
* Dynamical Threshold-Based Fractional Anisotropic Diffusion for Speckle Noise Removal
* Dynamics of S-cone contributions to the initiation of saccadic and smooth pursuit eye movements
* DyRoNet: Dynamic Routing and Low-Rank Adapters for Autonomous Driving Streaming Perception
* Early Post-Seismic Deformation Revealed After the Wushi (China) Earthquake (Mw = 7.1) Occurred on 22 January 2024
* EasyRet3D: Uncalibrated Multi-View Multi-Human 3D Reconstruction and Tracking
* ECF-YOLOv7-Tiny: Improving Feature Fusion and the Receptive Field for Lightweight Object Detectors
* EchoDFKD: Data-Free Knowledge Distillation for Cardiac Ultrasound Segmentation Using Synthetic Data
* ECINFusion: A Novel Explicit Channel-Wise Interaction Network for Unified Multi-Modal Medical Image Fusion
* EDFF-Unet: An Improved Unet-Based Method for Cloud and Cloud Shadow Segmentation in Remote Sensing Images
* EdgeGaussians - 3D Edge Mapping via Gaussian Splatting
* EDMB: Edge Detector with Mamba
* Effect of stimulus size on chromatic discrimination
* Effective and Efficient Medical Image Segmentation with Hierarchical Context Interaction
* Effective Backdoor Learning on Open-Set Face Recognition Systems
* Effective Global Context Integration for Lightweight 3D Medical Image Segmentation
* Effective Scene Graph Generation by Statistical Relation Distillation
* Effective Yet Fast Early Stopping Metric for Deep Image Prior in Image Denoising, An
* Efficient and Multi-Dimensional Privacy-Preserving Platoon Communication Scheme in Vehicular Networks, An
* Efficient Brain Tumor Prediction Using Pteropus Unicinctus Optimization on Deep Neural Network, An
* Efficient Image Super-Resolution with Feature Interaction Weighted Hybrid Network
* Efficient large-scale vegetation mapping at the formation level using multi-source data: A case study in Beijing, China
* Efficient metric-resolution land cover mapping using open-access low resolution annotations with prototype learning and modified Segment Anything model
* Efficient Progressive Image Compression with Variance-Aware Masking
* Efficient PSInSAR Method for High-Density Urban Areas Based on Regular Grid Partitioning and Connected Component Constraints, An
* Efficient Retinex-Based Framework for Low-Light Image Enhancement Without Additional Networks
* Efficient RGBT Tracking via Multi-Path Mamba Fusion Network
* Efficient Vehicle Selection and Resource Allocation for Knowledge Distillation-Based Federated Learning in UAV-Assisted VEC
* Efficient Video Object Segmentation via Modulated Cross-Attention Memory
* EfficientCrackNet: A Lightweight Model for Crack Segmentation
* EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation
* EFFICIENTMORPH: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration
* Ego-VPA: Egocentric Video Understanding with Parameter-Efficient Adaptation
* EgoCast: Forecasting Egocentric Human Pose in the Wild
* Egocentric and exocentric methods: A short survey
* EgoPoints: Advancing Point Tracking for Egocentric Videos
* EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos
* EI-Nexus: Towards Unmediated and Flexible Inter-Modality Local Feature Extraction and Matching for Event-Image Data
* Eigenpose: Occlusion-Robust 3D Human Mesh Reconstruction
* ElasticLaneNet: An Efficient Geometry-Flexible Lane Detection Framework
* ELBA: Learning by Asking for Embodied Visual Navigation and Task Completion
* Elemental Composite Prototypical Network: Few-Shot Object Detection on Outdoor 3D Point Cloud Scenes
* eLIR-Net: an Efficient AI Solution for Image Retouching
* ELMGS: Enhancing Memory and Computation Scalability Through coMpression for 3D Gaussian Splatting
* Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models
* Elucidating the Solution Space of Extended Reverse-Time SDE for Diffusion Models
* Emergence Model of Perception With Global-Contour Precedence Based on Gestalt Theory and Primary Visual Cortex
* Emotion Classification With Visibility Graphs
* EmoVOCA: Speech-Driven Emotional 3D Talking Heads
* EMS-SLAM: Dynamic RGB-D SLAM with Semantic-Geometric Constraints for GNSS-Denied Environments
* ENAF: A Multi-Exit Network with an Adaptive Patch Fusion for Large Image Super Resolution
* Encoder-Agnostic Weakly Supervised Method For Describing Textures, An
* End-to-End Target Speaker Speech Recognition Using Context-Aware Attention Mechanisms for Challenging Enrollment Scenario
* Endoscopic Scoring and Localization in Unconstrained Clinical Trial Videos
* Energy-based pseudo-label refining for source-free domain adaptation
* Energy-Concentrated Transform for Improved Time-Frequency Representation of Seismic Signals, An
* Enhanced Algorithm Based on Dual-Input Feature Fusion ShuffleNet for Synthetic Aperture Radar Operating Mode Recognition, An
* Enhanced CNN-BiLSTM-Attention Model for High-Precision Integrated Navigation During GNSS Outages
* Enhanced Compression Method for Medical Images Using SPIHT Encoder for Fog Computing, An
* Enhanced faster R-CNN based subcutaneous and visceral adipose tissue segmentation from abdominal MRI
* Enhanced generation of automatically labelled image segmentation datasets by advanced style interpreter deep architectures
* Enhanced Three-Dimensional Wind Retrieval Method Based on Genetic Algorithm-Particle Swarm Optimization for Coherent Doppler Wind Lidar, An
* Enhancing Aquifer Reliability and Resilience Assessment in Data-Scarce Regions Using Satellite Data: Application to the Chao Phraya River Basin
* Enhancing brain tumor classification in MRI images: A deep learning-based approach for accurate diagnosis
* Enhancing Crop Type Mapping in Data-Scarce Regions Through Transfer Learning: A Case Study of the Hexi Corridor
* Enhancing Crop Yield Estimation in Spinach Crops Using Synthetic Aperture Radar-Derived Normalized Difference Vegetation Index: A Sentinel-1 and Sentinel-2 Fusion Approach
* Enhancing Distributed Source Coding With Encoder-Centric Frequency Adaptation and Spatial Transformation
* Enhancing Embodied Object Detection with Spatial Feature Memory
* Enhancing Image Layout Control with Loss-Guided Diffusion Models
* Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks
* Enhancing Multi-Flight Unmanned-Aerial-Vehicle-Based Detection of Wheat Canopy Chlorophyll Content Using Relative Radiometric Correction
* Enhancing Novel Object Detection via Cooperative Foundational Models
* Enhancing Predictive Imaging Biomarker Discovery Through Treatment Effect Analysis
* Enhancing Real-Time Object Detection With Optical Flow-Guided Streaming Perception
* Enhancing Real-World Active Speaker Detection With Multi-Modal Extraction Pre-Training
* Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge
* Enhancing Skin Disease Diagnosis: Interpretable Visual Concept Discovery with SAM
* Enhancing the Collaborative Decision-Making Performance of Connected and Autonomous Vehicles: A Multi-Modal Failure-Aware Graph Representation Approach
* Enhancing trust in Large Language Models for streamlined decision-making in military operations
* Enhancing Tumor Edge Consistency in Multimodal MRI Synthesis for Improved Glioma Segmentation
* Enhancing Urban Flood Susceptibility Assessment by Capturing the Features of the Urban Environment
* Enhancing Urban Understanding Through Fine-Grained Segmentation of Very-High-Resolution Aerial Imagery
* Enhancing Vision-Language Few-Shot Adaptation with Negative Learning
* Enhancing Visual Classification Using Comparative Descriptors
* Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer
* Enriched Image Captioning Based on Knowledge Divergence and Focus
* Enriching Local Patterns with Multi-Token Attention for Broad-Sight Neural Networks
* Environmental Influence on NbS (Nature-Based Solution) Mitigation of Diurnal Surface Urban Heat Islands (SUHI)
* Epipolar Attention Field Transformers for Bird's Eye View Semantic Segmentation
* Equivariance-Based Markov Decision Process for Unsupervised Point Cloud Registration
* Equivariant Diffusion Model With A5-Group Neurons for Joint Pose Estimation and Shape Reconstruction
* ERM++: An Improved Baseline for Domain Generalization
* ERUP-YOLO: Enhancing Object Detection Robustness for Adverse Weather Condition by Unified Image-Adaptive Processing
* Estimating Soil Attributes for Yield Gap Reduction in Africa Using Hyperspectral Remote Sensing Data with Artificial Intelligence Methods: An Extensive Review and Synthesis
* Estimating Spatiotemporal Dynamics of Carbon Storage in Roinia pseudoacacia Plantations in the Caijiachuan Watershed Using Sample Plots and Uncrewed Aerial Vehicle-Borne Laser Scanning Data
* Estimating the Effects of Natural and Anthropogenic Activities on Vegetation Cover: Analysis of Zhejiang Province, China, from 2000 to 2022
* Estimation of Chlorophyll Content at Stand and Individual Tree Level by UAV Hyperspectral Combined with LiDAR
* Estimation of surface all-wave net radiation from MODIS data using deep residual neural network based on limited samples
* Evacuation Behavioural Instructions with 3D Motions: Insights from Three Use Cases
* Evaluating Agreement Between Global Satellite Data Products for Forest Monitoring in Madagascar
* Evaluating Arctic Thin Ice Thickness Retrieved from Latest Version of Multisource Satellite Products
* Evaluating saliency scores in point clouds of natural environments by learning surface anomalies
* Evaluating Sensitivity Consistency of Explanations
* Evaluating Shallow Landslide Prediction Mapping by Using Two Different GIS-Based Models: 4SLIDE and SHALSTAB
* Evaluation of an Infrastructure-Based Warning System: A Case Study on Roundabout Driving Behaviors
* Evaluation of Different Methods for Retrieving Temperature and Humidity Profiles in the Lower Atmosphere Using the Atmospheric Sounder Spectrometer by Infrared Spectral Technology
* Evaluation of Modified Reflection Symmetry Decomposition Polarization Features for Sea Ice Classification
* Event-Guided Fusion-Mamba for Context-Aware 3D Human Pose Estimation
* Event-Guided Low-Light Video Semantic Segmentation
* Event-Guided Video Transformer for End-to-End 3D Human Pose Estimation
* Event-Triggered Hybrid Consensus Filter for Distributed Extended Object Tracking, An
* Event-Triggered-Based Distributed Formation Cooperative Tracking Control of Under-Actuated Unmanned Surface Vehicles With Input and State Quantization
* EvoCL: Continual Learning over Evolving Domains
* Evolution and Attribution of Flood Volume in the Source Region of the Yellow River
* Evolutionary History of the Large-Scale Scarp in Jules Verne Crater, Moon
* Exemplar-free class incremental action recognition based on self-supervised learning
* Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos
* Experimental Analysis of the Temporal Variability of the V2V Channel at 5.9 GHz
* Experimental Study on Drivers' Eye Movement Behavior When Using an Automated Lane Change System, An
* Explainability Feature Bands Adaptive Selection for Hyperspectral Image Classification
* Explainable Deep Learning-Enabled Malware Attack Detection for IoT-Enabled Intelligent Transportation Systems
* Explainable Machine Learning Model for Predicting Macroseismic Intensity for Emergency Management, An
* Explainable Spatio-Temporal Inference Network for Car-Sharing Demand Prediction
* Explicit Guidance for Robust Video Frame Interpolation Against Discontinuous Motions
* Explicit Motion Handling and Interactive Prompting for Video Camouflaged Object Detection
* Explicitly Disentangling and Exclusively Fusing for Semi-Supervised Bi-Modal Salient Object Detection
* Exploiting Inter-Sample Information for Long-Tailed Out-of-Distribution Detection
* Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
* Exploratory Driving Performance and Car-Following Modeling for Autonomous Shuttles Based on Field Data
* Exploring dynamic plane representations for neural scene reconstruction
* Exploring Homogeneous and Heterogeneous Consistent Label Associations for Unsupervised Visible-Infrared Person ReID
* Exploring Non-Matching Multiple References for Speech Quality Assessment
* Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
* Exploring the Causes of Severe Fluctuations in Water Surface Area Using Water Index and Structural Equation Modeling: Evidence from Ebinur Lake, China
* Exploring the Effectiveness of Fusing Synchronous/Asynchronous Airborne Hyperspectral and LiDAR Data for Plant Species Classification in Semi-Arid Mining Areas
* Exploring the Stability Gap in Continual Learning: The Role of the Classification Head
* Exploring the Use of Data in a Digital Twin for the Marine and Coastal Environment
* Exploring the Vegetation Changes in Poyang Lake Wetlands: Succession and Key Drivers over Past 30 Years
* Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation
* Extended excitation backprop with gradient weighting: A general visualization solution for understanding heterogeneous face recognition
* Extended Object Tracking Using an Orientation Vector Based on Constrained Filtering
* Extensions in channel and class dimensions for attention-based knowledge distillation
* Extinction Coefficient Inversion Algorithm with New Boundary Value Estimation for Horizontal Scanning Lidar
* Extracting High-Discriminative Features for Detecting Double JPEG Compression With the Same Quantization Matrix
* Eye-SCAN: Eye-Movement-Attention-based Spatial Channel Adaptive Network for traffic accident prediction
* F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation
* F2former: When Fractional Fourier Meets Deep Wiener Deconvolution and Selective Frequency Transformer for Image Deblurring
* Face Anonymization Made Simple
* Facial Expression Recognition with Controlled Privacy Preservation and Feature Compensation
* Fair Domain Generalization with Heterogeneous Sensitive Attributes Across Domains
* FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training
* Fairer Analysis and Demographically Balanced Face Generation for Fairer Face Verification
* Fake News Detection using Hashtag Context
* FALCON: Fair Face Recognition via Local Optimal Feature Normalization
* FAM-LSTM: Predicting Macroscopic Pedestrian Dynamics Through Data-Driven Method
* Far-Field Earthquake-Induced Crustal Deformation and Mud Volcano Activity in Azerbaijan Based on the InSAR Technique
* FarmSeg_VLM: A farmland remote sensing image segmentation method considering vision-language alignment
* Fast 3D Breast Imaging With a Transmission-Based Microwave System
* Fast and Accurate Direct Position Estimation Using Low-Complexity Correlation and Swarm Intelligence Optimization
* Fast Reinforcement Learning for Resource Optimization in Dynamic Vehicular Communications
* Fast Satellite Selection Method Based on the Multi-Strategy Fusion Grey Wolf Optimization Algorithm for Low Earth Orbit Satellites, A
* Fast UAV Object-Searching in Large-Scale and Complex Environments
* FASTEN: Video Event Localization Based on Audio-Visual Feature Alignment and Multi-Scale Temporal Enhancement
* FASTER: A Font-Agnostic Scene Text Editing and Rendering Framework
* FasterSal: Robust and Real-Time Single-Stream Architecture for RGB-D Salient Object Detection
* FastTalker: Real-time audio-driven talking face generation with 3D Gaussian
* FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing
* FaTNET: Feature-alignment transformer network for human pose transfer
* Fault-Adaptive Traffic Demand Estimation Using Network Flow Dynamics
* FaVoR: Features via Voxel Rendering for Camera Relocalization
* FDS: Feedback-Guided Domain Synthesis with Multi-Source Conditional Diffusion Models for Domain Generalization
* Feasibility of Federated Learning from Client Databases with Different Brain Diseases and MRI Modalities
* Feature Augmentation Based Test-Time Adaptation
* Feature Design for Bridging SAM and CLIP Toward Referring Image Segmentation
* Feature Fusion Transferability Aware Transformer for Unsupervised Domain Adaptation
* Feature Multi-Scale Enhancement and Adaptive Dynamic Fusion Network for Infrared Small Target Detection
* Feature Space Perturbation: A Panacea to Enhanced Transferability Estimation
* Feature-Attention-Mechanism-Based Attack for Deep Robust Watermarking
* Feature-Guided Instance Mining and Task-Aligned Focal Loss for Weakly Supervised Object Detection in Remote Sensing Images
* Feature-Level and Spatial-Level Activation Expansion for Weakly-Supervised Semantic Segmentation
* Feature-PLPD: Feature-Point and Line Points Detection for Real-Time Embedded Visual Odometry-Based Systems
* Feature-Reinforced Ensemble Learning Framework for Space-Based DEM Correction, A
* Federated Source-Free Domain Adaptation for Classification: Weighted Cluster Aggregation for Unlabeled Data
* Federated Voxel Scene Graph for Intracranial Hemorrhage
* Federated-Continual Dynamic Segmentation of Histopathology Guided by Barlow Continuity
* Feedback Attention to Enhance Unsupervised Deep Learning Image Registration in 3D Echocardiography
* FER-Former: Multimodal Transformer for Facial Expression Recognition
* Few Annotated Pixels and Point Cloud Based Weakly Supervised Semantic Segmentation of Driving Scenes
* Few-shot object detection via synthetic features with optimal transport
* Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks
* FILP-3D: Enhancing 3D few-shot class-incremental learning with pre-trained vision-language models
* Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
* Fine-grained Controllable Video Generation via Object Appearance and Context
* Fine-Grained Spatial and Verbal Losses for 3D Visual Grounding
* Fine-Resolution Satellite Remote Sensing Improves Spatially Distributed Snow Modeling to Near Real Time
* Fine-Tuning Image-Conditional Diffusion Models is Easier than you Think
* FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection
* FineView Dataset: A 3D Scanned Multi-View Object Dataset of Fine-Grained Category Instances, The
* Finite-Time Learning-Based Optimal Elliptical Encircling Control for UAVs With Prescribed Constraints
* First-Arrival Tomography for Mountain Tunnel Hazard Assessment Using Unmanned Aerial Vehicle Seismic Source and Enhanced by Supervirtual Interferometry
* FitDiff: Robust Monocular 3D Facial Shape and Reflectance Estimation using Diffusion Models
* Five problems with color constancy metrics: discussion
* Fixed-Time Tracking Control of 3-D Collaborative Double Boom Cranes With Obstacle Avoidance and Prescribed Performance
* FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration
* FlashMix: Fast Map-Free LiDAR Localization via Feature Mixing and Contrastive-Constrained Accelerated Training
* FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding
* Flatness Improves Backbone Generalisation in Few-Shot Classification
* FLDet: Faster and Lighter Aerial Object Detector
* Flexible Temperature Parallel Distillation for Dense Object Detection: Make Response-Based Knowledge Distillation Great Again
* Flight Trajectory Control With Network-Oriented Hierarchical Reinforcement Learning for UAVs-Assisted Data Time-Sensitive IoT
* Flowering Time Prediction of Wheat From DIA-MS Data
* FluoNeRF: Fluorescent Novel-View Synthesis Under Novel Light Source Colors
* FM-RTDETR: Small Object Detection Algorithm Based on Enhanced Feature Fusion With Mamba
* FMD: Comprehensive Data Compression in Medical Domain via Fused Matching Distillation
* FN-NET: Adaptive data augmentation network for fine-grained visual categorization
* FocusCLIP: Focusing on Anomaly Regions by Visual-Text Discrepancies
* Focusing on what to Decode and what to Train: SOV Decoding with Specific Target Guided De-Noising and Vision Language Advisor
* FoodMem: Near real-time and precise food video segmentation
* FOR: Finetuning for Object Level Open Vocabulary Image Retrieval
* Force Observer-Based Motion Adaptation and Adaptive Neural Control for Robots in Contact With Unknown Environments
* Forecasting Chlorophyll-a in the Murray-Darling Basin Using Remote Sensing
* Forecasting Cumulonimbus Clouds: Evaluation of New Operational Convective Index Using Lightning and Precipitation Data
* Forensic Iris Image-Based Post-Mortem Interval Estimation
* Forgery-Aware Adaptive Learning With Vision Transformer for Generalized Face Forgery Detection
* Formal Quantification of Spatially Differential Characteristics of PSI-Derived Vertical Surface Deformation Using Regular Triangle Network: A Case Study of Shixi in the Northwest Xuzhou Coalfield
* Formation of Lunar Swirls: Implication from Derived Nanophase Iron Abundance
* Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering
* Foundation X: Integrating Classification, Localization, and Segmentation Through Lock-Release Pretraining Strategy for Chest X-Ray Analysis
* Fourth-Order Dimension Preserved Tensor Completion With Temporal Constraint for Missing Traffic Data Imputation
* Frame by Familiar Frame: Understanding Replication in Video Diffusion Models
* framework for global role-based author name disambiguation, A
* FRAUD-Net: Fraud News Detection Using Sample Uncertainty & Domain Aware Generalized Network
* FreeMix: Open-Vocabulary Domain Generalization of Remote-Sensing Images for Semantic Segmentation
* Frequency Domain Low Complexity Chase Reed-Solomon Decoding Under Blind Recognition Prior Conditions
* Frequency Domain-Based Cross-Layer Feature Aggregation Network for Camouflaged Object Detection
* Frequency-Domain Refinement of Vision Transformers for Robust Medical Image Segmentation Under Degradation
* Frequency-Spatial-Temporal Domain Fusion Network for Remote Sensing Image Change Captioning
* From rice planting area mapping to rice agricultural system mapping: A holistic remote sensing framework for understanding China's complex rice systems
* From Space to Stream: Combining Remote Sensing and In Situ Techniques for Comprehensive Stream Health Assessment
* From Visual Explanations to Counterfactual Explanations with Latent Diffusion
* From visual features to key concepts: A Dynamic and Static Concept-driven approach for video captioning
* FSMT: Few-Shot Object Detection via Multi-Task Decoupled
* FT2TF: First-Person Statement Text-to-Talking Face Generation
* FUN-AD: Fully Unsupervised Learning for Anomaly Detection with Noisy Training Data
* Fusion Method Based on Physical Modes and Satellite Remote Sensing for 3D Ocean State Reconstruction, A
* Fusion Regression
* Fuzzy-Based Optimal Control for an Underactuated Surface Vessel With User-Specified Performance
* GACraterNet: A collaborative geometry-attribute domain network for enhanced detection of Martian impact craters
* Gait Recognition in the Wild: A Large-Scale Benchmark and NAS-Based Baseline
* Gaitcloud: Leveraging Spatial-Temporal Information for Lidar-Base Gait Recognition With a True-3D Gait Representation
* GaitContour: Efficient Gait Recognition Based on a Contour-Pose Representation
* Game-Based Event-Triggered Control for Unmanned Surface Vehicle: Algorithm Design and Harbor Experiment
* GANESH: Generalizable NeRF for Lensless Imaging
* GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space
* GAT-LSTM: A feature point management network with graph attention for feature-based visual SLAM in dynamic environments
* GAUDA: Generative Adaptive Uncertainty-Guided Diffusion-Based Augmentation for Surgical Segmentation
* GauFRe: Gaussian Deformation Fields for Real-Time Dynamic Novel View Synthesis
* Gauging-delta-delta: A Non-Parametric Hierarchical Clustering Algorithm
* Gaussian Déjà-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities
* GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation
* GazeSearch: Radiology Findings Search Benchmark
* GCESS: A two-phase generative learning framework for estimate molecular expression to cell detection and analysis
* General Class-Balanced Multicentric Dynamic Prototype Pseudo-Labeling for Source-Free Domain Adaptation
* Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models
* Generalizable person re-identification method using bi-stream interactive learning with feature reconstruction
* Generalizable Single-Source Cross-Modality Medical Image Segmentation via Invariant Causal Mechanisms
* Generalizable Single-View Object Pose Estimation by Two-Side Generating and Matching
* Generalized Ambiguity Function for Bistatic FDA Radar Joint Velocity, Range, and Angle Parameters
* Generalized Robot Vision-Language Model via Linguistic Foreground-Aware Contrast
* Generalized Spatiotemporally Weighted Boosted Regression to Predict the Occurrence of Grassland Fires in the Mongolian Plateau, A
* Generalized Tensor Formulation for Hyperspectral Image Super-Resolution Under General Spatial Blurring, A
* GeneralizeFormer: Layer-Adaptive Model Generation Across Test-Time Distribution Shifts
* Generating Large-Scale Origin-Destination Matrix via Progressive Growing Generative Adversarial Networks Model
* Generating Long-Take Videos via Effective Keyframes and Guidance
* Generating Multi-Center Classifier via Conditional Gaussian Distribution
* Generating Visual Explanations from Deep Networks Using Implicit Neural Representations
* Generating visual-adaptive audio representation for audio recognition
* Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models
* Generative compositor for few-shot visual information extraction
* Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images
* Generic Vehicle-to-Sensor Calibration Framework, A
* GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
* Geographic Distribution of Lung and Bronchus Cancer Mortality and Elevation in the United States: Exploratory Spatial Data Analysis and Spatial Statistics
* Geographic Object-Oriented Analysis of UAV Multispectral Images for Tree Distribution Mapping in Mangroves
* Geographically Constrained Machine Learning-Based Kernel-Driven Method for Downscaling of All-Weather Land Surface Temperature
* Geographically Weighted Random Forest Based on Spatial Factor Optimization for the Assessment of Landslide Susceptibility
* GeoGuide: Geometric Guidance of Diffusion Models
* Geometrical preservation and correlation learning for multi-source unsupervised domain adaptation
* Geometry and Topology Correction of 3D Building Models with Fragmented and Disconnected Components
* Geometry-Aware Deep Learning for 3D Skeleton-Based Motion Prediction
* Geometry-Aware RWKV for Heterogeneous Light Field Spatial Super-Resolution
* GeoPos: A Minimal Positional Encoding for Enhanced Fine-Grained Details in Image Synthesis Using Convolutional Neural Networks
* Georeferencing Building Information Models for BIM/GIS Integration: A Review of Methods and Tools
* Geospatial Analysis of Regional Disparities in Non-Grain Cultivation: Spatiotemporal Patterns and Driving Mechanisms in Jiangsu, China
* Geospatial Approach to Assess Flash Flood Vulnerability in a Coastal District of Bangladesh: Integrating the Multifaceted Dimension of Vulnerabilities
* Geospatial Framework for Assessing the Suitability and Demand for Agricultural Digital Solutions in Europe: A Tool for Informed Decision-Making
* Geospatial Livestock-Carrying Capacity Model (GLCC) in the Akmola Oblast, Kazakhstan, A
* GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling
* GET3DGS: Generate 3D Gaussians Based on Points Deformation Fields
* GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-Grained Video-Language Learning
* GHOST: Grounded Human Motion Generation with Open Vocabulary Scene-and-Text Contexts
* GL-MCM: Global and Local Maximum Concept Matching for Zero-Shot Out-of-Distribution Detection
* Global Distribution and Local Variation of Pre-Rain Green-Up in Tropical Dryland
* Global Navigation Satellite System-Based Deformation Monitoring of Hydraulic Structures Using a Gated Recurrent Unit-Attention Mechanism
* Global Optical and SAR Image Registration Method Based on Local Distortion Division
* Global SAR Spectral Analysis of Intermediate Ocean Waves: Statistics and Derived Real Aperture Radar Modulation
* Global Variability and Future Projections of Marine Heatwave Onset and Decline Rates
* Global-Guided Focal Neural Radiance Field for Large-Scale Scene Rendering
* Global-local information sensitivity adjustment factor
* GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification
* GMT: Guided Mask Transformer for Leaf Instance Segmentation
* GMTNet: Dense Object Detection via Global Dynamically Matching Transformer Network
* GNSS Precipitable Water Vapor Prediction for Hong Kong Based on ICEEMDAN-SE-LSTM-ARIMA Hybrid Model
* Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
* Gradient-based class weighting for unsupervised domain adaptation in dense prediction visual tasks
* Gradient-Guided Joint Representation Loss With Adaptive Neck for Train Crash Detection
* Granular Ball K-Class Twin Support Vector Classifier
* Granular-ball computing-based Random Walk for anomaly detection
* Graph Foundation Models: Concepts, Opportunities and Challenges
* Graph-Jigsaw Conditioned Diffusion Model for Skeleton-Based Video Anomaly Detection
* Grid Partition-Based Dynamic Spatial-Temporal Graph Convolutional Network for Large-Scale Traffic Flow Forecasting
* Grid-Based Hierarchical Representation Method for Large-Scale Scenes Based on Three-Dimensional Gaussian Splatting, A
* GroundingMate: Aiding Object Grounding for Goal-Oriented Vision-and-Language Navigation
* Group commonality graph: Multimodal pedestrian trajectory prediction via deep group features
* Growing-before-pruning: A progressive neural architecture search strategy via group sparsity and deterministic annealing
* GStex: Per-Primitive Texturing of 2D Gaussian Splatting for Decoupled Appearance and Geometry Modeling
* GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction
* GTransPDM: A Graph-Embedded Transformer With Positional Decoupling for Pedestrian Crossing Intention Prediction
* Guardian of the Ensembles: Introducing Pairwise Adversarially Robust Loss for Resisting Adversarial Attacks in DNN Ensembles
* Guess Future Anomalies from Normalcy: Forecasting Abnormal Behavior in Real-World Videos
* Guided progressive learning for room layout estimation: From pixel-level embeddings to refined depth maps
* HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images
* Harmonizing Attention: Training-free Texture-aware Geometry Transfer
* Harris Hawks Optimization for Soil Water Content Estimation in Ground-Penetrating Radar Waveform Inversion
* HATNet: EEG-Based Hybrid Attention Transfer Learning Network for Train Driver State Detection
* Hausdorff Distance Matching with Adaptive Query Denoising for Rotated Detection Transformer
* HDPNet: Hourglass Vision Transformer with Dual-Path Feature Pyramid for Camouflaged Object Detection
* Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition
* HeightLane: BEV Heightmap Guided 3D Lane Detection
* HeightMapNet: Explicit Height Modeling for End-to-End HD Map Learning
* Hessian-Aware Zeroth-Order Optimization
* Heterogeneity Analysis of Resident Demands and Public Service Facilities in Megacities of China from the Perspective of Urban Health Examination
* Heterogeneous Correlation Aware Regularization for Sequential Confidence Calibration
* Heterogeneous Datasets for Unsupervised Image Anomaly Detection
* Heterogeneous Generative Tokens and Distance-Aware Recovery Network for Occluded Person Re-Identification
* HEX: Hierarchical Emergence Exploitation in Self-Supervised Algorithms
* HexaGen3D: StableDiffusion is One Step Away from Fast and Diverse Text-to-3D Generation
* HFEF2-YOLO: Hierarchical Dynamic Attention for High-Precision Multi-Scale Small Target Detection in Complex Remote Sensing
* HGAT and Multi-Agent RL-Based Method for Multi-Intersection Traffic Signal Control
* Hierarchical boundary feature alignment network for video salient object detection
* Hierarchical Context Measurement Network for Single Hyperspectral Image Super-Resolution
* Hierarchical diffusion models for generating various pattern vehicles in infrared aerial images
* Hierarchical Frequency-Based Upsampling and Refining for HEVC Compressed Video Enhancement
* Hierarchical Light Transformer Ensembles for Multimodal Trajectory Forecasting
* Hierarchical Motion-Enhanced Matching Framework for Few-Shot Action Recognition
* Hierarchical Prescribed-Time Platoon Control for Heterogeneous Vehicles With Actuator Faults and System Uncertainties
* Hierarchy-Based Diagram-Sentence Matching on Dual-Modal Graphs
* High Resolution Crop Type and Rotation Mapping in Farming-Pastoral Ecotone in China Using Multi-Satellite Imagery and Google Earth Engine
* High Resolution Spatially Consistent Global Dataset for CO2 Monitoring, A
* High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer
* High-Order State Space Model for Multi-Modal Accelerated MRI Reconstruction
* High-Pass Kernel Prediction for Efficient Video Deblurring
* High-Precision Multi-Source Fusion Navigation Solutions for Complex and Dynamic Urban Environments
* High-Precision Satellite Clock Offset Estimated by SRIF Based on Epoch-Wise Updated Orbit
* High-Resolution Maps of Left Atrial Displacements and Strains Estimated With 3D Cine MRI Using Online Learning Neural Networks
* High-Resolution, Low-Latency Multi-Satellite Precipitation Merging by Correcting with Weather Radar Network Data
* Highlighting illumination color by three-dimensional perception of scene moderates color constancy decline by systematic surface color change in stimulus background
* Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks
* Hilbert Transform on Graphs: Let There Be Phase
* Histo-Genomic Knowledge Association for Cancer Prognosis From Histopathology Whole Slide Images
* Historical Coast Snaps: Using Centennial Imagery to Track Shoreline Change
* Holistic High-Resolution Remote Sensing Approach for Mapping Coastal Geomorphology and Marine Habitats, A
* HOPE: A Memory-Based and Composition-Aware Framework for Zero-Shot Learning with Hopfield Network and Soft Mixture of Experts
* HOPE: A Reinforcement Learning-Based Hybrid Policy Path Planner for Diverse Parking Scenarios
* How Accurately and in What Detail Can Land Use and Land Cover Be Mapped Using Copernicus Sentinel and LUCAS 2022 Data?
* HSDA: High-Frequency Shuffle Data Augmentation for Bird's-Eye-View Map Segmentation
* HTACPE: A Hybrid Transformer With Adaptive Content and Position Embedding for Sample Learning Efficiency of Hyperspectral Tracker
* HUPE: Heuristic Underwater Perceptual Enhancement with Semantic Collaborative Learning
* Hybrid Attention Transformers with fast Fourier convolution for light field image super-resolution
* Hybrid Filtering Technique for Accurate GNSS State Estimation
* Hybrid Learning Model of Global-Local Graph Attention Network and XGBoost for Inferring Origin-Destination Flows
* Hybrid Model for Classification of Skin Cancer Images After Segmentation, A
* Hybrid Path Tracking Control for Autonomous Trucks: Integrating Pure Pursuit and Deep Reinforcement Learning With Adaptive Look-Ahead Mechanism
* HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors
* Hyperdimensional Representation for Adaptive Information Association and Memorization
* Hypergraph Based Contextual Relationship Modeling Method for Multimodal Emotion Recognition in Conversation, A
* Hyperspectral Image Classification Using a Multi-Scale CNN Architecture with Asymmetric Convolutions from Small to Large Kernels
* Hyperspectral image classification using hybrid convolutional-based cross-patch retentive network
* Hyperspectral Image Reconstruction Based on Blur-Kernel-Prior and Spatial-Spectral Attention
* Hyperspectral image restoration via the collaboration of low-rank tensor denoising and completion
* I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting
* I Spy with My Little Eye a Minimum Cost Multicut Investigation of Dataset Frames
* I3D-AE-LSTM: A 2-Stream Autoencoder for Action Quality Assessment Using a Newly Created Cricket Batsman Video Dataset
* I3Net: Intensive information interaction network for RGB-T salient object detection
* IceBench: A Benchmark for Deep-Learning-Based Sea-Ice Type Classification
* Identification of Exposed Beachrocks on South China Sea Islands Based on UAV Images, The
* Identify Backdoored Model in Federated Learning via Individual Unlearning
* Identifying the Pockets Most Affected by Temperature Rise and Evaluating the Repercussions on Urban Communities and Their Agricultural Lands
* Identity Curvature Laplace Approximation for Improved Out-of-Distribution Detection
* Identity-aware infrared person image generation and re-identification via controllable diffusion model
* iESTA: Instance-Enhanced Spatial-Temporal Alignment for Video Copy Localization
* IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models
* IIAG-CoFlow: Inter- and Intra-Channel Attention Transformer and Complete Flow for Low-Light Image Enhancement With Application to Night Traffic Monitoring Images
* Image Adaptation for Colour Vision Deficient Viewers Using Vision Transformers
* Image Forgery Localization With State Space Models
* Image is Worth Multiple Words: Multi-Attribute Inversion for Constrained Text-To-Image Synthesis, An
* Image Synthesis Under Limited Data: A Survey and Taxonomy
* Image Terrain Map Model for Texture Filtering, An
* Image-Caption Encoding for Improving Zero-Shot Generalization
* Image-Level Regression for Uncertainty-Aware Retinal Image Segmentation
* Image-text feature learning for unsupervised visible-infrared person re-identification
* Imaging of Leaf Water Patterns of Vitis vinifera Genotypes Infected by Plasmopara viticola
* Imbalanced Medical Image Segmentation With Pixel-Dependent Noisy Labels
* Impact of Climate Change on Oriental Migratory Locust Suitability: A Multi-Source Data and MaxEnt-Based Analysis in Hainan Island
* Impact of field of view on color constancy in virtual reality
* Impact of Mesoscale Eddies on Surface and Subsurface Sound Channels in the Kuroshio Extension, The
* Impact of Residents' Daily Internet Activities on the Spatial Distribution of Online Fraud: An Analysis Based on Mobile Phone Application Usage, The
* Impact of the Densest and Highest-Capacity Reservoirs on the Ecological Environment in the Upper Yellow River Basin of China: From 2000 to 2020, The
* Impact of the May 2024 Extreme Geomagnetic Storm on the Ionosphere and GNSS Positioning
* Impacts of Climate Change and Human Activities on Vegetation Productivity in China
* Impersonation Attack Using Quantum Shor's Algorithm Against Blockchain-Based Vehicular Ad-Hoc Network
* Implicit Image-to-Image Schrödinger Bridge for image restoration
* Importance of hue: color discrimination of three-dimensional objects and two-dimensional discs
* Importance of hue: the effect of saturation on hue-chroma asymmetries
* Importance of Spectral Information, Seasonality, and Topography on Land Cover Classification of Tropical Land Cover Mapping
* Importance-Guided Interpretability and Pruning for Video Transformers in Driver Action Recognition
* Improved Encoder-Decoder Architecture With Human-Like Perception Attention for Monaural Speech Enhancement
* Improved Fading Factor-Based Adaptive Robust Filtering Algorithm for SINS/GNSS Integration with Dynamic Disturbance Suppression, An
* Improved Point Cloud Filtering Algorithm Applies in Complex Urban Environments, An
* Improved Virtual Vehicles Design for On-Ramp Cooperative Merging
* Improving Aboveground Biomass Estimation in Beech Forests with 3D Tree Crown Parameters Derived from UAV-LS
* Improving Accuracy and Generalization for Efficient Visual Tracking
* Improving Adversarial Training From the Perspective of Class-Flipping Distribution
* Improving Conditional Diffusion Models through Re-Noising from Unconditional Diffusion Priors
* Improving Deep Detector Robustness via Detection-Related Discriminant Maximization and Reorganization
* Improving Detail in Pluralistic Image Inpainting with Feature Dequantization
* Improving Faithfulness of Text-to-Image Diffusion Models through Inference Intervention
* Improving imbalanced medical image classification through GAN-based data augmentation methods
* Improving model generalization by on-manifold adversarial augmentation in the frequency domain
* Improving Pelvic MR-CT Image Alignment with Self-Supervised Reference-Augmented Pseudo-CT Generation Framework
* Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling
* Improving the Accuracy of Soil Classification by Using Vis-NIR, MIR, and Their Spectra Fusion
* Improving the Regional Precipitation Simulation Corrected by Satellite Observation Using Quantile Mapping
* Improving Uncertainty Estimation with Confidence-Aware Training Data
* Improving Video Moment Retrieval by Auxiliary Moment-Query Pairs With Hyper-Interaction
* Improving visual grounding in remote sensing images with adaptive modality guidance
* Improving Visual Object Tracking Through Visual Prompting
* Improving Zero-Shot Object-Level Change Detection by Incorporating Visual Correspondence
* Incorporating Task Progress Knowledge for Subgoal Generation in Robotic Manipulation through Image Edits
* Incremental structural adaptation for camouflaged object detection
* InDistill: Information flow-preserving knowledge distillation for model compression
* Individual and Combined Effects of Natural-Human Factors on Forest Fire Frequency in Northeast China, The
* Indoor scene multi-object tracking based on region search and memory buffer pool
* Infant Action Generative Modeling
* Inference Calibration of Vision-Language Foundation Models for Zero-Shot and Few-Shot Learning
* Inferring Past Human Actions in Homes with Abductive Reasoning
* Influence of chromatic properties of background on color constancy for a two-dimensional stimulus
* Influence of Digital Elevation Model Resolution on the Normalized Stream Length-Gradient Index in Intraplate Regions: A Case Study of the Yangsan Fault, Korea
* Influence of Groundwater Management on Land Subsidence Patterns in the Metropolitan Region of Guatemala City: A Multi-Temporal InSAR Analysis, The
* Influences of Environmental Factors on the Microwave Scattering Coefficient from the Sea Surface, The
* Influences of Sampling Design and Model Selection on Predictions of Chemical Compounds in Petroferric Formations in the Brazilian Amazon
* Influencing Factors and Paths of the Coupling Relationship Between Ecosystem Services Supply-Demand and Human Well-Being in the Hexi Regions, Northwest China
* Information Bottleneck Based Self-Distillation: Boosting Lightweight Network for Real-World Super-Resolution
* Information enhancement graph representation learning
* Information Extraction from Heterogeneous Documents Without Ground Truth Labels Using Synthetic Label Generation and Knowledge Distillation
* Information Theoretic Pruning of Coupled Channels in Deep Neural Networks
* Information-Guided Diffusion Model for Downscaling Land Surface Temperature from SDGSAT-1 Remote Sensing Images
* Infrared Small Target Detection Based on Entropy Variation Weighted Local Contrast Measure
* Infrared small target detection based on hypergraph and asymmetric penalty function
* Infrared thermal dense point clouds: A new frontier for remote landslide investigation
* Infrastructure Enabled Guided Navigation for Visually Impaired
* Inherent Trade-Offs Between the Conflicting Aspects of Designing the Compact Global Navigation Satellite System (GNSS) Anti-Interference Array
* Innovation and Research to Support Policies on Sustainable Development Goals: An Integrated ICT Platform for the Definition and Monitoring of Programs in Puglia Region, Italy
* Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain Adaptation
* Instantaneous Frequency Estimation via Ridge Detection in Polynomial Time and Space
* Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-Identification
* Instructive3D: Editing Large Reconstruction Models with Text Instructions
* integrated algorithm to estimate chlorophyll-a concentration in various optical waters using HY-3A CZI, An
* Integrated GPR and Magnetometry Survey of the Roman Fort of Aquis Querquennis (Northwest Iberia), An
* Integrated Optimization of Order Processing and Robot Scheduling in Parts-to-Picker System
* Integrated subset selection and bandwidth estimation algorithm for geographically weighted regression
* Integrating end-to-end multimodal deep learning and domain adaptation for robust facial expression recognition
* Integrating large language models with explainable fuzzy inference systems for trusty steel defect detection
* Integrating LLMs With ITS: Recent Advances, Potentials, Challenges, and Future Directions
* Integrating Machine Learning and Geospatial Data for Mapping Socioeconomic Vulnerability to Urban Natural Hazard
* Integrating the Space of Reflectance Spectra
* Integration of Geospatial Data for the BIM-Based Inventory of a Skatepark: A Case Study, The
* Integration of Hyperspectral Imaging and AI Techniques for Crop Type Mapping: Present Status, Trends, and Challenges
* Integration of Optical and Microwave Satellite Data for Monitoring Vegetation Status in Sorghum Fields
* Intelligent Offloading Balance for Vehicular Edge Computing and Networks
* Intention-Aware Denoising Diffusion Model for Trajectory Prediction
* Interactive Decision-Making Integrating Graph Neural Networks and Model Predictive Control for Autonomous Driving
* Interactive Face Video Coding: A Generative Compression Framework
* Interactive Object Detection for Tiny Objects in Large Remotely Sensed Images
* Interpretable Multi-Sensor Fusion of Optical and SAR Data for GEDI-Based Canopy Height Mapping in Southeastern North Carolina
* Interval Type-2 T-S Fuzzy Robust Anti-Lock Braking Control for Electro-Mechanical Braking System Considering Road Uncertainty and Input Delay
* Invariant Shape Representation Learning for Image Classification
* Inverse Problems with Diffusion Models: A MAP Estimation Perspective
* Inverting the Generation Process of Denoising Diffusion Implicit Models: Empirical Evaluation and a Novel Method
* Investigating Imaging, Annotation and Self-Supervision for the Classification of Continuously Developing Cells in Histological Whole Slide Images
* Investigation of the Application of Measured Meteorological Observations in Real-Time Precise Point Positioning
* Investigation on LLMs' Visual Understanding Ability Using SVG for Image-Text Bridging, An
* InvisMark: Invisible and Robust Watermarking for AI-generated Image Provenance
* IPT-ILR: Image Pyramid Transformer Coupled With Information Loss Regularization for All-in-One Image Restoration
* IR-ADMDet: An Anisotropic Dynamic-Aware Multi-Scale Network for Infrared Small Target Detection
* IRIS-VIS: A New Dataset for Visibility Estimation in an Industrial Environment
* J-Invariant Volume Shuffle for Self-Supervised Cryo-Electron Tomogram Denoising on Single Noisy Volume
* Jeap-BiLSTM Neural Network for Action Recognition, A
* Jitter Error Correction for the HaiYang-3A Satellite Based on Multi-Source Attitude Fusion
* Joint Co-Speech Gesture and Expressive Talking Face Generation Using Diffusion with Adapters
* Joint Optimization Loss Function for Tiny Object Detection in Remote Sensing Images
* Joint Optimization of Carrier Frequency and PRF for Frequency Agile Radar Based on Compressed Sensing
* Joint Optimization of Electric Bus Infrastructure Planning, Fleet Composition, and Charging Schedule Considering Multi-Gun Charging and Compatibility
* Joint Style and Layout Synthesizing: Toward Generalizable Remote Sensing Semantic Segmentation
* Joint-Pixel Inversion for Ground Phase and Forest Height Estimation Using Spaceborne Polarimetric SAR Interferometry
* Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
* KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder
* Keep and Extent: Unified Knowledge Embedding for Few-Shot Image Generation
* Kinship verification via Frequency Feature Decoupling and Fusion
* Knockoff Branch: Model Stealing Attack via Adding Neurons in the Pre-Trained Model
* Knowledge-Driven Framework for Anatomical Landmark Annotation in Laparoscopic Surgery
* Knowledge-enhanced and structure-enhanced representation learning for protein-ligand binding affinity prediction
* Label Calibration in Source Free Domain Adaptation
* Label Convergence: Defining an Upper Performance Bound in Object Recognition Through Contradictory Annotations
* Label-Augmented Dataset Distillation
* Lake Evolution and Its Response to Urban Expansion in Wuhan City in the Last Hundred Years Based on Historical Maps and Remote Sensing Images
* landmarks-assisted diffusion model with heatmap-guided denoising loss for high-fidelity and controllable facial image generation, A
* Landslide Identification in the Yuanjiang Basin of Northwestern Hunan, China, Using Multi-Temporal Polarimetric InSAR with Comparison to Single-Polarization Results
* Lane Detection for Autonomous Driving: Comprehensive Reviews, Current Challenges, and Future Predictions
* Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation
* Language-Image Consistency Augmentation and Distillation Network for visual grounding
* Large-Scale Hyperspectral Image-Projected Clustering via Doubly Stochastic Graph Learning
* Latency Robust Cooperative Perception Using Asynchronous Feature Fusion
* LATTECLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts
* Layerlink: Bridging remote sensing object detection and large vision models with efficient fine-tuning
* LCNet: Lightweight real-time image classification network based on efficient multipath dynamic attention mechanism and dynamic threshold convolution
* LD-Det: Lightweight Ship Target Detection Method in SAR Images via Dual Domain Feature Fusion
* LDINet: Latent decomposition-interpolation for single image fast-moving objects deblatting
* LDTrack: Dynamic People Tracking by Service Robots Using Diffusion Models
* LEAD: Learning-Enhanced Adaptive Decision-Making for Autonomous Driving in Dynamic Environments
* Learnable Prompting SAM-Induced Knowledge Distillation for Semi-Supervised Medical Image Segmentation
* Learning Anatomy-Disease Entangled Representation
* Learning Deep Illumination-Robust Features from Multispectral Filter Array Images
* Learning Distinguishable Degradation Maps for Unknown Image Super-Resolution
* Learning Dynamic-Sensitivity Enhanced Correlation Filter With Adaptive Second-Order Difference Spatial Regularization for UAV Tracking
* Learning Emotion Category Representation to Detect Emotion Relations Across Languages
* Learning hyperspectral noisy label with global and local hypergraph laplacian energy
* Learning Instance-Specific Parameters of Black-Box Models Using Differentiable Surrogates
* Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision
* Learning Meshing from Delaunay Triangulation for 3D Shape Representation
* Learning Multiple Object States from Actions via Large Language Models
* Learning position-aware implicit neural network for real-world face inpainting
* Learning Probabilistic Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
* Learning Semantic Part-Based Graph Structure for 3D Point Cloud Domain Generalization
* Learning Semi-Supervised Medical Image Segmentation from Spatial Registration
* Learning temporal-aware representation for controllable interventional radiology imaging
* Learning the Power of No: Foundation Models with Negations
* Learning to Count from Pseudo-Labeled Segmentation
* Learning to Identify Seen, Unseen and Unknown in the Open World: A Practical Setting for Zero-Shot Learning
* Learning to Rebalance Multi-Modal Optimization by Adaptively Masking Subnetworks
* Learning to See Low-Light Images via Feature Domain Adaptation
* Learning to Visually Connect Actions and Their Effects
* Learning Under Noisy Labels, Spurious Points, and Diverse Structures: TS40K, a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission Systems
* Learning Unified Distance Metric Across Diverse Data Distributions with Parameter-Efficient Transfer Learning
* Learning Video Salient Object Detection Progressively From Unlabeled Videos
* Learning Visual Grounding from Generative Vision and Language Model
* Learning Visual-Semantic Hierarchical Attribute Space for Interpretable Open-Set Recognition
* Learning with Enriched Inductive Biases for Vision-Language Models
* Learning Without Forgetting for Vision-Language Models
* LEO Satellite Navigation Signal Multi-Dimensional Interference Optimisation Method Based on Hybrid Game Theory
* Leveraging CLIP Encoder for Multimodal Emotion Recognition
* Leveraging Land Cover Priors for Isoprene Emission Super-Resolution
* Leveraging multi-level regularization for efficient Domain Adaptation of Black-box Predictors
* Leveraging Phenology to Assess Seasonal Variations of Plant Communities for Mapping Dynamic Ecosystems
* Leveraging Principal Component Analysis for Data-Driven and Objective Weight Assignment in Spatial Decision-Making Framework for Qanat-Induced Subsidence Susceptibility Assessment in Railway Networks
* Leveraging Vision Language Models for Specialized Agricultural Tasks
* LGSNet: Local-Global Semantics Learning Object Detection
* LiCamPose: Combining Multi-View LiDAR and RGB Cameras for Robust Single-timestamp 3D Human Pose Estimation
* LiDAR-Based Road Cracking Detection: Machine Learning Comparison, Intensity Normalization, and Open-Source WebGIS for Infrastructure Maintenance
* Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation
* LiGAR: LiDAR-Guided Hierarchical Transformer for Multi-Modal Group Activity Recognition
* Lightweight Deep Exclusion Unfolding Network for Single Image Reflection Removal, A
* Lightweight RGB-D Salient Object Detection From a Speed-Accuracy Tradeoff Perspective
* Lightweight Strategies for Decision-Making of Autonomous Vehicles in Lane Change Scenarios Based on Deep Reinforcement Learning
* LiLMaps: Learnable Implicit Language Maps
* LIME: Localized Image Editing via Attention Regularization in Diffusion Models
* LIPIDS: Learning-based Illumination Planning In Discretized (Light) Space for Photometric Stereo
* Listen to your gradients: Integrating gradients into deep unfolding networks
* Listen With Seeing: Cross-Modal Contrastive Learning for Audio-Visual Event Localization
* Lithological Classification Using ZY1-02D Hyperspectral Data by Means of Machine Learning and Deep Learning Methods in the Kohat-Pothohar Plateau, Khyber Pakhtunkhwa, Pakistan
* LLaVA-SpaceSGG: Visual Instruct Tuning for Open-Vocabulary Scene Graph Generation with Enhanced Spatial Relations
* LLDiffusion: Learning degradation representations in diffusion models for low-light image enhancement
* LLM-Generated Rewrite and Context Modulation for Enhanced Vision Language Models in Digital Pathology
* LLM-RSPF: Large Language Model-Based Robotic System Planning Framework for Domain Specific Use-cases
* LLS: Local Learning Rule for Deep Neural Networks Inspired by Neural Activity Synchronization
* Local Consistency Guidance: Personalized Stylization Method of Face Video
* Local Gaussian ensemble for arbitrary-scale image super-resolution
* Local Masked Reconstruction for Efficient Self-Supervised Learning on High-Resolution Images
* Local Texture Pattern Estimation for Image Detail Super-Resolution
* Localization Method for UAV Aerial Images Based on Semantic Topological Feature Matching, A
* Localized Gaussian Splatting Editing with Contextual Awareness
* LoCo: Low-Bit Communication Adaptor for Large-Scale Model Training
* LoGAN: A novel local attentive generative adversarial resizable network for detailed 3D reconstruction of the Martian surface using monocular HiRISE images and DTMs
* LogicNet: A Logical Consistency Embedded Face Attribute Learning Network
* Long-Term (2015-2024) Daily PM2.5 Estimation in China by Using XGBoost Combining Empirical Orthogonal Function Decomposition
* Long-Term Ad Memorability: Understanding & Generating Memorable Ads
* Long-Term Impact of Extreme Weather Events on Grassland Growing Season Length on the Mongolian Plateau
* Long-Term Monitoring of Landslide Activity in a Debris Flow Gully Using SBAS-InSAR: A Case Study of Shawan Gully, China
* Looking at Model Debiasing through the Lens of Anomaly Detection
* Loose Social-Interaction Recognition in Real-World Therapy Scenarios
* LORD: Large Models Based Opposite Reward Design for Autonomous Driving
* LoSA: Long-Short-Range Adapter for Scaling End-to-End Temporal Action Localization
* Lost in light field compression: Understanding the unseen pitfalls in computer vision
* Low-Frequency Black-Box Backdoor Attack via Evolutionary Algorithm
* Low-Visibility Scene Enhancement by Isomorphic Dual-Branch Framework With Attention Learning
* LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones
* LPFFNet: Lightweight Prior Feature Fusion Network for SAR Ship Detection
* LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound Images
* LS-MambaNet: Integrating Large Strip Convolution and Mamba Network for Remote Sensing Object Detection
* LumiGauss: Relightable Gaussian Splatting in the Wild
* Lunar Visual Localization Method Based on Crater Geohash Encoding and Consistency Matching
* LYT-NET: Lightweight YUV Transformer-Based Network for Low-Light Image Enhancement
* M3IF-NSST-MTV: Modified Total variation-based multi-modal medical image fusion using Laplacian energy and morphology in the NSST domain
* M3Track: Meta-Prompt for Multi-Modal Tracking
* MA-SAM: A Multi-Atlas Guided SAM Using Pseudo Mask Prompts Without Manual Annotation for Spine Image Segmentation
* Machine Learning-Based Alfalfa Height Estimation Using Sentinel-2 Multispectral Imagery
* Machine Learning-Based Estimation of foF2 and MUF(3000)F2 Using GNSS Ionospheric TEC Observations
* Machine-Learning-Based Monitoring of Night Sky Brightness Using Sky Quality Meters and Multi-Source Remote Sensing
* MagicFace: Slot-Driven High-Fidelity One-Shot Facial Appearance Editing
* MagicStick: Controllable Video Editing via Control Handle Transformations
* MAGMA: Manifold Regularization for MAEs
* MAIR++: Improving Multi-View Attention Inverse Rendering With Implicit Lighting Representation
* MAISI: Medical AI for Synthetic Imaging
* Make VLM Recognize Visual Hallucination on Cartoon Character Image with Pose Information
* Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
* Mamba-Based Siamese Network for Remote Sensing Change Detection, A
* Mamba-ST: State Space Model for Efficient Style Transfer
* MambaMeshSeg-Net: A Large-Scale Urban Mesh Semantic Segmentation Method Using a State Space Model with a Hybrid Scanning Strategy
* MambaRecon: MRI Reconstruction with Structured State Space Models
* Mapping and Analyzing Winter Wheat Yields in the Huang-Huai-Hai Plain: A Climate-Independent Perspective
* Mapping Ecosystem Functional Groups in the Republic of Korea Based on the IUCN Global Ecosystem Typology
* Mapping Forest Aboveground Biomass with Phenological Information Extracted from Remote Sensing Images in Subtropical Evergreen Broadleaf Forests
* Mapping Gridded GDP Distribution of China Based on Remote Sensing Data and Machine Learning Methods
* Mapping Individual Tree- and Plot-Level Biomass Using Handheld Mobile Laser Scanning in Complex Subtropical Secondary and Old-Growth Forests
* Mapping of Flood Impacts Caused by the September 2023 Storm Daniel in Thessaly's Plain (Greece) with the Use of Remote Sensing Satellite Data
* Mapping of Monodominant Gilbertiodendron dewevrei Forest Across the Western Congo Basin Using Sentinel-2 Imagery
* Mapping Rice Phenology Using MODIS Products in An Giang Province, Mekong River Delta, Vietnam
* Mapping seamless surface water dynamics over East Africa semimonthly at a 10-meter resolution in 2017-2023 by integrating Sentinel-1/2 data
* Mapping Spatial Inequity in Urban Fire Service Provision: A Moran's I Analysis of Station Pressure Distribution
* Mapping Temperate Grassland Dynamics in China Inner Mongolia (1980s-2010s) Using Multi-Source Data and Deep Neural Network
* Mapping the Spatiotemporal Evolution of Cropland-Related Soil Erosion in China over the Past Four Decades
* Mapping Trails and Tracks in the Boreal Forest Using LiDAR and Convolutional Neural Networks
* Mapping Urban Divides: Analyzing Residential Segregation and Housing Types in a Medium-Sized Romanian City
* Mapping Windthrow Risk in Pinus radiata Plantations Using Multi-Temporal LiDAR and Machine Learning: A Case Study of Cyclone Gabrielle, New Zealand
* Mask-Guided Cross-Modality Fusion Network for Visible-Infrared Vehicle Detection
* MaskBlur: Spatial and Angular Data Augmentation for Light Field Image Super-Resolution
* Masked auto-encoding and scatter-decoupling transformer for polarimetric SAR image classification
* MaskVD: Region Masking for Efficient Video Object Detection
* MATLIT: MAT-Based Cooperative Reinforcement Learning for Urban Traffic Signal Control
* MatSpectNet: Material Segmentation Network with Domain-Aware and Physically-Constrained Hyperspectral Reconstruction
* McCaD: Multi-Contrast MRI Conditioned, Adaptive Adversarial Diffusion Model for High-Fidelity MRI Synthesis
* MDCN-PS: Monocular-Depth-Guided Coarse Normal Attention for Robust Photometric Stereo
* MDFormer: Multi-Scale Downsampling-Based Transformer for Low-Light Image Enhancement
* mDRA: A Multimodal Depression Risk Assessment Model Using Audio and Text
* MDSI: Pluggable Multi-strategy Decoupling with Semantic Integration for RGB-D Gesture Recognition
* Measuring the Validity of Clustering Validation Datasets
* MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
* MemControl: Mitigating Memorization in Diffusion Models via Automated Parameter Selection
* MemFusionMap: Working Memory Fusion for Online Vectorized HD Map Construction
* Memory-efficient Continual Learning with Neural Collapse Contrastive
* Memory-Efficient Pseudo-Labeling for Online Source-Free Universal Domain Adaptation using a Gaussian Mixture Model
* Memory-MambaNav: Enhancing object-goal navigation through integration of spatial-temporal scanning with state space models
* MENTOR: Human Perception-Guided Pretraining for Increased Generalization
* Merging Context Clustering With Visual State Space Models for Medical Image Segmentation
* MES-YOLO: An efficient lightweight maritime search and rescue object detection algorithm with improved feature fusion pyramid network
* Meta-Learning for Color-to-Infrared Cross-Modal Style Transfer
* MetaVIn: Meteorological and Visual Integration for Atmospheric Turbulence Strength Estimation
* Method for Identifying Landslide-Prone Areas Using Multiple Factors and Adaptive Probability Thresholds: A Case Study in Northern Tongren, Longwu River Basin, Qinghai Province, A
* Method for the 3D Reconstruction of Landscape Trees in the Leafless Stage, A
* Metric Compatible Training for Online Backfilling in Large-Scale Retrieval
* MF-LPR2: Multi-frame license plate image restoration and recognition using optical flow
* MFC-Net: Amodal instance segmentation with multi-path fusion and context-awareness
* MFNeRF: Memory Efficient NeRF with Mixed-Feature Hash Table
* MFSM-Net: Multimodal Feature Fusion for the Semantic Segmentation of Urban-Scale Textured 3D Meshes
* MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation
* MFTrans: A Multi-Resolution Fusion Transformer for Robust Tumor Segmentation in Whole Slide Images
* MIF: Multi-source information fusion for few-shot classification with CLIP
* MimicGait: A Model Agnostic approach for Occluded Gait Recognition Using Correlational Knowledge Distillation
* Mind the Map! Accounting for Existing Maps When Estimating Online HDMaps from Sensors
* Mind the Prompt: A Novel Benchmark for Prompt-Based Class-Agnostic Counting
* Minds on the Move: Decoding Trajectory Prediction in Autonomous Driving With Cognitive Insights
* Mineral segmentation using electron microscope images and spectral sampling through multimodal graph neural networks
* MIP-Enhanced Uncertainty-Aware Network for Fast 7T Time-of-Flight MRA Reconstruction
* MIP-GAF: A MLLM-Annotated Benchmark for Most Important Person Localization and Group Context Understanding
* MISF-Net: Modality-Invariant and -Specific Fusion Network for RGB-T Crowd Counting
* Missiongnn: Hierarchical Multimodal GNN-Based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation
* MixDiff: Mixing Natural and Synthetic Images for Robust Self-Supervised Representations
* Mixed Patch Visible-Infrared Modality Agnostic Object Detection
* MLLM as video narrator: Mitigating modality imbalance in video moment retrieval
* MLLM-LLaVA-FL: Multimodal Large Language Model Assisted Federated Learning
* MLLM-Tool: A Multimodal Large Language Model for Tool Agent Learning
* MMFF: Multiview and multi-level feature fusion method within limited sample conditions for SAR image target recognition
* MMGS: Multi-Model Synergistic Gaussian Splatting for Sparse View Synthesis
* Modality mixer exploiting complementary information for multi-modal action recognition
* Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-Based Semantic Segmentation
* Model of Building Changes to Support Comparative Studies and Open Discussions on Densification, A
* Model-Based Convolution Neural Network for 3D Near-Infrared Spectral Tomography
* Modeling and Correction Methods for Positioning Errors in Loran System at Sea
* Modeling Dynamics of Water Balance for Lakes in the Northwest Tibetan Plateau with Satellite-Based Observations
* Modeling Worldwide Tree Biodiversity Using Canopy Structure Metrics from Global Ecosystem Dynamics Investigation Data
* Modified Morphological Component Analysis Method for SAR Image Clutter Suppression
* Moment of Untruth: Dealing with Negative Queries in Video Moment Retrieval
* MONAS-ESNN: Multi-Objective Neural Architecture Search for Efficient Spiking Neural Networks
* MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications
* Monte Carlo Neural PDE Solver for Learning PDEs via Probabilistic Representation
* MoonShot: Towards Controllable Video Generation and Editing with Motion-Aware Multimodal Conditions
* MOOSS: Mask-Enhanced Temporal Contrastive Learning for Smooth State Evolution in Visual Reinforcement Learning
* Morag - Multi-Fusion Retrieval Augmented Generation for Human Motion
* Motion Intent Analysis-Based Full-Frame Video Stabilization
* MOVE: Effective and Harmless Ownership Verification via Embedded External Features
* Moving Average-Based Variable Projection for Separable Nonlinear Problems
* Moving Target Coherent Integration Method Based on TRCM-KT for UAV-Mounted Through-the-Wall Radar
* Moving-Least-Squares-Enhanced 3D Object Detection for 4D Millimeter-Wave Radar
* MoVis: When 3D Object Detection Is Like Human Monocular Vision
* MPPCAD: Minimum Power Pattern Constrained Adaptive Differential Beamforming
* MRCS-Net: Multi-Radar Clustering Segmentation Networks for Full-Pulse Sequences
* MrgaNet: Multi-scale recursive gated aggregation network for tracheoscopy images
* MRI Reconstruction with Regularized 3D Diffusion Model (R3DM)
* MS-Glance: Bio-Inspired Non-Semantic Context Vectors and Their Applications in Supervising Image Reconstruction
* MSCoTDet: Language-Driven Multi-Modal Fusion for Improved Multispectral Pedestrian Detection
* MSI-NeRF: Linking Omni-Depth with View Synthesis Through Multi-Sphere Image Aided Generalizable Neural Radiance Field
* MSIM: A Multiscale Iteration Method for Aerial Image and Satellite Image Registration
* MSKA: Multi-stream keypoint attention network for sign language recognition and translation
* MT-DyNN: Multi-Teacher Distilled Dynamic Neural Network for Instance-Adaptive Detection in Autonomous Driving
* MT_GAN: A SAR-to-optical image translation method for cloud removal
* MulDeF: A Model-Agnostic Debiasing Framework for Robust Multimodal Sentiment Analysis
* MulModSeg: Enhancing Unpaired Multi-Modal Medical Image Segmentation with Modality-Conditioned Text Embedding and Alternating Training
* Multi Target Localization With Block Orthogonal Least Squares for Multistatic MIMO Radars
* Multi-Aperture Transformers for 3D (MAT3D) Segmentation of Clinical and Microscopic Images
* Multi-channel set polynomial based label regularized graph neural networks against extreme data scarcity
* Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier
* Multi-Directional Dual-Window Method Using Fractional Optimal-Order Fourier Transform for Hyperspectral Anomaly Detection
* Multi-Directional Pyranometer (CUBE-i) for Real-Time Direct and Diffuse Solar Irradiance Decomposition, A
* Multi-Domain Fusion Network for Active Jamming Recognition in Cognitive Radar
* Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration
* Multi-Feature Lightweight DeeplabV3+ Network for Polarimetric SAR Image Classification with Attention Mechanism
* Multi-grained contrast for data-efficient unsupervised representation learning
* Multi-Granularity Context Perception Network for Open Set Recognition of Camouflaged Objects
* Multi-Granularity Domain-Adaptive Teacher for Unsupervised Remote Sensing Object Detection
* Multi-granularity interaction and feature recombination network for fine-grained visual classification
* Multi-Hazard Susceptibility Mapping Using Machine Learning Approaches: A Case Study of South Korea
* Multi-HexPlanes: A Lightweight Map Representation for Rendering and 3D Reconstruction
* Multi-information Fusion Graph Convolutional Network for cancer driver gene identification
* Multi-Label Continual Learning for the Medical Domain: A Novel Benchmark
* Multi-level cross-modal attention guided DIBR 3D image watermarking
* Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets
* Multi-modal hypergraph contrastive learning for medical image segmentation
* Multi-Modal Large Language Model with RAG Strategies in Soccer Commentary Generation
* Multi-Modal Large Language Models are Effective Vision Learners
* Multi-Modal Understanding and Generation for Object Tracking
* Multi-Object Feature Extraction in Resonance Region Based on Short-Time Matrix Pencil Method
* Multi-Objective Gray Consistency Correction Method for Mosaicking Regional SAR Intensity Images with Brightness Anomalies, A
* Multi-Objective Multi-Drone Collaborative Routing Problem With Heterogeneous Delivery and Pickup Service
* Multi-Path Feature Extraction and Transformer Feature Enhancement DEM Super-Resolution Reconstruction Network, A
* Multi-Resolution Guided 3D GANs for Medical Image Translation
* Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation
* Multi-Scenario Forecasting of Land Use and Ecosystem Service Values in Coastal Regions: A Case Study of the Chaoshan Area, China
* Multi-Scenario Land Use and Carbon Storage Assessment in the Yellow River Delta Under Climate Change and Resource Development
* Multi-Scenario Simulation of Urban Land Expansion Modes Considering Differences in Spatial Functional Zoning
* Multi-Spectral Image Color Reproduction
* Multi-Stage Control Strategy of IoT-Enabled Unmanned Vehicle Detection Systems
* Multi-stage intermediate fusion for multimodal learning to classify non-small cell lung cancer subtypes from CT and PET
* Multi-Stage Optimization Approach for Satellite Orbit Pursuit-Evasion Games Based on a Coevolutionary Mechanism, A
* Multi-Surrogate-Teacher Assistance for Representation Alignment in Fingerprint-Based Indoor Localization
* Multi-Target Association for UAVs Based on Fused Topology and Visual Features
* Multi-task Learning of Classification and Generation for Set-structured Data
* Multi-Task Supervised Compression Model for Split Computing, A
* Multi-TuneV: Fine-tuning the fusion of multiple modules for video action recognition
* Multi-View Factorizing and Disentangling: A Novel Framework for Incomplete Multi-View Multi-Label Classification
* Multi-View Image Diffusion via Coordinate Noise and Fourier Attention
* Multi-View Test-Time Adaptation for Semantic Segmentation in Clinical Cataract Surgery
* Multi-Year Global Oscillations in GNSS Deformation and Surface Loading Contributions
* Multidimensional Study of the 2023 Beijing Extreme Rainfall: Theme, Location, and Sentiment Based on Social Media Data, A
* Multilevel Feature Cross-Fusion-Based High-Resolution Remote Sensing Wetland Landscape Classification and Landscape Pattern Evolution Analysis
* Multilevel Representation Disentanglement Framework for Multimodal Sentiment Analysis
* Multimodal Emotional Talking Face Generation Based on Action Units
* Multimodal Fusion Learning with Dual Attention for Medical Imaging
* Multimodal Interpretable Depression Analysis Using Visual, Physiological, Audio and Textual Data
* Multimodal large language model for wheat breeding: A new exploration of smart breeding
* Multimodal Prompt-Guided Bidirectional Fusion for Referring Remote Sensing Image Segmentation
* Multiple Model Estimation via Variable Structure With Spatiotemporal Primal-Dual Projection
* Multiscale Skeleton-Based Temporal Action Segmentation Using Hierarchical Temporal Modeling and Prediction Ensemble
* Multispectral Object Detection Enhanced by Cross-Modal Information Complementary and Cosine Similarity Channel Resampling Modules
* Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network
* MuseumMaker: Continual Style Customization Without Catastrophic Forgetting
* Mutual Supervision Framework for Referring Expression Segmentation and Generation, A
* MVAD: A Multiple Visual Artifact Detector for Video Streaming
* MVCF-TMI: A Travel Mode Identification Framework via Contrastive Fusion of Multi-View Trajectory Representations
* MVFNet: Multipurpose Video Forensics Network using Multiple Forms of Forensic Evidence
* MVMD: A Multi-View Approach for Enhanced Mirror Detection
* My3DGen: A Scalable Personalized 3D Generative Model
* NarrAD: Automatic Generation of Audio Descriptions for Movies with Rich Narrative Context
* NAT: Learning to Attack Neurons for Enhanced Adversarial Transferability
* Natural Language-Based Automatic Identification System Trajectory Query Approach Using Large Language Models, A
* Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models
* Navigating Through Whole Slide Images With Hierarchy, Multi-Object, and Multi-Scale Data
* NCAdapt: Dynamic Adaptation with Domain-Specific Neural Cellular Automata for Continual Hippocampus Segmentation
* NCAP: Scene Text Image Super-Resolution with Non-CAtegorical Prior
* nD-histogram technique for querying non-uniformly distributed point cloud data, An
* Near-metameric illumination changes affect visually perceived food attributes
* Near-Real-Time Global Thermospheric Density Variations Unveiled by Starlink Ephemeris
* Near-Real-Time Imaging Algorithm for Focusing Spaceborne SAR Data in Multiple Modes Based on an Embedded GPU, A
* Needles & Haystacks: Dataset and Benchmark for Domain-Agnostic Image-Based Rigid Slice-to-Volume Registration
* Negative-Prompt Inversion: Fast Image Inversion for Editing with Text-Guided Diffusion Models
* neighbor-aware feature enhancement network for crowd counting, A
* NER-Net+: Seeing Motion at Nighttime With an Event Camera
* NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields, The
* NeRF-Det++: Incorporating Semantic Cues and Perspective-Aware Depth Supervision for Indoor Multi-View 3D Detection
* NeRFs are Mirror Detectors: Using Structural Similarity for Multi-View Mirror Scene Reconstruction with 3D Surface Primitives
* Nestedmorph: Enhancing Deformable Medical Image Registration With Nested Attention Mechanisms
* NeuManifold: Neural Watertight Manifold Reconstruction with Efficient and High-Quality Rendering Support
* Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration
* Neural Network Based on Dynamic Collaboration of Flows for Temporal Downscaling
* Neural SDF for Shadow-Aware Unsupervised Structured Light
* Neuroadaptive Admittance Control for Human-Robot Interaction With Human Motion Intention Estimation and Output Error Constraint
* NeuroViG: Integrating Event Cameras for Resource-Efficient Video Grounding
* new approach to color correction and equalization for generating mars global color image mosaics from Tianwen-1 MoRIC images, A
* New Benchmark and Baseline for Real-Time High-Resolution Image Inpainting on Edge Devices, A
* New CFAR Detector Based on the EM Algorithm, A
* New Derivation of the Formula for the Length of a Loxodrome Arc on a Sphere Using Cylindrical Projections, A
* New Earth System Spatial Grid Extending the Great Circle Arc QTM: The Spherical Geodesic Degenerate Octree Grid, A
* new fast DBSCAN using dual-space analysis and colour integral volume for document image segmentation, A
* New Iterative Weighted Least Squares Algorithm for 1-D SA Localization, A
* New maps of mafic mineral abundances in global mare units on the Moon
* New Method for Single-Site Cloud-to-Ground Lightning Location Based on Tri-Pre Processing
* New Quasi-Linear Integral Transform Between Ocean Wave Spectrum and Phase Spectrum of an XTI-SAR, A
* New Transformer Network for Short-Term Global Sea Surface Temperature Forecasting: Importance of Eddies, A
* New Typification Method for Combined Linear Building Patterns with the Resolution of Spatial Conflicts, A
* Night-Time Traffic Light Recognition Based on Enhancement-Guided Object Detection
* NIR-Assisted Image Denoising: A Selective Fusion Approach and a Real-World Benchmark Dataset
* No Annotations for Object Detection in Art Through Stable Diffusion
* No-Reference Point Cloud Quality Assessment via Graph Convolutional Network
* Noise Radar Waveform Design Using Evolutionary Algorithms and Negentropy Constraint
* Noise-Aware Evaluation of Object Detectors
* Non-Asymptotic Analysis on the Additional Bias of Capon's Method, A
* Non-Cross Diffusion for Semantic Consistency
* Nonlinear Impact of Built Environment on Older Adults' Bus Use Behavior: A Hybrid Model Considering Spatial Heterogeneity
* Nonlinear Phase Reconstruction and Compensation Method Based on Orthonormal Complete Basis Functions in Synthetic Aperture Ladar Imaging Technology
* Nonlinear Schur-Type Audio Signal Parameterization for Convolutional Networks
* Novel Adaptive Fine-Tuning Algorithm for Multimodal Models: Self-Optimizing Classification and Selection of High-Quality Datasets in Remote Sensing, A
* Novel Aerosol Optical Depth Retrieval Method Based on SDAE from Himawari-8/AHI Next-Generation Geostationary Satellite in Hubei Province, A
* Novel Computational Photography for Soft-Focus Effect in Automatic Post Production
* Novel Digital Twin Framework With Hardware-in-the-Loop for Engine Systems, A
* Novel Energy Efficient Single-Capacitor Switching Scheme for SAR ADCs, A
* novel framework for diverse video generation from a single video using frame-conditioned denoising diffusion probabilistic model and ConvNeXt-V2, A
* Novel Framework for Learning Bézier Decomposition From 3D Point Clouds, A
* novel heterogeneous data classification approach combining gradient boosting decision trees and hybrid structure model, A
* Novel Optimal Distributed Nonlinear Filter for Simultaneous State and Unknown Input Estimation in Multi-Sensor Networks, A
* Novel Perspective for Multi-Modal Multi-Label Skin Lesion Classification, A
* Novel Proactive Fault Tolerance Loss Function for Crack Segmentation, A
* Novel Sea Surface Temperature Prediction Model Using DBN-SVR and Spatiotemporal Secondary Calibration, A
* Novel Three-Stage Filtering Identification Algorithm for the Exponential Autoregressive Time-Series Model, A
* Now you see Me: Context-Aware Automatic Audio Description
* NP-Hand: Novel Perspective Hand Image Synthesis Guided by Normals
* NPL-MVPS: Neural Point-Light Multi-View Photometric Stereo
* NPP-VIIRS Nighttime Lights Illustrate the Post-Earthquake Damage and Subsequent Economic Recovery in Hatay Province, Turkey
* NTRENet++: Unleashing the Power of Non-Target Knowledge for Few-Shot Semantic Segmentation
* OccFlowNet: Occupancy Estimation via Differentiable Rendering and Occupancy Flow
* OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction
* Ocean Surface Wind Field Retrieval Simultaneously Using SAR Backscatter and Doppler Shift Measurements
* OEM-HWNet: A Prior Knowledge-Guided Network for Pavement Interlayer Distress Detection Based on Computer Vision Using GPR
* Offloading Model and Algorithm for VANET Broadcast Applications
* OIL-AD: An anomaly detection framework for decision-making sequences
* OmniDiffusion: Reformulating 360 Monocular Depth Estimation Using Semantic and Surface Normal Conditioned Diffusion
* OmniGS: Fast Radiance Field Reconstruction Using Omnidirectional Gaussian Splatting
* On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process
* On Neural BRDFs: A Thorough Comparison of State-of-the-Art Approaches
* On the consistency and stability of vegetation biophysical variables retrievals from Landsat-8/9 and Sentinel-2
* On the Importance of Dual-Space Augmentation for Domain Generalized Object Detection
* On the Possibility of Detecting Evaporation Ducts Through GNSS Reflectometry
* On the representation of sparse stochastic matrices with state embedding
* On the Upper Bounds of Number of Linear Regions and Generalization Error of Deep Convolutional Neural Networks
* On Which Data Distribution (Synthetic or Real) We Should Rely for Soft Biometric Classification
* On-the-Fly Object-aware Representative Point Selection in Point Cloud
* One VLM to Keep it Learning: Generation and Balancing for Data-free Continual Visual Question Answering
* One-Dimensional Convolutional Neural Network for Automated Kimchi Cabbage Downy Mildew Detection Using Aerial Hyperspectral Images
* online adaptive augmentation strategy for cervical cytopathology image recognition, An
* Online Asymmetric Supervised Discrete Cross-Modal Hashing for Streaming Multimedia Data
* Online-LoRA: Task-Free Online Continual Learning via Low Rank Adaptation
* Open and Free Sentinel-2 Mowing Event Data for Austria
* Open-vocabulary generative vision-language models for creating a large-scale remote sensing change detection dataset
* OpenCapBench: A Benchmark to Bridge Pose Estimation and Biomechanics
* OpenCity3D: What do Vision-Language Models Know About Urban Environments?
* Optimal color sets to represent the colors of natural scenes by k-medoids clustering
* Optimal Hierarchical Arithmetic Average Fusion of GM-PHD Filters, An
* Optimal Multisecret Image Sharing Using Lightweight Visual Sign-Cryptography Scheme With Optimal Key Generation for Gray/Color Images
* Optimal Trading of a Charging-Station Company in Auction Markets for Electricity
* Optimization for Paralyzing G2A Communication Network: A DRL-Based Joint Path Planning and Jamming Power Allocation Approach
* Optimization of Rank Losses for Image Retrieval
* Optimization Simulation and Comprehensive Evaluation Coupled with CNN-LSTM and PLUS for Multi-Scenario Land Use in Cultivated Land Reserve Resource Area
* Optimization with Deep Learning Classifier-Based Foliar Disease Classification in Apple Trees Using IoT Network
* Optimization-Based Downscaling of Satellite-Derived Isotropic Broadband Albedo to High Resolution
* Optimizing Dense Visual Predictions Through Multi-Task Coherence and Prioritization
* Optimizing Neural Network Effectiveness via Non-monotonicity Refinement
* Optimizing Unmanned Aerial Vehicle LiDAR Data Collection in Cotton Through Flight Settings and Data Processing
* Optimizing Urban Logistics: Vehicle Routing Problem With Underground Transportation
* Optimizing Vision-Language Model for Road Crossing Intention Estimation
* OPTIMUS: Observing Persistent Transformations in Multi-Temporal Unlabeled Satellite-Data
* Orbital Design Optimization for Large-Scale SAR Constellations: A Hybrid Framework Integrating Fuzzy Rules and Chaotic Sequences
* Ordinal Multiple-instance Learning for Ulcerative Colitis Severity Estimation with Selective Aggregated Transformer
* ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection
* ORID: Organ-Regional Information Driven Framework for Radiology Report Generation
* Oriented Cell Dataset: A Dataset and Benchmark for Oriented Cell Detection and Applications
* Oriented SAR Ship Detection Based on Edge Deformable Convolution and Point Set Representation
* ORPSD: Outer Rectangular Projection-Based Representation for Oriented Ship Detection in SAR Images
* Orthogonal opponent colour local binary patterns: a new colour-texture descriptor for content based-image retrieval
* OT-VP: Optimal Transport-Guided Visual Prompting for Test-Time Adaptation
* OTCXR: Rethinking Self-supervised Alignment using Optimal Transport for Chest X-ray Analysis
* PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement
* Paddy Field Scale Evapotranspiration Estimation Based on Two-Source Energy Balance Model with Energy Flux Constraints and UAV Multimodal Data
* Paladin: Understanding Video Intentions in Political Advertisement Videos
* Palo: A Polyglot Large Multimodal Model for 5B People
* Panoptic segmentation-based semantic embedding matching model for scene graph generation
* Parameter-Free Deep Multi-Modal Clustering With Reliable Contrastive Learning
* Parametric Approach to Adversarial Augmentation for Cross-Domain Iris Presentation Attack Detection, A
* Parametric Representation of Tropical Cyclone Outer Radical Wind Profile Using Microwave Radiometer Data
* Park Development, Potential Measurement, and Site Selection Study Based on Interpretable Machine Learning: A Case Study of Shenzhen City, China
* Part-aware distillation and aggregation network for human parsing
* Partial consistent adversarial unified framework for unsupervised non-contrast CT cross-domain adaptation and segmentation
* Partial Filter-Sharing: Improved Parameter-sharing Method for Single Image Super-Resolution Networks
* Partial Texture VAE: Color and Texture Encoder for Rock Particle Images
* Passive Microwave Imagers, Their Applications, and Benefits: A Review
* Passive Multisource Tracking via Distributed Sparse Arrays: Homogeneous Data Fusion and Multivariate Adaptation
* PAT: Pixel-wise Adaptive Training for long-tailed segmentation
* Patch Ranking: Token Pruning as Ranking Prediction for Efficient CLIP
* PatchFinder: Leveraging Visual Language Models for Accurate Information Retrieval Using Model Uncertainty
* Pattern Recognition in Urban Maps Based on Graph Structures
* Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation
* PC-GZSL: Prior Correction for Generalized Zero Shot Learning
* PCTrack: Accurate Object Tracking for Live Video Analytics on Resource-Constrained Edge Devices
* Pedestrian detection based on vision-language semantics with global adaptive adjustment
* Pedestrian-Vehicle Interaction Analysis Based on Concept of Dynamic Straight-Right Lane at Signalized Intersection
* Per-Pixel Solution of Multispectral Photometric Stereo
* Perceive. Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries
* Perceptual Screen Content Image Hashing Using Adaptive Texture and Shape Features
* Performance Evaluation of Inherent Optical Property Algorithms and Identification of Potential Water Quality Indicators Using GCOM-C Data in Eutrophic Lake Kasumigaura, Japan
* Performance of Green Areas in Mitigating the Alteration of Land Surface Temperature in Urban Zones of Lima, Peru
* Performance Prediction of Hybrid Integration Detector for Radar Moderately Fluctuating Rayleigh Targets
* Permissioned Blockchain-Based Quantum-Inspired Edge Intelligence Approach for the Services of Future Internet of Vehicles, The
* Personalised video summarisation using video-text multi-modal fusion
* Personalized Lane-Changing Decision System Based on Improved Stackelberg Game and Traffic Flow Information, A
* Personalized Mixture of Experts for Multi-Site Medical Image Segmentation
* PETALface: Parameter Efficient Transfer Learning for Low-Resolution Face Recognition
* PGRID: Power Grid Reconstruction in Informal Developments Using High-Resolution Aerial Imagery
* Phaseformer: Phase-Based Attention Mechanism for Underwater Image Restoration and Beyond
* Physics-Based Computational Forward Model for Efficient Image Reconstruction in Magnetic Particle Imaging, A
* Physiology-Aware PolySnake for Coronary Vessel Segmentation
* PhysMLE: Generalizable and Priors-Inclusive Multi-Task Remote Physiological Measurement
* PICASSO: A Feed-Forward Framework for Parametric Inference of CAD Sketches via Rendering Self-Supervision
* PICK: Predict and Mask for Semi-supervised Medical Image Segmentation
* Pipeline and NIR-Enhanced Dataset for Parking Lot Segmentation, A
* PivotAlign: Improve Semi-Supervised Learning by Learning Intra-Class Heterogeneity and Aligning with Pivots
* Pix2Poly: A Sequence Prediction Method for End-to-End Polygonal Building Footprint Extraction from Remote Sensing Imagery
* Pixel-Inconsistency Modeling for Image Manipulation Localization
* Pixel-Wise Shuffling with Collaborative Sparsity for Melanoma Hyperspectral Image Classification
* Pixel2Pixel: A Pixelwise Approach for Zero-Shot Single Image Denoising
* PixelQuery: Efficient Distance Range Join Query Technique for Visualization Analysis
* PixSwap: High-Resolution Face Swapping for Effective Reflection of Identity via Pixel-Level Supervision with Synthetic Paired Dataset
* PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices
* Planar Gaussian Splatting
* Planet4Stereo: A Photogrammetric Open-Source Pipeline for Generating Digital Elevation Models for Glacier Change Monitoring Using Low-Cost PlanetScope Satellite Data
* Plant Height and Soil Compaction in Coffee Crops Based on LiDAR and RGB Sensors Carried by Remotely Piloted Aircraft
* PLReMix: Combating Noisy Labels with Pseudo-Label Relaxed Contrastive Representation Learning
* Pluralistic Salient Object Detection
* PMMTalk: Speech-Driven 3D Facial Animation From Complementary Pseudo Multi-Modal Features
* PMNet: Predator-Mimicking Network for Video Camouflaged Object Detection
* PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing
* Point Cloud Color Upsampling with Attention-Based Coarse Colorization and Refinement
* Point-FCW: Transposed-FCW Graph Representation for Point Cloud Classification Using TDA
* Point-GN: A Non-Parametric Network Using Gaussian Positional Encoding for Point Cloud Classification
* Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud
* PointFormer: Keypoint-Guided Transformer for Simultaneous Nuclei Segmentation and Classification in Multi-Tissue Histology Images
* Polarity-Focused Denoising for Event Cameras
* Polarization as Texture: Microscale 3D Shape from Polarized Light Focus
* Policy-Oriented Cognitive Risk Map Modeling for Lane Change via Deep Successor Representation
* PolyReg: Autoregressive Building Outline Regularization via Masked Attention Sequence Generation
* PoolAtnRes: Towards Generalisable Differential Morphing Attack Detection
* Pose-graph optimization for efficient tie-point matching and 3D scene reconstruction from oblique UAV images
* PositiveCoOp: Rethinking Prompting Strategies for Multi-Label Recognition with Partial Annotations
* Post-Little Ice Age Equilibrium-Line Altitude and Temperature Changes in the Greater Caucasus Based on Small Glaciers
* Post-Processing Optimization of the Global 30m Land Cover Dynamic Monitoring Product
* PostoMETRO: Pose Token Enhanced Mesh Transformer for Robust 3D Human Mesh Recovery
* Potential of EnMAP Hyperspectral Imagery for Regional-Scale Soil Organic Matter Mapping
* Power Minimization-Based Secure Beamforming for MISO VLC Having Both Perfect and Imperfect CSI
* Practical Framework for Estimating Façade Opening Rates of Rural Buildings Using Real-Scene 3D Models Derived from Unmanned Aerial Vehicle Photogrammetry, A
* Pre-capture Privacy via Adaptive Single-Pixel Imaging
* Pre-trained Multiple Latent Variable Generative Models are Good Defenders Against Adversarial Attacks
* Pre-trained Trojan Attacks for Visual Recognition
* Precipitation Retrieval from Geostationary Satellite Data Based on a New QPE Algorithm
* Precise Integral in NeRFs: Overcoming the Approximation Errors of Numerical Quadrature
* Precise Prediction Method for Subsurface Temperatures Based on the Rock Resistivity-Temperature Coupling Model, A
* Predefined Time and Prespecified Precision for Bearing-Constrained AAV Swarm
* Predicting Event Memorability Using Personalized Federated Learning
* Predicting Tree-Level Diameter and Volume for Radiata Pine Using UAV LiDAR-Derived Metrics Across a National Trial Series in New Zealand
* Prediction and Feedback Assisted Evolutionary Algorithms for Scheduling Urban Traffic Signals
* Prediction of Sea Surface Chlorophyll-a Concentrations by Remote Sensing and Deep Learning
* Prediction of the Morphological Characteristics of Asymmetric Thaw Plate of Qinghai-Tibet Highway Using Remote Sensing and Large-Scale Geological Survey Data
* Predictive Observer-Based Dual-Rate Prescribed Performance Control for Visual Servoing of Robot Manipulators With View Constraints
* Predictive Sample Assignment for Semantically Coherent Out-of-Distribution Detection
* Pressure-Related Discrepancies in Landsat 8 Level 2 Collection 2 Surface Reflectance Products and Their Correction
* PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction
* PRGS: Patch-to-Region Graph Search for Visual Place Recognition
* Primary Interannual Variability Modes of Summer Moisture Transports in the Tibetan Plateau
* Prior2Posterior: Model Prior Correction for Long-Tailed Learning
* PRISMA imaging for land covers and surface materials composition in urban and rural areas adopting multiple endmember spectral mixture analysis (MESMA)
* PrivateEye: In-Sensor Privacy Preservation Through Optical Feature Separation
* Probabilistic Site Adaptation for High-Accuracy Solar Radiation Datasets in the Western Sichuan Plateau
* Production and Analysis of a Landslide Susceptibility Map Covering Entire China
* Progressive Invariant Causal Feature Learning for Single Domain Generalization
* Progressive Semantic-Visual Alignment and Refinement for Vision-Language Tracking
* Progressive Skip Connection Improves Consistency of Diffusion-Based Speech Enhancement
* PRoGS: Progressive Rendering of Gaussian Splats
* Prompt-Based Concept Learning for Few-Shot Class-Incremental Learning
* Prompt-Based Modality Alignment for Effective Multi-Modal Object Re-Identification
* Protecting the Copyright of Intelligent Transportation Systems Based on Zernike Moments
* Prototype Imputation Guided Incomplete Multi-View Clustering
* Prototype-augmented mean teacher for robust semi-supervised medical image segmentation
* Pruning One More Token is Enough: Leveraging Latency-Workload Non-Linearities for Vision Transformers on the Edge
* Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization Microscopy
* PS-YOLO: A Lighter and Faster Network for UAV Object Detection
* PSeqNet: A crop phenology monitoring model accounting for phenological associations
* Pseudo-Plane Regularized Signed Distance Field for Neural Indoor Scene Reconstruction
* Psych-Occlusion: Using Visual Psychophysics for Aerial Detection of Occluded Persons During Search and Rescue
* PTQ4VM: Post-Training Quantization for Visual Mamba
* PULSE: Physiological Understanding with Liquid Signal Extraction
* PureForest: A Large-Scale Aerial Lidar and Aerial Imagery Dataset for Tree Species Classification in Monospecific Forests
* PV-VTT: A Privacy-Centric Dataset for Mission-Specific Anomaly Detection and Natural Language Interpretation
* PVP: Polar Representation Boost for 3D Semantic Occupancy Prediction
* PVT: An Implicit Surface Reconstruction Framework via Point Voxel Geometric-Aware Transformer
* Q-TempFusion: Quantization-Aware Temporal Multi-Sensor Fusion on Bird's-Eye View Representation
* QELDBA: Query-Efficient and Low Distortion Black-Box Attack for Brainprint Recognition
* QformerID: Quaternion Transformer-Based Image Denoising
* Quality Assessment of Operational Fengyun-4B/GIIRS Atmospheric Temperature and Humidity Profile Products
* QuantAttack: Exploiting Quantization Techniques to Attack Vision Transformers
* Quantifying CIE alpha-opic signals in the indoor built environment
* Quantifying the Impact of Vegetation Greening on Evapotranspiration and Its Components on the Tibetan Plateau
* Quantitative and Spatially Explicit Clustering of Urban Grocery Shoppers in Montreal: Integrating Loyalty Data with Synthetic Population
* Query as Supervision: Toward Low-Cost and Robust Video Moment and Highlight Retrieval
* Radiance Field-Based Pose Estimation via Decoupled Optimization Under Challenging Initial Conditions
* RAFNet: Rotation-aware anchor-free framework for geospatial object detection
* RaLiBEV: Radar and LiDAR BEV Fusion Learning for Anchor Box Free Object Detection Systems
* RAM: Interpreting real-world image super-resolution in the industry environment
* Random Forest-Based Precipitation Detection Algorithm for FY-3C/3D MWTS2 over Oceanic Regions, A
* Randomized quaternion tensor UTV decompositions for color image and color video processing
* Rank-revealing fully-connected tensor network decomposition and its application to tensor completion
* Rao and Wald Tests in Nonzero-Mean Non-Gaussian Sea Clutter
* Rapeseed Area Extraction Based on Time-Series Dual-Polarization Radar Vegetation Indices
* Rapid Deformation Identification and Adaptive Filtering with GNSS TDCP Under Different Scenarios and Its Application in Landslide Monitoring
* Rapid Mapping of Rainfall-Induced Landslide Using Multi-Temporal Satellite Data
* Rapid Probabilistic Inundation Mapping Using Local Thresholds and Sentinel-1 SAR Data on Google Earth Engine
* Rapid Test for Accuracy and Bias of Face Recognition Technology, A
* RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone
* RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation
* RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis
* RBF-Based Weighted Minimum Likelihood Error Entropy Against Multimodal Noise
* RBMark: Robust and blind video watermark in DT CWT domain
* RC-SODet: Reparameterized dual convolutions and compact feature enhancement for small object detector
* RD-DPP: Rate-Distortion Theory Meets Determinantal Point Process to Diversify Learning Data Samples
* Re-Evaluating Group Robustness via Adaptive Class-Specific Scaling
* Re-identifying People in Video via Learned Temporal Attention and Multi-modal Foundation Models
* Real-Time Railway Obstacle Detection Based on Multitask Perception Learning
* Real-Time Regional Ionospheric Total Electron Content Modeling Using the Extended Kalman Filter
* Real-Time Self-Supervised Ultrasound Image Enhancement Using Test-Time Adaptation for Sophisticated Rotator Cuff Tear Diagnosis
* Real-World Image Reflection Removal: An Ultra-High-Definition Dataset and an Efficient Baseline
* Real-world nighttime image dehazing using contrastive and adversarial learning
* Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models
* Realistic Protocol for Evaluation of Weakly Supervised Object Localization, A
* Reality Check on Pre-training for Exemplar-free Class-Incremental Learning, A
* ReBotNet: Fast Real-Time Video Enhancement
* ReC- Ttt: Contrastive Feature Reconstruction for Test-Time Training
* Receiver-Agnostic Radio Frequency Fingerprinting Using a Prototypical Contrastive Domain Adaptation Method
* Recent security challenges and robust techniques in colour image watermarking
* Recipe for Geometry-Aware 3D Mesh Transformers, A
* Recognizing Unseen States of Unknown Objects by Leveraging Knowledge Graphs
* Recoverable Anonymization for Pose Estimation: A Privacy-Enhancing Approach
* Recruiting Teacher IF Modality for Nephropathy Diagnosis: A Customized Distillation Method With Attention-Based Diffusion Network
* Recurrence-Based Vanishing Point Detection
* Reducing the Content Bias for AI-generated Image Detection
* REEDIT: Multimodal Exemplar-Based Image Editing
* ReferSAM: Unleashing Segment Anything Model for Referring Image Segmentation
* Refined Deformable-DETR for SAR Target Detection and Radio Signal Detection
* Refinement of Trend-to-Trend Cross-Calibration Total Uncertainties Utilizing Extended Pseudo Invariant Calibration Sites (EPICS) Global Temporally Stable Target
* Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
* Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure
* Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation
* ReFu: Recursive Fusion for Exemplar-Free 3D Class-Incremental Learning
* RefVSR++: Exploiting Reference Inputs for Reference-based Video Super-resolution
* Regional-Level Resource-Saving Model for Winter Road Surface Snow Detection in Extreme Weathers, A
* ReinDiffuse: Crafting Physically Plausible Motions with Reinforced Diffusion Model
* Reinforcement Learning With Model Predictive Control for Highway Ramp Metering
* Relation Inference Enhancement Network for Visual Commonsense Reasoning
* Relation-Guided Versatile Regularization for Federated Semi-Supervised Learning
* Relational Self-Supervised Distillation with Compact Descriptors for Image Copy Detection
* Relaxing Binary Constraints in Contrastive Vision-Language Medical Representation Learning
* Reliable Representation Learning for Incomplete Multi-View Missing Multi-Label Classification
* Reliable-loc: Robust sequential LiDAR global localization in large-scale street scenes based on verifiable cues
* Relighting From a Single Image: Datasets and Deep Intrinsic-Based Architecture
* ReMix: Training Generalized Person Re-Identification on a Mixture of Data
* Remote Blood Pressure Estimation from Facial Videos Using Transfer Learning: Leveraging PPG to rPPG Conversion
* Remote blood pressure estimation using BVP signal features from facial videos
* Remote Sensing Image Segmentation Using Vision Mamba and Multi-Scale Multi-Frequency Feature Fusion
* Remote Sensing of Particle Absorption Coefficient of Pigments Using a Two-Stage Framework Integrating Optical Classification and Machine Learning
* Remote Sensing-Based Detection and Analysis of Slow-Moving Landslides in Aba Prefecture, Southwest China
* Removing Geometric Bias in One-Class Anomaly Detection with Adaptive Feature Perturbation
* ReMP: Reusable Motion Prior for Multi-domain 3D Human Pose Estimation and Motion Inbetweening
* RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye View Segmentation
* Replay Without Saving: Prototype Derivation and Distribution Rebalance for Class-Incremental Semantic Segmentation
* RepSNet: A Nucleus Instance Segmentation Model Based on Boundary Regression and Structural Re-Parameterization
* Research on Camouflage Target Classification and Recognition Based on Mid Wave Infrared Hyperspectral Imaging
* Research on Effective Radius Retrievals of Aerosol Particles Based on Dual-Wavelength Lidar
* Research on Full-Sky Star Identification Based on Spatial Projection and Reconfigurable Navigation Catalog
* Research on Monitoring Oceanic Precipitable Water Vapor and Short-Term Rainfall Forecasting Using Low-Cost Global Navigation Satellite System Buoy
* Research on Multi-Modal Point Cloud Completion Algorithm Guided by Image Rotation Attention
* Research on Printmaking Image Classification and Creation Based on Convolutional Neural Network
* Research on Ship Following Behavior Based on Data Mining in Arctic Waters
* Research on the Response Mechanism of Vegetation to Drought Stress in the West Liao River Basin, China
* Research on Weighted Fusion Method for Multi-Source Sea Surface Temperature Based on Cloud Conditions
* Response Mechanism of Ecosystem Service Trade-Offs Along an Aridity Gradient in Humid and Semi-Humid Regions: A Case Study of Northeast China, The
* Restricted Label-Based Self-Supervised Learning Using SAR and Multispectral Imagery for Local Climate Zone Classification
* Retaining and Enhancing Pre-trained Knowledge in Vision-Language Models with Prompt Ensembling
* Rethinking Active Domain Adaptation: Balancing Uncertainty and Diversity
* Rethinking Affine Transform for Efficient Image Enhancement: A Color Space Perspective
* Rethinking Cluster-Conditioned Diffusion Models for Label-Free Image Synthesis
* Rethinking Generalizability and Discriminability of Self-Supervised Learning from Evolutionary Game Theory Perspective
* Rethinking Low-Rank Adaptation in Vision: Exploring Head-Level Responsiveness across Diverse Tasks
* Rethinking the sample relations for few-shot classification
* Rethinking transformers with convolution and graph embeddings for few-shot molecular property discovery
* Retrieval Augmented Recipe Generation
* Retrieval of Cloud Ice Water Path from FY-3F MWTS and MWHS
* Retrieval of Dissolved Organic Carbon Storage in Plateau Lakes Based on Remote Sensing and Analysis of Driving Factors: A Case Study of Lake Dianchi
* Retrievals of Biomass Burning Aerosol and Liquid Cloud Properties from Polarimetric Observations Using Deep Learning Techniques
* Retrieving and Reasoning: Multivariate Feature and Attribute Cooperation for Video Anomaly Detection
* Retrieving Inland Water Quality Parameters via Satellite Remote Sensing: Sensor Evaluation, Atmospheric Correction, and Machine Learning Approaches
* Reversible data hiding with automatic contrast enhancement and high embedding capacity based on multi-type histogram modification
* Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression
* Review of High-Sensitivity Tracking Techniques for Satellite Navigation Signals, A
* Review of Machine Learning Applications in Ocean Color Remote Sensing, A
* Revisiting Deep Archetypal Analysis for Phenotype Discovery in High Content Imaging
* Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation
* Revisiting Gradient-Based Uncertainty for Monocular Depth Estimation
* Revisiting Machine Unlearning with Dimensional Alignment
* Revisiting the Role of SMAP Soil Moisture Retrievals in WRF-Chem Dust Emission Simulations over the Western U.S.
* Reviving Poor Object Segmentations in OOD Medical Images using Variational-Deep-PCA Modeling on Segmentation Maps with Sampling-Free Learning
* RGB-D Video Mirror Detection
* RGB2Point: 3D Point Cloud Generation from Single RGB Images
* Ricci curvature discretizations for head pose estimation from a single image
* RiemStega: Covariance-Based Loss for Print-Proof Transmission of Data in Images
* RIGID: Recurrent GAN Inversion and Editing of Real Face Videos and Beyond
* Risk-Aware Vehicle Trajectory Prediction Under Safety-Critical Scenarios
* River Radii: A Comparative National Framework for Remote Monitoring of Environmental Change at River Mouths
* RLita: A Region-Level Image-Text Alignment Method for Remote Sensing Foundation Model
* ROADS: Robust Prompt-Driven Multi-Class Anomaly Detection Under Domain Shift
* Robot Instance Segmentation with Few Annotations for Grasping
* Robust camera-independent color chart localization using YOLO
* Robust Control of Vehicle Platoons Based on a Unified Spacing Policy
* robust framework for mapping complex cropping patterns: The first national-scale 10m map with 10 crops in China using Sentinel 1/2 images, A
* Robust GNSS/INS Tightly Coupled Positioning Using Factor Graph Optimization with P-Spline and Dynamic Prediction
* Robust InSAR-DEM Block Adjustment Method Based on Affine and Polynomial Models for Geometric Distortion, A
* Robust Long-Range Perception Against Sensor Misalignment in Autonomous Vehicles
* Robust Model Predictive Control of a Gait Rehabilitation Exoskeleton With Whole Body Motion Planning and Neuro-Dynamics Optimization
* Robust Novelty Detection Through Style-Conscious Feature Ranking
* Robust pixel-wise detection of road obstacles by integrating composite and real images
* Robust Portrait Image Matting and Depth-of-field Synthesis via Multiplane Images
* Robust Semiparametric Efficient Estimator for Time Delay and Doppler Estimation
* Robust Sequential DeepFake Detection
* Robust trajectory forecasting in autonomous systems using mixtures of Student's T-distributions with T-DistNet
* Robust variance-covariance estimation of tropospheric turbulence improves InSAR capability for monitoring of small tectonic displacements
* Robustly solving PnL problem using Clifford tori
* RoIPoly: Vectorized building outline extraction using vertex and logit embeddings
* RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior
* ROSA: Reconstructing Object Shape and Appearance Textures by Adaptive Detail Transfer
* RouteLAND: An Integrated Method and a Geoprocessing Tool for Characterizing the Dynamic Visual Landscape Along Highways
* RSAPower: Random Style Augmentation Driven Structure Perception Network for Generalized Retinal OCT Fluid Segmentation
* RSGPT: A remote sensing vision language model and benchmark
* RT-DETRv3: Real-Time End-to-End Object Detection with Hierarchical Dense Positive Supervision
* RT3DHVC: A Real-Time Human Holographic Video Conferencing System With a Consumer RGB-D Camera Array
* Rubric-Constrained Figure Skating Scoring
* Rule-Based Multi-Task Deep Learning for Highly Efficient Rice Lodging Segmentation
* S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation
* S3F2Net: Spatial-Spectral-Structural Feature Fusion Network for Hyperspectral Image and LiDAR Data Classification
* S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving
* SACNet: Saliency-Aided Aggregation Consensus Network for RGB-D Co-Salient Object Detection
* SADA: Semantic Adversarial Unsupervised Domain Adaptation for Temporal Action Localization
* SADDLe: Sharpness-Aware Decentralized Deep Learning with Heterogeneous Data
* Saliency supervised masked autoencoder pretrained salient location mining network for remote sensing image salient object detection
* SALVE: A 3D Reconstruction Benchmark of Wounds from Consumer-Grade Videos
* SAM-COD+: SAM-Guided Unified Framework for Weakly-Supervised Camouflaged Object Detection
* SAM-DA: Decoder Adapter for Efficient Medical Domain Adaptation
* SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation
* Sample selection for noisy partial label learning with interactive contrastive learning
* Sample-Cohesive Pose-Aware Contrastive Facial Representation Learning
* SAND: Enhancing Open-Set Neuron Descriptions through Spatial Awareness
* SANPO: A Scene Understanding, Accessibility and Human Navigation Dataset
* Satellite Data Revealed That the Expansion of China's Lakes Is Accompanied by Rising Temperatures and Wider Temperature Differences
* Satellite-Observed Arid Vegetation Greening and Terrestrial Water Storage Decline in the Hexi Corridor, Northwest China
* SatGS: Remote Sensing Novel View Synthesis Using Multi-Temporal Satellite Images with Appearance-Adaptive 3DGS
* SCAGAT: A scene-aware ensemble graph attention network for global PM2.5 pollution mapping via land-atmosphere interactions
* Scalable and Robust Tensor Ring Decomposition for Large-Scale Data With Missing Data and Outliers
* Scale-Aware Crowd Counting Network With Annotation Error Modeling
* Scaled robust linear embedding with adaptive neighbors preserving
* Scene-enhanced multi-scale temporal aware network for video moment retrieval
* Scene-LLM: Extending Language Model for 3D Visual Reasoning
* SCF-CIL: A Multi-Stage Regularization-Based SAR Class-Incremental Learning Method Fused with Electromagnetic Scattering Features
* Scientific Production on GPS Trajectory Clustering: A Bibliometric Analysis
* SCOT: Self-Supervised Contrastive Pretraining for Zero-Shot Compositional Retrieval
* SCRM-Net: Self-Supervised Deep Clustering Feature Representation for Urban 3D Mesh Semantic Segmentation
* SDFC-YOLO: A YOLO-Based Model With Selective Dynamic Feature Compensation for Pavement Distress Detection
* Sea Breeze-Driven Variations in Planetary Boundary Layer Height over Barrow: Insights from Meteorological and Lidar Observations
* SeaDATE: Remedy Dual-Attention Transformer With Semantic Alignment via Contrast Learning for Multimodal Object Detection
* SeaFormer++: Squeeze-Enhanced Axial Transformer for Mobile Visual Recognition
* SeaFree-GS: Reconstructing Underwater 3D Scenes With True Appearances
* Seasonal and Interannual Variations in M2 Tidal Current in Offshore Guangdong
* SeCo-INR: Semantically Conditioned Implicit Neural Representations for Improved Medical Image Super-Resolution
* Secrets of Edge-Informed Contrast Maximization for Event-Based Vision
* Secure reversible privacy protection for face multiple attribute editing
* SEDN: A Spatiotemporal Encoder-Decoder Network for End-to-End Object Removal Forgery Detection in High-Resolution Videos
* See Through Water: Heuristic Modeling Toward Color Correction for Underwater Image Enhancement
* SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark
* Seeing Eye to AI: Comparing Human Gaze and Model Attention in Video Memorability
* SegBuilder: A Semi-Automatic Annotation Tool for Segmentation
* SegDesicNet: Lightweight Semantic Segmentation in Remote Sensing with Geo-Coordinate Embeddings for Domain Adaptation
* Segment Anything Meets Point Tracking
* Segment Anything Model-Based Hyperspectral Image Classification for Small Samples
* Selecting High Forage-Yielding Alfalfa Populations in a Mediterranean Drought-Prone Environment Using High-Throughput Phenotyping
* Selection Method of Massive Point Cluster Using the Delaunay Triangulation to Support Real-Time Visualization, A
* Self-Aligning Depth-Regularized Radiance Fields for Asynchronous RGB-D Sequences
* Self-attention and frequency-augmentation for unsupervised domain adaptation in satellite image-based time series classification
* Self-Distillation Attention for Efficient and Accurate Motion Prediction in Autonomous Driving
* Self-distillation guided Semantic Knowledge Feedback network for infrared-visible image fusion
* Self-Relaxed Joint Training: Sample Selection for Severity Estimation with Ordinal Noisy Labels
* Self-Supervised Anomaly Segmentation via Diffusion Models with Dynamic Transformer UNet
* Self-Supervised Feature Contrastive Learning for Small Weak Object Detection in Remote Sensing
* Self-Supervised Incremental Learning of Object Representations from Arbitrary Image Sets
* Self-Supervised Learning with Probabilistic Density Labeling for Rainfall Probability Estimation
* Self-supervised Learning with Spectral Low-Rank Prior for Hyperspectral Image Reconstruction
* Self-supervised polarization image dehazing method via frequency domain generative adversarial networks
* Self-Supervised Pre-Training with Diffusion Model for Few-Shot Landmark Detection in X-Ray Images
* Self-supervised Shutter Unrolling with Events
* Self-Weighted Multi-View Fuzzy Clustering With Multiple Graph Learning
* SELL:A Method for Low-Light Image Enhancement by Predicting Semantic Priors
* SEM-Net: Efficient Pixel Modelling for Image Inpainting with Spatially Enhanced SSM
* Semantic Clustering of Image Retrieval Databases used for Visual Localization
* Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation
* Semantic Prompting with Image Token for Continual Learning
* Semantic Scene Completion via Semantic-Aware Guidance and Interactive Refinement Transformer
* Semantic Segmentation Method for Automated Indoor 3D Reconstruction based on Architectural-Knowledge-Aware Features
* Semantic-Guided Cross-Attention Network for Change Detection in High-Resolution Remote Sensing Images, A
* Semantically Conditioned Prompts for Visual Recognition Under Missing Modality Scenarios
* Semantically Impactful Image Manipulation Dataset: Characterizing Image Manipulations Using Semantic Significance, A
* Semi-Automatic Extraction of Hedgerows from High-Resolution Satellite Imagery
* Semi-Supervised Echocardiography Video Segmentation via Adaptive Spatio-Temporal Tensor Semantic Awareness and Memory Flow
* Semi-Supervised Object Detection for Remote Sensing Images Using Consistent Dense Pseudo-Labels
* Semiotic-Based Construction of a Large Emotional Image Dataset with Neutral Samples
* SEMU-Net: A Segmentation-Based Corrector for Fabrication Process Variations of Nanophotonics with Microscopic Images
* SenCLIP: Enhancing Zero-Shot Land-Use Mapping for Sentinel-2 with Ground-Level Prompting
* Sensitivity to the number of colors in textures defined by luminance and chromatic contrast
* SensorFlow: Sensor and Image Fused Video Stabilization
* Separating Direct and Global Components from Novel Viewpoints
* Separation of Unknown Features and Samples for Unbiased Source-free Open Set Domain Adaptation
* Session-Guided Attention in Continuous Learning With Few Samples
* Several Points Are All It Takes: Saluting User-Assisted Single Image Reflection Removal
* Severe Disturbance of Aurora on C-Band Sentinel-1 Interferogram at Mid-Latitudes: A Case Study During 11 May 2024
* SFANet: A Ground Object Spectral Feature Awareness Network for Multimodal Remote Sensing Image Semantic Segmentation
* SfM on-the-fly: A robust near real-time SfM for spatiotemporally disordered high-resolution imagery from multiple agents
* SFRADNet: Object Detection Network with Angle Fine-Tuning Under Feature Matching
* SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior
* Shadow Removal Refinement via Material-Consistent Shadow Edges
* Shape-Biased Texture Agnostic Representations for Improved Textureless and Metallic Object Detection and 6D Pose Estimation
* ShapeMorph: 3D Shape Completion via Blockwise Discrete Diffusion
* Shapley Consensus Deep Learning for Ensemble Pruning
* Shared Growth of Graph Neural Networks via Prompted Free-Direction Knowledge Distillation
* Shift Equivariant Pose Network
* SHIP: Structural Hierarchies for Instance-Dependent Partial Labels
* Sifting Through the Haystack - Efficiently Finding Rare Animal Behaviors in Large-Scale Datasets
* Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation
* Sign Language Recognition: A Large-scale Multi-view Dataset and Comprehensive Evaluation
* Signal-to-Noise Ratio Model and Imaging Performance Analysis of Photonic Integrated Interferometric System for Remote Sensing
* Significant Improvement in Short-Term Green-Tide Transport Predictions Using the XGBoost Model
* SIGNN: Star Identification Using Graph Neural Networks
* Similarity Over Factuality: Are we Making Progress on Multimodal Out-of-Context Misinformation Detection?
* Simple One-Step Multi-View Clustering With Fast Similarity and Cluster Structure Learning
* Simple-but-Effective Baseline for Training-Free Class-Agnostic Counting, A
* Simulated annealing-based text clustering
* Simulating Co-Evolution and Knowledge Transfer in Logistic Clusters Using a Multi-Agent-Based Approach
* Simulation and Assessment of Extreme Precipitation in the Pearl River Delta Based on the WRF-UCM Model
* Simulation and Sensitivity Analysis of Remote Sensing Reflectance for Optically Shallow Water Bathymetry
* Simulation of the Carbon Cycle's Spatiotemporal Dynamics in the Hangzhou Forest Ecosystem and How It Responds to Phenology
* Simultaneous planning of standpoints and routing for laser scanning of buildings with network redundancy
* Simultaneous Vibration and Nonlinearity Compensation for One-Period Triangular FMCW Ladar Signal Based on MSST
* SimuScope: Realistic Endoscopic Synthetic Dataset Generation Through Surgical Simulation and Diffusion Models
* Single source domain generalization for palm biometrics
* Single-Group Generalized RGB and RGB-D Co-Salient Object Detection
* Single-Layer Distillation with Fourier Convolutions for Texture Anomaly Detection
* Single-Source Frequency Transform for Cross-Scene Classification of Hyperspectral Image
* SinWaveFusion: Learning a single image diffusion model in wavelet domain
* Situational Scene Graph for Structured Human-Centric Situation Understanding
* Skew-probabilistic neural networks for learning from imbalanced data
* Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects
* Skyeyes: Ground Roaming using Aerial View Images
* SkyLoc: Cross-Modal Global Localization With a Sky-Looking Fish-Eye Camera and OpenStreetMap
* SkyML: A MLaaS Federation Design for Multicloud-Based Multimedia Analytics
* Sli2Vol+: Segmenting 3D Medical Images Based on an Object Estimation Guided Correspondence Flow Network*
* SLIDE: A Unified Mesh and Texture Generation Framework with Enhanced Geometric Control and Multi-view Consistency
* SmartKC++: Improving Performance of Smartphone-Based Corneal Topographers
* SMDAF: A Scalable Sidewalk Material Data Acquisition Framework with Bidirectional Cross-Modal Knowledge Distillation
* Snow Cover Trends in the Chilean Andes Derived from 39 Years of Landsat Data and a Projection for the Year 2050
* Social EgoMesh Estimation
* Social-Ecological Factors and Ecosystem Service Trade-Offs/Synergies in Vegetation Change Zones of Qilian Mountain National Park During 2000-2020
* Socially-Informed Reconstruction for Pedestrian Trajectory Forecasting
* SODA: Spectral Orthogonal Decomposition Adaptation for Diffusion Models
* Soft Actor-Critic Deep Reinforcement Learning for Train Timetable Collaborative Optimization of Large-Scale Urban Rail Transit Network Under Dynamic Demand
* Soil Classification Maps for the Lower Tagus Valley Area, Portugal, Using Seismic, Geological, and Remote Sensing Data
* Soil Moisture Inversion Using Multi-Sensor Remote Sensing Data Based on Feature Selection Method and Adaptive Stacking Algorithm
* Soil Organic Carbon Prediction and Mapping in Morocco Using PRISMA Hyperspectral Imagery and Meta-Learner Model
* Solar Multimodal Transformer: Intraday Solar Irradiance Predictor Using Public Cameras and Time Series
* SoundLoc3D: Invisible 3D Sound Source Localization and Classification Using a Multimodal RGB-D Acoustic Camera
* SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with Silhouettes
* Soybean Lodging Classification and Yield Prediction Using Multimodal UAV Data Fusion and Deep Learning
* SPAC: Sampling-Based Progressive Attribute Compression for Dense Point Clouds
* Space-Time Dynamics of Mortality and Recruitment of Stems and Trees in a Seasonally Dry Tropical Forest: Effect of the 2012-2021 Droughts
* SPACE: SPAtial-Aware Consistency rEgularization for Anomaly Detection in Industrial Applications
* Spaceborne Lightweight and Compact High-Sensitivity Uncooled Infrared Remote Sensing Camera for Wildfire Detection
* SpaGBOL: Spatial-Graph-Based Orientated Localisation
* Sparse Point Clouds Assisted Learned Image Compression
* Sparse-View 3D Reconstruction of Clothed Humans via Normal Maps
* SparseTrack: Multi-Object Tracking by Performing Scene Decomposition Based on Pseudo-Depth
* Spatial Agglomeration Characteristics and Impact Factors of the Cultural and Creative Industries in Harbin
* Spatial Analysis of Urban Expansion and Energy Consumption Using Nighttime Light Data: A Comparative Study of Google Earth Engine and Traditional Methods for Improved Living Spaces
* Spatial and Temporal Characteristics of Mesoscale Eddies in the North Atlantic Ocean Based on SWOT Mission
* Spatial Downscaling of Soil Moisture Product to Generate High-Resolution Data: A Multi-Source Approach over Heterogeneous Landscapes in Kenya
* Spatial Mask-Based Adaptive Robust Training for Video Object Segmentation With Noisy Labels
* Spatial Quality Oriented Rate Control for Volumetric Video Streaming via Deep Reinforcement Learning
* Spatial Residual for Underwater Object Detection
* Spatial Shift in Flood-Drought Severity in the Decades Surrounding 2000 in Xinjiang, China, A
* Spatial-Frequency Combined Transformer for Cloud Removal of Optical Remote Sensing Images, A
* Spatially-Adaptive Hash Encodings for Neural Surface Reconstruction
* Spatio-Temporal Analysis of the Redundancies of Construction Land in the Beijing-Tianjin-Hebei Region (2000-2020)
* Spatio-Temporal Context Prompting for Zero-Shot Action Detection
* Spatio-Temporal Evolution and Susceptibility Assessment of Thaw Slumps Associated with Climate Change in the Hoh Xil Region, in the Hinterland of the Qinghai-Tibet Plateau
* Spatio-Temporal Paths and Influencing Factors of Residential Mobility in Guangzhou: A Micro-Level Perspective of Newly Employed College Graduates
* Spatio-Temporal Patterns and Drivers of the Urban Heat Island Effect in Arid and Semi-Arid Regions of Northern China
* Spatio-Temporal Representation Learning as an Alternative to Traditional Glosses in Sign Language Translation and Production, A
* Spatiotemporal Analysis and Anomalous Trends of Asia AOD (2001-2024): Insights from a Deep Learning Fusion Model and EOF Decomposition
* Spatiotemporal Analysis of Urban Vitality and Its Drivers from a Human Mobility Perspective
* Spatiotemporal Changes in China's Mangroves and Their Possible Impacts on Coastal Water Quality from 1998 to 2018
* Spatiotemporal Changes of Pine Caterpillar Infestation Risk and the Driving Effect of Habitat Factors in Northeast China
* Spatiotemporal Distribution and Evolution of Global World Cultural Heritage, 1972-2024
* Spatiotemporal Dynamics and Evolutionary Relationship Between Urbanization and Eco-Environmental Quality: A Case Study in Hangzhou City, China, The
* Spatiotemporal Dynamics and Future Projections of Carbon Use Efficiency on the Mongolian Plateau: A Remote Sensing and Machine Learning Approach
* Spatiotemporal Dynamics of Forest Carbon Sinks in China's Qinba Mountains: Insights from Sun-Induced Chlorophyll Fluorescence Remote Sensing
* Spatiotemporal Dynamics of Habitat Quality in Semi-Arid Regions: A Case Study of the West Songnen Plain, China
* Spatiotemporal Evolution and Driving Factors of Surface Urban Heat Islands: A Comparative Study of Beijing and Dalian (2003-2023), The
* Spatiotemporal Evolution of Urban Driving Factors and Seasonal Heat Island Response from the Perspective of Local Climate Zones: A Case Study of Xiamen City, China
* Spatiotemporal Fusion of Multi-Temporal MODIS and Landsat-8/9 Imagery for Enhanced Daily 30 m NDVI Reconstruction: A Case Study of the Shiyang River Basin Cropland (2022)
* Spatiotemporal Implicit Neural Representation for Unsupervised Dynamic MRI Reconstruction
* Spatiotemporal Responses of Global Vegetation Growth to Terrestrial Water Storage
* Spatiotemporal Typhoon Damage Assessment: A Multi-Task Learning Method for Location Extraction and Damage Identification from Social Media Texts
* Spatiotemporal U-Net-Based Data Preprocessing Pipeline for Sun-Synchronous Path Planning in Lunar South Polar Exploration, A
* Spatiotemporal Variability of Ozone and Nitrogen Dioxide in the Po Valley Using In Situ Measurements and Model Simulations, The
* Specific Responses to Environmental Factors Cause Discrepancy in the Link Between Solar-Induced Chlorophyll Fluorescence and Transpiration in Three Plantations
* SpectFormer: Frequency and Attention is what you need in a Vision Transformer
* Spectral Contrastive Clustering
* Speech Conv-Mamba: Selective Structured State Space Model With Temporal Dilated Convolution for Efficient Speech Separation
* Speech Enhancement: A Review of Different Deep Learning Methods
* SPIFFNet: A Statistical Prediction Interval-Guided Feature Fusion Network for SAR and Optical Image Classification
* SpiralMLP: A Lightweight Vision MLP Architecture
* Spk2ImgMamba: Spiking Camera Image Reconstruction with Multi-Scale State Space Models
* SplatFace: Gaussian Splat Face Reconstruction Leveraging an Optimizable Surface
* SpotDiffusion: A Fast Approach for Seamless Panorama Generation Over Time
* SRMA-KD: Structured relational multi-scale attention knowledge distillation for effective lightweight cardiac image segmentation
* SSegRef2Surf: Near Real-Time Photogrammetric Flood Monitoring and Refinement of Classified Water Surfaces
* SSHFormer: Optimizing Spectral Reconstruction with a Spatial-Spectral Hybrid Transformer
* SSMamba: Superpixel Segmentation With Mamba
* SSRMF: A sparse spectral reconstruction enhanced matched filter for improving point-source methane emission detection in complex terrain
* Stable Autofocus with Focal Consistency Loss
* State Space Model for Multiobject Full 3-D Information Estimation From RGB-D Images, A
* Static-Dynamic Analytical Framework for Urban Health Resilience Evaluation and Influencing Factor Exploration from the Perspective of Public Health Emergencies: Case Study of 61 Cities in Mainland China
* Statistical Analysis of Maximum Correntropy Criterion Subband Adaptive Filtering Algorithm
* STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation
* STBA: Towards Evaluating the Robustness of DNNs for Query-Limited Black-Box Scenario
* Steady-State Performance Analysis of the Nearest Kronecker Product Decomposition Based LMS Adaptive Algorithm
* Steerable Conditional Diffusion for Out-of-Distribution Adaptation in Medical Image Reconstruction
* StegMamba: Distortion-Free Immune-Cover for Multi-Image Steganography With State Space Model
* STELA: Spatial-temporal enhanced learning with an anatomical graph transformer for 3D human pose estimation
* Stepwise Multi-Temporal Interferometric Synthetic Aperture Radar with Partially Coherent Scatterers for Long-Time Series Deformation Monitoring, The
* Stereo Disparity Map Refinement Method Without Training Based on Monocular Segmentation and Surface Normal, A
* STLight: A Fully Convolutional Approach for Efficient Predictive Learning by Spatio-Temporal Joint Processing
* Stochastic Behavior Modeling and Optimal Bidirectional Charging Station Deployment in EV Energy Network
* Stochastic limited memory bundle algorithm for clustering in big data
* Strategic Base Representation Learning via Feature Augmentations for Few-Shot Class Incremental Learning
* Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition
* Stray Light Suppression Design and Test for the Jilin-1 GF04A Satellite Remote Sensing Camera
* Streamable Neural Audio Codec With Residual Scalar-Vector Quantization for Real-Time Communication, A
* Street Legibility and Sustainable Urban Development: Insights from Saudi Arabia's Addressing System
* Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images
* STRIDE: Single-Video Based Temporally Continuous Occlusion-Robust 3D Pose Estimation
* Structural Similarity-Guided Siamese U-Net Model for Detecting Changes in Snow Water Equivalent
* Structure perception and edge refinement network for monocular depth estimation
* Structure-Aware Human Body Reshaping with Adaptive Affinity-Graph Network
* Structured Human Assessment of Text-to-Image Generative Models
* Study of the Non-Linear Relationship Between Urban Morphology and Vitality in Heritage Areas Based on Multi-Source Data and Machine Learning: A Case Study of Dalian, A
* Study on an Anti-Multiple Periodic Frequency Modulation (PFM) Interference Algorithm in Single-Antenna Low-Earth-Orbit Signal-of-Opportunity Positioning Systems, A
* Study on Class Imbalance in Land Use Classification for Soil Erosion in Dry-Hot Valley Regions
* Study on Landslide Hazards Based on Multi-Source Data and GMLCM Approach, The
* Study on the Spatial Distribution Patterns and Driving Forces of Rainstorm-Induced Flash Flood in the Yarlung Tsangpo River Basin
* Style-Pro: Style-Guided Prompt Learning for Generalizable Vision-Language Models
* Subimage Autofocus Bistatic Ground Cartesian Back-Projection Algorithm for Passive Bistatic SAR Based on GEO Satellites, A
* Subjective and Objective Quality Assessment of Non-Uniformly Distorted Omnidirectional Images
* SUM: Saliency Unification Through Mamba for Visual Attention Modeling
* Summary of Recent Advances in the Literature on Machine Learning Techniques for Remote Sensing of Groundwater Dependent Ecosystems (GDEs) from Space, A
* Sun Off, Lights on: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception
* Super Fragmented Coprime Arrays for DOA Estimation
* Super-Ellipse Formation Tracking of Uncertain Vehicles: A Simplified Reinforcement Learning Energy Optimization Method
* Super-Resolution of Landsat-8 Land Surface Temperature Using Kolmogorov-Arnold Networks with PlanetScope Imagery and UAV Thermal Data
* Super-resolution supporting individual tree detection and canopy stratification using half-meter aerial data
* Supervised Information Mining From Weakly Paired Images for Breast IHC Virtual Staining
* Supervised Semantic Segmentation of Urban Area Using SAR
* Supplementary Material AnonyNoise: Anonymizing Event Data with Smart Noise to Outsmart Re-Identification and Preserve Privacy
* Surface Reflection Suppression Method for Air-Coupled SFCW GPR Systems
* Surface-Continuous Scene Representation for Light Field Depth Estimation via Planarity Prior
* Surface-Dependent Meteorological Responses to a Taklimakan Dust Event During Summer near the Northern Slope of the Tibetan Plateau
* Survey of Sampling Methods for Hyperspectral Remote Sensing: Addressing Bias Induced by Random Sampling, A
* Survival Prediction in Lung Cancer through Multi-Modal Representation Learning
* Sustainable Urban Land Management Based on Earth Observation Data: State of the Art and Trends
* SV-data2vec: Guiding Video Representation Learning with Latent Skeleton Targets
* Swap Path Network for Robust Person Search Pre-training
* Swin-delta: Gradient-Based Image Restoration from Image Sequences using Video Swin-Transformers
* SwinIA: Self-Supervised Blind-Spot Image Denoising Without Convolutions
* SwinNowcast: A Swin Transformer-Based Model for Radar-Based Precipitation Nowcasting
* SWOT-Based Intertidal Digital Elevation Model Extraction and Spatiotemporal Variation Assessment
* SymGraphAU: Prior knowledge based symbolic graph for action unit recognition
* SYNAuG: Exploiting synthetic data for data imbalance problems
* SyncDiff: Diffusion-Based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization
* SyncViolinist: Music-Oriented Violin Motion Generation Based on Bowing and Fingering
* SynDRA: Synthetic Dataset for Railway Applications
* SynDroneVision: A Synthetic Dataset for Image-Based Drone Detection
* Synergising machine learning and blockchain for enhanced fraud detection
* Synergistic Impacts of Land Deformation and Rapid Socio-Ecological Changes on Disaster Risk in Indonesian Alluvial Plains Using Multiple Satellite Datasets
* Synergistic Semantic Segmentation and Height Estimation for Monocular Remote Sensing Images via Cross-Task Interaction
* Synthetic Training Datasets for Architectural Conservation: A Deep Learning Approach for Decay Detection
* Systematic Bias of Machine Learning Regression Models and Correction
* Systematic Review into the Application of Ground-Based Interferometric Radar Systems for Bridge Monitoring, A
* systematic review of intermediate fusion in multimodal deep learning for biomedical applications, A
* S^2-Transformer for Mask-Aware Hyperspectral Image Reconstruction
* T2EA: Target-Aware Taylor Expansion Approximation Network for Infrared and Visible Image Fusion
* TA-MSA: A Fine-Tuning Framework for Few-Shot Remote Sensing Scene Classification
* TACLE: Task and Class-Aware Exemplar-Free Semi-Supervised Class Incremental Learning
* TaCOS: Task-Specific Camera Optimization with Simulation
* Talking Head Anime 4: Distillation for Real-Time Performance
* TAM-VT: Transformation-Aware Multi-Scale Video Transformer for Segmentation and Tracking
* Taming a Diffusion Model to Revitalize Remote Sensing Image Super-Resolution
* Target Near-Field Scattering Measurement Technique Utilizing 3D Near-Field Imaging via Cylindrical Scanning, A
* Task Configuration Impacts Annotation Quality and Model Training Performance in Crowdsourced Image Segmentation
* Task-Adapted Learnable Embedded Quantization for Scalable Human-Machine Image Compression
* TaxaBind: A Unified Embedding Space for Ecological Applications
* Taxonomy of Sensors, Calibration and Computational Methods, and Applications of Mobile Mapping Systems: A Comprehensive Review, A
* TempA-VLP: Temporal-Aware Vision-Language Pretraining for Longitudinal Exploration in Chest X-Ray Image
* Temporal aggregation for real-time RGBT tracking via fast decision-level fusion
* Temporal and Spatial Distribution of 2022-2023 River Murray Major Flood Sediment Plume
* Temporal and Spatial Prediction of Column Dust Optical Depth Trend on Mars Based on Deep Learning
* Temporal Denoising of Infrared Images via Total Variation and Low-Rank Bidirectional Twisted Tensor Decomposition
* Temporal Dynamics in Visual Data: Analyzing the Impact of Time on Classification Accuracy
* Temporally Grounding Instructional Diagrams in Unconstrained Videos
* Temporally Streaming Audio-Visual Synchronization for Real-World Videos
* Tensor-Based Privacy Protection Scheme With Multifeature Fusion for Facial Recognition
* Terrain Segmentation Network in Wild Environments With Hybrid Plus Downsampling
* TerrAInav Sim: An Open-Source Simulation of UAV Aerial Imaging from Map-Based Data
* Test-Time Adaptation in Point Clouds: Leveraging Sampling Variation with Weight Averaging
* Test-Time Adaptation of 3D Point Clouds via Denoising Diffusion Models
* Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Vision-Language Models
* Testing the Applicability of Drone-Based Ground-Penetrating Radar for Archaeological Prospection
* Text Change Detection in Multilingual Documents Using Image Comparison
* Text Geolocation Prediction via Self-Supervised Learning
* Text-to-Image Synthesis for Domain Generalization in Face Anti-Spoofing
* Text-Video Knowledge Guided Prompting for Weakly Supervised Temporal Action Localization
* Texture, Shape and Order Matter: A New Transformer Design for Sequential DeepFake Detection
* Texture-Enhanced Deep Learning Network for Cloud Detection of GaoFen/WFV by Integrating an Object-Oriented Dynamic Threshold Labeling Method and Texture-Feature-Enhanced Attention Module, A
* TF-CorrNet: Leveraging Spatial Correlation for Continuous Speech Separation
* TFM2: Training-Free Mask Matching for Open-Vocabulary Semantic Segmentation
* Three-Dimensional Outdoor Pedestrian Road Network Map Construction Based on Crowdsourced Trajectory Data
* Through the Curved Cover: Synthesizing Cover Aberrated Scenes with Refractive Field
* Tidal flat topography mapping with Sentinel time series using cross-modal sample transfer and deep learning
* Tightly Coupled AI-ISP Vision Processor, A
* TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations
* Time Series Analysis of Vegetation Recovery After the Taum Sauk Dam Failure
* Time-Series InSAR Monitoring of Permafrost-Related Surface Deformation at Tiksi Airport: Impacts of Climate Warming and Coastal Erosion on the Northernmost Siberian Mainland
* Time-Series Modeling of Ozone Concentrations Constrained by Residual Variance in China from 2005 to 2020
* TJCMNet: An Efficient Vision-Text Joint Identity Clues Mining Network for Visible-Infrared Person Re-Identification
* TKSF-KAN: Transformer-enhanced oat yield modeling and transferability across major oat-producing regions in China using UAV multisource data
* TLDR: Text Based Last-Layer Retraining for Debiasing Image Classifiers
* TMBO-AOD: Transparent Mask Background Optimization for Accurate Object Detection in Large-Scale Remote-Sensing Images
* To Ask or Not to Ask? Detecting Absence of Information in Vision and Language Navigation
* Token Turing Machines are Efficient Vision Models
* TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm
* Topography-Land Surface Temperature Coupling: A Promising Approach for the Early Identification of Coal Seam Fire Zones
* TORE: Token Recycling in Vision Transformers for Efficient Active Visual Exploration
* Toward an Operational System for Automatically Detecting Xylella fastidiosa in Olive Groves Based on Hyperspectral and Thermal Remote Sensing Data
* Toward Human-Vehicle Collaboration for Automated Vehicles: A Review and Perspective
* Toward Long Video Understanding via Fine-Detailed Video Story Generation
* Toward Physically Stable Motion Generation: A New Paradigm of Human Pose Representation
* Towards a Theoretical Understanding of Semi-Supervised Learning Under Class Distribution Mismatch
* Towards a Training Free Approach for 3D Scene Editing
* Towards Accurate Unified Anomaly Segmentation
* Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting
* Towards Generalized Face Anti-Spoofing from a Frequency Shortcut View
* Towards High-fidelity Head Blending with Chroma Keying for Industrial Applications
* Towards human society-inspired decentralized DNN inference
* Towards Multiple-in-One Image Deraining via Scale-Aware Trident Transformer Network
* Towards on-device continual learning with Binary Neural Networks in industrial scenarios
* Towards On-the-Fly Novel Category Discovery in Dynamic Long-Tailed Distributions
* Towards Privacy-Preserving Split Learning for ControlNet
* Towards Real-Time Open-Vocabulary Video Instance Segmentation
* Towards Robust Training via Gradient-Diversified Backpropagation
* Towards Secure and Usable 3D Assets: A Novel Framework for Automatic Visible Watermarking
* Towards trustworthy image super-resolution via symmetrical and recursive artificial neural network
* Towards Unbiased Continual Learning: Avoiding Forgetting in the Presence of Spurious Correlations
* Towards Unsupervised Blind Face Restoration Using Diffusion Prior
* Towards Utilising a Range of Neural Activations for Comprehending Representational Associations
* Towards Zero-shot 3D Anomaly Localization
* TPD-STR: Text Polygon Detection with Split Transformers
* TPNet: A High-Performance and Lightweight Detector for Ship Detection in SAR Imagery
* TPP-Gaze: Modelling Gaze Dynamics in Space and Time with Neural Temporal Point Processes
* TR-Adapter: Parameter-Efficient Transfer Learning for Video Question Answering
* Trace Back and Go Ahead: Completing partial annotation for continual semantic segmentation
* TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models
* Traffic Prediction Based on Formal Concept-Enhanced Federated Graph Learning
* Training-free Medical Image Inverses via Bi-level Guided Diffusion Models
* Trajectory- and Friendship-Aware Graph Neural Network with Transformer for Next POI Recommendation
* Transcranial Photoacoustic Tomography De-Aberrated Using Boundary Elements
* Transferable-Guided Attention Is All You Need for Video Domain Adaptation
* Transferring Foundation Models for Generalizable Robotic Manipulation
* Transformer-based weakly supervised 3D human pose estimation
* Transientangelo: Few-Viewpoint Surface Reconstruction Using Single-Photon Lidar
* Translation-classification loss for SAR image understanding with deep learning
* Treading Towards Privacy-Preserving Table Structure Recognition
* TreeFormer: Single-View Plant Skeleton Estimation via Tree-Constrained Graph Generation
* TRH2TQA: Table Recognition with Hierarchical Relationships to Table Question-Answering on Business Table Images
* Tri-AFLLM: Resource-Efficient Adaptive Asynchronous Accelerated Federated LLMs
* Triple Graph Convolutional Network for Hyperspectral Image Feature Fusion and Classification
* TRNeRF: Restoring Blurry, Rolling Shutter, and Noisy Thermal Images with Neural Radiance Fields
* Tropospheric NO2 Column over Tibet Plateau According to Geostationary Environment Monitoring Spectrometer: Spatial, Seasonal, and Diurnal Variations
* TRUST: Time-Domain Residual Unsupervised Stability Technique for Improved Heart Rate Estimation
* TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On
* Tumor Synthesis Conditioned on Radiomics
* Tuned Contrastive Learning
* Two contrast phenomena inconsistent with illumination assumptions
* Two-Dimensional Successive Variational Mode Decomposition
* Two-Head Loss Function for Deep Average-K Classification, A
* Two-Stage Deep Learning Framework for Individual Tree Crown Detection and Delineation in Mixed-Wood Forests Using High-Resolution Light Detection and Ranging Data
* Two-stream transformer tracking with messengers
* Two-tiered Spatio-temporal Feature Extraction for Micro-expression Classification
* U-MixFormer: UNet-Like Transformer with Mix-Attention for Efficient Semantic Segmentation
* U-N2C: A Dual Memory-Guided Disentanglement Framework for Unsupervised System Matrix Denoising in Magnetic Particle Imaging
* UAL-Bench: The First Comprehensive Unusual Activity Localization Benchmark
* UCDR-Adapter: Exploring Adaptation of Pre-Trained Vision-Language Models for Universal Cross-Domain Retrieval
* UDMC: Unified Decision-Making and Control Framework for Urban Autonomous Driving With Motion Prediction of Traffic Participants
* UGC-Net: Uncertainty-Guided Cost Volume Optimization with Contextual Features for Satellite Stereo Matching
* uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images
* Ultra-Short Baseline Synthetic Aperture Passive Positioning Based on Interferometer Assistance
* Ultra-Wide Swath Synthetic Aperture Radar Imaging System via Chaotic Frequency Modulation Signals and a Random Pulse Repetition Interval Variation Strategy, An
* UncertainBEV: Uncertainty-aware BEV fusion for roadside 3D object detection
* Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation
* Uncertainty Aware Interest Point Detection and Description
* Uncertainty Awareness Enables Efficient Labeling for Cancer Subtyping in Digital Pathology
* Uncertainty estimation using boundary prediction for medical image super-resolution
* Uncertainty-Aware Label Refinement on Hypergraphs for Personalized Federated Facial Expression Recognition
* Uncertainty-Aware Online Extrinsic Calibration: A Conformal Prediction Approach
* Uncertainty-Aware Regularization for Image-to-Image Translation
* Uncertainty-Aware Self-Knowledge Distillation
* Uncertainty-based Data-wise Label Smoothing for Calibrating Multiple Instance Learning in Histopathology Image Classification
* Uncertainty-Guided Cross Attention Ensemble Mean Teacher for Semi-Supervised Medical Image Segmentation
* Uncertainty-Guided Metric Learning Without Labels
* Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion
* Uncrewed Aerial Vehicle-Based Automatic System for Seat Belt Compliance Detection at Stop-Controlled Intersections
* Underwater image quality assessment method via the fusion of visual and structural information
* Underwater image quality evaluation via deep meta-learning: Dataset and objective method
* Underwater Image Restoration Method With Polarization Imaging Optimization Model for Poor Visible Conditions, An
* Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis
* Underwater image restoration using Joint Local-Global Polarization Complementary Network
* Undifferenced Ambiguity Resolution for Precise Multi-GNSS Products to Support Global PPP-AR
* Undistorted and Consistent Enhancement of Automotive SAR Image via Multi-Segment-Reweighted Regularization
* UnDIVE: Generalized Underwater Video Enhancement Using Generative Priors
* Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction
* UniAda: Domain Unifying and Adapting Network for Generalizable Medical Image Segmentation
* UniCanvas: Affordance-Aware Unified Real Image Editing via Customized Text-to-Image Generation
* Unified Deep Learning Model for Global Prediction of Aboveground Biomass, Canopy Height, and Cover from High-Resolution, Multi-Sensor Satellite Imagery
* Unified Framework for Adversarial Patch Attacks Against Visual 3D Object Detection in Autonomous Driving, A
* Unified Framework for Open-World Compositional Zero-Shot Learning
* Unified Model and Survey on Modulation Schemes for Next-Generation Automotive Radar Systems
* Unified Prompt Attack Against Text-to-Image Generation Models
* Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing
* Unifying Low-Resolution and High-Resolution Alignment by Event Cameras for Space-Time Video Super-Resolution
* UniHDSA: A unified relation prediction approach for hierarchical document structure analysis
* UniTMGE: Uniform Text-Motion Generation and Editing Model via Diffusion
* Unleash the Power of Vision-Language Models by Visual Attention Prompt and Multimodal Interaction
* Unleashing Potentials of Vision-Language Models for Zero-Shot HOI Detection
* Unmanned Aerial Vehicle-Based Hyperspectral Imaging for Potato Virus Y Detection: Machine Learning Insights
* Unpaired recurrent learning for real-world video de-hazing
* Unpaired translation of chest X-ray images for lung opacity diagnosis via adaptive activation masks and cross-domain alignment
* Unraveling Aerosol and Low-Level Cloud Interactions Under Multi-Factor Constraints at the Semi-Arid Climate and Environment Observatory of Lanzhou University
* Unsupervised anomaly detection with a temporal continuation, confidence-aware VAE-GAN
* Unsupervised Cross-Domain Polarimetric Synthetic Aperture Radar (PolSAR) Change Monitoring Based on Limited-Label Transfer Learning and Vision Transformer
* Unsupervised Denoising for Signal-Dependent and Row-Correlated Imaging Noise
* Unsupervised Domain Adaptive Visual Question Answering in the Era of Multi-Modal Large Language Models
* Unsupervised Domain Transfer for Object Classification in 3D Point Clouds via Hierarchical Prompt Learning
* Unsupervised multiplex graph representation learning via maximizing coding rate reduction
* Unsupervised Range-Nullspace Learning Prior for Multispectral Images Reconstruction
* Unsupervised Salient Object Detection on Light Field With High-Quality Synthetic Labels
* Unsupervised Semantic Segmentation of Urban Scenes via Cross-Modal Distillation
* Unsupervised Single-Image Intrinsic Image Decomposition with LiDAR Intensity Enhanced Training
* Unsupervised Video Highlight Detection by Learning from Audio and Visual Recurrence
* Unveiling the Drivers of Unplanned Urbanization: A High-Resolution Night Light Development Index Approach for Assessing Regional Inequality and Urban Growth in Dhaka
* Unveiling the Effects of Crop Rotation on Cropland Soil pH Mapping: A Remote Sensing-Based Soil Sample Grouping Strategy
* Unveiling the Spatial Inequality of Accessibility to High-Quality Healthcare Resources in the Beijing-Tianjin-Hebei Urban Agglomeration of China: A Focus on the Impacts of Intercity Patient Mobility
* Unveiling the Spatial Variation in Ecosystem Services Interactions and Their Drivers Within the National Key Ecological Function Zones, China
* Urban Functional Zone Classification Based on High-Resolution Remote Sensing Imagery and Nighttime Light Imagery
* Urban Sprawl Monitoring by VHR Images Using Active Contour Loss and Improved U-Net with Mix Transformer Encoders
* Urban Street Network Configuration and Property Crime: An Empirical Multivariate Case Study
* Urban-Rural Education Divide: A GIS-Based Assessment of the Spatial Accessibility of High Schools in Romania, The
* Use and Effectiveness of Chatbots as Support Tools in GIS Programming Course Assignments
* Use of Radiative Transfer Model for Inter-Satellite Microwave Radiometer Calibration
* Use of Tropospheric Delay in GNSS-Based Climate Monitoring: A Review
* User-in-the-Loop Evaluation of Multimodal LLMs for Activity Assistance
* Using Geodetic Data to Monitor Hydrological Drought at Different Spatial Scales: A Case Study of Brazil and the Amazon Basin
* Using Multi-Angular Spectral Reflection of Dorsiventral Leaves to Improve the Transferability of PLSR Models for Estimating Leaf Biochemical Traits
* Using Pleiades Satellite Imagery to Monitor Multi-Annual Coastal Dune Morphological Changes
* USWformer: Efficient Sparse Wavelet Transformer for Underwater Image Enhancement
* Utilizing Uncertainty in 2D Pose Detectors for Probabilistic 3D Human Mesh Recovery
* UW-GS: Distractor-Aware 3D Gaussian Splatting for Enhanced Underwater Scene Reconstruction
* V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations
* VADet: Multi-Frame LiDAR 3D Object Detection Using Variable Aggregation
* Valid: Variable-Length Input Diffusion for Novel View Synthesis
* Validation of Inland Water Surface Elevation from SWOT Satellite Products: A Case Study in the Middle and Lower Reaches of the Yangtze River
* Variational Bayes Image Restoration With Compressive Autoencoders
* Variational Bayesian Inference Theory of Elasticity and Its Mixed Probabilistic Finite Element Method for Inverse Deformation Solutions in Any Dimension, A
* VDD: Varied Drone Dataset for semantic segmentation
* VecMapLocNet: Vision-based UAV localization using vector maps in GNSS-denied environments
* vector quantized masked autoencoder for audiovisual speech emotion recognition, A
* Vegetation Growth Changes and Their Constraining Effects on Ecosystem Services Under Ecological Restoration in the Shendong Mining Area
* Vegetation Restoration Outpaces Climate Change in Driving Evapotranspiration in the Wuding River Basin
* Vehicle Trajectory Prediction by Integrating Data-Driven and Knowledge-Guided Technique
* VerA: Versatile Anonymization Applicable to Clinical Facial Photographs
* Verriest Lecture: Color vision from pixels to objects, The
* Versatile and Differentiable Hand-Object Interaction Representation, A
* VFM-Depth: Leveraging Vision Foundation Model for Self-Supervised Monocular Depth Estimation
* VG-SSL: Benchmarking Self-Supervised Representation Learning Approaches for Visual Geo-Localization
* VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors
* Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval, A
* Video summarization with temporal-channel visual transformer
* Videogamebunny: Towards Vision Assistants for Video Games
* Viewport-Independent Blind Quality Assessment of AI-Generated Omnidirectional Images via Vision-Language Correspondence
* VIIS: Visible and Infrared Information Synthesis for Severe Low-Light Image Enhancement
* VILLS: Video-Image Learning to Learn Semantics for Person Re-Identification
* VioPose: Violin Performance 4D Pose Estimation by Hierarchical Audiovisual Inference
* VipDiff: Towards Coherent and Diverse Video Inpainting via Training-Free Denoising Diffusion Models
* Viscoelastic Cluster-Constrained PBD-Based Soft Tissue Behavior and Interactive Media Applications for Surgical Simulation
* Visible-Infrared Person Re-Identification With Real-World Label Noise
* Vision Mamba Distillation for Low-Resolution Fine-Grained Image Classification
* Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding
* Vision-Based Driving Decision Making Using Multi-Action Deep Q Network
* Vision-Based Landing Guidance Through Tracking and Orientation Estimation
* Vision-language foundation model for generalizable nasal disease diagnosis using unlabeled endoscopic records
* vision-language foundation model-based multi-modal retrieval-augmented generation framework for remote sensing lithological recognition, A
* Vision-Language Meets the Skeleton: Progressively Distillation With Cross-Modal Knowledge for 3D Action Representation Learning
* VISIONARY: Novel Spatial-Spectral Attention Mechanism for Hyperspectral Image Denoising
* Visual cues for moisture perception of facial skin: a pilot study on the effects of enhancing high-spatial-frequency components of skin lightness to decrease perceived moisture levels in young Asian observers
* Visual fidelity and full-scale interaction driven network for infrared and visible image fusion
* Visual Prompt Learning of Foundation Models for Post-Disaster Damage Evaluation
* Visual Robustness Benchmark for Visual Question Answering (VQA)
* VisualFusion: Enhancing Blog Content with Advanced Infographic Pipeline
* ViT-KAN Synergistic Fusion: A Novel Framework for Parameter- Efficient Multi-Band PolSAR Land Cover Classification
* VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
* VM-Gait: Multi-Modal 3D Representation Based on Virtual Marker for Gait Recognition
* VMAs: Video-to-Music Generation via Semantic Alignment in Web Music Videos
* Volcanic Activity Classification Through Semi-Supervised Learning Applied to Satellite Radiance Time Series
* Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images
* VortSDF: 3D Modeling with Centroidal Voronoi Tessellation on Signed Distance Field
* Voxel and deep learning based depth complementation for transparent objects
* Voxel-Based Path Planning for Autonomous Vehicles in Parking Lots
* Vulnerability Assessment of Charging Stations in the Electrified Road Network
* W-Net: A facial feature-guided face super-resolution network
* WAFFLE: Multimodal Floorplan Understanding in the Wild
* Walking to Public Transport: Rethinking Catchment Areas Considering Topography and Surrogate Buffers
* WARLearn: Weather-Adaptive Representation Learning
* WaterVG: Waterway Visual Grounding Based on Text-Guided Vision and mmWave Radar
* Wave-based cross-phase representation for weakly supervised classification
* Waveform Design Using Cauchy-Schwarz Divergence for Target Detection
* Waveform Optimization for Enhancing the Performance of a Scanning Imaging Radar Utilizing a Terahertz Metamaterial Antenna
* Wavelength- and Depth-Aware Deep Image Prior for Blind Hyperspectral Imagery Deblurring with Coarse Depth Guidance
* Weakly supervised camouflaged object detection based on the SAM model and mask guidance
* Webly Supervised Fine-Grained Classification by Integrally Tackling Noises and Subtle Differences
* WeedsGalore: A Multispectral and Multitemporal UAV-Based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields
* Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers
* Wheel-GINS: A GNSS/INS Integrated Navigation System With a Wheel-Mounted IMU
* When Aware Haze Density Meets Diffusion Model for Synthetic-to-Real Dehazing
* When Cars Meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather
* When Time Prevails: The Perils of Overlooking Temporal Landscape Evolution in Landslide Susceptibility Predictions
* When Visual State Space Model Meets Backdoor Attacks
* Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
* Who Brings the Frisbee: Probing Hidden Hallucination Factors in Large Vision-Language Model via Causality Analysis
* Wholly-WOOD: Wholly Leveraging Diversified-Quality Labels for Weakly-Supervised Oriented Object Detection
* WiGNet: Windowed Vision Graph Neural Network
* WINE: Wavelet-Guided GAN Inversion and Editing for High-Fidelity Refinement
* WTDBNet: A Wavelet Transform-Based Dual-Stream Backbone Network for Fine-Grained Ship Detection
* XPose: Towards Extreme Low Light Hand Pose Estimation
* XR-MBT: Multi-Modal Full Body Tracking for XR Through Self-Supervision with Learned Depth Point Cloud Registration
* YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-Time Object Detection
* YOLOv5_CDB: A Global Wind Turbine Detection Framework Integrating CBAM and DBSCAN
* ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset
* Zero-Shot Class Unlearning in CLIP with Synthetic Samples
* Zero-Shot Detection of Out-of-Context Objects Using Foundation Models
* Zerocomp: Zero-Shot Object Compositing from Image Intrinsics via Diffusion
2365 for 2505