2601
* 2D DOA Estimation of Coherent Signals Exploiting Moving Uniform Rectangular Array
* 2D-3D Attention and Entropy for Pose Robust 2D Facial Recognition
* 3D Deep-Learning-Based Segmentation of Human Skin Sweat Glands and Their 3D Morphological Response to Temperature Variations
* 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation
* 3D Gaussian Splatting Reconstruction from Simulated CT Projections with Geometric Initialization
* 3D human mesh recovery: Comparative review, models, and prospects
* 3D human pose estimation based on a Hybrid approach of Transformer and GCN-Former
* 3D Magnetic Inverse Routine for Single-Segment Magnetic Field Images
* 3D Mesh Convolution-Based Autoencoder for Geometry Compression, A
* 3D Multi-Object Tracking Driven by Multi-Level Association and Intelligent Filtering
* 3D point cloud classification network with hybrid sampling enhancement and point energy attention
* 3D Trajectory and Pickup/Drop-Off Strategy for UAV-Enabled Delivery: Trade-Off Between Time and Energy Minimization
* 3D-sLSCI: Three-Dimensional Surface Laser Speckle Contrast Imaging of Blood Flow Using Phase-Shifting Profilometry
* Diffusion for Layout Control in Text to Image Generation (H4)
* Face Restoration, Facial Image Restoration (H4)
* Non Line of Sight Imaging (H2)
* Screen Content Coding, Compression (H4)
* U-Net, Convolutional Neural Networks (H4)
* A3-TTA: Adaptive Anchor Alignment Test-Time Adaptation for Image Segmentation
* Absolute Radiometric Calibration of CAS500-1/AEISS-C: Reflectance-Based Vicarious Calibration and Cross-Calibration with Sentinel-2/MSI
* ACC: Alternating Complementary Colors for Display Energy Reduction
* Accelerating inter-frame prediction in Versatile Video Coding via deep learning-based mode selection
* Action-to-Action Diffusion Network for Weakly Supervised Temporal Action Localization
* ActRecognition-GPT: Utilizing Multimodal Large Language Models for Spatiotemporal Action Recognition in Nursery Videos
* AdaMulti: An adaptive cascaded multi-modal recognition framework for sports action analysis
* Adapting Foundation Features via Cross-View Contrastive Learning for Unseen Object Pose Estimation
* Adaptive Batch Size Time Evolving Stochastic Gradient Descent for Federated Learning
* Adaptive Fault-Tolerant Perimeter Control for Two-Region Networks With Actuator Faults
* Adaptive fuzzy feature selection with the fusion of data distribution information
* Adaptive Hierarchical Feature Difference Auto-Encoder for Robust RGB-T Object Tracking
* Adaptive High-Frequency Preprocessing for Video Coding
* Adaptive Multi-Modal Visual Tracking With Dynamic Semantic Prompts
* Adaptive Multimodal Fusion via Attention-Guided Feature Selection for Histopathology Image Classification
* Adaptive Non-Linear Graph Filter in Semi-Supervised Graph Based Classification, An
* Adaptive Nonuniform Map Re-weighting Method for Image Dehazing, An
* Adaptive recursive channel selection for robust decoding of motor imagery EEG signal in patients with intracerebral hemorrhage
* Adaptive risk-aware reinforcement learning for safe navigation of unmanned systems
* Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression
* Adaptive Sequential Bayesian Iterative Learning for Myocardial Motion Estimation on Cardiac Image Sequences
* Adaptive Smoothing of Non-Rectangular Prediction Block Edges in the Wedge Mode of AVM
* Adaptive sparse contrastive learning for unsupervised object re-identification
* Adaptive Voxelization for Transform Coding of 3D Gaussian Splatting Data
* AdaptiveFusion: Adaptive Multi-Modal Multi-View Fusion for 3D Human Body Reconstruction
* ADNet: Delving into generalizable deepfake detection via adaptive expert selection and discrepancy learning
* Advanced Sensor Analytics and Extreme Value Modeling: Dichotomizing Day-Night Variability in Rear-End Collisions on Expressways
* Advancements in Medical Image Classification Through Fine-Tuning Natural Domain Foundation Models
* Advancing Chinese Lip Reading through Contextual Enhancement
* Advancing Limited-Angle CT Reconstruction through Diffusion-Based Sinogram Completion
* Adversarial flow-based generative models for visible-to-Infrared person re-Identification
* Adversarial Image Purification by Explaining Adversarial Detectors
* AeroGen: Ground-to-Air Generalization for Action Recognition
* AesPrompt: Zero-Shot Image Aesthetics Assessment With Multi-Granularity Aesthetic Prompt Learning
* Affective Image Editing: Shaping Emotional Factors via Text Descriptions
* AffectVLM: Contrastive Language-Image Learning with Augmented Textual Prompts for 3D/4D Facial Expression Recognition Using Vision-Language Model
* Affine Correspondences Between Multi-Camera Systems for Relative Pose Estimation
* Afmunet: Adaptive Filter-Based Frequency Modulation UNET For OCTA Segmentation
* AGAFNet: Adaptive Gated Attention Fusion Network for Accurate Nuclei Segmentation and Classification in Histology Images
* AgeDB-30M Dataset: Melanated Faces for Age-Invariant Face Recognition, The
* Aggressive Rejection with Adaptive Gradient for Contaminated Data
* AGVOT: Visual Object Tracking via Cooperation of Aerial and Ground Views
* AI and Smart Sensors Usher in a New Era in Patient Care
* AI-Based Real-Time Fight Detection Through CCTV Cameras
* AI-Driven Vehicle Damage Detection: A Saliency-Based Segmentation Approach
* Air Brake Model With Electronically Controlled Pneumatic for Heavy-Haul Trains, An
* ALCER3D: Adaptive Learning Constraints for Enhanced Retrieval of Complex Indoor 3D Scenarios
* ALDA: Enhancing the transferability of adversarial attacks with attention-guided look-ahead and data augmentation
* Algorithm for Extracting Bathymetry from ICESat-2 Data That Employs Structure and Density Using Concentric Ellipses, An
* Aligning computational and human perceptions of image complexity: A dual-task framework for prediction and localization
* Almost-Surely Convergent Randomly Activated Monotone Operator Splitting Methods
* ALPSB: Adaptive learngene with plastic and stable branches
* Alternative Cardinal Spline for Cubic B-Spline Interpolation, An
* Ambiguity-Aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning
* Analysis of Human Perception in Distinguishing Real and AI-Generated Faces: An Eye-Tracking Based Study
* Analysis of image aesthetics assessment as a positive-unlabelled problem
* Analysis of Image Domain Characteristics of Maritime Rotating Ships for Spaceborne Multichannel SAR
* Analytical Implementation of the Rosenblatt Transformation, An
* Analytical Reconstruction of Human-Scale Dark-Field CT
* Anatomical Attention Alignment Representation for Radiology Report Generation
* Anchor-Based Gravity Alignment for Panoramas
* Anchor-ViT: Spatially-Focused Vision Transformer for Distracted Driving Detection
* Anisotropic Cross-View Texture Transfer With Multi-Reference Non-Local Attention for CT Slice Interpolation, An
* Anti-FT: Towards Practical Deep Leakage From Gradients
* AOD-RSE: Improved Dehazing Network for Object Detection Models in Self-Driving Scenarios
* ARaBIQA: A Novel Blind Image Quality Assessment Model for Augmented Reality
* Arbitrary-scale atmospheric downscaling with mixture of implicit neural networks trained on fixed-scale data
* Are Foundation Models All You Need for Zero-shot Face Presentation Attack Detection?
* Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation?
* ASK-HOI: Affordance-Scene Knowledge Prompting for Human-Object Interaction Detection
* ASM-DiffConvNet: Physics-Guided Difference Convolution Network for Single-Image Restoration
* Assessing ICESat-2's Capability for Global Mangrove Forest Canopy Measurements
* Assessing the Impact of T-Mart Adjacency Effect Correction on Turbidity Retrieval from Landsat 8/9 and Sentinel-2 Imagery (Case Study: St. Lawrence River, Canada)
* Assessing the Performance of Multiple Satellite-Based Evapotranspiration Models over Tropical Forests
* Astrophotography Turbulence Mitigation Via Generative Models
* Asymmetric modal fusion for multi-modal crowd counting
* Asymmetric Strip Transformer With Position Vectors Embedding for Lane Detection
* AT-PMF: Progressive multi-modal fusion with adversarial training for physiological emotion recognition
* Atmospheric Weighted Average Temperature Enhancement Model for the European Region Considering Daily Variations and Residual Changes in Surface Temperature
* Attention and Mamba-Driven Quality Assessment for Underwater Images
* Attention and mask-guided context fusion network for camouflaged object detection
* Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs
* Attention-driven refinement network for continuity-preserving airway segmentation in class-imbalanced CT
* Attention-Gated U-Net for Robust Cross-Domain Plastic Waste Segmentation Using a UAV-Based Hyperspectral SWIR Sensor
* Attention-Guided Band Pruning for Efficient Hyperspectral Early Grape Leaf Disease Detection
* AttentiveSfP: Leveraging Dualpool-Former and attention mechanisms for accurate shape from polarization
* Attribute-Centric Cross-Modal Alignment for Weakly Supervised Text-Based Person Re-ID
* Attribute-Specified Generation And Style-Transfer Diffusion For Face Recognition Enhancement
* AU-EMO Correlation based zero-shot facial expression recognition with graph convolutional network
* Audio Visual Segmentation through Text Embeddings
* Audio-Guided Video Scene Editing
* Auto-DBPA: Density-Aware Ball-Pivoting Algorithm With Adaptive Radius Using Contextual Bandits for Object and Scene Reconstruction
* Automated Activity Monitoring of Cryptic Species in a Zoo Environment
* Automated Calculation of Rice-Lodging Rates Within a Parcel Area in a Mobile Environment Using Aerial Imagery
* Automated Detection of Submerged Sandbar Crest Using Sentinel-2 Imagery
* Automatic Choroid Segmentation and Thickness Measurement Based on Mixed Attention-Guided Multiscale Feature Fusion Network
* Automatic Extrinsic Calibration Method for mmWave Radar and Camera in Traffic Environment, An
* Automatic Insect Pest Identification and Recognition for Paddy Crops Pest Control
* Automatic Turkish Image Captioning Using Non-Native Deep Caption Generator Models and Neural Machine Translators
* Autoregression-Free Video Prediction Using Diffusion Model for Mitigating Error Propagation
* Avoiding Bias While Pruning Neural Networks: The Case of Image Classification
* AWFusion: An adaptive end-to-end wave-based method for infrared and visible image fusion
* Axial Sphere Loss: Encouraging Open-Space Risk Minimization in Face Identification Tasks
* Background Suppression by Multivariate Gaussian Denoising Diffusion Model for Hyperspectral Target Detection
* BAIT: A New DNN Backdoor Attack Using Inpainted Triggers
* Balancing Optimization Strategies and Practical Goals: An Efficient Scene Text Detector
* BAM: Backdoor defense based on adversarial mitigation
* Batch-Aware Active Learning for Object Detection
* BayesAdapter: Enhanced Uncertainty Estimation in CLIP Few-Shot Adaptation
* Bayesian high-order tensor factorization for learning the hidden low-rank structure
* Bayesian Hybrid Attention Module for Underwater Acoustic Target Recognition, A
* Bayesian Multifractal Image Segmentation
* Bayesian Optimization Based Deep Learning Models for Detection of Forest Fires
* Bayesian Surprise for Small and Sub-Pixel Moving Target Detection
* BCFNet: Bi-temporal collaborative fusion network for multi-modal humor detection
* BD Open LULC Map: High-Resolution Land Use Land Cover Mapping and Benchmarking For Urban Development In Dhaka, Bangladesh
* Benchmark and Evaluation for Real-World Out-of-Distribution Detection Using Vision-Language Models, A
* Benchmark Dataset for Automated Diagnosis and Treatment Planning of Class III Malocclusion Using X-Rays and Profile Photos, A
* Benchmarking MSWEP Precipitation Accuracy in Arid Zones Against Traditional and Satellite Measurements
* Beta Wavelet Induced Multi-Scale Kernel Clustering: A Frequency-Aware Framework for Complex Data Analysis
* beta-DARTS++: Bi-Level Regularization for Proxy-Robust Differentiable Architecture Search
* Bevanet: Bilateral Efficient Visual Attention Network for Real-Time Semantic Segmentation
* Beyond deceptive flatness: Dual-order solution for strengthening adversarial transferability
* Beyond Deep Learning: Agentic AI Framework for Object Detection
* Beyond FACS: Data-driven Facial Expression Dictionaries, with Application to Predicting Autism
* Beyond Meme Templates: Limitations of Visual Similarity Measures in Meme Matching
* Beyond non-expert demonstrations: Outcome-driven action constraint for offline reinforcement learning
* Beyond pillars: Advancing 3D object detection with salient voxel enhancement of liDAR-4D radar fusion
* Beyond Spatial Domain: Multi-View Geo-Localization with Frequency-Based Positive-Incentive Information Screening
* Beyond Static Fusion: A Mixture-of-Experts Framework for Multimodal Breast Cancer Classification
* Bi-Grid Reconstruction for Image Anomaly Detection
* BIAN: Bidirectional interwoven attention network for retinal OCT image classification
* Biased Aerosol Wet Deposition CAM5 Simulations: A Result of Misrepresented Convective-Stratiform Precipitation Partitioning When Benchmarked Against SPCAM
* Bicycle Travel Time Estimation via Dual Graph-Based Neural Networks
* Bicycledualnet: Bicyclegan-Powered Dual Encoder Network for Single Image 3D Reconstruction
* Bidirectional Flow Fields for Sparse Input Novel View Synthesis of Dynamic Scenes
* Bimodal beta mixture distribution for enhanced OOD inner-differentiation in multi-class text classification
* BioGaze: a Framework for Evaluating the Photographic Requirements of the ISO/IEC 39794-5 Standard
* BioVL-QR: Egocentric Biochemical Vision-And-Language Dataset Using Micro QR Codes
* Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering
* Blaze: A Dataset For Wildfire And Burnt Area UAV Image Classification And Segmentation
* Blind compressed image diffusion restoration based on content prior and dense residual connection driven transformer
* Blind Denoising Using Dense in Dense Network with Attention Module
* Blind Multi-Mode Ptychography using a Distributed Probe Estimate
* BLPNet: Boosting localization perception network for foveal avascular zone segmentation
* Boosted Affine Motion Compensation For Geometric Partitioning Mode
* Boosting Active Prompt Learning via Discriminative Self-Training Dual-Curriculum Learning
* Boosting Dataset Distillation With the Assistance of Crucial Samples for Visual Learning
* Boosting Faithful Multi-Modal LLMs via Complementary Visual Grounding
* Boosting Text-To-Image Person Re-Identification With Generative Hard Negative
* Boosting the patch-based self-supervised learning through past-to-present smoothing
* Boosting Tiny Face Detection in Videos with an Integral Score Framework
* Bootstrap Deep Spectral Clustering With Optimal Transport
* Boundary mutual information hashing for cross-modal retrieval
* Bounds on the Natarajan dimension of a class of linear multi-class predictors
* BRACTIVE: A Brain Activation Approach to Human Visual Brain Learning
* Brain foundation models with hypergraph dynamic adapter for brain disease analysis
* Brain3D: Generating 3D Objects from fMRI
* Branch-Splitter multi-granularity feature fusion for local joint-angle estimation
* Breaking Redundancy via 3D Sparse Geometry: 3D-aware Neural Compression for Multi-View Videos
* BreathAI: Transfer Learning-Based Thermal Imaging for Automated Breathing Pattern Recognition
* Bridging Domain Shifts Through Self-Contrastive Learning And Distribution Alignment
* Brief Analysis of the Change Detector by Kervrann et al., A
* BSA-Dehaze: Multi-Scale Bitemporal Fusion and Size-Aware Decoder for Unsupervised Image Dehazing
* BSRPCA: A Simplified Blind Super-Resolved RPCA-Based Approach for Enhancing Blood Flow Estimation
* BTDGNet: A Dual-Guided Camouflaged Object Detection Network Leveraging Boundary and Texture Information
* BuildFunc-MoE: An Adaptive Multimodal Mixture-of-Experts Network for Fine-Grained Building Function Identification
* BVSR-EvD: Blurry Video Space-Time Super-Resolution With Events via Diffusion Models
* CADOT: Cityscape Aerial Image Dataset For Object Detection
* CAE-Net: Generalized deepfake image detection using convolution and attention mechanisms with spatial and frequency domain features
* CAFCL: Class-aware flow-based contrastive learning for out-of-distribution detection
* CAG: Context-Conditional 2D Affordance Generation
* CAIT: Triple-Win Compression Toward High Accuracy, Fast Inference, and Favorable Transferability for ViTs
* Calibrated mixup for imbalanced regression on tabular data
* Calibration of Sparse LiDAR and Camera Based on Spatial Feature Analysis and Adaptive Constraints
* CAM-HRNet: A Multi-Scale Parallel Structure Change Detection Framework for Forest Land
* Camera Pose Estimation in Multi-Object Scenes Using Ray Diffusion and Point Cloud Alignment
* CAMN-FSOD: Class-aware memory network for few-shot infrared object detection
* Can Large Language Models Challenge CNNs in Medical Image Analysis?
* Can Pose Transfer Models Generate Realistic Human Motion?
* CAS-AIR-3D: A Large-scale Low-quality Multi-modal Face Database
* Cas-OVD: Cascaded Open-Vocabulary Detection of Small Objects Using Multi-Refined Region Proposal Network in Autonomous Driving
* Cascaded Local-Nonlocal Pansharpening with Adaptive Channel-Kernel Convolution and Multi-Scale Large-Kernel Attention
* Category-Dependent Learned Image Compression for Smartphone Photography with Standard-Compliant Decoders
* Causal Interventional Prompt Tuning for Few-Shot Out-of-Distribution Generalization
* Causal-Ex: Causal graph-based micro and macro expression spotting
* Causality-Driven Explainable Multimodal Fusion With Visual-Text Parallel Computing for Cloth-Changing Pedestrian Re-Identification
* Causality-Inspired Debiasing Learning for Open World Object Detection
* CauSkelNet: Causal Representation Learning for Human Behaviour Analysis
* CCAI-YOLO: A High-Precision Synthetic Aperture Radar Ship Detection Model Based on YOLOv8n Algorithm
* CDCGM: Composition-specified Dance Choreography Generation from Music
* CDTFusion: Crossing Domain and Task for Infrared and Visible Image Fusion
* CEA-Net: A multi-modal model for corn disease classification with dynamic fusion and cross-layer connection mechanism
* Certainty and Uncertainty Guided Active Domain Adaptation
* CFFCNet: Center-Guided Feature Fusion Completion for Accurate Vehicle Localization and Dimension Estimation from Lidar Point Clouds
* CFFN: Cascaded Feature Fusion Network for Facial Expression Recognition
* CFSM: A Novel Causal Feature Selection Module for Two-Dimensional Out-of-Distribution Generalization
* CGD-MAE: Clip Distillation-Driven Pre-Training Framework for Vehicle Re-Identification
* Chagas Parasite Semi-Supervised Classification in Blood Sample Images Using Different Deep Learning
* Channel-Wise 1D Convolutional U-Net
* Chasing Shadows: Solving Deepfake Detection Benchmarks Using Irrelevant Features Only
* ChatPPG: Computational Analysis and Statistics of Table Tennis Games
* Chlorophyll Retrieval in Sun Glint Region Based on VIIRS Rayleigh-Corrected Reflectance
* CHTMAE: Cross-Modal Hierarchical Temporal-Spatial Masked Autoencoder Model for Micro-Expression Recognition
* CHUG: Crowdsourced User-Generated HDR Video Quality Dataset
* City-Level Pavement Distress Inspection Using Crowdsourced Data of Logistics Vehicles
* CLAFusion: Misaligned infrared and visible image fusion based on contrastive learning and collaborative attention
* Class-aware prototype augmentation and decoupled feature distillation for class-incremental learning
* Class-specific feature reconstruction with pseudo-label for open-set HRRP recognition
* CLIP-AE: Clip-Assisted Cross-View Audio-Visual Enhancement for Unsupervised Temporal Action Localization
* CLIP-driven rain perception: Adaptive deraining with pattern-aware network routing and mask-guided cross-attention
* CLIP-FSQAE: Clip-Guided Finite Scalar Quantized Autoencoder for Few-Shot Anomaly Detection
* CLIP-HandID: Vision-Language Model for Hand-Based Person Identification
* CLIP-SENet: CLIP-Based Semantic Enhancement Network for Vehicle Re-Identification
* CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
* CLIP-WSDDN: An optimized weakly supervised object detection network with zero supervised classification prior
* Close-to-Optimal Counter Histogram-Based Forensics Using Mean Structural Similarity Index Metric
* Cloud Optical Thickness Retrievals Using Angle Invariant Attention Based Deep Learning Models
* Cluster Contrast for Unsupervised Visual Representation Learning
* Cluster-aware prompt ensemble learning for few-shot vision-language model adaptation
* Clustering Diffusion Model With Frequency-Signal Modulation for Variational Graph Autoencoders
* CMALDD-PTAF: Cross-modal adversarial learning for deepfake detection by leveraging pre-trained models and cross-attention fusion
* CMANet: Context-Aware Mutual Attention Network for Referring Image Segmentation
* CMI-Net: Cross-View Message Token Interaction Network for 3D Shape Recognition
* CMIP: Combining Constructive Model With Improvement Policy for Large-Scale Min-Max Multiple Traveling Salesman Problem
* CMP: Composable Meta Prompt for Sam-Based Cross-Domain Few-Shot Segmentation
* CMTM: Cross-Modal Token Modulation for Unsupervised Video Object Segmentation
* CNN-Based 360° Scene Recognition for Automatic Generation of Omnidirectional Scent Effects
* Co-HSC: Complementary image-mesh fusion for dense human-scene contact estimation
* Coastal Zone Imager Sargassum Index Model Reveals the Change Details of Sargassum in Coastal Waters of China
* COBRA: A Continual Learning Approach to Vision-Brain Understanding
* Coding-Information Based Improvement For In-Loop Filters Beyond VVC
* COFP: A Collaborative Optimization Framework With Polyhedral Feature Extraction for Multi-Weather Image Restoration
* Cognitive and Memory-Driven EEG-Based Authentication: A Multi-Session Approach to Secure Biometric Systems
* Cognitive Task Virtualization for Alzheimer's Diagnosis Using Realistic VR Simulation
* Collaborative Computation in Integrated Sensing, Communication, and Computation System for Autonomous Driving
* Collaborative Learning of Augmentation and Disentanglement for Semi-Supervised Domain Generalized Medical Image Segmentation
* Collision attack and error corrected multimodal-inspired framework for underwater video enhancement
* Color Is Not Enough: Dataset and Method for Identifying Relevant Traffic Lights in Driving Scenes
* ColorGPT: Automatic Colorization with Generative Prompts and Transformer
* Combination Test of NNVC Tools and NN-Inter In VVC
* Combining EEG and MRI in a Multimodal Approach for Parkinson's Disease Detection
* Combining Ground Penetrating Radar and a Terrestrial Laser Scanner to Constrain EM Velocity: A Novel Approach for Masonry Wall Characterization in Cultural Heritage Applications
* Combining In Situ and Remote-Sensing Data to Assess the Spatial Pattern and Changes of Major Grassland Types in Xinjiang, China, Under Climate Change Scenarios
* Combining short-term and long-term memory for robust visual tracking
* Common and Unique Representation Deep Embedded Clustering
* Communication Efficient Over-the-Air Federated Learning With Random FLARE Algorithm
* Communication-Efficient FL With Hybrid Aggregation for the CAVs Over Multiple BSs
* Compact exploration for continuous action reinforcement learning
* Compact Latent Representation for Image Compression (CLRIC)
* Compact Polarimetric CTLR Mode Calibration Method Immune to Faraday Rotation Using Two Dihedral Reflectors, A
* Comparative Analysis of Automatic Speech Recognition Fine-Tuning Strategies for Speech From Cochlear Implant Users
* Comparative Analysis of IR-VIS Image Fusion Methods: Object Detection with YOLO Architectures and Fusion Quality Evaluation, A
* Comparative Assessment of Eight Satellite Precipitation Products over the Complex Terrain of the Lower Yarlung Zangpo Basin: Performance Evaluation and Topographic Influence Analysis
* Comparative Study of DINOv2, I-JEPA, and ViT Embeddings for Unsupervised Anomaly Detection
* Comparison and Evaluation of Multi-Source Evapotranspiration Datasets in the Yarlung Zangbo River Basin
* Comparison of the unmodified Rytov method and the modified Rytov method in obtaining scintillations in various strongly turbulent media
* Comparison of Visual Trackers for Biomechanical Analysis of Running
* Compositional and Mineralogical Diversity of Jezero Western Fan, Mars, Revealed by Elemental Observations
* comprehensive analysis of Mamba for 3D volumetric medical image segmentation, A
* comprehensive approach for image quality assessment using quality-centric embedding and ranking networks, A
* Comprehensive Benchmark for Evaluating Night-time Visual Object Tracking, A
* comprehensive survey of image clustering based on deep learning, A
* Comprehensive Survey of Transformers in Text Recognition: Techniques, Challenges, and Future Directions, A
* Compressed image super-resolution based on invertible degradation and restoration
* Compressing Human Body Video with Interactive Semantics: A Generative Approach
* Compressing Multi-Scale Features with a Channel-Shrinked Single-Branch Architecture
* computationally efficient framework leveraging auxiliary head features for robust cloth-changing person re-identification, A
* Concentration Inequalities for Semidefinite Least Squares Based on Data
* Concept-Based Explanation for Deep Vision Models: A Comprehensive Survey on Techniques, Taxonomy, Applications, and Recent Advances
* Conditional Diffusion Transformer for Unified Distortion Correction and Rectification
* Conditional GAN for Time-to-Peak (TTP) Generation from Non-Contrast MRI Modalities
* Confidence-Aware Agglomeration Classification And Segmentation Of 2D Microscopic Food Crystal Images*
* Confidence-Based Sampling Strategy for Dense Temporal Token Learning in Thermal Infrared Object Tracking, A
* configurable global context reconstruction hybrid detector for enhanced small object detection in UAV aerial imagery, A
* Conformal Compressors
* consistency regularization training method for automatic modulation classification under incomplete information, A
* Consistent Connected Operators Based on Trees of Shapes
* Consistent View Synthesis with Bidirectional Epipolar Attention and Reconstruction
* Constrained GAN-Generated X-Ray CT Data For Self-Supervised And Foundation-Model Segmentation Of Concrete Microstructures
* Constructing adaptive spatial-frequency interactive network with bi-directional adapter for generalizable face forgery detection
* Context-Assisted Low-Light Face Detection through Global and Local Image Enhancement
* Context-aware 3D CNN for action recognition based on semantic segmentation (CARS)
* Context-Aware and Semantic-Synergistic Linguistic Steganalysis for Social Networks
* Context-Aware Simulation with Machine Vision for Industrial Safety
* Context-Based Screening of Autism Risk in Children
* Context-Dependent Anomaly Action Recognition
* Contextloss: Context Information for Topology-Preserving Segmentation
* Continual deep multi-view clustering via contrastive knowledge replay
* Continuous Action Unit Intensity Modeling for Micro-Expression Recognition
* Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution
* Contrastive Diversity Augmentation for Single Domain Generalization
* Contrastive Learning-Based Deep Embedded Clustering and the TCN-DMAttention Model for Traffic Congestion Prediction
* ConvFuse: A Progressive Convformer Network for Context-Aware Multisensor Image Fusion
* CONXA: A CONvnext and CROSS-attention combination network for Semantic Edge Detection
* Cooperative Perception of Multi-Agents Under the Spatio-Temporal Drift Issue
* CorDis: A Novel Correlation-Based Disentanglement Measure
* Corn Plant Detection Using YOLOv9 Across Different Soil Background Colors, Growth Stages, and UAV Flight Heights
* Cosine Network for Image Super-Resolution, A
* Cost-Efficient Approach to Managing Simultaneous Charging Sessions in Large-Scale EV Stations, A
* COT-AD: Cotton Analysis Dataset
* Coupled Diffusion Posterior Sampling for Unsupervised Hyperspectral and Multispectral Images Fusion
* CQR-UC: A color QR code-based underwater wireless communication method with GAN-based image enhancement
* CRAFT: Contextual Re-Activation of Filters for face recognition Training
* CRB-NCE: An adaptable cohesion rule-based approach to number of clusters estimation
* Critical Contour Prior-Guided Graph Learning With Pose Calibration for Identity-Aware Deepfake Detection
* Cromdbn: Dynamic brain network analysis for neuropsychiatric diseases classification via multi-knowledge integration
* Cross-attention relation network based on metric learning for few-shot specific emitter identification
* Cross-distance near-infrared face recognition
* Cross-Domain Adversarial Structural Deformation Model for Post-Disaster Image Generation
* Cross-Domain Analysis of Cybersickness and Motion Sickness Mitigation Strategies
* Cross-Domain detection of AI-Generated text: Integrating linguistic richness and lexical pair dispersion via deep learning
* Cross-Domain Feature Fusion Network for Nighttime Drone-View Object Detection, A
* Cross-domain Few-shot Classification via Invariant-content Feature Reconstruction
* Cross-Domain Image Steganalysis Based on Frequency-Domain Alignment and Feature Calibration
* Cross-Frequency Attention and Color Contrast Constraint for Remote Sensing Dehazing
* Cross-Modal Attention Guided Enhanced Fusion Network for RGB-T Tracking
* Cross-Modal Attention with Adaptive and Hierarchical Fusion for Robust RGB-T Image Segmentation for Safe Driving
* Cross-modal Emotion-specific Attention model for Multimodal Emotion Recognition
* Cross-Modal Spherical Aggregation for Weakly Supervised Remote Sensing Shadow Removal
* Cross-Modality Abdominal Multi-Organ Segmentation via Source-Free Unsupervised Domain Adaptation
* Cross-Modality Feature Aggregation for Cross-Domain Point Cloud Representation Learning
* Cross-modality masked autoencoder for infrared and visible image fusion
* Cross-Model Face Recognition via Guided Alignment Mutual Decoupled Distillation
* Cross-Regional Detection and Precise GIS Localization of Old Landslides Using High-Resolution Remote Sensing Imagery and YOLOv5
* Cross-scale adaptive transformer with hierarchical feature synergy for aerial small object detection
* Crossdr: Bridging 2D And 3D Features For Diabetic Retinopathy Classification Using Context-Aware Cross-Attention
* CrowdMoGen: Event-Driven Collective Human Motion Generation
* CRPE-Net: Infrared small target detection transformer with cross-layer relative-position embedding
* CS-TRD: a Cross-Section Tree Ring Detection Method
* CSCA: Channel-specific information contrast and aggregation for weakly supervised semantic segmentation
* CTU-Level Rate Control with lambda Optimization Based on Visual Gaze Mechanism for 360-Degree Versatile Video Coding
* Curvature Dedicated Architecture Using Swin Transformers for RGB-D Object Recognition
* Curve: Clip-Utilized Reinforcement Learning for Visual Image Enhancement via Simple Image Processing
* Curvi-Tracker: Curvilinear structure segmentation refinement by iterative tracking
* Custom Condition Generation for Zero-Shot Human-Scene Interactions Synthesis
* CWC-DNERF: Compact Dynamic Neural Radiance Field VIA Discrete Wavelet Transform And Learnable Codebooks
* Cybersickness in VR: State-Of-The-Art and Future Research Agenda
* Cytofusion: A Latent Diffusion-Based Framework for Cytology Classification
* D2TR: Sea Clutter Suppression via Dynamic Dual-Tree Complex Wavelet Selection and Target-Guided Regularization
* Daafnet: Domain Adaptive Augmented Feature Network for Biosignal based Emotion Recognition
* DAE-YOLO: Remote Sensing Small Object Detection Method Integrating YOLO and State Space Models
* DAF-Mamba: Dynamic selective and adaptive fused mamba for cardiac image segmentation
* DAIRNet: Degradation-aware All-in-one Image Restoration Network with cross-channel feature interaction
* DANIM: Domain adaptation network with intermediate domain masking for night-time scene parsing
* Dark Count Removal in Photon-Counting SPAD Arrays
* Darts: Deformable Animation Ready Templates for Clothing Humans
* DAS-Accelerometer Data Fusion With Semi-Supervised Graph Variational Autoencoder for In-Service Train Wheel Flat Detection
* Data-Dependent Rectangular Bounding Processes
* Data-driven bayesian-guided activation functions for multi-task pattern recognition
* Data-Driven Recursive Intra Prediction
* Data-Efficient Semi-Supervised Few-Shot Speaker Verification via Prototype Space Optimization
* DBF-Net: A Dual-Branch Network with Feature Fusion for Ultrasound Image Segmentation
* DBIDM: Implementing blind image separation through a dual branch interactive diffusion model
* dc-GAN: Dual-Conditioned GAN for Face Demorphing From a Single Morph
* DCART: A dual contrastive alignment residual transformer model for visual grounding
* DCM-VideoNet: A Densely-Connected Modulated Decoder Framework for Implicit Neural Video Compression
* DCMAE: A dual-branch contrastive masked autoencoder for 3D object detection
* DCTFormer: A Dual-Branch Transformer With Cloze Tests for Video Anomaly Detection
* DDPTA: Zero-Shot Learning for Skeleton-Based Action Recognition
* Dead Zone Mitigation in Vehicular Platoon via Solar Panel as a Communication Receiver
* Debiasing Framework For Attribute Binding In Diffusion-Based Text-To-Image Generation, A
* Deblurring Images by Huber Lasso
* DECFusion: A lightweight decomposition fusion method for luminance artifact removal in infrared and visible images
* Decision-Making and Planning for Intelligent Vehicle Considering Human Factors: Methods, Challenges, and Prospects
* Decoding Emotions: How Graph Transformer with Adaptive Graph Structure Learning Understands Micro-Expressions
* Decoding UAV Scenes: A Novel Framework for Deep Semantic Segmentation Using U-Net and Transformer Hybrids
* Decoupled self-supervised deep multi-task learning framework for subscriber portrait in smart meter
* Decoupling augmentation bias in prompt learning for vision-language models
* Decoupling representation learning and classifier for long-tailed adversarial training
* Deep CNN Face Matchers Inherently Support Revocable Biometric Templates
* Deep color constancy via a color shift aware conditional diffusion model
* Deep contrastive graph clustering with information preservation
* Deep Learning Based Crab Classification for Marine Pest Monitoring
* Deep Learning Multimodal Fusion-Based Method for Cell and Nucleus Segmentation, A
* Deep Learning Training Framework for Solving Data Explosion Problem in DOA Estimation
* Deep Learning-Based 3D Ocean Current Reconstruction Improved by Vertical Temperature and Salinity
* Deep Learning-Based Automated Diagnosis for Breast Cancer Classification Using Mammogram Analysis
* Deep learning-based detection of autism spectrum disorder and emotion recognition in children
* Deep Learning-Based Diffraction Identification and Uncertainty-Aware Adaptive Weighting for GNSS Positioning in Occluded Environments
* Deep Learning-Based Segmentation of Hysteroscopic Images for Early Detection of Endometrial Cancer
* Deep Low Light Image Enhancement via Multi-Task Learning of Few Shot Exposure Imaging
* Deep Neural Network Parameter Selection via Dataset Similarity Under Meta-Learning Framework
* Deep No-Reference Quality Assessment for Underwater Enhanced Images
* Deep Object Recognition-Based Analysis of Diverse Culinary Landscapes
* Deep positional encoders for graph classification
* Deep Reinforcement Learning-Based Task Offloading With Collaborative Inference in UAV-Assisted Mobile Edge Computing Networks
* Deep Semantic Tuplet-Based Hashing by Hypergraph Modeling for Cross-Modal Retrieval
* Deep semi-supervised learning method based on sample adaptive weights and discriminative feature learning
* Deep semi-supervised relation preserving learning model
* Deep Spectral Analytics Based Soil Nutrient Prediction Using Spatial-Semantic Feature Embedding with Prototype-Guided Perturbation
* Deep Unfolding-Based Image Reconstruction For Quanta Image Sensors
* Deep Unsupervised Despeckling With Unbiased Risk Estimation
* Deep Vision of Mobility for Urban Detection From Street-View Projections
* Deep-Learning Based Quality Assessment in Adaptive Optics Ophthalmoscopy Images
* Deeper Insights Into Deep Graph Convolutional Networks: Stability and Generalization
* DeepFake Detection With Multi-View Fusion and Graph Convolutional Network
* Deforestation Monitoring for Mongolia's Forest-Steppe Ecoregion Through Satellite Images
* deformable registration framework for brain MR images based on a dual-channel fusion strategy using GMamba, A
* Deformable Shape Registration from Inexact Correspondences
* Deformable Spherical Geometry Transformer For Panoramic Semantic Segmentation
* DEGAN-CS: An efficient code search model based on dataenhanced optimization of generative adversarial networks
* Degradation accordant plug-and-play for low-rank tensor recovery
* Degradation-Aware Prompted Transformer for Unified Medical Image Restoration
* Delineating the Distribution Outline of Populus euphratica in the Mainstream Area of the Tarim River Using Multi-Source Thematic Classification Data
* Delving into Pre-training for Domain Transfer: A Broad Study of Pre-training for Domain Generalization and Domain Adaptation
* Delving Into the Secrets of BEV 3D Object Detection in Autonomous Driving: A Comprehensive Survey
* Denoising-enhanced pancreatic segmentation using diverse kernel mutual adaptive learning
* Density-sorted prediction set: Efficient conformal prediction for multi-target regression
* Depth error points optimization for 3D Gaussian Splatting in few-shot synthesis
* Depth-Aware Scoring and Hierarchical Alignment for Multiple Object Tracking
* Depth-Aware YOLO Segmentation: Enhancing Small Object Detection via MiDaS-Based Spatial Reasoning
* Describing Land Cover Changes via Multi-Temporal Remote Sensing Image Captioning Using LLM, ViT, and LoRA
* Design and Optimization of a Hybrid VLC/THz Infrastructure-to-Vehicle Communication System for Intelligent Transportation
* DeskTransfer: Predicting Multi-Scenario Video Stream Throughput in Cloud Desktop Based on Transfer Autoencoder
* Detecting And Mitigating Incoherent Input Of Latent Diffusion Models
* Detecting human-object interactions with image category-guided and query denoising
* Detection Algorithm with Multi-Scale Reconstruction and Feature Compensation for Small and Occluded Objects, A
* Detection and Monitoring of Volcanic Islands in Tonga from Sentinel-2 Data
* Detection of Pavement Defects on Roads using a Multimodal YOLOv8 with Image and IMU Data
* Detection of Screen Usage During Eating Events Among Preschool-Aged Children
* detector-free feature matching method with dual-frequency transformer, A
* Developing distance-based genetic programming classifiers by reconstructing datasets for imbalanced binary classification
* DFT Gaze: Distilled and Fine-Tuned Gaze Estimation for Personalization on Tiny Devices
* DGRGaze: A Difference-Guided Gaze Estimation Framework Based on 6D Rotation Matrix Representation
* DI-Net: Decomposed implicit garment transfer network for digital clothed 3D human
* Dictionary-Based Block Term Decomposition for Third-Order Tensors
* DictRoadNet: A Dictionary-Based RNN With Road Network Module for GPS Trajectory Completion
* Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases
* DiffDeMorph: Extending Reference-Free Demorphing to Unseen Faces
* DiffProb: Data Pruning for Face Recognition
* DiffProtect: Generative adversarial examples using diffusion models for facial privacy protection
* Diffuse and Refine Latent Prior with Transformers For Neural ISP
* DIFFUSE2ADAPT: Controlled Diffusion for Synthetic-to-Real Domain Adaptation
* Diffusion Based Shape-Aware Learning with Multi-Scale Context for Segmentation of Tibiofemoral Knee Joint Tissues: an End-to-End Approach
* Diffusion Model for Virtual Try-On Systems, A
* Diffusion Pretraining for Gait Recognition in the Wild
* Diffusion to Confusion: Naturalistic Adversarial Patch Generation Based on Diffusion Model for Object Detector
* Diffusion-Based CT Image Segmentation for Intracerebral Hemorrhage
* DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment
* DiffusionMixNet: Leveraging Foundation Models and Contrastive Learning for Semi-Supervised Polyp Segmentation
* DiffW: Multi-Encoder Based on Conditional Diffusion Model for Robust Image Watermarking
* DiMo: Diffusion transformers for monocular human motion estimation in the world system
* DIRE: Enhancing Facial Expression Recognition Through Domain-Invariant Representation Learning for Robust Generalization
* Direction-Emphasizing Transformer for Road Extraction From Optical Remote Sensing Imagery
* DISCO: A Diffusion Model For Spatial Transcriptomics Data Completion
* Discourse-Aware Language Representation
* Discrete Diffusion Propagated Transformer For Flexible Ureteroscopic Semantic Segmentation
* Discrete-Phase Waveform Design for Desired Ambiguity Functions in Pulse-Doppler MIMO Radar
* Discriminative Image Feature Extraction for Traffic Sign Detection in Road Inspection
* DisenEmo: Learning disentangled emotional representation from facial motion for 3D talking head generation
* Disentangled Denoising and Counterfactual Balance for Multimodal Recommendation
* Disentangled Source-Free Personalization for Facial Expression Recognition with Neutral Target Data
* Dissecting Human Body Representations in Deep Networks Trained for Person Identification
* Distinct Polyp Generator Network for polyp segmentation
* Distortion Classification in Computer Vision Applications: Current Progress, Challenges, and Perspectives
* Distractor suppression Siamese network with task-aware attention for visual tracking
* Distributed Adaptive Tracking Control of an Underactuated High-Speed Train With Completely Unknown System Parameters
* DIVA-VQA: Detecting Inter-Frame Variations in UGC Video Quality
* DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
* Diversifying Human Pose In Synthetic Data For Aerial-View Human Detection
* Diversity covariance-aware prompt learning for vision-language models
* Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory
* Divide-and-Summarize: Enhancing Deep Neural Video Summarization
* DLP-YOLOv9: Model with Fewer Parameters and Higher Precision Based on Improved YOLOv9 in Drone-Captured-Scenarios
* DM-DPR: Diffusion and Mamba-based Degradation Prediction for Blind Face Restoration
* DM-FNet: Unified Multimodal Medical Image Fusion via Diffusion Process-Trained Encoder-Decoder
* DMSO: A Dynamic Momentum-Smoothing Optimizer for Learned Image Compression
* DNMF-AG: A Sparse Deep NMF Model with Adversarial Graph Regularization for Hyperspectral Unmixing
* DOA Estimation Exploiting Compressive Measurements With Mixed-ADCs
* Does Brain Network Construction Choice Matter? an Empirical Study of Individual Networks from Static FDG-PET for Alzheimer's Diagnosis
* Does Hyperspectral Imagery Improve Satellite-Derived Bathymetry? A Case Study from a Posidonia oceanica-Dominated Mediterranean Region
* Does noise in the knowledge graph really harm recommendations?
* Domain divergence minimization for unsupervised domain adaptation cross-modality medical image segmentation
* Domain Generalization for Face Anti-Spoofing via Content-Aware Composite Prompt Engineering
* Domain Transfer Generative Model for New Face Generation
* DP-Net: A 3D Dilated Projection Framework For Precise Fetal Brain Tissue Segmentation
* DPL: Spatial-Conditioned Diffusion Prototype Enhancement for One-Shot Medical Segmentation
* DPM-CLIP: Zero-Shot Multimodal Egocentric Activity Recognition based on Dual-Prediction Mechanism
* DPtSTrip: Adversarially robust learning with distance-aware point-to-set triplet loss
* DR-IAL: Decoupling-to-recoupling guided interaction-aware learning for egocentric action recognition
* Driver State Classification: Identifying High Cognitive Load and Drowsiness Through Driver Performance and Physiology
* Driving Decision-Making at Freeway Weaving Segments Using Relational Graph Attention Network and Deep Reinforcement Learning
* DSCIL: Dynamic selected contrastive instance learning for weakly supervised video anomaly detection
* DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification
* DSDP: Real-Time Asymmetric Dual-Stream Instance Segmentation Embedding Depth-Predictive Architecture for Enhanced Scene Understanding
* DSFace: Conditional Diffusion Inpainting for Sketch-to-Face Synthesis
* DSFusion: A dual-branch step-by-step fusion network for medical image fusion
* DT-JRD: Deep Transformer-Based Just Recognizable Difference Prediction Model for Video Coding for Machines
* DTFPNet: Temporal and frequency dynamic graph neural network for time series classification
* DTL: Parameter- and Memory-Efficient Disentangled Vision Learning
* DTLS-Inpaint: Yet Another Efficient Image Inpainting with Domain Transfer
* DTSI: Towards faster convergence of query-based detectors for rotated dense aerial images
* DU-Net: A Dual U-Net for semantic text-guided style transfer
* dual branch graphic text detection network based on progressive Domain adaptation, A
* Dual Consistency Matching for Semi-Supervised Semantic Correspondence
* Dual Cross-Image Semantic Consistency with Self-Aware Pseudo Labeling for Semi-Supervised Medical Image Segmentation
* Dual Stream Networks for 3d Human Pose and Shape Estimation
* Dual-Attention based prompt generation and catalyzing for instance-wise continual learning
* Dual-Branch Partial Annotation Learning for Facial Attributes Recognition
* Dual-domain homogeneous fusion with cross-modal mamba and progressive decoder for 3D object detection
* Dual-functional fractal-fractional Sobel operator for efficient image enhancement and edge detection
* Dual-Graph Transformer with Contextual-Support Nodes for Interpersonal Relationship Recognition
* Dual-Level Attention Relearning for Cross-Modality Rotated Object Detection in UAV RGB-Thermal Imagery
* Dual-perception prompt learning: Illumination-adaptive and semantic-aware guidance for backlit image enhancement
* Dual-Reward Guided 2D Mapping Generation Network for JPEG Reversible Data Hiding, A
* Dual-Stream Spatio-Temporal Accident Anticipation and Detection
* DualAdaptNet: Enhancing domain adaptation regression with error accumulation reduction and dual-head heterogeneous learning
* DV-Net: Detecting and Distinguishing Copy-Move Regions From Dual Views
* DWOA-BNC: Discrete whale optimization algorithm for Bayesian network classifier learning and its application
* DXA-Net: Dual-Task Cross-Lingual Alignment Network for Zero-Shot Cross-Lingual Spoken Language Understanding
* DyGLNet: Hybrid global-local feature fusion with dynamic upsampling for medical image segmentation
* Dynamic 3D Gaussian Reconstruction with Specular Reflection
* Dynamic Clustering-driven weakly-supervised online hashing with enhanced similarity
* dynamic hybrid network with attention and mamba for image captioning, A
* Dynamic MAsk-Pruning Strategy for Source-Free Model Intellectual Property Protection
* Dynamic Mesh Coding Using Edge Length-Based Adaptive Subdivision
* Dynamic Mesh Coding With Temporally Consistent UV Atlas Generation
* Dynamic Multi-Level Feature Alignment Method for Domain Adaptive Driver Distraction Detection, A
* Dynamic PET Image Reconstruction via Non-Negative INR Factorization
* Dynamic prototype with discriminative representation for rapid adaptation in new organ segmentation
* Dynamic Spatiotemporal Graph Convolutional Neural Network Based on Congestion Propagation for Traffic Prediction
* Dynamic Visual Speaking Patterns: You Are the Way You Speak
* E2GenF: Universal AIGC image detection based on edge enhanced generalizable features
* E2IGB: Enhanced effective-information-guided class-balanced loss for long-tailed object recognition
* E2MPL: An Enduring and Efficient Meta Prompt Learning Framework for Few-Shot Unsupervised Domain Adaptation
* EAFvision: Real-Time Automated Safety Surveillance in Electric Arc Furnaces Using Deep Learning Models
* EAR-MM: An Efficient Adaptive and Robust Algorithm for Streaming Map Matching
* Early pneumoconiosis recognition from CT via progressive lesion awareness and multi-axis denoising attention mechanisms
* Echocardiogram to CMR Image Synthesis using Generative Models
* Edge Feature Inclusive Variational Graph Autoencoder for Pet-Driven Alzheimer's Diagnosis, An
* Edge-Guided Monocular Absolute Depth Estimation with Diffusion-Based Refinement
* EdgeRegNet: Edge Feature-Based Multimodal Registration Network Between Images and LiDAR Point Clouds
* EdinoGait: Transferring Large Visual Models to Event-Based Vision for Enhancing Gait Recognition
* EEG-driven natural image reconstruction with regional semantic awareness
* EF2lane: Enhanced Feature Fusion 2D Lane Detection Network In 3d Point Cloud
* Effect of Desert Dust Intrusion on the Detection of Marine Heatwaves
* Effective face recognition from video using enhanced social collie optimization-based deep convolutional neural network technique
* Effects of Facial Hair on Face Recognition
* Efficient 6DoF pose estimation for multi-instance objects from a single image
* Efficient and Robust Video Virtual Try-On via Enhanced Multi-Garment Alignment
* Efficient Approximation of Earth Mover's Distance Based on Nearest Neighbor Search
* Efficient Asymmetric Shared Low-Rank Adaptation Based on Selective Scanning Vision Mamba for Medical Imaging Analysis
* Efficient Atlas Generation for Medical Imaging Via Groupwise Latent Diffusion Models
* Efficient Constraining of Transcoding in DNA-Based Image Storage
* Efficient CRB Estimation for Linear Models via Expectation Propagation and Monte Carlo Sampling
* Efficient Feature-Guided Approach for Image Restoration
* Efficient High-Fidelity Global Low-Rank Optimization for Multispectral Demosaicing
* Efficient Implicit Neural Representations for Videos with Feature Modulation
* Efficient industrial anomaly detection via cross-scale distillation with enhanced feature compression
* Efficient Leaf Disease Classification and Segmentation Using Midpoint Normalization Technique and Attention Mechanism
* efficient loop and clique coarsening algorithm for graph classification, An
* Efficient motion-centric CLIP for compressed video action recognition
* Efficient Oriented Object Detection via Wavelet-Based Energy Label Reassignment and Dual Prediction Strategy
* Efficient Random Access Method Using Seed and Inter-Key Frames for Next Generation Video Codec
* Efficient spectral embedding representation approximation for large-scale data clustering
* Efficient Text-to-Image Generation: An Adaptive Step Schedule Controller for Diffusion Models
* Efficient Topology-Aware Motion Planning for AVP in Large-Scale Occupancy Map
* Efficient vision-based occupancy prediction with knowledge distillation
* Efficient4D: Fast Dynamic 3D Object Generation from a Single-view Video
* EICSeg: Universal Medical Image Segmentation via Explicit In-Context Learning
* Elevation-Dependent Glacier Albedo Modelling Using Machine Learning and a Multi-Algorithm Satellite Approach in Svalbard
* EM-Based Multi-Object Tracking With Strong Association Constraints
* Embracing the Power of Known Class Bias in Open Set Recognition from a Reconstruction Perspective
* EmoEEG: A transferable generalist framework for EEG emotion recognition via information bottleneck theory
* EmoGene: Audio-Driven Emotional 3D Talking-Head Generation
* Emomamba: Advancing Dynamic Facial Expression Recognition with Visual and Textual Fusion
* Empathic Risk Companion: Multimodal Vision-Language Fusion with Emotion Prediction Error for Decision Support
* Enabling Controllable, Identity Preserving, Non-Rigid Edits in Human-Centric Images
* Enact: Entropy-Based Clustering of Attention Input for Reducing the Computational Needs of Object Detection Transformers
* End-to-End Automated Screening of Lordosis-Kyphosis-Scoliosis and Vertebral Compression in Salmon X-Ray Images
* End-to-End Full-Page Optical Music Recognition for Pianoform Sheet Music
* End-to-end interactive joint model: Clause-phrase multi-task learning for suicidal ideation cause extraction (SICE) in Chinese Weibo text
* End-to-End Microexpression Detection Using 3D Convolution and LSTM
* end-to-end pipeline for team-aware, pose-aligned augmented reality in cycling broadcasts, An
* End-to-end susceptibility-induced distortion correction for diffusion MRI with unsupervised deep learning
* Energy Efficiency of Video Quality Assessment Metrics
* Energy-Based Distortion-Balancing Parameterization for Open Surfaces
* Energy-Based Generative Models with Morphological Attention Networks for Hyperspectral Image Classification: a Unified Framework
* Enforcing Cooperative Safety for Reinforcement Learning-Based Mixed-Autonomy Platoon Control
* Enhanced Adaptive Confidence Margin for Semi-Supervised Facial Expression Recognition, An
* Enhanced CycleGAN to Derive Temporally Continuous NDVI from Sentinel-1 SAR Images, An
* Enhanced Emphysema Classification in CT Images Using RIU4-LQP and Spatial Texture Features
* Enhanced facial expression manipulation through domain-aware transformation and dual-level classification with expression awarness loss in the CLIP space
* Enhanced Frame Context Initialization for Video Coding Beyond AV1
* Enhanced Geometry and Semantics for Camera-Based 3D Semantic Scene Completion
* Enhanced Graph Convolutional Network with Chebyshev Spectral Graph and Graph Attention for Autism Spectrum Disorder Classification
* Enhanced Multi-Scale Network for Single Image Super-Resolution
* Enhanced Multi-Scale PoseNet for Self-Supervised Monocular Depth Estimation
* Enhanced Small Object Detection Using Multi-Scale Attention for Automated Seabird Detection
* Enhancing 3D point cloud generation via Mamba-based time-varying denoising diffusion
* Enhancing 3D Scene Representation with Structural Dissimilarity-Aware Learning
* Enhancing Adversarial Robustness of Foundation Models Without Data Centralization
* Enhancing Autonomous Driving Perception Under Complex Weather Conditions Through Cyclegan-Based Driving Scene Generation
* Enhancing Brain Source Reconstruction by Initializing 3-D Neural Networks With Physical Inverse Solutions
* Enhancing Breast Cancer Detection Using Multistage Transformer with Positional Encoding and Feature Fusion
* Enhancing CNN-Based Blind Image Quality Assessment via Deep Cross-Layer Pattern Encoding
* Enhancing Domain Generalisability for Lung Nodule Detection: A Hybrid Strategy with Multi-Source Training and MixStyle
* Enhancing graph neural networks on SPD manifolds via cholesky decomposition
* Enhancing Image Deraining Through VLM-Based Data Refinement and Classification
* Enhancing local attention with global information interaction via progressive cluster propagation
* Enhancing Machine Learning-Based GPP Upscaling Error Correction: An Equidistant Sampling Method with Optimized Step Size and Intervals
* Enhancing Medical Vision-Language Models with Rich Textual Descriptions and Multiple Alignments for Chest X-Ray Diagnosis
* Enhancing Multi-Task Learning with Attention Mechanisms
* Enhancing Multiscale Feature Representation For Object-Level Recognition In Masked Image Modeling
* Enhancing point cloud feature representation via historical node state increments in graph neural networks
* Enhancing precipitation detection: A multi-sensor approach using conditional GANs and recurrent networks
* Enhancing skin cancer classification with Soft Attention and genetic algorithm-optimized ensemble learning
* Enhancing spatio-temporal zero-shot action recognition with language-driven description attributes
* Enhancing Unsupervised Domain Adaptation in Semantic Segmentation Through Selective Consensus and Gaussian Mixture Model-Based Pseudo-Labeling
* Enhancing visual inertial odometry with efficient dynamic PerceptionNet and consistency improvement fusion
* Enhancing Visual Question Answering Via Clustered In-Context Sequence Configuration
* Enhancing Visual Re-Ranking Through Denoising Nearest Neighbor Graph via Continuous CRF
* Enhancing VMamba for change detection via lightweight feature interaction and selection
* Enhancing Wide-Angle VR Video Transmission Using Human Perception
* Enhancing Zero-Shot Object-Goal Visual Navigation with target context and appearance awareness
* Entropy-aware dynamic bias watermarking for LLM-generated emotional content
* EPDiff: Erasure Perception Diffusion Model for Unsupervised Anomaly Detection in Preoperative Multimodal Images
* EQUR: Equivariant Uncertainty Quantification and Refinement for Point Cloud Registration
* Erp-Aware Text-To-360 Panorama Diffusion Model
* ErpGS: Equirectangular Image Rendering Enhanced with 3D Gaussian Regularization
* Error Correction for DNA-Based Image Storage
* ESGN-YOLO: Enhancing Multi-Scale Small Object Detection via Efficient Feature Fusion and Adaptive Spatial Modeling
* Estimating Cloud Base Height via Shadow-Based Remote Sensing
* Estimating Soil Moisture Using Multimodal Remote Sensing and Transfer Optimization Techniques
* Estimating Virtual Camera FOV to Reduce Perspective Shape Distortion in 2D-to-3D Face Reconstruction
* Estimation of Object Volume in Aqueous Food Media Using Surface Electric Potential and Neural Network Regression
* Eswindnet: Image Demoiréing Using Multiscale Swin Transformer Layers
* ETLight: An Evolution Transformer for Efficient Traffic Signal Control
* Evaluating Data Quality and Preprocessing Methods to Enhance Skeleton-Based Action Recognition in Retail Environments
* Evaluating Direct Georeferencing of UAV-LiDAR Data Through QGIS Tools: An Application to a Coastal Area
* Evaluating Human Perception of Automatically Created Synthetic Road Networks that Integrate Real-World Cost Factors and Terrain Features
* Evaluating Spherical Gaussian Fuzzy Sets in Image Enhancement
* Evaluating Students' Attention: A Deep Learning Approach
* Evaluating the effect of image quantity on Gaussian Splatting: A statistical perspective
* Evaluating the Hydrological Applicability of Satellite Precipitation Products Using a Differentiable, Physics-Based Hydrological Model in the Xiangjiang River Basin, China
* Evaluation of Urban Nighttime Light Environment Safety Using Integrated Remote Sensing and Perception Modeling
* Event Denoising Based on Iterative Tree-Structured Information Aggregation
* Event-Based Egocentric Human Pose Estimation in Dynamic Environment
* Event-Guided Motion Deblurring with Wavelet-Based Cross-Modal Feature Fusion
* Event-Triggered Regulation of Mixed-Autonomy Traffic Under Varying Traffic Conditions
* EventEgoHands: Event-Based Egocentric 3D Hand Mesh Reconstruction
* Evidence Conflict Sampling for Open-set Active Learning
* EvLight++: Low-Light Video Enhancement With an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More
* EXDF: Explainable Deepfake Detection with Vision-Language Model
* Expanding on the BRIAR Dataset: A Comprehensive Whole Body Biometric Recognition Resource at Extreme Distances and Real-World Scenarios (Collections 1-4)
* Explainable Artificial Intelligence Approach Using Low-Dimensional Visualization and Ensembling Uncertainty Quantification for Rare Chromosomal Aberration Detection in Cytogenetic Imaging
* Explicit semantic guidance for single image reflection removal via perceptual influence modeling
* Explicit Visual Prompting for Universal Foreground Segmentations
* Explicit-Implicit Prompt Injection and Semantic-Guided Latent LoRA for Vision-Language Tracking
* Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation
* Exploiting Hu invariant moments and deep features for image retrieval
* Exploiting the Benefits of Temporal Information in the Realm of LiDAR Panoptic Segmentation
* Exploring Effective Unfolding Covering Prompt Tuning for Vision Mamba
* Exploring How Soil Moisture Varies with Soil Depth in the Root Zone and Its Rainfall Lag Effect in the Ecotone from the Qinghai-Tibetan Plateau to the Loess Plateau
* Exploring Image-Language Data for Enhanced Soccer Understanding
* Exploring joint embedding predictive architectures for pretraining convolutional neural networks
* Exploring The Potential of Vision-Language Models for Pure-Image and Text-Guided-Image Saliency Prediction
* Exploring the Roles of Ancient Trees in Disturbance and Recovery Processes Using Monthly Landsat Time Series Analysis
* Exploring the Temporal Dynamics of Facial Mimicry in Emotion Processing Using Action Units
* Exploring Vision-Based Features for Detecting Deception in Well-Being: A Cross-Domain Comparison
* Exploring visual language models for driver gaze estimation: A task-based approach to debugging AI
* Extended Node-Specific Distributed Generalized Sidelobe Canceler for Outdoor Wireless Acoustic Sensor Networks
* Extension of Semi-Decoupled Partitioning in Inter Frames
* Extension of Sound Field Image Denoising to High-Frequency Sound Fields by Considering Wavenumber Spectral Loss
* Extensions of Morphological Gradient for Hyperspectral Images
* Eye-Closure-Based Alertness Detection via Adaptive Eye Region Extraction and Deep Learning
* Eyes and Ears: Automated Annotation of Audio Data Using Computer Vision
* F-LGAM: Enhancing Single Domain Generalized Object Detection Through Fourier-Based Local and Global Amplitude Mixup
* F2T2-HIT: A U-Shaped FFT Transformer and Hierarchical Transformer for Reflection Removal
* FA-Net: A Feature Alignment Network for Video-Based Visible-Infrared Person Re-Identification
* Face Forgery Detection With CLIP-Enhanced Multi-Encoder Distillation
* FaceCloak: Learning to Protect Face Templates
* FaceLiVT: Face Recognition Using Linear Vision Transformer with Structural Reparameterization for Mobile Device
* Facial digital markers For hypomimia detection in Parkinson's disease: A systematic review
* Facial Identity Editing: Towards Effective De-Identification
* Facilitate and Scale Up the Creation of 3D Meshes, 6D Category-Based Datasets and Grasping with Generative Models: GenVegeFruits3D
* fair spectral clustering with weighted fairness constraints, A
* Fake Money, Real Threat: Fooling Wavelet-Based Banknote Authentication with AdvGAN
* Family Resemblance or Fraud? Face Morphing Attacks on Kinship Verification
* FarmChanger: A Diffusion-Guided Network for Farmland Change Detection
* Fast and Accurate Outlier-Aware Lidar Super-Resolution for Slam Applications
* Fast Blind Image Deblurring Based on Cross Partial Derivative
* Fast Bounding Box Hierarchy
* Fast Image Vector Quantization Using Sparse Oblique Regression Trees
* Fast Iterative Enhancement for Image Signal Processing
* Fast Low-Artifact Image Generation for Staggered SAR: A Preview-Oriented Method
* Fast online L_0 elastic net subspace clustering via a novel dictionary update strategy
* FastEdit: fast text-guided single-image editing via semantic-aware diffusion fine-tuning
* FC-Render: Adaptive Font- and Color-Aware Text Diffusion Model
* FDFENet: Cropland Change Detection in Remote Sensing Images Based on Frequency Domain Feature Exchange and Multiscale Feature Enhancement
* FDNet: High-frequency disentanglement network with information-theoretic guidance for multivariate time series forecasting
* Feature Disentanglement Based on Dual-Mask-Guided Slot Attention for SAR ATR Across Backgrounds
* Feature Dispersion Adaptation With Pre-Pooling Prototype for Continual Image Classification
* Feature-Enhanced Network-Based Target Detection Method for SAR Images of Ships in Complex Scenes, A
* FeDepthX: A Federated Learning Depth eXperiment
* FeDi: Feature disentanglement for self-supervised learning
* FERGI: Automatic Scoring of User Preferences for Text-to-Image Generation from Spontaneous Facial Expression Reaction
* Few-label blind image quality assessment via samples chosen from new and existing scenes
* Few-Shot Class-Incremental Learning for Efficient SAR Automatic Target Recognition
* Few-Shot Fine-Grained Classification With Foreground-Aware Kernelized Feature Reconstruction Network
* Few-shot Medical Image Segmentation via Boundary-extended Prototypes and Momentum Inference
* FFTDiff: Tuning-free image texture transfer based on diffusion model
* FGA-NN: Film Grain Analysis Neural Network
* Field correlations in jet engine exhaust turbulence
* Field correlations of a Gaussian vortex laser beam in vertical turbulent oceanic links
* Fine-Grained Spatial-Temporal Perception for Gas Leak Segmentation
* Finetuning the Sample Points in Gaussian Filters via Neural Networks
* Fixing Background Misclassification in Few-Shot Object Detection via Product of Experts
* FL-WOSP: Federated learning with walrus optimization for sepsis prediction using MIMIC-III physiological and clinical data
* Flexible Geometric Guidance for Probabilistic Human Pose Estimation with Diffusion Models
* FlowCalib: Targetless Infrastructure LiDAR-Camera Extrinsic Calibration Based on Optical Flow and Scene Flow
* FMG-Det: Foundation Model Guided Robust Object Detection
* Focal Modulation for Image Restoration
* FocusPatch AD: Few-Shot Multi-Class Anomaly Detection With Unified Keywords Patch Prompts
* FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
* Foundation Model-Based Deformable Registration of Multi-Modal Remote Sensing Images
* Four-Decade CDOM Dynamics in Amur River Basin Lakes from Landsat and Machine Learning
* FourierMIL: Fourier Filtering-based Multiple Instance Learning for Whole Slide Image Analysis
* FPQuant: A deep learning-based scalable framework for fingerprint phenomics quantification in large-scale biometric population studies
* FPW: Frequency-Domain Pixel-by-Pixel Watermarking Against Unauthorized Images Used on Training Generative Model
* Fractional-Order Cauchy Penalty With Enhanced Adaptability for Signal Recovery, A
* FreerCustom: Training-Free Multi-Concept Customization for Image and Video Generation
* FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition
* FreqOR: Frequency-guided sampling initialization with attention enhancements for training-free object repositioning
* Frequency Aware Learned Image Compression Using Swin Transformer and Discrete Wavelet Transform
* Frequency Domain Transformation and Loss Adjustment for Enhancing Transferability of Adversarial Examples
* Frequency-Augmented CBAM Attention-Aided YOLOv8 for Object Detection, A
* Frequency-domain multi-regularization-experts fusion for robust non-line-of-sight imaging
* Frequency-Guided Contextual Image Captioning
* Frequency-guided multi-level human action anomaly detection with normalizing flows
* FRFSL: Feature Reconstruction-Based Cross-Domain Few-Shot Learning for Coastal Wetland Hyperspectral Image Classification
* From 2D X-Rays to a 3D Surgical Plan: Progress with AI Reconstruction
* From forgotten to pan-sharpening
* From Laplace to Mellin: A Unified Biorthogonal Transform Framework
* From MSG-SEVIRI to MTG-FCI: Advancing Volcanic Thermal Monitoring from Geostationary Satellites
* From Pixels to Panoramas: A Deep Learning Pipeline for Mineral Image Analysis
* From Small to Large: In-Context Learning as a New Paradigm for Domain Generalization
* Frozen Network Few-Shot Object Detection
* FSAC-IA: A Hierarchical Constructed SAC-IA Algorithm for Point Cloud Alignment Acceleration
* FSATFusion: Frequency-Spatial Attention Transformer for infrared and visible image fusion
* Function-based labels for complementary recommendation: Definition, annotation, and LLM-as-a-Judge
* FUNet: Frequency-Aware and Uncertainty-Guiding Network for Rain-Hazy Image Restoration
* Fungi or Fatal: Ensemble Learning for Mushroom Edibility Classification in the Wild
* Fused Satellite Fire Products Reveal Fire Diurnal Cycles and Improve Fire Emission Estimates over North America and East Asia
* Fusing Enhanced Flux Measurements and Multi-Source Satellite Observations to Improve GPP Estimation for the Qinghai-Tibet Plateau Based on AutoML Techniques
* Fusion of BeiDou and MODIS Precipitable Water Vapor Using the Random Forest Algorithm: A Case Study of Multi-Source Data Synergy in Hunan Province, China
* Fusion of Face and Ear Biometrics for Robust Child Recognition: Insights into Age-Dependent Recognition Trends
* Fusion of Sentinel-2 and Sentinel-3 Images for Producing Daily Maps of Advected Aerosols at Urban Scale
* Future object localization using multi-modal ego-centric video
* Fuzzy cluster-aware contrastive clustering for time series
* Fuzzy neighborhood-based feature selection with missing labels via feature graph matrix and label enhancement
* Fuzzy-Clustering-Based Domain Adaptation for Speech Steganalysis in Dynamic Scenarios
* FVR: Feature variance reduction for post-hoc network calibration
* Game-Theoretic Reinforcement Learning-Based Behavior-Aware Merging in Mixed Traffic
* Game-Theoretical Framework for Safe Decision Making and Control of Mixed Autonomy Vehicles, A
* GAMNet: Graph Attention Mlp-Based Network for 3D Human Pose Estimation
* Garment De-Warping for Virtual Try-on in the Wild
* Gaussian landmarks tracking-based real-time splatting reconstruction model
* GaussianGAN: Real-Time Photorealistic controllable Human Avatars
* GCL-GroW: Graph contrastive learning via group whitening
* GEE-UOD: An Underwater Object Detection Network Based on Global and Edge Information Enhancement
* GELD: A unified neural model for efficiently solving traveling salesman problems across different scales
* Gender Fairness of Machine Learning Algorithms for Pain Detection
* Generalizable poisoning-resistant backdoor detection and removal framework: From dataset perspective
* Generalization-Aware Remote Sensing Change Detection via Domain-Agnostic Learning
* Generalized prompt-driven zero-shot domain adaptive segmentation with feature rectification and semantic modulation
* Generative AI for Virtual Staining in Histopathological Data Analysis
* Generative Approach for Detecting Small Intrusive Foreign Objects in High-Speed Railway Scenario
* Generative Diffusion Model to Solve Inverse Problems for Robust in-NICU Neonatal MRI, A
* Generative Face Video Coding Framework with Disentangled and Consistent Background, A
* Generative Face Video Compression Using Depth Estimation and Compressed Sensing
* Generative image compression by prediction of optimal realism levels
* Generative Personalized Blind Face Restoration Enhanced by Physical Identity
* Generative views recovery and error-guided topological tensor network for incomplete multi-view clustering
* Generic-to-Personalised Learning for Multimodal Image Synthesis With Bidirectional Variational GAN
* Genetic Algorithms for Parameter Optimization for Disparity Map Generation of Radiata Pine Branch Images
* Geometric and topological structure-induced large-scale graph learning for social and information networks
* Geometric Continuity and Consistency Learning for Self-Supervised Point Cloud Completion
* Geometric Mean Improves Loss For Few-Shot Learning
* Geometric Shape Matching for Recovering Protein Conformations from Single-Particle Cryo-EM Data
* Geometry Parametrization Stabilization For Dynamic Mesh Coding
* Geometry Regularized Point Cloud Autoencoder
* GeoScaler: Geometry and Rendering-Aware Downsampling of 3D Mesh Textures
* Geospatial and Deep Learning Approaches for Modeling Floodwater Depth in Urbanized Areas
* GestDoor: Gesture-Based User Authentication for Door Entries Utilizing Wearable IMUs
* Gesture Recognition for Emergencies: Dataset and Cross-Condition Analysis
* GHS-VDG: Graph and Hybrid Spatio-Temporal Attention for Video Diffusion Generation
* GIMMNet: Geometry-Aware Interactive Multi-Modal Network for Semantic Segmentation of High-Resolution Remote Sensing Imagery
* GIP: Gated Interaction Prompt for Parameter Efficient Vision-Language Fine-Tuning
* GL2T-Diff: Medical image translation via spatial-frequency fusion diffusion models
* GlioSurvNet: Multimodal Survival Prediction for Glioblastoma Using Deep Learning and Clinical Variables from Brain MRI
* Global aggregated gradient-guided adversarial attacks for person re-identification
* Global and local Mamba network for multi-modality medical image super-resolution
* Global context guided refinement and aggregation network for lightweight surface defect detection
* Global, Multidecadal Carbon Monoxide (CO) Record from the Sounder AIRS/CrIS System, A
* Global-Local Mamba-Based Dual-Modality Fusion for Hyperspectral and LiDAR Data Classification
* GLOSS: Global-Local Matching Network Towards Outfit Recommendation for Diverse Body Shapes and Scenes
* GM-ABS: Promptable Generalist Model Drives Active Barely Supervised Training in Specialist Model for 3D Medical Image Segmentation
* GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
* GMD: A Multimodal Framework for AI-Generated Misinformation Detection
* GMOT-Mamba: Mamba-Based Model Prediction For Generic Multiple Object Tracking
* Gop-Level Adaptive Resampling with CNN-based Super Resolution
* GrabDAE: An Innovative Framework for Unsupervised Domain Adaptation Utilizing Grab-Mask and Denoise Auto-Encoder
* Gradient degradation-aware rate control for VVC using Nash equilibrium
* GRAFT-XPCI: Dataset of Synchrotron X-Ray Images for Detection of Acute Cellular Rejection after Heart Transplantation
* Graph Convolutional Network Aggregation For Broad-Spectral Object Detection
* GraphST: Class-imbalanced node classification with semantic relation transfer
* GRASP-Former: A Lightweight Global-Random Sparse Attention for Domain-Aware Multi-Class Obscenity Detection
* Grating Lobes Suppression Method for MIMO Imaging Radar Based on Phase-Coherence-Guided Adaptive Threshold Classification, A
* Green Learning Approach to LDCT Image Restoration, A
* Green Synergy Index for Urban Green Space Assessment Based on Multi-Source Data Integration, A
* Grid-Logat: Grid Based Local And Global Area Transcription For Video Question Answering
* Group Equivariant Morphological Networks
* Group Joint Independent Component Analysis (Group jICA): a Novel Method to Jointly Decompose and Link Simultaneous EEG and fMRI
* GStitch: Spatial-temporal fusion of 3D Gaussian splattings for scalable 3D reconstruction
* GTA-Crime: A Synthetic Dataset and Generation Framework for Fatal Violence Detection with Adversarial Snippet-Level Domain Adaptation
* Guest Editorial: Special Issue for the British Machine Vision Conference (BMVC), 2024 (Glasgow, Scotland, UK)
* Guest Editorial: Special Issue on Visual Datasets
* Guided Detail Filter for AVM
* Guided Diffusion For Class-Conditioned Synthesis and Classification Of Microscopic Blood Cell Images
* HA-Tracker: A Hybrid Architecture Tracker with Spatiotemporal Mamba Motion Model for UAV-Based Video Multi-Object Tracking
* Hallucination Early Detection in Diffusion Models
* Hallucination Elimination and Text Annotation Framework for Large Vision-Language Models in Traffic Scenarios
* Hand-Aware Masked Graph Convolutional Network for Skeleton-based Sign Language Recognition
* Handling Multiple Hypotheses In Coarse-To-Fine Dense Image Matching
* HandOcc: NeRF-based Hand Rendering with Occupancy Networks
* Hands-On: Segmenting Individual Signs from Continuous Sequences
* Hardware Friendly Multi-Hypothesis Cross Component Prediction
* Harnessing Feature Distribution Consistency for Federated Learning with Noisy Labels
* Harnessing the Power of LLMS for Image Aesthetics Assessment Through Semantic and Contextual understanding
* HEL-Net: Heterogeneous Ensemble Learning for comprehensive diabetic retinopathy multi-lesion segmentation via Mamba-UNet
* HFD-Net: A benchmark framework of foreign object detection for high-speed train
* Hi-Mamba: Hierarchical Mamba for Efficient Image Super-Resolution
* Hiding Local Manipulations on SAR Images: A Counter-Forensic Attack
* Hier-EgoPack: Hierarchical Egocentric Video Understanding With Diverse Task Perspectives
* HIER: Heterogeneous Information Bottleneck and Expert Routing for Social Bot Detection
* Hierarchical Contrastive Learning for Precise Whole-Body Anatomical Localization in PET/CT Imaging
* Hierarchical Kernel Decoupling for Graph Convolution: Enhancing Skeleton-Based Action Recognition Through Structured Representation
* Hierarchical Multi-Modal Transformer for Cross-Modal Long Document Classification
* Hierarchical order preserving spectral embedding
* Hierarchical Recursive Interaction and Multi-Stage Goal-Guided Mechanism for Multimodal Trajectory Prediction
* Hierarchical Reinforcement Learning Shared Steering Control Strategy Considering Driver-Vehicle-Road Risk Assessment
* Hierarchical Structure Dependency Whitening for Single-Domain Generalized Infrared Small Target Detection
* High Specificity Guided Cross-Domain Few-Shot Segmentation
* High-Accuracy Identification of Cropping Structure in Irrigation Districts Using Data Fusion and Machine Learning
* High-Capacity Image Steganography via Latent Diffusion Models
* High-Frequency Semantic Enhancement in Compressed Scenarios for Robust Visual and Machine Vision Applications
* High-Order Internal Model-Based Data-Driven Iterative Learning Control of High-Speed Railways Subject to Faded Channels
* High-Precision Camera Distortion Correction: A Decoupled Approach With Rational Functions
* High-quality controlled clustering expert networks
* High-Resolution Open-Vocabulary Object 6D Pose Estimation
* High-Rise Building Area Extraction Based on Prior-Embedded Dual-Branch Neural Network
* HMS^2Net: Heterogeneous Multimodal State Space Network via CLIP for Dynamic Scene Classification in Livestreaming
* Holism To Atomism: Enhancing The Vision-Language Alignment For Cross-Modal Few-Shot Learning
* Holistic Coreset Selection for Data Efficient Image Quality Assessment
* HoloDx: Knowledge- and Data-Driven Multimodal Diagnosis of Alzheimer's Disease
* HomE: A Homogeneous Ensemble Framework for Dynamic Hand Gesture Recognition
* How Cloud Feedbacks Modulate the Tibetan Plateau Thermal Forcing: A Lead-Lag Perspective
* HSBS: Comprehensive Boosting Of Facial Expression Recognition Via Hierarchical Semantic And Batch-Wise Similarity
* Hue Are You? Can Skin Depigmentation Affect Face Recognition Performance?
* HUGS-Net: A Lightweight and Unified Network for Adverse Weather Image Denoising
* Human or Machine: A Novel Deep Learning Framework for Autonomous Driver Identification Based on Vehicle Trajectories
* Human Pose Estimation Under Occlusion: A Data Restoration Framework Using GANs
* Human-in-the-loop adaptation in group activity feature learning for team sports video retrieval
* hybrid active contour model driven by local region-based self-organizing map for infrared image segmentation, A
* Hybrid Deep Learning and Handcrafted Feature Fusion for Mammographic Breast Cancer Classification
* Hybrid SIFT-SNN for Efficient Anomaly Detection of Traffic Flow-Control Infrastructure
* Hybrid texture-structural learning for hyperspectral image classification
* Hybrid-stage association with dynamicity adaptation and enhanced cues for multi-object tracking and segmentation
* Hyperparameter Optimization Method for Affine Projection Algorithm Based on Deep Unrolling
* HyperPoint: Multimodal 3D foundation model in hyperbolic space
* HySaM: An improved hybrid SAM and Mask R-CNN for underwater instance segmentation
* I&S-ViT: An Inclusive & Stable Method for Post-Training ViTs Quantization
* iClickSeg: Interactive click segmentation for zero-shot cross-category 3D part segmentation
* ICP-3DGS: SFM-Free 3D Gaussian Splatting for Large-Scale Unbounded Scenes
* ICPL-ReID: Identity-Conditional Prompt Learning for Multi-Spectral Object Re-Identification
* ICTNet: Image Complexity-Aware Two-Branch Network With Enhanced Decoding for Real-Time Segmentation
* ID-Booth: Identity-consistent Face Generation with Diffusion Models
* ID-Guard: A Universal Framework for Combating Facial Manipulation via Breaking Identification
* ID-TTA: Classifier-Free Test Time Adaptation for Metric Learning
* Identification and Segmentation of Internal Solitary Waves in the East China Sea: A TransUNet Approach Using Multi-Source Satellite Imagery
* iHDR: Iterative HDR Imaging With Arbitrary Number Of Exposures
* Illumination Spectrum Estimation for Multispectral Images Using Illuminant Prior
* illumination-robust feature decomposition approach for low-light crowd counting, An
* IlluSign: Illustrating Sign Language Videos by Leveraging the Attention Mechanism
* iLSU-T: an Open Dataset for Uruguayan Sign Language Translation
* Image Motion Blur Removal In The Temporal Dimension With Video Diffusion Models
* Image-based Morphological Characterization of Filamentous Biological Structures with Non-constant Curvature Shape Feature
* IMMix: Class-Imbalanced node classification via prototypical selective mixup augmentation
* Impact of Sunglasses on One-to-Many Facial Identification Accuracy
* Implicit authentication method based on image temporal features
* Implicit Neural Compression of Point Clouds
* Implicit Object Recognition via Reinforcement Learning in Out-Of-Domain Scenarios
* Improve Real-Time Flood Segmentation by Encoding and Distilling Foreground Information
* Improved Cervical Cell Detection Model Based on Hybrid-Domain Feature Pyramid Network
* Improved ORB-SLAM2 Algorithm Based on Extended Kalman Filtering and Particle Swarm Optimization, An
* Improved Representation Learning for Unconstrained Face Recognition
* Improved UNet++ Based on Kolmogorov-Arnold Convolutions
* Improving Fine-Grained Understanding for Retrieval in Human Motion and Text
* Improving Infrared Small Target Detection With GAN-Driven Data Augmentation
* Improving Learning of New Diseases Through Knowledge-Enhanced Initialization for Federated Adapter Tuning
* Improving Mobility in NDN-Based VANET: A Deep Reinforcement Learning Approach With Deep Prioritization
* Improving Multi-Organ Segmentation in Abdomen CT Images Incorporating Shape Priors and Spatial Information in Deep Learning
* Improving Novel View Synthesis of 360° Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images
* Improving Open-World Class-Agnostic Object Detectors via Feature Distillation with Student-Aware Adaptation
* Improving pseudo-labelling for semi-supervised single-class instance segmentation via mask symmetry scoring
* Improving Pseudo-Labels Selection Using Domain Priors for Semi-Supervised Detection in Capsule Endoscopy
* Improving the Performance of Compressive Spectral Imaging with Bayer Color Filter Array
* Improving the Stability and Efficiency of Diffusion Models for Content Consistent Super-Resolution
* Improving Yolov8 For Fast Few-Shot Object Detection By Dinov2 Distillation
* In Vivo 4D X-Ray Dark-Field Lung Imaging in Mice
* In-Orbit Assessment of Image Quality Metrics for the LuTan-1 SAR Satellite Constellation
* In2Out: Fine-Tuning Video Inpainting Model for Video Outpainting Using Hierarchical Discriminator
* Incomplete Modalities Restoration via Hierarchical Adaptation for Robust Multimodal Segmentation
* IndicSideFace: A Dataset for Advancing Deepfake Detection on Side-Face Perspectives of Indian Subjects
* Individuation of 3D Perceptual Units from Neurogeometry of Binocular Cells
* Influence of irregular particles on the propagation of polarized pulsed laser beams through turbid underwater environments
* InfoARD: Enhancing Adversarial Robustness Distillation With Attack-Strength Adaptation and Mutual-Information Maximization
* Information-Driven Complementarity and Consistency Mining for Multi-View Clustering
* Infrared and visible image fusion model based on source image interaction
* Infrared remote sensing small ship target detection method based on spatial-semantic enhancement and feature reconstruction neck
* Initial Results of Site-Specific Assessment of Cereal Leaf Beetle (Oulema melanopus L.) Damage Using RGB Images by UAV
* Instance-wise distribution control of text-to-image diffusion models
* Instant 3DCG Dance Generation System Based on Music and Dance Composition
* Integrated Design of Mobile Battery-Swapping and Charging Services for Electric Vehicles
* Integrating GAN-Generated SAR and Optical Imagery for Building Damage Mapping
* Integration of InSAR and GNSS Data: Improved Precision and Spatial Resolution of 3D Deformation
* intelligent multimodal medical image registration using hybrid meta-heuristic optimization with transformer-based residual UNet, An
* Inter-Trial Coherence Reveals Enhanced Synchrony During Mantra Listening
* Interacting Factors Controlling Total Suspended Matter Dynamics and Transport Mechanisms in a Major River-Estuary System
* Interactive feature fusion for camera-radar-based vehicle segmentation in bird's-eye view
* Interactive Sign Language Question Answering Framework and Dataset for Barrier-Free Exhibition Service
* Interference Mitigation in Automotive Radar Systems: A Current State Survey and Future Trends
* Intermodal correlation modeling for incomplete multi-modal learning in land use and land cover classification
* Interpretable image classification based on antifactual data
* Interpretable Multi-View Representation Learning Towards Complex Scenes: From Homogeneity to Heterogeneity
* Interpretable Transformer-Based Framework for Monitoring Dissolved Inorganic Nitrogen and Phosphorus in Jiangsu-Zhejiang-Shanghai Offshore, An
* Interpreting the Trispectrum as the Cross-Spectrum of the Wigner-Ville Distribution
* Intervention-Based Mixup Augmentation for Multiple Instance Learning in Whole-Slide Image Survival Analysis
* Intra-modal consistency for image-text retrieval through soft-label distillation
* Introducing the short-time fourier Kolmogorov Arnold network: A dynamic graph CNN approach for tree species classification in 3D point clouds
* Invariants of Color Images to N-Fold Symmetric Out-of-Focus Blur
* Inverse Scattering for Schrödinger Equation in the Frequency Domain via Data-Driven Reduced Order Modeling
* Investigating Data Replication in Medical Synthetic Image Generation with Diffusion Models
* Investigating Robustness of Unsupervised Stylegan Image Restoration
* Investigating Role of Big Five Personality Traits in Audio-Visual Rapport Estimation
* Investigating Social Biases in Multimodal LLMs
* Investigating Uncertainty Weighting for Multi-Task Learning: Insights and Analytical Alternative
* InvJND: Just Noticeable Difference Estimation via Deep Invertible Network
* IO-LIO: Information-Oriented Voxel Mapping for Efficient and Precise LiDAR-Inertial Odometry
* IPF-RDA: An Information-Preserving Framework for Robust Data Augmentation
* IRNet: Iterative Refinement Network for Noisy Partial Label Learning
* Is Perturbation-Based Image Protection Disruptive to Image Editing?
* Iterative Filtering and Smoothing with Optical Flow Prediction Models
* Iterative mutual voting matching for efficient and accurate Structure-from-Motion
* Iterative optimal transport for multimodal image registration
* Iterative Self-Improvement of Vision Language Models for Image Scoring and Self-Explanation
* IterDiff: Training-Free Iterative Face Editing Via Efficient Clip-Guided Memory Bank
* Ivory: Adversarial Purification of Obfuscated Faces to Extract Soft-Biometrics using Diffusion Transformers
* J-CaPA: Joint Channel and Pyramid Attention Improves Medical Image Segmentation
* JAM: A Comprehensive Model for Age Estimation, Verification, and Comparability
* JanusGAN: GANs Disentangled Editing with Two Discriminators
* JFDet: Joint Fusion and Detection for Multimodal Remote Sensing Imagery
* Joint Deblurring and Destriping for Infrared Remote Sensing Images with Edge Preservation and Ringing Suppression
* Joint Deep-Unfolding Optimization Learning for Depth Map Arbitrary-Scale Super-Resolution
* Joint distribution alignment on Lie group manifolds for domain adaptation
* Joint Enhancement and Bandwidth Extension for Radar Through-Barrier Speech Acquisition
* Joint Geometry-Attribute Point Cloud Compression with Spatial Context Mining and Dual-Class Attribute Loss
* Joint JPEG Compression and Encryption With DC Groups' Random Cross-Permutation and ZRVs' Inter-Block Permutation
* Joint Optimization of Primary and Secondary Transforms Using Rate-Distortion Optimized Transform Design
* Joint Optimization of Vehicle and Pedestrian Traffic Signals Using Multi-Objective Deep Reinforcement Learning
* Joint Super-Resolution and Segmentation for Low-Resolution Brain MRI Analysis
* Judging From Support-Set: A New Way To Utilize Few-Shot Segmentation For Segmentation Refinement Process
* JustRAIGS: Justified Referral in AI Glaucoma Screening Challenge
* KANM^2L: Enhancing Multi-Modal Recommendation With KAN and Dilated Attention
* Keypoint detection in Tai Chi Chuan Essence via Waist and Limbs Feature Separation
* Keypoint Estimation for Real-Time Pinus Radiata Cutpoint Detection
* Knowledge and experience for visible-infrared person re-identification
* Knowledge Distillation Between 2D and 3D Vision Transformers for Point Cloud Quality Assessment
* Knowledge Distillation for Resource Efficient Classification with CLIP-Guided Specialist Networks
* Knowledge distillation meets video foundation models: A video saliency prediction case study
* Knowledge Is What You Need For Active Object Tracking
* Knowledge Refinement For Unsupervised Lifelong Person Re-Identification
* Knowledge-Aware Diffusion-Enhanced Multimedia Recommendation
* Knowledge-Enhanced Dynamic Scene Graph Attention Network for Fake News Video Detection
* Knowledge-Enhanced Graph Contrastive Learning for Recommendations
* KPLNet: Keypoint prototype learning for zero image semantic correspondence
* KuRALS: Ku-Band Radar Datasets for Multi-Scene Long-Range Surveillance with Baselines and Loss Design
* L1-Norm Redundant Delaunay Phase Unwrapping and Gradient Correction
* Label-informed knowledge integration: Advancing visual prompt for VLMs adaptation
* Lagrangian Motion Fields for Long-Term Motion Generation
* Land Cover Type Classification Using High-Resolution Orthophotomaps and Convolutional Neural Networks: Case Study of Tatra National Park
* Landmark-Based Fast LIP Reading with CTC Loss
* Language-Dominated Fusion and Self-Distillation for Multimodal Sentiment Analysis with Incomplete Modalities
* Langvision-Lora-Nas: Neural Architecture Search for Variable Lora Rank In Vision Language Models
* Laplacian-Mamba: Mamba-Based Laplacian Pyramid Enhancement Network for Unpaired High-Definition Images
* Large (Vision) Language Models for Autonomous Vehicles: Current Trends and Future Directions
* Large Vision-Language Models are Generalist Solvers For Pathology Tasks
* large-scale drone based thermal infrared benchmark and inception transformer network for crowd counting, A
* Large-Scale Pre-Trained Models Empowering Phrase Generalization in Temporal Sentence Localization
* LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving
* LargeSceneGaussian: High-Efficiency 3D Gaussian Splatting for Large-Scale Scene Reconstruction
* Layer-Adaptive-Augmentation-Based Graph Contrastive Learning With Feature Decorrelation
* LCFusion: Infrared and visible image fusion network based on local contour enhancement
* Learnable Time-Frequency Transform and Ridge Separation
* Learned Hybrid Video Coding for Human Perception and Multiple Machine Vision Tasks
* Learned Video Compression with Spatial Correlation Priors and Hierarchical Temporal Attention
* Learning a Filtered Backprojection Reconstruction Method for Photoacoustic Computed Tomography With Hemispherical Measurement Geometries
* Learning clique-based inter-class affinity for compositional zero-shot learning
* Learning Compact Representations With an Information Bottleneck for Camouflaged Object Detection
* Learning contrastive feature representations for facial action unit detection
* Learning Dynamic Graph Embeddings With Neural Controlled Differential Equations
* Learning Efficient and Adaptive Cross-Channel Dependencies for Weakly-Supervised Object Detection
* Learning Efficient Meshflow and Optical Flow From Event Cameras
* Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration
* Learning from PU Data Using Disentangled Representations
* Learning Generalization From Various Unaware Degradations for Blind Hyperspectral Image Super-Resolution via Transparent Diffusion Model
* Learning Geometry-Aware Representation for Gaze Estimation
* Learning Knowledge-Based Prompts for Robust 3D Mask Presentation Attack Detection
* Learning Roles With Emergent Social Value Orientations
* Learning-based Human Relighting: A Survey
* Learning-Based Steering Estimation for Motorcycles via Visual-Inertial Fusion
* Learning-by-generation: Enhancing gaze estimation via controllable generative data and two-stage training
* LeMoRe: Learn More Details for Lightweight Semantic Segmentation
* LENet: A Semantic Segmentation Network for Complex Landforms in Remote Sensing Imagery via Axial Semantic Modeling and Deformation-Aware Compensation
* LETTER: Self-Harmonized Representation Learning for Multimodal Recommendation
* Leveraging Complementary Attention Maps in Vision Transformers for OCT Image Analysis
* Leveraging Depth Foundation Models in Self Supervised Monocular Depth Estimation
* Leveraging Pupil Facial Fusion for Enhanced Micro-Expression Recognition
* LFT-Net: A lightweight frequency-based transformer for low-light enhancement and exposure correction
* LGD: Leveraging generative descriptions for zero-shot referring image segmentation
* Lift-PCAC: Lifting Based Point Cloud Attribute Compression
* LiftFormer: Lifting and Frame Theory Based Monocular Depth Estimation Using Depth and Edge Oriented Subspace Representation
* Lightweight Attention-Enhanced Multi-Scale Detector for Robust Small Object Detection in UAV
* Lightweight Environment Vector Map Framework and Its Fast Lane-Level Navigation Strategy for Autonomous Vehicles, A
* Lightweight Hybrid Gabor Deep Learning Approach and its Application to Medical Image Classification, A
* Lightweight Image Super-Resolution Preprocessor for Jpeg Compression
* Lightweight LiDAR-Based Cooperative Localization Model for Asymmetric Leader-Follower Cooperative Driving Automation System
* Lightweight macro-pixel quality enhancement network for light field images compressed by versatile video coding
* Lightweight Temporal Contextual Fine-Tuning Method of Large Multimodal Model for Video Moment Retrieval
* LighTwSVM: Efficient linear nonparallel classifier for millions of data
* LigTomDet: Knowledge distillation in a new lightweight tomato disease detection model in planting fields
* Linea: Fast and Accurate Line Detection using Scalable Transformers
* Linking model intervention to causal interpretation in model explanation
* Lip Enhancement and Multi-View Simulation for Robust Visual Speech Recognition in MAVSR 2025
* Liquid: Language Models are Scalable and Unified Multi-Modal Generators
* Listening for You: Enhancing Speech Image Retrieval via Target Speaker Extraction
* LLHA-Net: A hierarchical attention network for two-view correspondence learning
* LLIE-Face: A multi-modal dataset for low-light facial image enhancement
* LLM-Driven Medical Report Generation via Communication-Efficient Heterogeneous Federated Learning
* LLM-informed global-local contextualization for zero-shot food detection
* LNet: Lightweight Network for Driver Attention Estimation via Scene and Gaze Consistency
* Local Analysis of Iterative Reconstruction from Discrete Generalized Radon Transform Data in the Plane
* Local and Global Structure-Guided No-Reference Point Cloud Quality Assessment
* Local attention and contrastive clustering network for sign language recognition
* Local Refinement and Global Strengthening Network for Vehicle Re-Identification
* Locally Shuffled Low Rank Column-Wise Sensing
* LoGA-Attack: Local geometry-aware adversarial attack on 3D point clouds
* Long-Short Exposure Fusion With Event Data For Low-Light Video Enhancement
* Long-Short Match for Lost Control in UAV Multi-Object Tracking
* Long-Tailed Continual Learning for Visual Food Recognition
* LoRA Patching: Exposing the Fragility of Proactive Defenses Against Deepfakes
* Lorentz Transformation Neural Network
* LoTeR: Localized text prompt refinement for zero-shot referring image segmentation
* Low-Light Image Enhancement Using a Retinex-Based Variational Model with Weighted L_p Norm Constraint
* Low-Rank Adaptation of Pre-Trained Vision Backbones for Energy-Efficient Image Coding For Machines
* LRCPN: A Lightweight Parallel Scheme for Underwater Acoustic Modulation Recognition
* LURE: An Unsupervised Denoising Framework for Multiplicative Lognormal Noise
* LVMF3D: Large Vision Model Boosting Multimodal Fusion for Indoor 3D Object Detection
* M-AIDE: Mechanistic Agentic Interpretability for Decoding Empathy in Language Models
* M3C: Resist Agnostic Attacks by Mitigating Consistent Class Confusion Prior
* M3D: A Benchmark Dataset and Model for Microscopic 3D Shape Reconstruction
* Machine learning and transformers for thyroid carcinoma diagnosis
* Machine Learning Models for Predicting Post-Wildfire Methane Emissions in Australia Using Multivariate Data
* Machine Learning-Based Decoding Energy Modeling for VVC Streaming
* Mamba-Based Global Correlation Learning for Light Field Spatial Super-Resolution
* Mamba-Based Progressive-Recovery Framework for Multimodal Low Light Image Enhancement
* Mamba-Driven Comprehensive Context Learning for Zero-Shot HOI Detection
* Mamba-SF: Monocular Scene Flow Learning with State Space Models
* Mambafusion: State-space model-driven object-scene fusion for multi-modal 3D object detection
* Manifold learning based on locally linear embedding for symmetric positive definite matrix
* MarsTerrNet: A U-Shaped Dual-Backbone Framework with Feature-Guided Loss for Martian Terrain Segmentation
* Mask-RadarNet: Enhancing Radar Object Detection With Spatio-Temporal Context
* Masked Text Pre-Training for Scene Text Detection
* Matching ambiguity-resilient multi-view stereo via adaptive patch deformation
* MBFI-Net: Multi-Branch Feature Interaction Network for Semantic Change Detection
* MCE: Towards a general framework for handling missing modalities under imbalanced missing rates
* MCM: A Multi-Agent Collaborative Multimodal Framework For Traditional Chinese Medicine Diagnosis
* MDCM: A multi-granularity disentanglement and cross-modal synergy-based model for sentiment analysis
* MDT-FI: Mask-Guided Dual-Branch Transformer With Texture and Structure Feature Interaction for Image Inpainting
* Measuring Anxiety Levels with Head Motion Patterns in Severe Depression Population
* Measuring Distortion Strength with Dewarping Diffusion Models in Anomaly Detection
* Measuring Teacher Empathy in a Virtual Reality Scenario Simulating Racial Bias
* MedKI: Knowledge Dual Injections for Medical Visual Question Answering
* Metaformer-like Convolutional Neural Networks and Learnable Decision Fusion for SAR Ship Classification
* METAH2: A Snapshot Metasurface HDR Hyperspectral Camera
* Metalwork: A Synthetic Dataset and Baseline for Stereo Matching of Metal Workpieces
* METAREG: Robust Camera Parameter Estimation by Leveraging Noisy Camera Extrinsics
* Method of Convolutional Neural Networks for Lithological Classification Using Multisource Remote Sensing Data
* MFA-Net: Motion Field Adaptive Network for Skeleton-Based Action Recognition
* MFB-SAC: A Multi-Scale Frequency and Boundary-Enhanced SAM for Cell Segmentation
* MFDiff: Diffusion Probabilistic Model for Medical Image Segmentation with Multi-Scale Features and Frequency-Aware Attention
* MFF-Net: Flood Detection from SAR Images Using Multi-Frequency and Fuzzy Uncertainty Fusion
* MFHS: Mutual consistency learning-based foundation model integrates hypergraph for semi-supervised medical image segmentation
* MFJLN: Multi-Frequency Feature Joint Learning Network for Rain Removal
* MFU-Net: A Novel Deep Learning Framework for Unmixing Method for Sentinel-2 Imagery of Invasive Serrated Tussock (Nassella Trichotoma)
* MGP-KAD: Multimodal Geometric Priors and Kolmogorov-Arnold Decoder for Single-View 3d Reconstruction in Complex Scenes
* Micro-Expression Analysis Based on Self-Adaptive Pseudo-Labeling and Residual Connected Channel Attention Mechanisms
* MIP-CLIP: Multimodal Independent Prompt CLIP for Action Recognition
* Mirror Feature-Aware Generative Adversarial Network for RGB-T Salient Object Detection
* Mitigating bias in Few Shot Class Incremental Learning with Feature Augmentation and Logits Mix-up
* Mitigating Bottlenecks Caused by Freeway Exiting Flows' Merging Maneuvers to Hard Shoulder: An Integrated Proactive Control
* Mix-Based Training Strategies for Learning Implicit Neural Representations
* MKGPL: graph prompt learning with multi-view knowledge for few-shot recognition
* MLP Fusion: Revisiting Convolutional Networks with Transformer-Based Insights
* MM-IML: Multi-Modal Image Forgery Detection and Localization
* MMP-2k: A Benchmark Multi-Labeled Macro Photography Image Quality Assessment Database
* MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models
* MOAT: Multi-Scale Group Interaction Transformer for Trajectory Prediction in Crowded Scenes
* Mobile Robot Navigation Method Based on Multiple External Cameras in Crowded Environment
* MobilityGPT: Enhanced Human Mobility Modeling With a GPT Model
* Modality-Aware Diffusion Distillation Network for Sentiment Analysis in Missing Modalities
* MODE: A model-agnostic framework for object detection under adverse weather conditions
* Model Synthesis for Zero-Shot Model Attribution
* Model-Data Dual-Driven Method for Mode-Switching Radar Target Detection
* Model-Free Test Time Adaptation for Out-of-Distribution Detection
* Modeling and Analysis of Car-Following Behavior Based on Macro-Micro Coupling
* Modeling Localized PPG for Blood Pressure Forecasting With MoE and Quantile Regression
* Modeling Moso Bamboo Tree Density and Aboveground Biomass Using Multi-Site UAV-LiDAR Data
* Modelling the Remote Sensing Reflectance for the Sea Surface Layer Using Empirical Inherent Optical Properties
* modular augmented reality framework for real-time clinical data visualization and interaction, A
* Modular System for Human Action Detection Combining YOLO and Transformer-Based Video Understanding, A
* MolRL: Self-supervised molecular image representation learning via graph structure bootstrapping
* Monitoring Glacier Debris Flows and Dammed Lakes Using Multiple Satellite Images in the Badswat Watershed, Northern Karakoram
* MONSTR: Model-Oriented Neutron Strain Tomographic Reconstruction
* Mosaic-SR: An Adaptive Multi-Step Super-Resolution Method For Low-Resolution 2d Barcodes
* MoTiC: momentum tightness and contrast for few-shot class-incremental learning
* Motion control of 3-DoF delta robot using adaptive neuro fuzzy inference system
* Motion-Aware Reconstruction for Video Snapshot Compressive Imaging
* Moving Forward with BWC: The Faleb Dataset for Multimodal Image Analysis
* MPEG Edgebreaker: An Efficient Static and Dynamic Mesh Codec in MPEG V-DMC
* MS-DSCLNet: A Multi-Scale Dual-Stream Contrastive Learning Network for Image Tampering Detection and Localization
* MS-RAFT-3D: A Multi-Scale Architecture for Recurrent Image-Based Scene Flow
* MSBC-Segformer: An automatic segmentation model of clinical target volume and organs at risk in CT images for radiotherapy after breast-conserving surgery
* MSCANet: Multi-Scale Spatial-Channel Attention Network for Urbanization Intelligent Monitoring
* MSDP-Net: Multi-scale distribution perception network for rotating object detection in remote sensing
* MSG-CLIP: Enhancing CLIP's ability to learn fine-grained structural associations through multi-modal scene graph alignment
* MSMCD: A Multi-Stage Mamba Network for Geohazard Change Detection
* MSSG: Multi-Scale Speaker Graph Network for Active Speaker Detection
* MSTDNet: Multi-scale traffic object detection network with smooth information perception
* MSTSGM: A multi-scale temporal-spatial guided model for image deblurring
* MTD-Net: A robust multi-task discriminative network for choroidal neovascularization segmentation
* MTFM: Multi-Teacher Feature Matching for Cross-Dataset and Cross-Architecture Adversarial Robustness Transfer in Remote Sensing Applications
* Multi-Agent Deep Reinforcement Learning for Safe Autonomous Driving With RICS-Assisted MEC
* Multi-Agent-Based Approaches for Cooperative Traffic Management in C-ITS: Systematic Literature Review (SLR)
* Multi-Channel Convolutional Neural Network Model for Detecting Active Landslides Using Multi-Source Fusion Images, A
* Multi-Class Part Parsing Based on Multi-Class Boundaries
* Multi-Class Smoothed Hinge Loss Function in Pre-Training for Transfer Learning
* Multi-Domain Biometric Recognition using Body Embeddings
* multi-expert framework for enhancing multimodal large language models in industrial anomaly detection, A
* Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
* Multi-Granularity Query Network With Adaptive Category Feature Embedding for Behavior Recognition
* Multi-Granularity Scene-Aware Graph Convolution Method for Weakly Supervised Person Search, A
* Multi-Graph Spatio-Temporal Network for Traffic Accident Risk Forecasting
* Multi-Head Attention Residual Unfolded Network for Model-Based Pansharpening
* Multi-Kernel Maximum Asymmetric Correntropy Criterion: Foundation and Analysis
* Multi-Layer End-to-End 360° Image Compression, A
* Multi-Level and Multi-Modal Action Anticipation
* Multi-Level Contrastive Learning for Multimodal Sentiment Analysis
* Multi-level ensemble feature selection for omics data
* Multi-level spectral-spatial mutual learning for pansharpening
* Multi-Level Statistical Model Guidance Improves Generalization for Biometric Synthetic Face Detection
* Multi-Matrix Completion: A Novel Framework for Structurally Missing Elements
* Multi-Method Explainability Evaluation of a Graph-Based Neural Network for Alertness Detection, A
* Multi-modal cooperative fusion network for dual-stream RGB-D salient object detection
* Multi-Modal Knowledge-Driven Approach for Generalized Zero-shot Video Classification, A
* Multi-model co-training for medical image segmentation with limited annotation
* Multi-Objective Heterogeneous Fleet Vehicle Routing Problem: Formulation and Algorithm
* Multi-Perspective Information Fusion Network for Remote Sensing Segmentation
* Multi-Res-3DGS: Multi-Resolution 3d Gaussian Splatting Bound with a Subdivided Mesh Sequence
* Multi-Satellite Image Matching and Deep Learning Segmentation for Detection of Daytime Sea Fog Using GK2A AMI and GK2B GOCI-II
* Multi-scale interleaved transformer network for image deraining
* Multi-Scale Spatial-Frequency Features Representation and Learnable Cross Modal Feature Fusion in DeepFake Detection
* multi-scale U-shaped transformer neural network for low-light image enhancement, A
* Multi-Sensor Hybrid Modeling of Urban Solar Irradiance via Perez-Ineichen and Deep Neural Networks
* Multi-Source Remote Sensing Identification Framework for Coconut Palm Mapping, A
* multi-spatiotemporal joint next-POI travel sequence recommendation method based on federated learning, A
* Multi-Stage Group Interaction and Cross-Domain Fusion Network for Real-Time Smoke Segmentation
* Multi-Task Learning for Hierarchical Professional Gesture Recognition: State-Space Modeling for Task Temporal Dependencies
* Multi-Teacher Knowledge Distillation for Efficient Object Segmentation
* Multi-view 3D model recognition via multi-label and multi-level fusion with bidirectional GRU
* Multi-View Amodal Instance Segmentation Based on 3d Representation
* Multi-View Dispersion Entropy on Graphs: Application to the Detection of Cerebral Palsy After Neonatal Stroke
* Multi-view fuzzy C-means clustering via multi-objective slime mould and cooperative learning
* Multi-View Graph Approach for Morphometry-Aware Semantic Image Segmentation, A
* Multi-view graph pooling via dominant sets for graph classification
* Multi-View Knowledge Guided Semantic Prototype Learning for Generalized Zero-Shot Action Recognition
* Multi-view unsupervised feature selection with unified measurement of consistency and diversity
* Multi-year long-term person re-identification using gait and HAR features
* Multidimensional Media Adaptation Framework for Live Holographic Communication, A
* Multifidelity-Based Ant Colony Optimization Algorithm for Capacitated Electric Vehicle Routing Problems, A
* Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning
* MultiMAE Meets Earth Observation: Pre-Training Multi-Modal Multi-Task Masked Autoencoders for Earth Observation Tasks
* Multimodal alignment of event and text streams in spiking neural networks for human action recognition
* Multimodal Cell Context Instruction Tuning for Conditional DNA Regulatory Sequence Generation with Large Language Models
* Multimodal Classification and Out-of-Distribution Detection for Multimodal Intent Understanding
* Multimodal Cross-Attention for Range of Motion Assessment
* Multimodal driver behavior recognition based on frame-adaptive convolution and feature fusion
* Multimodal Multi-Graph Fusion Learning for Alzheimer's Disease Diagnosis
* Multimodal Re-Ranking for Heterogeneous Face Re-Identification
* Multimodal-LLM Agent For Text-Driven Multi-Attribute Face Editing
* Multiple Instance Learning Framework with Masked Hard Instance Mining for Gigapixel Histopathology Image Analysis
* Multiple object stitching for unsupervised representation learning
* Multiple weather degraded image restoration based on multi-component decomposition
* Multiplicative Reweighting for Robust Neural Network Optimization
* Multiscale Attention-Based Deep Learning Method for DCE-MRI Breast Tumor Segmentation, A
* MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion
* MultiVeg: A Very High-Resolution Benchmark for Deep Learning-Based Multi-Class Vegetation Segmentation
* MUSIC: Multi-coil unified sparsity regularization using inter-slice correlation for arterial spin labeling MRI denoising
* MUVOD: A Novel Multi-View Video Object Segmentation Dataset and a Benchmark for 3D Segmentation
* MVD-NeRF: Multi-View Deblurring Neural Radiance Fields from Defocused Images
* MVDCNN: A Multi-View Deep Convolutional Network with Feature Fusion for Robust Sonar Image Target Recognition
* MVFormer: Multi-View Point Cloud Transformer for 3D Mechanical Component Recognition
* My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing
* Nagisa: A reversible privacy preservation scheme against facial soft-biometric attributes recognition
* NAR-DIFF: A Noise-Adaptive Reflectance Diffusion Model for Low-Light Image Enhancement
* NASSBLiF: No-Reference Light Field Image Quality Assessment Via Neighborhood Attention and Scale Swin
* Natural image stitching using depth maps
* Naturalistic physical adversarial camouflage for object detection via differentiable rendering and style learning
* NeB-SLAM: Neural blocks-based salable RGB-D SLAM for unknown scenes
* Neighbor-Aware Feature-Driven Motion Compensation for Learned Video Compression
* NEPose: A novel benchmark dataset with an improved framework for vision-based nasal endoscope pose estimation
* Nested evolution for interactively fusing feature agents and learning ensembled classifier agents
* Neural Prediction Errors as a Unified Cue for Abstract Visual Reasoning
* NeuralSEIR: Modeling uncertainty in non-pharmaceutical interventions with neural epidemic dynamics
* New Change Detection Method for Heterogeneous Remote Sensing Images Via an Automatic Differentiable Adversarial Search, A
* New Method for Detecting Automated Mapping Anomalies in Himalayan Glacial Lakes from Satellite Images, A
* New Multi-Source Distributed Transfer Learning Framework
* Next Chain Prediction: A Generative Recommendation Model With Sequence-Chain Attention
* Next-Stitch Counting in Crochet Swatches via Multi-Class Semantic Segmentation
* NiCI-Pruning: Enhancing Diffusion Model Pruning via Noise in Clean Image Guidance
* Nissl-Stained Histological Slice Image Completion Based on Generated Masks
* No-Reference Stitched Wide Field of View Light Field Image Quality Assessment via Structured Representation and Progressive Learning
* No-Reference Textured Mesh Quality Assessment Using Graph-Based Features
* Noise perturbation augmentation based dual-branch alignment network for cross-domain hyperspectral image classification
* Noise-Robust Approach Using Dynamic Graph Neural Networks for Bus Passenger Flow Prediction, A
* Noise-to-Noise Training Approach for Robust Motion-Compensated Processing in Cardiac-Gated Images, A
* Noisy Label Refinement with Semantically Reliable Synthetic Images
* Non-Invasive Neonatal Jaundice Detection via Two-Phase Self-Supervised Learning and Vision Transformer
* Non-Local N2V: Improving N2V Networks for Spatially Correlated Noise
* Non-Rigid Motion Correction for MRI Reconstruction via Coarse-to-Fine Diffusion Models
* Non-Uniform Illumination Image Restoration for Deep-Sea Exploration with A New Scattering Model
* Nonintrusive Watermarking for CycleGAN
* Nonlinear Elasticity Model in Computer Vision, A
* Nonlinear Modifications of Transform Coefficients in VVC Intra Coding
* Nonlinear Water-Heat Thresholds, Human Amplification, and Adaptive Governance of Grassland Degradation Under Climate Change
* Nonnegative Matrix Factorization in Dimensionality Reduction: A Survey
* Novel Adaptive Low-Rank Matrix Approximation Method for Image Compression and Reconstruction, A
* Novel AI Framework for Breast Cancer Molecular Biomarker Response Score Detection on Cells Level Using Marker-Based Watershed Segmentation and Machine Learning Classifiers, A
* Novel Automated System for Pathological Lung Segmentation Using Modified Local Binary Patterns and Hierarchical Transformers, A
* Novel Criterion for Hankel Norm Performance in Digital Filters With Quantization and Overflow, A
* Novel Data-Focusing Method for Highly Squinted MEO SAR Based on Spatially Variable Spectrum and NUFFT 2D Resampling, A
* Novel Downsampling Strategy Based on Information Complementarity for Medical Image Segmentation, A
* novel dynamic graph attention aggregation network for multivariate time series classification, A
* Novel Explainable AI-Based System For Improved Prediction of Breast Cancer Response to Neoadjuvant Chemotherapy, A
* Novel Game Graphics Quality Evaluation Model Using Saliency and Resolution Information, A
* Novel Hybrid Model Based on VMD-KAN-Informer for Railway Traction Power Grid Short-Term Load Forecasting, A
* Novel Method and Dataset for Depth-Guided Image Deblurring From Smartphone Lidar, A
* Novel Method for Reducing Uncertainty in Subglacial Topography: Implications for Greenland Ice Sheet Volume and Stability, A
* Novel Robust Reversible Watermarking Scheme Using Fractional-Order Polar Complex Exponential Transform, A
* novel weakly supervised immunohistochemical cell segmentation method via counting labels, A
* NSGA-II-XGBoost Machine Learning Approach for High-Precision Cropland Identification in Highland Areas: A Case Study of Xundian County, Yunnan, China, An
* NuclSeg-v2.0: Nuclei segmentation using semi-supervised stain deconvolution with real-time user feedback
* OASIS: Object-guided Attention for Text-conditional Diffusion Synthesis of Human Interaction Sequences
* Object Detection and Fruit Tree Growth Stage Identification Via YOLO with Inverted and Swin Transformer Blocks
* Object-Guided Semi-Supervised Bird's-Eye View 3D Object Detection With 3D Box Refinement
* Object-IR: Leveraging object consistency and mesh deformation for self-supervised image retargeting
* Objective, Absolute and Hue-Aware Metrics for Intrinsic Image Decomposition on Real-World Scenes: A Proof of Concept
* Oblique Decision Trees as an Image Model for Cubist Image Restyling
* Observer-Based Prescribed-Time Resilient Control for 2-D Plane Heterogeneous Vehicular Platooning System With Hybrid Communication Threats
* OFVL-MS++: Once for visual localization across multiple scenes via a two-stage framework
* Oilseed Flax Yield Prediction in Arid Gansu, China Using a CNN-Informer Model and Multi-Source Spatio-Temporal Data
* Omnidirectional image quality assessment using frequency-domain information
* OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
* On color differences in context
* On the correlations between geometric metrics and fairness in pruning CNN
* On the Impact of Natural Guide-Star Asterism Geometry on Atmospheric Tomography
* On the relevance of patch-based extraction methods for monocular depth estimation
* One Face, Many Views: Cross-View Consistency of Facial Action Unit Analysis in Multi-Camera Settings
* One-shot unsupervised industrial anomaly detection: Enhanced performance under extreme data scarcity
* One-stage Framework for Thyroid Nodule Detection with Mixup and Negative Sample Utilization
* Online Continual Learning of Diffusion Models: Multi-Mode Adaptive Generative Distillation
* Online graph based transforms for intra-predicted imaging data
* Online multi-label classification under noisy and changing label distribution
* Online Prototype Angular Balanced Self-Distillation for Non-Ideal Annotation in Remote Sensing Image Segmentation
* Open-Vocabulary Object Detection for High-Resolution Remote Sensing Images
* OpenFace 3.0: A Lightweight Multitask System for Comprehensive Facial Behavior Analysis
* OpenRR-1k: A Scalable Dataset for Real-World Reflection Removal
* OPONeRF: One-Point-One NeRF for Robust Few-shot Rendering
* optical remote sensing ship detection model based on feature diffusion and higher-order relationship modeling, An
* Optimal Ice Particle Models of Different Cloud Types for Radiative Transfer Simulation at 183 GHz Frequency Band
* Optimal Transport-Based Domain Alignment as a Preprocessing Step for Federated Learning
* Optimal-Coupling-Observer AV Motion Control Securing Comfort in the Presence of Cyber Attacks
* Optimization model for sign language recognition using hybrid convolution networks
* Optimization of Urban Emergency Multimodal Transportation Scheduling With UAV-Ground Traffic Coordination
* Optimized Approach for Methane Spectral Feature Extraction Under High-Humidity Conditions, An
* Optimized Learned Image Compression for Facial Expression Recognition
* Optimized temporal inductive path neural network based early-stage detection of autism spectrum disorders
* Optimizing CO2 Concentrations and Emissions Based on the WRF-Chem Model Integrated with the 3DVAR and EAKF Methods
* Optimizing In-Context Learning for Efficient Full Conformal Prediction
* Optimizing Unnormalized Statistical Models Through Compositional Optimization
* Organoid-ICLIP: Class Imbalance-Aware Vision-Language Learning for Organoid Mitosis Classification
* Oriented Object Detection Based On Composite Trigonometric Function Coder
* Orthogonal Constrained Minimization with Tensor L_2,p Regularization for HSI Denoising and Destriping
* Orthogonal Decoupling Contrastive Regularization: Toward Uncorrelated Feature Decoupling for Unpaired Image Restoration
* Oulu Remote-photoplethysmography Presentation Attacks Database (OR-PAD)
* Out-of-Distribution Sample Selection Generated by Diffusion Model toward Model Generalization
* Outlier-Robust KalmanNet: Neural Network Aided Kalman Filtering Based on Huber Loss
* Output-Feedback Safety-Critical Path-Guided Herding Control of MIMO Nonlinear Agents Based on Finite-Time Neural Predictor
* Overlooked Factors in Continual Zero-Shot Learning: Inflexible Semantic Prototypes, Simplistic Loss Functions, and SGD Noise
* P-Norm Based Fractional-Order Robust Subband Adaptive Filtering Algorithm for Impulsive Noise and Noisy Input
* P-RoPE: A polar-based rotary position embedding for polar transformed images in rotation-invariant tasks
* PADNet: Progressive-Difference-Aware Feature Reconstruction Mechanism for Anomaly Detection
* PAFNet: A Parallel Attention Fusion Network for Water Body Extraction of Remote Sensing Images
* PanoTPS-Net: Panoramic room layout estimation via thin plate spline transformation
* Pansharpened WorldView-3 Imagery and Machine Learning for Detecting Mal secco Disease in a Citrus Orchard
* Parallel consensus transformer for local feature matching
* Parallel-Based Fast Coding Mode Decision for Intra Coding in VVC SCC
* Partial label feature selection with dynamic streaming labels
* Partition Map-Based Fast Block Partitioning for VVC Inter Coding
* Partitioned observation network for camouflaged object detection
* PASS: Peer-agreement based sample selection for training with instance dependent noisy labels
* PatchNeRF: Patch-based Neural Radiance Fields for real time view synthesis in wide-scale scenes
* Pathological Region Inpainting in MRI Data Using Generative AI
* PawPrint: Whose Footprints are These? Identifying Animal Individuals by their Footprints
* Pay more attention to dark regions for faster shadow detection
* PCNet3D++: A pillar-based cascaded 3D object detection model with an enhanced 2D backbone
* PDD-AGENT: Multimodal Large Language Model-Driven AI Agent for Enhanced Plant Disease Diagnosis
* PDP-FedKD: Personalized Differential Privacy With Adaptive Budget Selection in Heterogeneous Federated Learning
* PE-ViT: Parameter-efficient vision transformer with dimension-adaptive experts and economical attention
* Peepers and Pixels: Human Recognition Accuracy on Low Resolution Faces
* Perface: Metric Learning in Perceptual Facial Similarity for Enhanced Face Anonymization
* Performance Assessment of Satellite-Based Rainfall Products in the Abbay Basin, Ethiopia
* Performance Evaluation of Deep Learning for Tree Branch Segmentation in Autonomous Forestry Systems
* Petri Net-Based Resource Failure and Recovery Strategy for Design and Control of Resilient Intersections, A
* PetsRS - a Dataset and Benchmark for Pet Recognition on a Climate Disaster Scenario
* Phase structure function and spatial coherence in underwater Rayleigh-Benard turbulence: experimental characterization
* Physics-Constrained Deep Learning with Adaptive Z-R Relationship for Accurate and Interpretable Quantitative Precipitation Estimation
* Physics-Guided Smoothing Method For Material Modeling With Digital Image Correlation (DIC) Measurements, A
* PiercingEye: Dual-Space Video Violence Detection With Hyperbolic Vision-Language Guidance
* PillarID: Rethinking Backbone Network Designs for Pillar-Based 3D Object Detection in Infrastructure Point Cloud
* Pioneering Facial Expression Generation from sEMG Signals with Diffusion Models
* PIT-QMM: A Large Multimodal Model for No-Reference Point Cloud Quality Assessment
* PixelBoost: Leveraging Brownian Motion for Realistic-Image Super-Resolution
* PixelShuffler: A Simple Image Translation through Pixel Rearrangement
* Place recognition for visual assistive localization under challenging visual appearance variations
* Player Perceptions of Path-First Procedural Content Generation Level Design for 3D Platformer Games
* Plug-and-Play Model-Agnostic Embedding Enhancement Approach for Explainable Recommendation, A
* Plug-and-Play Priors as a Score-Based Method
* PMDAv2: Multi-scale prototype matching for domain adaptive semantic segmentation
* Point cloud accumulation via multi-dimensional pseudo label and progressive instance association
* Point Cloud Pretraining Dataset Effects on MaskPoint Classification Performance
* Polarimetric SAR Salt Crust Classification via Autoencoded and Attention-Enhanced Feature Representation
* Polarization Denoising and Demosaicking: Dataset and Baseline Method
* Policy Gradient-Based Optimal Subset Selection for Few-Shot Vision-Language Learning
* Pose Estimation of Artwork Characters with Series and Parallel Dilated Convolution And Style Channel Attention
* Pose-Free 3D Gaussian Splatting via Shape-Ray Estimation
* Pose-Invariant Face Recognition via Feature-Space Pose Frontalization
* PoseMoE: Mixture-of-Experts Network for Monocular 3D Human Pose Estimation
* PosePilot: A Web-Based Application for Human Motion Data Analysis and Visualization
* Position-rotation graph and elevation partitioning strategy for traffic police gesture recognition
* Post-Fire Restauration in Mediterranean Watersheds: Coupling WiMMed Modeling with LiDAR-Landsat Vegetation Recovery
* Power Cost Comparison of Neural-Network Compression Methods for Satellite Imagery
* Prediction of Vitamin D Deficiency Using Machine Learning, Deep Learning, and a Hybrid Model
* Preserving instance-level characteristics for multi-instance generation
* Primal-Dual Splitting Algorithm with Convex Combination and Larger Step Sizes for Composite Monotone Inclusion Problems, A
* Principled Diffusion Posterior Sampling for Inverse Problem with Mixed Poisson-Gaussian Noise, A
* PRISM-Occ: Path-Routed Integrated Sparse Mixture-of-Experts for Multi-Modal BEV Occupancy Prediction
* Privacy-aware knowledge distillation for retinal scans de-identification through adversarial perturbations
* Privacy-Preserving CNN Inference for Image Super-Resolution Cross Multiple Ciphertexts
* Privacy-Preserving Face Recognition Scheme Based on Secure Data Storage and Secret Splitting
* Privacy-Preserving Person Re-Identification from Temporal Sequences with Transformer and Hungarian Optimization
* Proactive Collaborative Perception for CAVs: A Multi-Agent Reinforcement Learning Method
* Probabilistic Sampling with Frobenius Norm for Action Recognition
* Probabilistic Temporal Masked Attention for Cross-View Online Action Detection
* Probabilistically Aligned View-Unaligned Clustering With Adaptive Template Selection
* Progressive Cross-Validation Learning for Signal Classification with Noisy Labels
* Progressive Distillation Attention for Robust Left Ventricular Ejection Fraction Estimation
* Progressive Distillation for Incremental Learning in Corneal Confocal Microscopy Segmentation
* Progressive Learning of Instance-Level Proxy Semantics for Few-Shot Action Recognition
* Progressive Text-Semantic-Aware Generative Adversarial Network for Image Fusion
* Projection Difference-Guided Geometry Quality Enhancement for Video-Based Point Cloud Compression
* Prompt-guided dual-channel attention model predicts brain activation from functional and structural profiles
* Prompt-oriented and frequency-regularized schrödinger bridge for unpaired rain streaks and raindrops removal
* Propagation Based Recycling Contrastive Learning for Coupled Noisy Visible-Infrared Person Re-Identification
* Propagation of Laguerre-Gaussian beams in anisotropic atmospheric turbulence: analysis via two analytical and a computational method
* Protein interaction pattern recognition using heterogeneous semantics mining and hierarchical graph representation
* Prototype-based scatter learning for smoke segmentation
* Prototype-Driven Multi-View Attribute-Missing Graph Clustering
* PRTF: Polar Space Represented Multi-View 3D Object Detection With Temporal Fusion Enhancement
* PRVR: Partially Relevant Video Retrieval
* PseR: Pseudo-Label Refinement for Point-Supervised Temporal Action Detection
* Pseudo labels approach to interpretable self-guided subspace clustering
* PSF-SRDN: Point Spread Function-Aware Speckle Reducing Diffusion Network
* PSG-MCANet: Multi-order cross-attention modeling for multimodal fusion based on punning semantic guidance
* Psychology-informed safety attributes recognition in dense crowds
* Purified Zero-Shot Sketch-Based Image Retrieval
* Push the limit of scene text recognition using character and text length guided text super-resolution
* PV3M-YOLO: A triple attention-enhanced model for detecting pedestrians and vehicles in UAV-enabled smart transport networks
* PyraSegNet: A Novel Framework for Thermal Facial Image Segmentation
* P^2M: Progressive Perspective Mining for Referring Video Object Segmentation
* Quadratic Equality Constrained Least Squares: Low-Complexity ADMM for Global Optimality
* Quality Evaluation of AI-Generated Images: Subjective Study and Objective Methodology
* Quality Versus Sparsity in Image Recovery by Dictionary Learning Using Iterative Shrinkage
* Quanta Diffusion
* Quanta-Slomo: Single Photon Camera Guided 100x Video Frame Interpolation
* Quantized DiT with Hadamard transformation: A technical report
* Quantum jellyfish search optimizer applied in high-precision wrapper feature selection
* Quantum-Enhanced Cancer Detection for Histopathologic Images
* Query expansion with topic-aware in-context learning and vocabulary projection for open-domain dense retrieval
* Question-Guided Multigranular Visual Augmentation for Knowledge-Based Visual Question Answering
* R-FGDepth: Towards foundation models for recurrent depth learning with frequency-Guided initialization and refinement
* R-RNet: Probability-Driven Networks for Pedestrian Trajectory Prediction
* RA-GCN: Residual attention based graph convolutional network for multi-label pattern image retrieval
* Radar-Camera Fused Multi-Object Tracking: Online Calibration and Common Feature
* Radius-Aligned Training and Rotated IOU Metrics for Pedestrian Detection in Top-View Fisheye Images
* Rank-based transformation algorithm for image contrast adjustment
* Rapid Object Modeling Initialization for Vector Quantized-Variational AutoEncoder
* Rate-Distortion Optimization with Non-Reference Metrics for UGC Compression
* Rate-Distortion Optimized Chroma Quantization for Point Cloud Compression
* RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-Plane Networks
* RAW: Region Attention-Weighted Guided Network with Inter-Region Exchange for AMD Grading
* RDG-GS: Relative Depth Guidance with Gaussian Splatting for Real-time Sparse-View 3D Rendering
* Re-Purposing Segment Anything For Skeleton Action Localization
* React: Reference-Based Anime Colorization Transformer
* Reading Between the Lines: How Eye-Tracking Data can Inform Reading Strategies for Large Language Models
* Real projection algorithms for generalized low-rank approximation of large-scale quaternion matrix in color image processing
* Real-Scene Image Dehazing via Laplacian Pyramid-Based Conditional Diffusion Model
* Real-Time Detection and Classification of Drones, Vehicles, and Humans from Radar Data Using Deep Learning
* Real-Time Detection of Road Defects Using YOLO Architectures: A Comparative Study
* Real-time habitat mapping with YOLOv8: A multi-threaded approach to biodiversity preservation
* Real-Time Semantic Video Communication with Temporally Consistent And Controllable Diffusion Models
* Real-Time Traffic Accident Anticipation with Feature Reuse
* RealCustom++: Representing Images as Real Textual Word for Real-Time Customization
* Realistic Skin Trouble Simulation Via Image Generation Models
* REC-GCN: Robust ensemble clustering with graph convolutional networks
* Reconstruction-Segmentation Framework for Robust Tree Cover Mapping in North Korea Using Time-Series Reconstruction Autoencoders, A
* Recovering and Classifying Upper Limb Impairment Trajectories After Stroke
* REDNet: Reliable Evidential Discounting Network for Multi-Modality Medical Image Segmentation
* RefComp: A Reference-Guided Unified Framework for Unpaired Point Cloud Completion
* Referring Video Object Segmentation With Cross-Modality Proxy Queries
* Refined Leaf Area Index Retrieval in Yellow River Delta Coastal Wetlands: UAV-Borne Hyperspectral and LiDAR Data Fusion and SHAP-Correlation-Integrated Machine Learning
* Refinement Assessment of Soil Conservation Service and Analysis of Its Trade-Off/Synergy with Other Key Services in the Guizhou Plateau Based on Satellite-UAV-Ground Systems
* Regional decay attention for image shadow removal
* regularized deep self-expression feature augmentation network for few-shot unconstrained palmprint recognition, A
* Regularized evidential neural networks for deep active learning
* ReID: Re-ranking through image description for object re-identification
* reinforcement learning framework for energy-optimal UAV path planning in wind fields, A
* Reinforcement Learning-Based Attack Generator for Testing the Security of Connected and Autonomous Vehicles, A
* Reinforcement Learning-Based Decentralized Control Strategy for Eco-Safe Mixed Platooning With CAVs and HDVs, A
* Release the Potential of Memory Buffer in Continual Learning: A Dynamic System Perspective
* Reliable Crack Evolution Monitoring from UAV Remote Sensing: Bridging Detection and Temporal Dynamics
* Reliable Exploration Strategy for Unsupervised Person Re-Identification
* Remembering CIFAR-10 images with the entropic associative memory
* Remote Respiration Measurement with RGB Cameras: A Review and Benchmark
* Remote Sensing Extraction and Spatiotemporal Change Analysis of Time-Series Terraces in Complex Terrain on the Loess Plateau Based on a New Swin Transformer Dual-Branch Deformable Boundary Network (STDBNet)
* Remote Sensing Image Super-Resolution for Heritage Sites Using a Temporal Invariance-Aware Training Strategy
* Remote Sensing in Mining-Related Eco-Environmental Monitoring and Assessment
* Remote Sensing Interpretation of Soil Elements via a Feature-Reinforcement Multiscale-Fusion Network
* Remote Sensing Target Detector with Multi Scale Attention Mechanism
* Remote Sensing-Driven Dynamic Risk Assessment Model for Cyclical Glacial Lake Outbursts: A Case Study of Merzbacher Lake, A
* renaissance of explicit motion information mining from transformers for action recognition, A
* REPAIR: Rank Correlation and Noisy Pair Half-Replacing With Memory for Noisy Correspondence
* Research on Net Ecosystem Exchange Estimation Model for Alpine Ecosystems Based on Multimodal Feature Fusion: A Case Study of the Babao River Basin, China
* Resilient Multi-Agent Reinforcement Learning for Tiered Mixed Autonomy
* Resolving sentiment discrepancy for multimodal sentiment detection via semantics completion and decomposition
* Restoration of partially damaged fingerprints using a partial differential equation
* Rethinking Artifact Mitigation in HDR Reconstruction: From Detection to Optimization
* Rethinking Class-Incremental Learning From a Dynamic Imbalanced Learning Perspective
* Rethinking Image Histogram Matching for Image Classification
* Rethinking normalization strategies and convolutional kernels for multimodal image fusion
* Rethinking Vision Transformer for Large-Scale Fine-Grained Image Retrieval
* Retina adaptation network for low-light image enhancement
* Retinex-based variational model for low-light image enhancement with noise transformation, A
* Retinex-Based Variational Model with A Nonlocal Gradient-Type Constraint for Low-Light Image Enhancement, A
* Retrieval-augmented image harmonization
* Retrieving Woody Components from Time-Series Gap-Fraction and Multispectral Satellite Observations over Deciduous Forests
* Reverse Distillation Based Detection of Anomalies on a Newly Developed Fabric Dataset
* Reversible Column Disentangled Augmentation Tricks for Graph Contrastive Learning
* Reversible Data Hiding in Encrypted Polygonal Faces Using Vertex Index Similarity
* Review and Perspectives on Pedestrian Trajectory Prediction for Safe Transportation
* Revisited Visual Saliency Detection with Deep Learning: A Review of Recent Advancements
* Revisiting color-event based tracking: A unified network, dataset, and metric
* Revisiting Deformable Convolution on Graphs: Large-Range Modeling and Robustness
* Revisiting Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
* Revisiting the Intrusion Detection in In-Vehicle Networks
* Revisiting the representation learning in long-tailed medical image classification
* Reward-Adaptation: A Novel Test-Time Adaptation Method With Reward Model
* RGAL: Node-adaptive training strategies for reinforced graph adversarial learning
* RGC-Bent: A Novel Dataset for Bent Radio Galaxy Classification
* rho-NeRF: Leveraging Attenuation Priors in Neural Radiance Field for 3d Computed Tomography Reconstruction
* Ridgeformer: Mutli-Stage Contrastive Training for Fine-Grained Cross-Domain Fingerprint Recognition
* Risk-Controlled Multimodal Emotion Coaching for Autism Support Using Self-Supervised Vision and Speech Encoders
* RL-GTN: A reinforced divergence-optimized graph transformer network for skeleton-based action recognition
* RMPT: Retrieval-based multimodal prompt tuning for event detection
* RN-Sam: Road Network-Aided Sam Optimization for Road Segmentation In Satellite Imagery
* RobotFlags: AI-Powered Semaphore Interacting Between Chatbot and Humanoid Robot
* robust and efficient approach using Aggregated-FlexiNet for interpretable musculoskeletal radiograph classification, A
* Robust and flexible multi-view subspace clustering with nuclear norm
* robust and secure video recovery scheme with deep compressive sensing, A
* Robust Character Stroke Segmentation For Diverse Fonts Via Contour Matching and Chain Propagation
* Robust Deepfake Detection for Electronic Know Your Customer Systems Using Registered Images
* Robust Estimation of Bump Height for Wafer-Level Packaging Using Optical Triangulation
* Robust ISAR Autofocus for Maneuvering Ships Using Centerline-Driven Adaptive Partitioning and Resampling
* Robust Low-Rank and Spatio-Temporal Regularization Framework for Moving-Vehicle Detection in Satellite Videos
* Robust Multi-Label Learning with Human-Guided and Foundation Model-Aided Crowd Framework
* Robust Multimodal Representation Learning with Information Bottleneck and Balanced Fusion for Alzheimers Disease Classification
* Robust Noisy Label Learning via Two-Stream Sample Distillation
* Robust Object Detection for UAVs in Foggy Environments with Spatial-Edge Fusion and Dynamic Task Alignment
* Robust outlier elimination trace ratio LDA for dimensionality reduction
* Robust Temporal Action Localization With Meta Boundary Refinement
* Robustness of Deep Learning-Based for Acute Lymphoblastic Leukemia Detection and Classification
* Rollout-Guided Token Pruning for Efficient Video Understanding
* Rotation-Invariant Game State Evaluation via Board Tensor Canonicalisation
* rPPG-NDCL: Unsupervised Remote Physiological Measurement Via Noise-Disentangled Contrastive Learning
* RSOS-Net: Real-Time Surface Obstacle Segmentation Network for Uncrewed Waterborne Vehicles
* RSSRGAN: A Residual Separable Generative Adversarial Network for Remote Sensing Image Super-Resolution Reconstruction
* RT-X Net: RGB-Thermal Cross Attention Network for Low-Light Image Enhancement
* RUL: Region Uncertainty Learning for Robust Face Recognition
* RVGCL: Towards robust recommendation via graph contrastive learning with variational inference
* S3VD Self-Supervised Spatial Video Downsampling Loss: A Method for Training Video FPN Denoising Networks
* SADet: A semantic-aware tiny object detection network against missed detection
* SafeBench: A Safety Evaluation Framework for Multimodal Large Language Models
* SAIMNet: Object Detection Based on Semantic Alignment of Infrared Image and Microwave Non-image Information Fusion
* Salience Adjustment for Context-Based Emotion Recognition
* SAM 2-Driven Self-Training for Mammogram Segmentation: Zero-Shot Mask Generation Via Pseudo-Video
* SAM-Based Leaf Segmentation with Morphological Quality Assessment for Enhanced Plant Disease Detection
* SAM-FireAdapter: An adapter for fire segmentation with SAM
* Sample-Level Prototypical Federated Learning
* SAR and Visible Image Fusion via Retinex-Guided SAR Reconstruction
* SAR target recognition based on CNN with 2-D dual-tree complex wavelet transform decomposition
* SAR-to-Optical Remote Sensing Image Translation Method Based on InternImage and Cascaded Multi-Head Attention
* SarAdapter: Prioritizing Attention on Semantic-Aware Representative Tokens for Enhanced Medical Image Segmentation
* SAREval: A Multi-Dimensional and Multi-Task Benchmark for Evaluating Visual Language Models on SAR Image Understanding
* SAST: Semantic-Aware stylized Text-to-Image generation
* Satellite Views of Long-Term Variations in pCO2 on the Changjiang River Estuary and the Adjacent East China Sea (1998-2024)
* Satellite-Based Assessment of Marine Environmental Indicators and Their Variability in the South Pacific Island Regions: A National-Scale Perspective
* Satellite-Driven Evaluation of Ecological Environmental Quality Based on the PSR Framework
* SatViT-Seg: A Transformer-Only Lightweight Semantic Segmentation Model for Real-Time Land Cover Mapping of High-Resolution Remote Sensing Imagery on Satellites
* Saw-Monodetr: Shape-Aware Adaptive Weighted Transformer for Monocular 3d Object Detection
* SCAFNet: Multimodal Stroke Medical Image Synthesis and Fusion Network Based on Self Attention and Cross Attention
* Scalable Image Compression Using Conditional Diffusion Model in Human-Machine Hybrid Vision, A
* Scalable Multi-View Clustering via Bipartite Graph Consensus Filtering
* Scale-aware adaptive supervised network with limited medical annotations
* Scale-Aware Attention and Multi-Modal Prompt Learning With Fusion Adapter for RGBT Tracking
* Schedule-Robust Continual Learning
* SCIGS: 3D Gaussians Splatting from A Snapshot Compressive Image
* SCL-GAN: Spatially-Correlative Lightweight GAN for Efficient and High-Fidelity Thermal-Visible Face Synthesis
* Scribble-Guided Diffusion for Training-Free Text-to-Image Generation
* SDFCNet: A Spatial-Domain and Frequency-Domain Collaborative Network for Building Extraction in High-Resolution Remote Sensing Images
* SDR-GAIN: A High Real-Time Occluded Pedestrian Pose Completion Method for Autonomous Driving
* SEAGNet: Spatial-Epipolar-Angular-Global feature learning for light field super-resolution
* Seeing Beyond the Airways: Asthma Prediction via Cross-Attention on Dual Retinal Modalities
* SegMamba-V2: Long-Range Sequential Modeling Mamba for General 3-D Medical Image Segmentation
* Segment-Attention Augmented Dual-Contrastive Aggregation Learning for Unsupervised Visible-Infrared Person Re-Identification
* Segmentation for Early Tumor Detection in Mammograms Via Temporal Discrepancy Analysis and Dynamic Loss Weighting
* SEGMN: A structure-enhanced graph matching network for graph similarity learning
* Self Characterized Fusion Network for Prognosis of Brain Diseases
* Self-distilled learning of adaptive interval 3D lookup tables on real-time image enhancement
* Self-Expert Imitation With Purifying Latent Feature for Generalization in Visual Reinforcement Learning
* Self-Guided Discriminative Locality Preserving Projections
* Self-Referencing Adapt-Then-Combine Information Diffusion Scheme for Distributed PHD Filtering
* Self-Supervised LiDAR Desnowing with 3D-KNN Blind-Spot Networks
* Self-supervised multi-scale uniform motion deblurring via alternating optimization
* Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and Beyond
* SelfieAvatar: Real-time Head Avatar reenactntment from a Selfie Video
* SelfMAD: Enhancing Generalization and Robustness in Morphing Attack Detection via Self-Supervised Learning
* Semantic Concentration for Self-Supervised Dense Representations Learning
* Semantic Context Re-Mining for Multimodal Guided Human-Object Interaction Detection
* Semantic Prototype-Guided Sampling for Long-Tailed Generalized Category Discovery
* Semantic Segmentation of Typical Oceanic and Atmospheric Phenomena in SAR Images Based on Modified Segformer
* Semantic-guided occlusion simulation based local feature semantic expansion network for person re-identification, A
* Semantics- and Physics-Guided Generative Network for Radar HRRP Generalized Zero-Shot Recognition
* Semantics-Aware Spatial-Temporal Dynamic Graph Transformer Network for On-Street Parking Occupancy Prediction
* Semantics-Guided Generative Image Compression
* Semi-supervised crowd counting from unlabeled data
* Semi-Supervised Infrared Meibomian Gland Segmentation with Intra-Patient Registration and Feature Supervision
* Semi-Supervised Seafloor Habitat Classification: A Pseudo-Labeling Framework
* Semi-supervised semantic segmentation meets masked modeling: Fine-grained locality learning matters in consistency regularization
* Sensor Distance Learning For Cross-Camera Color Constancy
* Sentence-Level Lip-Reading with Integrated Synthetic Data and Speaker Normalization
* Sentiment analysis and risk early-warning system for cross-border M&A based on natural language processing
* Separating Domain-Private Classes for Universal Unsupervised Cross-Domain 3D Model Retrieval
* Session Class Prototype Incremental Learning (SCPIL): Mitigating Catastrophic Forgetting with Distance-Based Prototype Learning
* SF-VQA: Saliency Fragments No-Reference Video Quality Assessment
* SFEformer: Frequency-enhanced model for wind speed prediction
* SFS-NeRF: Enhancing Geometry Consistency in Few-Shot Novel View Synthesis Through Surface-Aware Neural Rendering
* SGCNet: Silhouette Guided Cascaded Network for Multi-Modal Image Fusion
* SGNet: Style-Guided Network With Temporal Compensation for Unpaired Low-Light Colonoscopy Video Enhancement
* ShadowMamba: State-space model with boundary-region selective scan for shadow removal
* Shallow Neural Network Training via Atomic Norms and Semidefinite Programming
* Shape Reconstruction of Foreground and Background in Scenes with Translucent Objects Based on Coding Curves
* Shape-Aware Refinement of Deep Learning Detections from UAS Imagery for Tornado-Induced Treefall Mapping
* Sheep Facial Pain Assessment Under Weighted Graph Neural Networks
* Shielding Latent Face Representations From Privacy Attacks
* Shift-Invariant Unsupervised Pansharpening Based on Diffusion Model
* Ship Incremental Recognition Framework via Unknown Extraction and Joint Optimization Learning, A
* Shuffle PatchMix Augmentation with Confidence-Margin Weighted Pseudo-Labels for Enhanced Source-Free Domain Adaptation
* SiamDiff: A Diffusion-Driven Siamese Network for Scale-Aware Anti-UAV Tracking
* Siamese Feature Decoupling and Adaptive Prototype Alignment for Clothes Changed Person Re-Identification
* Siavatar: Animatable 3D Gaussian Avatar from a Single Image
* SignDiff: Diffusion Model for American Sign Language Production
* Similarity Normalization and Strong Geometric Augmentation for Local Feature Matching Under Large Scale and Rotation Changes
* Similarity Shuffled Criss-Cross Transformer With Angle Loss for Image-Text Matching
* Simple Alternating Framework for ReLU-Based Nonlinear Tensor Decomposition, A
* Simple Self-Organizing Map With Vision Transformers
* Simple Zero-Shot Image Dehazing
* SimPRL: A Simple Contrastive Learning for Path Representation Learning by Joint GPS Trajectories and Road Paths
* Simulation and Analysis of Sea Surface Skin Temperature Diurnal Variation Using a One-Dimensional Mixed Layer Model and Himawari-8 Data
* SingingHead: A Large-Scale 4D Dataset for Singing Head Animation
* Single Snapshot Distillation for Phase Coded Mask Design in Phase Retrieval
* Single Voter Spreading for Efficient Correspondence Grouping and 3D Registration
* Single-ToF-LiDAR-based plane information recognition
* Sinogram Inpainting with Physics-Guided Latent Diffusion Model for Synchrotron Light Sources
* Skeleton-based Geometric Deep Neural Network for Alzheimer's Disease Mice Behavioral Analysis, A
* SkeletonX: Data-Efficient Skeleton-Based Action Recognition via Cross-Sample Feature Aggregation
* Sketch to Stylized-Image: A Two-Stage Approach for Artistic Image Generation
* Skin Cancer Classification Using Extended 5 Channel (I-RGB-U) Images Generated From RGB Images
* SLICE: Synthetic Caption-Trained Lightweight Image Captioner for Edge Devices
* Small or large superpixel graphs? Gaussian influence walk with rebound can assist
* SMC++: Masked Learning of Unsupervised Video Semantic Compression
* SmokeAttack: Physically-based adversarial smoke for LiDAR point cloud detectors
* SmoothFace: Class-Conditional Label Smoothing for Synthetic-based Face Recognition
* SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation
* Soft Gradient Boosting With Learnable Feature Transforms for Sequential Regression
* SOI-Net: Structural Optimization-Inspired Interpretable Network for Incomplete Multi-View Clustering
* Soil Organic Matter Prediction by Fusing Supervised-Derived VisNIR Variables with Multispectral Remote Sensing
* Source(s) of the Smooth Caloris Exterior Plains on Mercury: Mapping, Remote Analyses, and Scenarios for Future Testing with BepiColombo Data
* Source-free domain adaptation via multimodal space-guided alignment
* Sparse kernel k-means for high-dimensional data
* Sparse R-CNN OBB: Ship Target Detection in SAR Images Based on Oriented Sparse Learnable Proposals
* Sparse subspace learning based redundancy-aware unsupervised feature selection
* Sparse2DGS: Sparse-View Surface Reconstruction Using 2D Gaussian Splatting with Dense Point Cloud
* Sparsity-Driven Parallel Imaging Consistency for Improved Self-Supervised MRI Reconstruction
* Spatial coherence loss: All objects matter in salient and camouflaged object detection Spatial coherence loss: All objects matter in salient and camouflaged object detection
* Spatial Distribution and Characteristics of Debris-Covered Glaciers in Xinjiang Based on CGI-XJ2020
* Spatial-Spectral Consistency: A Semi-Supervised Approach for Multispectral Scene Classification
* Spatially continuous dual optimization on compactness function for image segmentation
* Spatio-Temporal Data Enhanced Vision-Language Model for Traffic Scene Understanding
* Spatio-Temporal Feature Learning Fusion and Visual Scene Endpoint Prediction for Pedestrian Trajectory Prediction
* Spatio-temporal side tuning pre-trained foundation models for video-based pedestrian attribute recognition
* Spatio-temporal transformers for action unit classification with event cameras
* Spatiotemporal Dynamics and Budget of Particulate Organic Carbon in China's Marginal Seas Based on MODIS-Aqua
* Spatiotemporal Dynamics and Driving Factors of Vegetation Gross Primary Productivity in a Typical Coastal City: A Case Study of Zhanjiang, China
* Spatiotemporal Face Alignment for Generalizable Deepfake Detection
* Spatiotemporal-Decoupled Training: Enhancing Car-Following Behavior Modeling With Cross-Spatiotemporal Generalization
* SPC TO 3d: Novel View Synthesis from Binary SPC VIA I2I Translation
* Special section: CIARP-24
* Specific Emitter Identification by Edge Pattern Detection and Incremental Open-World Learning
* Spectral Characterization of Nine Urban Tree Species in Southern Wisconsin
* Spectral Mixing Augmentation for Preventing False Positives from Hyperspectral Anomaly Detection
* Spectral Unmixing of Airborne and Ground-Based Imaging Spectroscopy for Pigment-Specific FAPAR and Sun-Induced Fluorescence Interpretation
* Spectral-aware Global Fusion for RGB-Thermal Semantic Segmentation
* Spectrum-guided feature enhancement network for event person re-identification
* Spike-Inspired Adaptive Spatial Suppression Framework for Large-Scale Landslide Extraction, A
* Splitter: Faster Inference through Channel Partitioning and Feature Fusion
* SS-NeRF: Shine-sphere rendering for neural radiance fields
* SSPD: Spatial-Spectral Prior Decoupling Model for Spectral Snapshot Compressive Imaging
* ST-GRIT: Spatio-Temporal Graph Transformer For Internal Ice Layer Thickness Prediction
* Stable-Invertible Graph Convolutional Networks for Label-Efficient Skeleton-Based Recognition
* StableIdentity: Inserting Anybody Into Anywhere at First Sight
* Stacked one-vs-one (SOvO): A new approach for multi-class classification for sEMG recognition
* State transition difference prediction for deep reinforcement learning
* StegFlow: Flow-Based High-Frequency Distribution Mapping Network for Multi-Image Steganography
* Stencil: Subject-Driven Generation with Context Guidance
* Step-Wise Distribution-Aligned Style Prompt Tuning for Source-Free Cross-Domain Few-Shot Learning
* STMixer: Spatial-Temporal Mixer for Continuous Sign Language Recognition
* Structural entropy guided relation extraction on adaptive graph structure
* Structure and sensitivity in 3D human pose similarity quantification and estimation
* Structure-aware filter using self-guided information
* Structure-Based Drug Design with Geometric Deep Learning: A Comprehensive Survey
* Structure-from-motion in micro-image domain for uncalibrated plenoptic 2.0 cameras
* Structure-Guided Diffusion Transformer for Low-Light Image Enhancement
* Structured Instruction Parsing and Scene Alignment For UAV Vision-Language Navigation
* Study on an Intelligent Screening Method for Polycystic Ovary Syndrome Based on Deep PhysicsInformed Neural Network
* StyleShot: A Snapshot on Any Style
* Subspace Training Mitigates Gradient Noise Vulnerability
* Sufficient Conditions for Convergence of RHT and RHTP Algorithms Based on RIC of Order 2S
* SUP-Net: Slow-Time Upsampling Network for Aliasing Removal in Doppler Ultrasound
* Super-resolution time-frequency decomposition with hyperlets for neural spike analysis
* Super-Resolving Digital Terrain Models Using a Modified RCAN
* Superdirective Beamforming Method Based on Spherical Harmonic Expansion in the Waveguide Environment
* Supervised Contrastive Learning for Indoor Point Cloud Oversegmentation
* Supervised learning for low-resource isolated glyph recognition in palm leaf manuscripts
* Supervisory feedback for high-resolution low-textured large-scale multi-view stereo
* Suppression of Parasitic Peaks on CFOSAT SWIM Wave Spectra Based on a Specific Parametric Method
* Survey of Defenses Against AI-Generated Visual Media: Detection, Disruption, and Authentication, A
* Survey of Small Sea-Surface Target Detection for Maritime Search and Rescue, A
* Survey on Deep Face Restoration: From Non-blind to Blind and Beyond
* Survey on Privacy-Preserving Computing in the Automotive Domain, A
* Survey on Proactive Deepfake Defense: Disruption and Watermarking, A
* survey on video emotion recognition: Segmentation, classification, and explainable AI techniques, A
* Survey on Video Temporal Grounding With Multimodal Large Language Model, A
* SVA: Towards speech-Enabled vision-Language-Action model
* SwimVG: Step-Wise Multimodal Fusion and Adaption for Visual Grounding
* Swinscale-LFVS: Parallel Feature Integration for Light Field View Synthesis
* SynTaskNet: A synergistic multi-task network for joint segmentation and classification of small anatomical structures in ultrasound imaging
* SynthAorta: A 3D Mesh Dataset of Parametrized Physiological Healthy Aortas
* Synthetic Chest X-Ray Augmentation via Generative Variational Autoencoding for Pneumonia Detection
* Synthetic Faces, Real Gains: Improving Age and Gender Classification through Generative Data
* Systematic Literature Review on Vehicular Collaborative Perception: A Computer Vision Perspective, A
* Systematic Review of Methodological Advances in Glacier-Velocity Retrieval with an Emphasis on Debris-Covered Glaciers, A
* Systematic Review of the Use of Augmented Reality in Pedestrian Navigation, A
* Systematic Review on Remote Sensing of Dryland Ecological Integrity: Improvement in the Spatiotemporal Monitoring of Vegetation Is Required, A
* T2RIAD: A Two-Stage Framework for Truck Re-ID With Domain Adversarial and Distillation Learning
* Tackling Ambiguity From Perspectives of Uncertainty Inference and Affinity Diversification for Weakly Supervised Semantic Segmentation
* Tagsim: Topic-Informed Attention Guided Similarity Metric for Image Caption Comparison
* Target Driven Adaptive Loss for Infrared Small Target Detection
* Targetless Extrinsic Calibration of Fisheye Cameras Using Vehicle Detection and Monodepth Alignment in Cylindrical Image Space
* Task-Driven Underwater Image Enhancement via Hierarchical Semantic Refinement
* Task-Specific Spatiotemporal Context-Aware Decoupling for Occluded Video Object Detection
* Taylornet: Rethinking monomial-based graph neural networks with taylor expansion
* TCP: Text-Guided Cascade Network for Pedestrian Crossing Intention Prediction
* TDI-TFFNet: Infusing time dependent images and two-stream feature fusion network for gymnastic activity recognition
* Teach Me Sign: Stepwise Prompting LLM for Sign Language Production
* Temperature Governs the Elevation Dependency of Snow Cover Changes in the Upper Reaches of the Yarkand River Basin
* Temporal knowledge graph reasoning with local-global evolutionary patterns
* Temporal prompt guided visual-text-object alignment for zero-shot video captioning
* Temporal Prompt Learning With Depth Memory for Video Mirror Detection
* Tensor band restricted thresholding algorithms for affine tensor rank minimization
* Tensor Completion Framework by Graph Refinement for Incomplete Multi-View Clustering
* Tensor wheel completion with parallel matrix factorization and group smoothness for hyperspectral image recovery
* Tensor-Based Privacy-Aware Driving Route Navigation Based on Cloud-Fog-Edge Calculative User-Vehicle-Road Preferences
* TerraFly-Forensics: A Dataset for Forensic Detection of Generated Map Images with Quality Assessment of Generative Models
* Test-Time Augmentation for Pose-invariant Face Recognition
* Test-Time Vocabulary Adaptation for Language-Driven Object Detection
* Testing Peepers on Pixels: A Demo of Human Recognition Accuracy for Low Resolution Faces
* Tetrahedral molecular pretraining for enhanced property prediction
* Text-Centric multimodal sentiment analysis with asymmetric fine-tuning
* Text-guided weakly supervised framework for dynamic facial expression recognition
* Text-Injected Discriminative Model for Remote Sensing Visual Grounding
* Text-to-Floorplan Synthesis via Graph-Conditioned Diffusion Processes
* Texture- and Shape-Based Adversarial Attacks for Overhead Image Vehicle Detection
* Texturing Endoscopic 3D Stomach via Neural Radiance Field Under Uneven Lighting
* TG-TSGNet: A Text-Guided Arbitrary-Resolution Terrain Scene Generation Network
* Theft model-based black-box adversarial attack in embedding space
* Thermal Deformation Correction for the FY-4A LMI
* Thinning Methods and Assimilation Applications for FY-4B/GIIRS Observations
* Time-Efficient Uncertainty Estimation Based on Target Networks in Deep Reinforcement Learning
* Tiny Faces, Big Trouble: Evaluating Super-Resolution for Face Recognition
* TLRR-TF: A fast tensor low-rank representation via tri-factorization
* To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion
* To Skip or to Mask? A Study of the Adversarial Purification Spectrum
* Token Calibration for Transformer-Based Domain Adaptation
* Tool-Assisted Annotation of Seafloor Sediment-Linked Features Using Weakly Supervised Semantic Segmentation
* Topology-Aware Modeling for Unsupervised Simulation-to-Reality Point Cloud Recognition
* Toroidal Adaptive Intensity and Spectrum Updating Image Reconstruction for Fourier Ptychographic Microscopy
* Toward Free-Form Local Feature Matching
* Toward Real-Time BCI Authentication for Enhanced Security in Collaborative Systems
* Toward Size-Invariant Salient Object Detection: A Generic Evaluation
* Toward Thermal Infrared Image Colorization via Large Kernel Convolution and Patch-Wise Graph Contrastive Learning
* Toward Unified Co-Speech Gesture Generation via Hierarchical Implicit Periodicity Learning
* Towards All-Time, All-Weather Fod Detection Through Generative AI
* Towards Certified Object Detectors: Certified Runway Detection Using Yolo
* Towards Controllable Real Image Denoising With Camera Parameters
* Towards Dark-Field X-ray Microscopy Through Coherent Encoding
* Towards Effective and Robust Unlearnable Examples Against Object Detection
* Towards Fair and Robust Face Parsing for Generative AI: A Multi-Objective Approach
* Towards Image Copy Detection at E-Commerce Scale
* Towards Invisible Decision-Based Adversarial Attacks Against Visual Object Tracking
* Towards Iris Presentation Attack Detection with Foundation Models
* Towards maximizing feature efficiency: All-in-one image restoration via radial basis attention
* Towards ML-based Assessment of Synthetic Characters Heads
* Towards Open-set Face Anti-spoofing with Unseen Attack Synthesis
* Towards Reliable Disaster Detection: Comparing Semantic and Heuristic Filters for Multimodal Data
* Towards robust and inversion-free randomized neural networks: The XG-RVFL framework
* Towards robust and reliable multi-modal 3D segmentation of multiple sclerosis lesions
* Towards Robust Text-Guided Image Compression Under Modality Missing
* Towards Test Time Adaptation in Low Dose Computed Tomography Denoising Via Bias Modulation
* Towards Trustworthy Disaster Severity Scoring: Combining Semantic Alignment and Chain-of-Thought LLMs
* Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models
* TP-IoAV: A Tri-Party Cloud Data Protection Scheme for Internet of Autonomous Vehicle Coupled With Chaotic Biometric Cryptography
* TPEech: Target Speaker Extraction and Noise Suppression With Historical Dialogue Text Cues
* Track2Net: A Fast Lightweight Model With Keypoint Alignment and Track Anchors Identification for Railway Line Tracking
* Traditional Approach for Color Constancy and Color Assimilation Illusions with Its Applications to Low-Light Image Enhancement, A
* Training A Phase Detection Autofocus Model Using Hybrid Labels
* Trajectory Planning for Autonomous Driving in Transportation Systems Based on Deep Reinforcement Learning and Spatio-Temporal Voxels
* Trajectory-guided Motion Perception for Facial Expression Quality Assessment in Neurological Disorders
* Transductive One-Shot Learning Meet Subspace Decomposition
* TransFA: Transformer-based representation for face attribute evaluation
* TransFace++: Rethinking the Face Recognition Paradigm With a Focus on Accuracy, Efficiency, and Security
* Transfer Learning from Visual Speech Recognition to Mouthing Recognition in German Sign Language
* Transform Set Merging for Neural Network-Based Intra Prediction in beyond VVC
* Transformer Augmented Multi-Resolution Hash Encoding in Diffusion Model for 3D Point Cloud Denoising
* Transformer tracking with high-low frequency attention
* Transformer-Based Approaches to Description Sequence Generation for Chinese Characters
* Transparent and Lightweight Tumor-Aware MRI Super-Resolution Framework to Enhance Prostate Cancer Detection, A
* Tree of Shapes Computation Algorithm for Massively Parallel Architectures, A
* Tri-modal fusion for dynamic hand gesture recognition: Integrating RGB, depth, and skeleton data
* TriDGNet: Triple Feature Encoder-Based Dual Granularity Graph Learning Network for Enhanced Travel Time Estimation
* TriGAN-SiaMT: A triple-segmentor adversarial network with bounding box priors for semi-supervised brain lesion segmentation
* Triqa: Image Quality Assessment by Contrastive Pretraining on Ordered Distortion Triplets
* Trust-Guided Approach to MR Image Reconstruction With Side Information, A
* TrustSkin: A Fairness Pipeline for Trustworthy Facial Affect Analysis Across Skin Tone
* TSFormer: Efficient Ultra-High-Definition Image Restoration via Trusted Min-p
* TSLDSeg: A texture-aware and semantic-enhanced latent diffusion model for medical image segmentation
* Tuning-Free High-Resolution Video Diffusion With Spatial-Temporal Latent Grouping
* TURBIT: Generating Turbid Underwater Images with Diffusion and Differential Transformers
* Two-stage diffusion for hands and articulated objects interaction synthesis
* Two-Stage Framework For Enhanced Hyperspectral Anomaly Detection
* two-stage tone mapping network based on attention mechanism for high dynamic range images, A
* U-Shaped Network with Hybrid Convolution and Block Calculation for Road Extraction
* UAV Remote Sensing of Submerged Marine Heritage: The Tirpitz Wreck Site, Håkøya, Norway
* UAV-based multimodal object detection via feature enhancement and dynamic gated fusion
* UAV-Based Remote Sensing Methods in the Structural Assessment of Remediated Landfills
* UIL-AQA: Uncertainty-Aware Clip-Level Interpretable Action Quality Assessment
* Ultrafast High-Flux Single-Photon Lidar Simulator via Neural Mapping
* UltraSeP: Sequence-aware pre-training for echocardiography probe movement guidance
* UMCL: Unimodal-generated Multimodal Contrastive Learning for Cross-compression-rate Deepfake Detection
* uncertainty advantage: Enhancing large language models' reliability through chain of uncertainty reasoning, The
* Uncertainty-Aware and Decoupled Distillation for Semantic Segmentation
* Uncertainty-Driven Sampling for Efficient Pairwise Comparison Subjective Assessment
* Unconstrained Body Recognition at Altitude and Range: Comparing Four Approaches
* Understanding the adversarial robustness of deep learning-based single-pixel imaging
* Underwater Acoustic Channel Estimation via Accelerated TMSBL With KSVD-Based Denoising and Robust Initialization
* Underwater image compression for human and machine visions with hybrid priors embedding
* UNet-Like Transformer Network for Camouflaged Object Detection, A
* UniAlign: A Universal Cross-Modality Knowledge Alignment Framework for Fine-Grained Action Recognition
* Unified Cross-Modal Medical Image Synthesis With Hierarchical Mixture of Product-of-Experts
* Unified Masked Jigsaw Puzzle Framework for Vision and Language Models, A
* Unified multi-modality conditional latent diffusion model for point cloud generation, A
* Unified Multimodal Vessel Trajectory Prediction With Explainable Navigation Intention
* Unified Transformer-Based Framework with Pretraining for Whole Body Grasping Motion Generation, A
* Unifying and Conquering Adversarial Attacks against Deep Face Recognition
* UniGait: A Unified Transformer-based Multitask Framework for Gait Analysis in the Wild
* UniqueSplat: View-Conditioned 3D Gaussian Splatting for Generalizable 3D Reconstruction
* UniSOT: A Unified Framework for Multi-Modality Single Object Tracking
* Unleashing the Potential of Hierarchical Region Clues for Open-Vocabulary Multi-Label Classification
* Unlocking A New Paradigm In Robustness For Multi-Step Facial Forgery Detection
* Unlocking Cross-Domain Synergies for Domain Adaptive Semantic Segmentation
* Unlocking human intent perception through multimodal large models
* Unpredictable Trajectory Optimization for UAV-Assisted Anti-Jamming Data Collection
* Unraveling Urban Mobility: A Domain Knowledge-Free Trajectory Classification Using Gramian Angular Fields
* Unraveling Vanishing Point And Calibrating Tiny Objects For Semantic Scene Completion
* Unrevealed Threats: Adversarial Robustness Analysis of Underwater Image Enhancement Models
* Unrolling Nonconvex Graph Total Variation for Image Denoising
* Unsharp-Inspired Adversarial Point Cloud Perturbation via Low-Rank Approximation
* Unsupervised Brain Lesion Segmentation Using Posterior Distributions Learned by Subspace-Based Generative Model
* Unsupervised Deep Learning for Anomaly Detection in Automotive Trucks: A Survey
* Unsupervised multi-modal domain adaptation for RGB-T Semantic Segmentation
* Unsupervised Point Cloud Reconstruction via Recurrent Multi-Step Moving Strategy
* Unsupervised Robust Domain Adaptation: Paradigm, Theory and Algorithm
* US-Loss: Integrating Uncertainty Estimation in the Loss Function of Image Segmentation
* Useg-PanoDepth: Unified 360° Depth Estimation for Indoor and Outdoor Scenes With Semantic Assistance
* User-in-the-Loop View Sampling with Error Peaking Visualization
* USformer: A U-Shaped Structure Transformer for RGB-Thermal Semantic Segmentation and Traffic Scene Understanding
* Using Cross-Domain Detection Loss to Infer Multi-Scale Information for Improved Tiny Head Tracking
* USRNet: A simple yet effective Underwater Scene Restoration Network
* USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s
* Utilization of Diffusion Models for Noise Reduction in Ultrasound Images
* Utilizing Keypoint R-CNN for Automated Root Angulation Detection in OPGs
* Value-Based Parallel Update MCTS Method for Multi-Agent Cooperative Decision-Making of Connected and Automated Vehicles, A
* Variable priority for unsupervised variable selection
* Variable Rate Learned Wavelet Video Coding Using Temporal Layer Adaptivity
* variational Bayesian approach for multimodal multi-instance classification, A
* Variational graph filter autoencoder for uncovering community structure in multiplex networks
* Vehicle Dynamics Embedded World Models for Autonomous Driving
* Vehicle Visual Perception Under Low Visibility Road Environments Based on AoP&DoP Multi-Polarization Parameter Characterization
* Vertical Monitoring of Chlorophyll-a and Phycocyanin Concentrations High-Latitude Inland Lakes Using Sentinel-3 OLCI
* Vertically Resolved Supercooled Liquid Water over the North China Plain Revealed by Ground-Based Synergetic Measurements
* Veta-Gs: View-Dependent Deformable 3d Gaussian Splatting for Thermal Infrared Novel-View Synthesis
* Vicinal Gaussian Transform: Rethinking Source-Free Domain Adaptation Through Source-Informed Label Consistency
* VIDA: Unsupervised Visible-to-Infrared Domain Adaptation for Object Detection Using Large Vision Language Model
* Video Face Super-Resolution With High-Precision Identity Preservation
* Video Individual Counting with Implicit One-to-Many Matching
* Video is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation
* Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
* Video-Based Assessment of Bradykinesia in Ataxia-Teleangiectasia Patients
* Viewpoint-Dependent 3D Visual Grounding for Mobile Robots
* Viewport-Patch Extraction Enhanced 360° Video Quality Assessment
* Virtual Reference Frame-Based Inter Prediction for MPEG Enhanced G-PCC
* Visibility-Based Geometry Pruning of Neural Plenoptic Scene Representations
* Vision Language Model Interpretability with Concept Guided Decoding
* vision-based framework and dataset for human behavior understanding in industrial assembly lines, A
* Vision-Based Mobile App GUI Testing: A Survey
* Visionary Co-Driver: Enhancing Driver Perception of Potential Risks With LLM and HUD
* VisionScores - A System-Segmented Image Score Dataset for Deep Learning Tasks
* Visual Artificial Intelligence: Unlocking Efficiency with Psychovisual Models
* Visual Encoders for Generalized Chromosome Recognition
* Visual Keyword Spotting with Multi-Encoder for MAVSR 2025
* Visual Measurement and Uncertainty Prediction of Insulator Thickness in Insulated Rail Joints
* Visual object tracking via adaptive feature fusion and two-stage channel selection
* Visual Prompt Aided Single Shot Object Part Segmentation
* Visual Prompting Through Image Mines
* Visual Question Answering Using Multimodal Data Augmentation for Hausa
* Visual reasoning consistency and robustness analysis of multimodal LLMs
* Visual saliency fixation via deeply tri-layered multi blended trans-encoder framework
* VisualCent: Visual Human Analysis using Dynamic Centroid Representation
* Visualization of Breast Cancer Using Contrast-Enhanced Optical Coherence Elastography Based on Tissue Heterogeneity
* ViTA-PAR: Visual And Textual Attribute Alignment With Attribute Prompting For Pedestrian Attribute Recognition
* ViV-ReID: Bidirectional Structural-Aware Spatial-Temporal Graph Networks on Large-Scale Video-Based Vessel Re-Identification Dataset
* VLMAR: Maritime scene anomaly detection via retrieval-augmented vision-language models
* VQIT-GNN: A collaborative knowledge transfer for node-level structure imbalance
* VQM4HAS: A Real-Time Quality Metric for HEVC Videos in HTTP Adaptive Streaming
* VRScout: Towards Real-Time, Autonomous Testing of Virtual Reality Games
* VSIS-RDPA: Verifiable Secret Image Sharing Based on Polynomial Interpolation for Resisting Dishonest Participant Attacks
* WAM-Net: Wavelet-Based Adaptive Multi-scale Fusion Network for fine-grained action recognition
* Warmer Start to Active Learning with Adaptive Gaussian Mixture Models for Skin Lesion Segmentation, A
* Wasserstein-Space-Based Framework for Processing Fiber Orientation Geometry in Diffusion MRI, A
* Watermarking Diffusion Models By Constructing Generative Classifiers
* WaveE2VID: Frequency-Aware Event-Based Video Reconstruction
* Wavelet Packing for Self-Supervised Monocular Depth Estimation
* Wavelet-Based Denoising Transformer With Fourier Adjustment for UAV Nighttime Tracking
* Wavelet-based learning and optimized sampling for image deraining
* Weakly Supervised Defect Localization with Residual Features
* Weakly-Supervised Nuclei Segmentation Integrating Hybrid Decoder and Graph-Based Spatial Modeling
* Wearable-Derived Behavioral and Physiological Biomarkers for Classifying Unipolar and Bipolar Depression Severity
* Weighted Average Prediction for Region Adaptive Hierarchical Transform in Solid Geometry Point Cloud Compression
* Weighted Total Variation for Hyperspectral Image Denoising Based on Hyper-Laplacian Scale Mixture Distribution
* WF-MDC: Enhancing few-shot website fingerprinting via multiplicative distribution calibration
* What2Keep: A communication-efficient collaborative perception framework for 3D detection via keeping valuable information
* When 512X512 is Not Enough: Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution
* When Deep Learning Meets Broad Learning: A Unified Framework for Change Detection with Synthetic Aperture Radar Images
* When Mamba meets CNN: A hybrid architecture for skin lesion segmentation
* When Multi-Focus Image Fusion Meets Nonlinear Spiking Neural P Systems
* Which Image Quality Measure is Optimal for Ultrasound Imaging?
* Wide-Area Spectrum Sensing for Space Targets Based on Low-Earth Orbit Satellite Constellations: A SRFlow Model for Electromagnetic Spectrum Map Reconstruction
* Winter Sea-Surface-Temperature Memory in the East/Japan Sea Under the Arctic Oscillation: Time-Integrated Forcing, Coupled Hot Spots, and Predictability Windows
* Wireless Channel as a Sensor: An Anti-Electromagnetic Interference Vehicle Detection Method Based on Wireless Sensing Technology
* Wonder3D++: Cross-Domain Diffusion for High-Fidelity 3D Generation From a Single Image
* Would a Simple Res-UNet Unsupervised Domain Adaptation Solve DubaiSat2 Delineated Labels?
* X265-PVMAF: A Real-Time Perceptual Video Quality Metric for HEVC Video Encoding
* Y-LIChess: Live and Interactive Over-The-Board Chess Recognition and Play with Yolo
* YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications
* YOLO-Based Waste Detection in Smart Waste Management for a Cleaner Future: A Review
* YOLO-CMFM: A Visible-SAR Multimodal Object Detection Method Based on Edge-Guided and Gated Cross-Attention Fusion
* YOLO-VG: Enhancing Multi-Stage Feature Interaction for Visual Grounding
* YOLOFeat: Unified Object Detection and Feature Extraction for Multi-Object Tracking
* YOLOv11-SAFM: Enhancing Landslide Detection in Complex Mountainous Terrain Through Spatial Feature Adaptation
* Your Face, Your Privacy: Combating Unauthorized Usage
* Zero-shot egocentric action recognition via chain-of-imagination prompts and inertial strengthening adaptor
* Zero-Shot Learning for Limited Photon Budget Denoising in Structured Illumination Microscopy
* Zero-Shot Pseudo Labels Generation Using Sam and Clip for Semi-Supervised Semantic Segmentation
* Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models
* ZVIR: Zero-shot implicit deep image prior with prior activation for infrared and visible image fusion
1964 for 2601