Update Dates 2601

2601 * 2D DOA Estimation of Coherent Signals Exploiting Moving Uniform Rectangular Array
* 2D-3D Attention and Entropy for Pose Robust 2D Facial Recognition
* 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation
* 3D Gaussian Splatting Reconstruction from Simulated CT Projections with Geometric Initialization
* 3D Magnetic Inverse Routine for Single-Segment Magnetic Field Images
* 3D Mesh Convolution-Based Autoencoder for Geometry Compression, A
* 3D Multi-Object Tracking Driven by Multi-Level Association and Intelligent Filtering
* 3D point cloud classification network with hybrid sampling enhancement and point energy attention
* 3D Trajectory and Pickup/Drop-Off Strategy for UAV-Enabled Delivery: Trade-Off Between Time and Energy Minimization
* Diffusion for Layout Control in Text to Image Generation (H4)
* Face Restoration, Facial Image Restoration (H4)
* U-Net, Convolutional Neural Networks (H4)
* A3-TTA: Adaptive Anchor Alignment Test-Time Adaptation for Image Segmentation
* ACC: Alternating Complementary Colors for Display Energy Reduction
* Action-to-Action Diffusion Network for Weakly Supervised Temporal Action Localization
* ActRecognition-GPT: Utilizing Multimodal Large Language Models for Spatiotemporal Action Recognition in Nursery Videos
* Adapting Foundation Features via Cross-View Contrastive Learning for Unseen Object Pose Estimation
* Adaptive Fault-Tolerant Perimeter Control for Two-Region Networks With Actuator Faults
* Adaptive fuzzy feature selection with the fusion of data distribution information
* Adaptive Hierarchical Feature Difference Auto-Encoder for Robust RGB-T Object Tracking
* Adaptive High-Frequency Preprocessing for Video Coding
* Adaptive Multimodal Fusion via Attention-Guided Feature Selection for Histopathology Image Classification
* Adaptive Non-Linear Graph Filter in Semi-Supervised Graph Based Classification, An
* Adaptive Nonuniform Map Re-weighting Method for Image Dehazing, An
* Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression
* Adaptive Smoothing of Non-Rectangular Prediction Block Edges in the Wedge Mode of AVM
* Adaptive sparse contrastive learning for unsupervised object re-identification
* Adaptive Voxelization for Transform Coding of 3D Gaussian Splatting Data
* AdaptiveFusion: Adaptive Multi-Modal Multi-View Fusion for 3D Human Body Reconstruction
* Advanced Sensor Analytics and Extreme Value Modeling: Dichotomizing Day-Night Variability in Rear-End Collisions on Expressways
* Advancements in Medical Image Classification Through Fine-Tuning Natural Domain Foundation Models
* Advancing Chinese Lip Reading through Contextual Enhancement
* Advancing Limited-Angle CT Reconstruction through Diffusion-Based Sinogram Completion
* Adversarial flow-based generative models for visible-to-Infrared person re-Identification
* Adversarial Image Purification by Explaining Adversarial Detectors
* AeroGen: Ground-to-Air Generalization for Action Recognition
* AffectVLM: Contrastive Language-Image Learning with Augmented Textual Prompts for 3D/4D Facial Expression Recognition Using Vision-Language Model
* Afmunet: Adaptive Filter-Based Frequency Modulation UNET For OCTA Segmentation
* AgeDB-30M Dataset: Melanated Faces for Age-Invariant Face Recognition, The
* Aggressive Rejection with Adaptive Gradient for Contaminated Data
* AGVOT: Visual Object Tracking via Cooperation of Aerial and Ground Views
* AI and Smart Sensors Usher in a New Era in Patient Care
* AI-Based Real-Time Fight Detection Through CCTV Cameras
* AI-Driven Vehicle Damage Detection: A Saliency-Based Segmentation Approach
* Air Brake Model With Electronically Controlled Pneumatic for Heavy-Haul Trains, An
* ALCER3D: Adaptive Learning Constraints for Enhanced Retrieval of Complex Indoor 3D Scenarios
* ALDA: Enhancing the transferability of adversarial attacks with attention-guided look-ahead and data augmentation
* Almost-Surely Convergent Randomly Activated Monotone Operator Splitting Methods
* ALPSB: Adaptive learngene with plastic and stable branches
* Alternative Cardinal Spline for Cubic B-Spline Interpolation, An
* Analysis of Human Perception in Distinguishing Real and AI-Generated Faces: An Eye-Tracking Based Study
* Analysis of image aesthetics assessment as a positive-unlabelled problem
* Anatomical Attention Alignment Representation for Radiology Report Generation
* Anchor-Based Gravity Alignment for Panoramas
* Anchor-ViT: Spatially-Focused Vision Transformer for Distracted Driving Detection
* Anti-FT: Towards Practical Deep Leakage From Gradients
* AOD-RSE: Improved Dehazing Network for Object Detection Models in Self-Driving Scenarios
* ARaBIQA: A Novel Blind Image Quality Assessment Model for Augmented Reality
* Are Foundation Models All You Need for Zero-shot Face Presentation Attack Detection?
* ASM-DiffConvNet: Physics-Guided Difference Convolution Network for Single-Image Restoration
* Astrophotography Turbulence Mitigation Via Generative Models
* Asymmetric modal fusion for multi-modal crowd counting
* Asymmetric Strip Transformer With Position Vectors Embedding for Lane Detection
* AT-PMF: Progressive multi-modal fusion with adversarial training for physiological emotion recognition
* Attention and Mamba-Driven Quality Assessment for Underwater Images
* Attention-Guided Band Pruning for Efficient Hyperspectral Early Grape Leaf Disease Detection
* AttentiveSfP: Leveraging Dualpool-Former and attention mechanisms for accurate shape from polarization
* Attribute-Specified Generation And Style-Transfer Diffusion For Face Recognition Enhancement
* AU-EMO Correlation based zero-shot facial expression recognition with graph convolutional network
* Audio Visual Segmentation through Text Embeddings
* Automated Activity Monitoring of Cryptic Species in a Zoo Environment
* Automatic Extrinsic Calibration Method for mmWave Radar and Camera in Traffic Environment, An
* Automatic Insect Pest Identification and Recognition for Paddy Crops Pest Control
* Automatic Turkish Image Captioning Using Non-Native Deep Caption Generator Models and Neural Machine Translators
* Autoregression-Free Video Prediction Using Diffusion Model for Mitigating Error Propagation
* Avoiding Bias While Pruning Neural Networks: The Case of Image Classification
* AWFusion: An adaptive end-to-end wave-based method for infrared and visible image fusion
* Axial Sphere Loss: Encouraging Open-Space Risk Minimization in Face Identification Tasks
* BAIT: A New DNN Backdoor Attack Using Inpainted Triggers
* BAM: Backdoor defense based on adversarial mitigation
* Batch-Aware Active Learning for Object Detection
* Bayesian high-order tensor factorization for learning the hidden low-rank structure
* Bayesian Multifractal Image Segmentation
* Bayesian Optimization Based Deep Learning Models for Detection of Forest Fires
* Bayesian Surprise for Small and Sub-Pixel Moving Target Detection
* BCFNet: Bi-temporal collaborative fusion network for multi-modal humor detection
* BD Open LULC Map: High-Resolution Land Use Land Cover Mapping and Benchmarking For Urban Development In Dhaka, Bangladesh
* Benchmark and Evaluation for Real-World Out-of-Distribution Detection Using Vision-Language Models, A
* Benchmark Dataset for Automated Diagnosis and Treatment Planning of Class III Malocclusion Using X-Rays and Profile Photos, A
* Beta Wavelet Induced Multi-Scale Kernel Clustering: A Frequency-Aware Framework for Complex Data Analysis
* Bevanet: Bilateral Efficient Visual Attention Network for Real-Time Semantic Segmentation
* Beyond deceptive flatness: Dual-order solution for strengthening adversarial transferability
* Beyond Deep Learning: Agentic AI Framework for Object Detection
* Beyond FACS: Data-driven Facial Expression Dictionaries, with Application to Predicting Autism
* Beyond Meme Templates: Limitations of Visual Similarity Measures in Meme Matching
* Beyond non-expert demonstrations: Outcome-driven action constraint for offline reinforcement learning
* Beyond Static Fusion: A Mixture-of-Experts Framework for Multimodal Breast Cancer Classification
* Bi-Grid Reconstruction for Image Anomaly Detection
* Bicycle Travel Time Estimation via Dual Graph-Based Neural Networks
* Bicycledualnet: Bicyclegan-Powered Dual Encoder Network for Single Image 3D Reconstruction
* Bidirectional Flow Fields for Sparse Input Novel View Synthesis of Dynamic Scenes
* BioGaze: a Framework for Evaluating the Photographic Requirements of the ISO/IEC 39794-5 Standard
* BioVL-QR: Egocentric Biochemical Vision-And-Language Dataset Using Micro QR Codes
* Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering
* Blaze: A Dataset For Wildfire And Burnt Area UAV Image Classification And Segmentation
* Blind Denoising Using Dense in Dense Network with Attention Module
* Blind Multi-Mode Ptychography using a Distributed Probe Estimate
* Boosted Affine Motion Compensation For Geometric Partitioning Mode
* Boosting Dataset Distillation With the Assistance of Crucial Samples for Visual Learning
* Boosting Faithful Multi-Modal LLMs via Complementary Visual Grounding
* Boosting Text-To-Image Person Re-Identification With Generative Hard Negative
* Boosting Tiny Face Detection in Videos with an Integral Score Framework
* Boundary mutual information hashing for cross-modal retrieval
* Brain foundation models with hypergraph dynamic adapter for brain disease analysis
* Branch-Splitter multi-granularity feature fusion for local joint-angle estimation
* BreathAI: Transfer Learning-Based Thermal Imaging for Automated Breathing Pattern Recognition
* Bridging Domain Shifts Through Self-Contrastive Learning And Distribution Alignment
* Brief Analysis of the Change Detector by Kervrann et al., A
* BSRPCA: A Simplified Blind Super-Resolved RPCA-Based Approach for Enhancing Blood Flow Estimation
* BTDGNet: A Dual-Guided Camouflaged Object Detection Network Leveraging Boundary and Texture Information
* BVSR-EvD: Blurry Video Space-Time Super-Resolution With Events via Diffusion Models
* CADOT: Cityscape Aerial Image Dataset For Object Detection
* CAFCL: Class-aware flow-based contrastive learning for out-of-distribution detection
* CAG: Context-Conditional 2D Affordance Generation
* Calibrated mixup for imbalanced regression on tabular data
* Calibration of Sparse LiDAR and Camera Based on Spatial Feature Analysis and Adaptive Constraints
* Camera Pose Estimation in Multi-Object Scenes Using Ray Diffusion and Point Cloud Alignment
* Can Large Language Models Challenge CNNs in Medical Image Analysis?
* Can Pose Transfer Models Generate Realistic Human Motion?
* Category-Dependent Learned Image Compression for Smartphone Photography with Standard-Compliant Decoders
* Causality-Driven Explainable Multimodal Fusion With Visual-Text Parallel Computing for Cloth-Changing Pedestrian Re-Identification
* Causality-Inspired Debiasing Learning for Open World Object Detection
* CauSkelNet: Causal Representation Learning for Human Behaviour Analysis
* CDCGM: Composition-specified Dance Choreography Generation from Music
* Certainty and Uncertainty Guided Active Domain Adaptation
* CFFN: Cascaded Feature Fusion Network for Facial Expression Recognition
* CGD-MAE: Clip Distillation-Driven Pre-Training Framework for Vehicle Re-Identification
* Chagas Parasite Semi-Supervised Classification in Blood Sample Images Using Different Deep Learning
* Channel-Wise 1D Convolutional U-Net
* Chasing Shadows: Solving Deepfake Detection Benchmarks Using Irrelevant Features Only
* ChatPPG: Computational Analysis and Statistics of Table Tennis Games
* CHTMAE: Cross-Modal Hierarchical Temporal-Spatial Masked Autoencoder Model for Micro-Expression Recognition
* CHUG: Crowdsourced User-Generated HDR Video Quality Dataset
* City-Level Pavement Distress Inspection Using Crowdsourced Data of Logistics Vehicles
* Class-aware prototype augmentation and decoupled feature distillation for class-incremental learning
* Class-specific feature reconstruction with pseudo-label for open-set HRRP recognition
* CLIP-AE: Clip-Assisted Cross-View Audio-Visual Enhancement for Unsupervised Temporal Action Localization
* CLIP-FSQAE: Clip-Guided Finite Scalar Quantized Autoencoder for Few-Shot Anomaly Detection
* CLIP-HandID: Vision-Language Model for Hand-Based Person Identification
* CLIP-SENet: CLIP-Based Semantic Enhancement Network for Vehicle Re-Identification
* CLIP-WSDDN: An optimized weakly supervised object detection network with zero supervised classification prior
* Close-to-Optimal Counter Histogram-Based Forensics Using Mean Structural Similarity Index Metric
* Cloud Optical Thickness Retrievals Using Angle Invariant Attention Based Deep Learning Models
* Cluster Contrast for Unsupervised Visual Representation Learning
* Cluster-aware prompt ensemble learning for few-shot vision-language model adaptation
* CMIP: Combining Constructive Model With Improvement Policy for Large-Scale Min-Max Multiple Traveling Salesman Problem
* CMP: Composable Meta Prompt for Sam-Based Cross-Domain Few-Shot Segmentation
* CMTM: Cross-Modal Token Modulation for Unsupervised Video Object Segmentation
* Co-HSC: Complementary image-mesh fusion for dense human-scene contact estimation
* Coding-Information Based Improvement For In-Loop Filters Beyond VVC
* COFP: A Collaborative Optimization Framework With Polyhedral Feature Extraction for Multi-Weather Image Restoration
* Cognitive and Memory-Driven EEG-Based Authentication: A Multi-Session Approach to Secure Biometric Systems
* Cognitive Task Virtualization for Alzheimer's Diagnosis Using Realistic VR Simulation
* Collaborative Computation in Integrated Sensing, Communication, and Computation System for Autonomous Driving
* Collision attack and error corrected multimodal-inspired framework for underwater video enhancement
* Color Is Not Enough: Dataset and Method for Identifying Relevant Traffic Lights in Driving Scenes
* ColorGPT: Automatic Colorization with Generative Prompts and Transformer
* Combination Test of NNVC Tools and NN-Inter In VVC
* Combining EEG and MRI in a Multimodal Approach for Parkinson's Disease Detection
* Common and Unique Representation Deep Embedded Clustering
* Communication Efficient Over-the-Air Federated Learning With Random FLARE Algorithm
* Communication-Efficient FL With Hybrid Aggregation for the CAVs Over Multiple BSs
* Compact exploration for continuous action reinforcement learning
* Compact Latent Representation for Image Compression (CLRIC)
* Comparative Analysis of Automatic Speech Recognition Fine-Tuning Strategies for Speech From Cochlear Implant Users
* Comparative Analysis of IR-VIS Image Fusion Methods: Object Detection with YOLO Architectures and Fusion Quality Evaluation, A
* Comparative Study of DINOv2, I-JEPA, and ViT Embeddings for Unsupervised Anomaly Detection
* Comparison of the unmodified Rytov method and the modified Rytov method in obtaining scintillations in various strongly turbulent media
* Comparison of Visual Trackers for Biomechanical Analysis of Running
* comprehensive survey of image clustering based on deep learning, A
* Comprehensive Survey of Transformers in Text Recognition: Techniques, Challenges, and Future Directions, A
* Compressed image super-resolution based on invertible degradation and restoration
* Compressing Human Body Video with Interactive Semantics: A Generative Approach
* Compressing Multi-Scale Features with a Channel-Shrinked Single-Branch Architecture
* Concentration Inequalities for Semidefinite Least Squares Based on Data
* Conditional Diffusion Transformer for Unified Distortion Correction and Rectification
* Conditional GAN for Time-to-Peak (TTP) Generation from Non-Contrast MRI Modalities
* Confidence-Aware Agglomeration Classification And Segmentation Of 2D Microscopic Food Crystal Images*
* Confidence-Based Sampling Strategy for Dense Temporal Token Learning in Thermal Infrared Object Tracking, A
* Conformal Compressors
* consistency regularization training method for automatic modulation classification under incomplete information, A
* Consistent Connected Operators Based on Trees of Shapes
* Consistent View Synthesis with Bidirectional Epipolar Attention and Reconstruction
* Constrained GAN-Generated X-Ray CT Data For Self-Supervised And Foundation-Model Segmentation Of Concrete Microstructures
* Context-Assisted Low-Light Face Detection through Global and Local Image Enhancement
* Context-Aware and Semantic-Synergistic Linguistic Steganalysis for Social Networks
* Context-Aware Simulation with Machine Vision for Industrial Safety
* Context-Based Screening of Autism Risk in Children
* Context-Dependent Anomaly Action Recognition
* Contextloss: Context Information for Topology-Preserving Segmentation
* Continual deep multi-view clustering via contrastive knowledge replay
* Continuous Action Unit Intensity Modeling for Micro-Expression Recognition
* Contrastive Learning-Based Deep Embedded Clustering and the TCN-DMAttention Model for Traffic Congestion Prediction
* ConvFuse: A Progressive Convformer Network for Context-Aware Multisensor Image Fusion
* Cooperative Perception of Multi-Agents Under the Spatio-Temporal Drift Issue
* CorDis: A Novel Correlation-Based Disentanglement Measure
* Cost-Efficient Approach to Managing Simultaneous Charging Sessions in Large-Scale EV Stations, A
* COT-AD: Cotton Analysis Dataset
* CRAFT: Contextual Re-Activation of Filters for face recognition Training
* Critical Contour Prior-Guided Graph Learning With Pose Calibration for Identity-Aware Deepfake Detection
* Cromdbn: Dynamic brain network analysis for neuropsychiatric diseases classification via multi-knowledge integration
* Cross-attention relation network based on metric learning for few-shot specific emitter identification
* Cross-Domain Adversarial Structural Deformation Model for Post-Disaster Image Generation
* Cross-Domain Analysis of Cybersickness and Motion Sickness Mitigation Strategies
* Cross-Domain Feature Fusion Network for Nighttime Drone-View Object Detection, A
* Cross-Domain Image Steganalysis Based on Frequency-Domain Alignment and Feature Calibration
* Cross-Frequency Attention and Color Contrast Constraint for Remote Sensing Dehazing
* Cross-Modal Attention Guided Enhanced Fusion Network for RGB-T Tracking
* Cross-Modal Attention with Adaptive and Hierarchical Fusion for Robust RGB-T Image Segmentation for Safe Driving
* Cross-modal Emotion-specific Attention model for Multimodal Emotion Recognition
* Cross-Modality Abdominal Multi-Organ Segmentation via Source-Free Unsupervised Domain Adaptation
* Cross-modality masked autoencoder for infrared and visible image fusion
* Crossdr: Bridging 2D And 3D Features For Diabetic Retinopathy Classification Using Context-Aware Cross-Attention
* CRPE-Net: Infrared small target detection transformer with cross-layer relative-position embedding
* CS-TRD: a Cross-Section Tree Ring Detection Method
* CTU-Level Rate Control with lambda Optimization Based on Visual Gaze Mechanism for 360-Degree Versatile Video Coding
* Curvature Dedicated Architecture Using Swin Transformers for RGB-D Object Recognition
* Curve: Clip-Utilized Reinforcement Learning for Visual Image Enhancement via Simple Image Processing
* Custom Condition Generation for Zero-Shot Human-Scene Interactions Synthesis
* CWC-DNERF: Compact Dynamic Neural Radiance Field VIA Discrete Wavelet Transform And Learnable Codebooks
* Cybersickness in VR: State-Of-The-Art and Future Research Agenda
* Cytofusion: A Latent Diffusion-Based Framework for Cytology Classification
* D2TR: Sea Clutter Suppression via Dynamic Dual-Tree Complex Wavelet Selection and Target-Guided Regularization
* Daafnet: Domain Adaptive Augmented Feature Network for Biosignal based Emotion Recognition
* DAF-Mamba: Dynamic selective and adaptive fused mamba for cardiac image segmentation
* Dark Count Removal in Photon-Counting SPAD Arrays
* Darts: Deformable Animation Ready Templates for Clothing Humans
* DAS-Accelerometer Data Fusion With Semi-Supervised Graph Variational Autoencoder for In-Service Train Wheel Flat Detection
* Data-Driven Recursive Intra Prediction
* DBF-Net: A Dual-Branch Network with Feature Fusion for Ultrasound Image Segmentation
* dc-GAN: Dual-Conditioned GAN for Face Demorphing From a Single Morph
* DCART: A dual contrastive alignment residual transformer model for visual grounding
* DCM-VideoNet: A Densely-Connected Modulated Decoder Framework for Implicit Neural Video Compression
* DCTFormer: A Dual-Branch Transformer With Cloze Tests for Video Anomaly Detection
* Dead Zone Mitigation in Vehicular Platoon via Solar Panel as a Communication Receiver
* Debiasing Framework For Attribute Binding In Diffusion-Based Text-To-Image Generation, A
* Deblurring Images by Huber Lasso
* Decision-Making and Planning for Intelligent Vehicle Considering Human Factors: Methods, Challenges, and Prospects
* Decoding Emotions: How Graph Transformer with Adaptive Graph Structure Learning Understands Micro-Expressions
* Decoding UAV Scenes: A Novel Framework for Deep Semantic Segmentation Using U-Net and Transformer Hybrids
* Decoupled self-supervised deep multi-task learning framework for subscriber portrait in smart meter
* Decoupling augmentation bias in prompt learning for vision-language models
* Decoupling representation learning and classifier for long-tailed adversarial training
* Deep CNN Face Matchers Inherently Support Revocable Biometric Templates
* Deep contrastive graph clustering with information preservation
* Deep Learning Based Crab Classification for Marine Pest Monitoring
* Deep Learning-Based Automated Diagnosis for Breast Cancer Classification Using Mammogram Analysis
* Deep Learning-Based Segmentation of Hysteroscopic Images for Early Detection of Endometrial Cancer
* Deep No-Reference Quality Assessment for Underwater Enhanced Images
* Deep Object Recognition-Based Analysis of Diverse Culinary Landscapes
* Deep Reinforcement Learning-Based Task Offloading With Collaborative Inference in UAV-Assisted Mobile Edge Computing Networks
* Deep Spectral Analytics Based Soil Nutrient Prediction Using Spatial-Semantic Feature Embedding with Prototype-Guided Perturbation
* Deep Unfolding-Based Image Reconstruction For Quanta Image Sensors
* Deep Unsupervised Despeckling With Unbiased Risk Estimation
* Deep Vision of Mobility for Urban Detection From Street-View Projections
* Deep-Learning Based Quality Assessment in Adaptive Optics Ophthalmoscopy Images
* Deforestation Monitoring for Mongolia's Forest-Steppe Ecoregion Through Satellite Images
* Deformable Shape Registration from Inexact Correspondences
* Deformable Spherical Geometry Transformer For Panoramic Semantic Segmentation
* DEGAN-CS: An efficient code search model based on dataenhanced optimization of generative adversarial networks
* Degradation accordant plug-and-play for low-rank tensor recovery
* Degradation-Aware Prompted Transformer for Unified Medical Image Restoration
* Delving Into the Secrets of BEV 3D Object Detection in Autonomous Driving: A Comprehensive Survey
* Denoising-enhanced pancreatic segmentation using diverse kernel mutual adaptive learning
* Density-sorted prediction set: Efficient conformal prediction for multi-target regression
* Depth-Aware Scoring and Hierarchical Alignment for Multiple Object Tracking
* Depth-Aware YOLO Segmentation: Enhancing Small Object Detection via MiDaS-Based Spatial Reasoning
* Design and Optimization of a Hybrid VLC/THz Infrastructure-to-Vehicle Communication System for Intelligent Transportation
* DeskTransfer: Predicting Multi-Scenario Video Stream Throughput in Cloud Desktop Based on Transfer Autoencoder
* Detecting And Mitigating Incoherent Input Of Latent Diffusion Models
* Detection Algorithm with Multi-Scale Reconstruction and Feature Compensation for Small and Occluded Objects, A
* Detection of Pavement Defects on Roads using a Multimodal YOLOv8 with Image and IMU Data
* Detection of Screen Usage During Eating Events Among Preschool-Aged Children
* DFT Gaze: Distilled and Fine-Tuned Gaze Estimation for Personalization on Tiny Devices
* DGRGaze: A Difference-Guided Gaze Estimation Framework Based on 6D Rotation Matrix Representation
* DI-Net: Decomposed implicit garment transfer network for digital clothed 3D human
* Dictionary-Based Block Term Decomposition for Third-Order Tensors
* DictRoadNet: A Dictionary-Based RNN With Road Network Module for GPS Trajectory Completion
* DiffDeMorph: Extending Reference-Free Demorphing to Unseen Faces
* DiffProb: Data Pruning for Face Recognition
* Diffuse and Refine Latent Prior with Transformers For Neural ISP
* DIFFUSE2ADAPT: Controlled Diffusion for Synthetic-to-Real Domain Adaptation
* Diffusion Based Shape-Aware Learning with Multi-Scale Context for Segmentation of Tibiofemoral Knee Joint Tissues: an End-to-End Approach
* Diffusion Model for Virtual Try-On Systems, A
* Diffusion Pretraining for Gait Recognition in the Wild
* Diffusion to Confusion: Naturalistic Adversarial Patch Generation Based on Diffusion Model for Object Detector
* Diffusion-Based CT Image Segmentation for Intracerebral Hemorrhage
* DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment
* DiffusionMixNet: Leveraging Foundation Models and Contrastive Learning for Semi-Supervised Polyp Segmentation
* DIRE: Enhancing Facial Expression Recognition Through Domain-Invariant Representation Learning for Robust Generalization
* Direction-Emphasizing Transformer for Road Extraction From Optical Remote Sensing Imagery
* DISCO: A Diffusion Model For Spatial Transcriptomics Data Completion
* Discrete Diffusion Propagated Transformer For Flexible Ureteroscopic Semantic Segmentation
* Discriminative Image Feature Extraction for Traffic Sign Detection in Road Inspection
* DisenEmo: Learning disentangled emotional representation from facial motion for 3D talking head generation
* Disentangled Denoising and Counterfactual Balance for Multimodal Recommendation
* Disentangled Source-Free Personalization for Facial Expression Recognition with Neutral Target Data
* Dissecting Human Body Representations in Deep Networks Trained for Person Identification
* Distortion Classification in Computer Vision Applications: Current Progress, Challenges, and Perspectives
* Distributed Adaptive Tracking Control of an Underactuated High-Speed Train With Completely Unknown System Parameters
* DIVA-VQA: Detecting Inter-Frame Variations in UGC Video Quality
* Diversifying Human Pose In Synthetic Data For Aerial-View Human Detection
* Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory
* Divide-and-Summarize: Enhancing Deep Neural Video Summarization
* DLP-YOLOv9: Model with Fewer Parameters and Higher Precision Based on Improved YOLOv9 in Drone-Captured-Scenarios
* DM-DPR: Diffusion and Mamba-based Degradation Prediction for Blind Face Restoration
* DM-FNet: Unified Multimodal Medical Image Fusion via Diffusion Process-Trained Encoder-Decoder
* DMSO: A Dynamic Momentum-Smoothing Optimizer for Learned Image Compression
* Does Brain Network Construction Choice Matter? an Empirical Study of Individual Networks from Static FDG-PET for Alzheimer's Diagnosis
* Does noise in the knowledge graph really harm recommendations?
* Domain divergence minimization for unsupervised domain adaptation cross-modality medical image segmentation
* Domain Transfer Generative Model for New Face Generation
* DP-Net: A 3D Dilated Projection Framework For Precise Fetal Brain Tissue Segmentation
* DPL: Spatial-Conditioned Diffusion Prototype Enhancement for One-Shot Medical Segmentation
* DPM-CLIP: Zero-Shot Multimodal Egocentric Activity Recognition based on Dual-Prediction Mechanism
* DR-IAL: Decoupling-to-recoupling guided interaction-aware learning for egocentric action recognition
* Driver State Classification: Identifying High Cognitive Load and Drowsiness Through Driver Performance and Physiology
* Driving Decision-Making at Freeway Weaving Segments Using Relational Graph Attention Network and Deep Reinforcement Learning
* DSCIL: Dynamic selected contrastive instance learning for weakly supervised video anomaly detection
* DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification
* DSDP: Real-Time Asymmetric Dual-Stream Instance Segmentation Embedding Depth-Predictive Architecture for Enhanced Scene Understanding
* DSFace: Conditional Diffusion Inpainting for Sketch-to-Face Synthesis
* DTFPNet: Temporal and frequency dynamic graph neural network for time series classification
* DTLS-Inpaint: Yet Another Efficient Image Inpainting with Domain Transfer
* DTSI: Towards faster convergence of query-based detectors for rotated dense aerial images
* dual branch graphic text detection network based on progressive Domain adaptation, A
* Dual Stream Networks for 3d Human Pose and Shape Estimation
* Dual-Attention based prompt generation and catalyzing for instance-wise continual learning
* Dual-Branch Partial Annotation Learning for Facial Attributes Recognition
* Dual-domain homogeneous fusion with cross-modal mamba and progressive decoder for 3D object detection
* Dual-functional fractal-fractional Sobel operator for efficient image enhancement and edge detection
* Dual-Graph Transformer with Contextual-Support Nodes for Interpersonal Relationship Recognition
* Dual-perception prompt learning: Illumination-adaptive and semantic-aware guidance for backlit image enhancement
* Dual-Stream Spatio-Temporal Accident Anticipation and Detection
* DualAdaptNet: Enhancing domain adaptation regression with error accumulation reduction and dual-head heterogeneous learning
* Dynamic 3D Gaussian Reconstruction with Specular Reflection
* Dynamic Clustering-driven weakly-supervised online hashing with enhanced similarity
* Dynamic Mesh Coding Using Edge Length-Based Adaptive Subdivision
* Dynamic Mesh Coding With Temporally Consistent UV Atlas Generation
* Dynamic Multi-Level Feature Alignment Method for Domain Adaptive Driver Distraction Detection, A
* Dynamic PET Image Reconstruction via Non-Negative INR Factorization
* Dynamic Spatiotemporal Graph Convolutional Neural Network Based on Congestion Propagation for Traffic Prediction
* Dynamic Visual Speaking Patterns: You Are the Way You Speak
* E2MPL: An Enduring and Efficient Meta Prompt Learning Framework for Few-Shot Unsupervised Domain Adaptation
* EAFvision: Real-Time Automated Safety Surveillance in Electric Arc Furnaces Using Deep Learning Models
* EAR-MM: An Efficient Adaptive and Robust Algorithm for Streaming Map Matching
* Early pneumoconiosis recognition from CT via progressive lesion awareness and multi-axis denoising attention mechanisms
* Echocardiogram to CMR Image Synthesis using Generative Models
* Edge Feature Inclusive Variational Graph Autoencoder for Pet-Driven Alzheimer's Diagnosis, An
* Edge-Guided Monocular Absolute Depth Estimation with Diffusion-Based Refinement
* EdgeRegNet: Edge Feature-Based Multimodal Registration Network Between Images and LiDAR Point Clouds
* EEG-driven natural image reconstruction with regional semantic awareness
* EF2lane: Enhanced Feature Fusion 2D Lane Detection Network In 3d Point Cloud
* Effects of Facial Hair on Face Recognition
* Efficient and Robust Video Virtual Try-On via Enhanced Multi-Garment Alignment
* Efficient Asymmetric Shared Low-Rank Adaptation Based on Selective Scanning Vision Mamba for Medical Imaging Analysis
* Efficient Atlas Generation for Medical Imaging Via Groupwise Latent Diffusion Models
* Efficient Constraining of Transcoding in DNA-Based Image Storage
* Efficient Feature-Guided Approach for Image Restoration
* Efficient High-Fidelity Global Low-Rank Optimization for Multispectral Demosaicing
* Efficient Implicit Neural Representations for Videos with Feature Modulation
* Efficient Leaf Disease Classification and Segmentation Using Midpoint Normalization Technique and Attention Mechanism
* efficient loop and clique coarsening algorithm for graph classification, An
* Efficient motion-centric CLIP for compressed video action recognition
* Efficient Random Access Method Using Seed and Inter-Key Frames for Next Generation Video Codec
* Efficient spectral embedding representation approximation for large-scale data clustering
* Efficient Text-to-Image Generation: An Adaptive Step Schedule Controller for Diffusion Models
* Efficient Topology-Aware Motion Planning for AVP in Large-Scale Occupancy Map
* Efficient vision-based occupancy prediction with knowledge distillation
* EM-Based Multi-Object Tracking With Strong Association Constraints
* EmoEEG: A transferable generalist framework for EEG emotion recognition via information bottleneck theory
* EmoGene: Audio-Driven Emotional 3D Talking-Head Generation
* Emomamba: Advancing Dynamic Facial Expression Recognition with Visual and Textual Fusion
* Empathic Risk Companion: Multimodal Vision-Language Fusion with Emotion Prediction Error for Decision Support
* Enabling Controllable, Identity Preserving, Non-Rigid Edits in Human-Centric Images
* Enact: Entropy-Based Clustering of Attention Input for Reducing the Computational Needs of Object Detection Transformers
* End-to-End Automated Screening of Lordosis-Kyphosis-Scoliosis and Vertebral Compression in Salmon X-Ray Images
* End-to-End Microexpression Detection Using 3D Convolution and LSTM
* Energy Efficiency of Video Quality Assessment Metrics
* Energy-Based Distortion-Balancing Parameterization for Open Surfaces
* Energy-Based Generative Models with Morphological Attention Networks for Hyperspectral Image Classification: a Unified Framework
* Enforcing Cooperative Safety for Reinforcement Learning-Based Mixed-Autonomy Platoon Control
* Enhanced Emphysema Classification in CT Images Using RIU4-LQP and Spatial Texture Features
* Enhanced Frame Context Initialization for Video Coding Beyond AV1
* Enhanced Graph Convolutional Network with Chebyshev Spectral Graph and Graph Attention for Autism Spectrum Disorder Classification
* Enhanced Multi-Scale Network for Single Image Super-Resolution
* Enhanced Multi-Scale PoseNet for Self-Supervised Monocular Depth Estimation
* Enhanced Small Object Detection Using Multi-Scale Attention for Automated Seabird Detection
* Enhancing 3D Scene Representation with Structural Dissimilarity-Aware Learning
* Enhancing Adversarial Robustness of Foundation Models Without Data Centralization
* Enhancing Autonomous Driving Perception Under Complex Weather Conditions Through Cyclegan-Based Driving Scene Generation
* Enhancing Breast Cancer Detection Using Multistage Transformer with Positional Encoding and Feature Fusion
* Enhancing CNN-Based Blind Image Quality Assessment via Deep Cross-Layer Pattern Encoding
* Enhancing Domain Generalisability for Lung Nodule Detection: A Hybrid Strategy with Multi-Source Training and MixStyle
* Enhancing graph neural networks on SPD manifolds via cholesky decomposition
* Enhancing Image Deraining Through VLM-Based Data Refinement and Classification
* Enhancing local attention with global information interaction via progressive cluster propagation
* Enhancing Medical Vision-Language Models with Rich Textual Descriptions and Multiple Alignments for Chest X-Ray Diagnosis
* Enhancing Multi-Task Learning with Attention Mechanisms
* Enhancing Multiscale Feature Representation For Object-Level Recognition In Masked Image Modeling
* Enhancing point cloud feature representation via historical node state increments in graph neural networks
* Enhancing spatio-temporal zero-shot action recognition with language-driven description attributes
* Enhancing Unsupervised Domain Adaptation in Semantic Segmentation Through Selective Consensus and Gaussian Mixture Model-Based Pseudo-Labeling
* Enhancing Visual Question Answering Via Clustered In-Context Sequence Configuration
* Enhancing Visual Re-Ranking Through Denoising Nearest Neighbor Graph via Continuous CRF
* Enhancing VMamba for change detection via lightweight feature interaction and selection
* Enhancing Wide-Angle VR Video Transmission Using Human Perception
* EQUR: Equivariant Uncertainty Quantification and Refinement for Point Cloud Registration
* Erp-Aware Text-To-360 Panorama Diffusion Model
* ErpGS: Equirectangular Image Rendering Enhanced with 3D Gaussian Regularization
* Error Correction for DNA-Based Image Storage
* Estimating Virtual Camera FOV to Reduce Perspective Shape Distortion in 2D-to-3D Face Reconstruction
* Estimation of Object Volume in Aqueous Food Media Using Surface Electric Potential and Neural Network Regression
* Eswindnet: Image Demoiréing Using Multiscale Swin Transformer Layers
* ETLight: An Evolution Transformer for Efficient Traffic Signal Control
* Evaluating Data Quality and Preprocessing Methods to Enhance Skeleton-Based Action Recognition in Retail Environments
* Evaluating Human Perception of Automatically Created Synthetic Road Networks that Integrate Real-World Cost Factors and Terrain Features
* Evaluating Spherical Gaussian Fuzzy Sets in Image Enhancement
* Evaluating Students' Attention: A Deep Learning Approach
* Event Denoising Based on Iterative Tree-Structured Information Aggregation
* Event-Based Egocentric Human Pose Estimation in Dynamic Environment
* Event-Guided Motion Deblurring with Wavelet-Based Cross-Modal Feature Fusion
* Event-Triggered Regulation of Mixed-Autonomy Traffic Under Varying Traffic Conditions
* EventEgoHands: Event-Based Egocentric 3D Hand Mesh Reconstruction
* EXDF: Explainable Deepfake Detection with Vision-Language Model
* Expanding on the BRIAR Dataset: A Comprehensive Whole Body Biometric Recognition Resource at Extreme Distances and Real-World Scenarios (Collections 1-4)
* Explainable Artificial Intelligence Approach Using Low-Dimensional Visualization and Ensembling Uncertainty Quantification for Rare Chromosomal Aberration Detection in Cytogenetic Imaging
* Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation
* Exploring Effective Unfolding Covering Prompt Tuning for Vision Mamba
* Exploring Image-Language Data for Enhanced Soccer Understanding
* Exploring The Potential of Vision-Language Models for Pure-Image and Text-Guided-Image Saliency Prediction
* Exploring the Temporal Dynamics of Facial Mimicry in Emotion Processing Using Action Units
* Exploring Vision-Based Features for Detecting Deception in Well-Being: A Cross-Domain Comparison
* Extended Node-Specific Distributed Generalized Sidelobe Canceler for Outdoor Wireless Acoustic Sensor Networks
* Extension of Semi-Decoupled Partitioning in Inter Frames
* Extension of Sound Field Image Denoising to High-Frequency Sound Fields by Considering Wavenumber Spectral Loss
* Extensions of Morphological Gradient for Hyperspectral Images
* Eye-Closure-Based Alertness Detection via Adaptive Eye Region Extraction and Deep Learning
* Eyes and Ears: Automated Annotation of Audio Data Using Computer Vision
* F-LGAM: Enhancing Single Domain Generalized Object Detection Through Fourier-Based Local and Global Amplitude Mixup
* F2T2-HIT: A U-Shaped FFT Transformer and Hierarchical Transformer for Reflection Removal
* FA-Net: A Feature Alignment Network for Video-Based Visible-Infrared Person Re-Identification
* Face Forgery Detection With CLIP-Enhanced Multi-Encoder Distillation
* FaceCloak: Learning to Protect Face Templates
* FaceLiVT: Face Recognition Using Linear Vision Transformer with Structural Reparameterization for Mobile Device
* Facial digital markers For hypomimia detection in Parkinson's disease: A systematic review
* Facial Identity Editing: Towards Effective De-Identification
* Facilitate and Scale Up the Creation of 3D Meshes, 6D Category-Based Datasets and Grasping with Generative Models: GenVegeFruits3D
* Fake Money, Real Threat: Fooling Wavelet-Based Banknote Authentication with AdvGAN
* Family Resemblance or Fraud? Face Morphing Attacks on Kinship Verification
* Fast and Accurate Outlier-Aware Lidar Super-Resolution for Slam Applications
* Fast Blind Image Deblurring Based on Cross Partial Derivative
* Fast Bounding Box Hierarchy
* Fast Image Vector Quantization Using Sparse Oblique Regression Trees
* Fast Iterative Enhancement for Image Signal Processing
* FastEdit: fast text-guided single-image editing via semantic-aware diffusion fine-tuning
* FC-Render: Adaptive Font- and Color-Aware Text Diffusion Model
* FeDepthX: A Federated Learning Depth eXperiment
* FeDi: Feature disentanglement for self-supervised learning
* FERGI: Automatic Scoring of User Preferences for Text-to-Image Generation from Spontaneous Facial Expression Reaction
* Few-label blind image quality assessment via samples chosen from new and existing scenes
* Few-Shot Class-Incremental Learning for Efficient SAR Automatic Target Recognition
* FGA-NN: Film Grain Analysis Neural Network
* Field correlations in jet engine exhaust turbulence
* Field correlations of a Gaussian vortex laser beam in vertical turbulent oceanic links
* Fine-Grained Spatial-Temporal Perception for Gas Leak Segmentation
* Flexible Geometric Guidance for Probabilistic Human Pose Estimation with Diffusion Models
* FlowCalib: Targetless Infrastructure LiDAR-Camera Extrinsic Calibration Based on Optical Flow and Scene Flow
* FMG-Det: Foundation Model Guided Robust Object Detection
* Foundation Model-Based Deformable Registration of Multi-Modal Remote Sensing Images
* FPW: Frequency-Domain Pixel-by-Pixel Watermarking Against Unauthorized Images Used on Training Generative Model
* FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition
* Frequency Aware Learned Image Compression Using Swin Transformer and Discrete Wavelet Transform
* Frequency Domain Transformation and Loss Adjustment for Enhancing Transferability of Adversarial Examples
* Frequency-Augmented CBAM Attention-Aided YOLOv8 for Object Detection, A
* Frequency-Guided Contextual Image Captioning
* Frequency-guided multi-level human action anomaly detection with normalizing flows
* From 2D X-Rays to a 3D Surgical Plan: Progress with AI Reconstruction
* From forgotten to pan-sharpening
* From Laplace to Mellin: A Unified Biorthogonal Transform Framework
* From Pixels to Panoramas: A Deep Learning Pipeline for Mineral Image Analysis
* Frozen Network Few-Shot Object Detection
* FSAC-IA: A Hierarchical Constructed SAC-IA Algorithm for Point Cloud Alignment Acceleration
* FUNet: Frequency-Aware and Uncertainty-Guiding Network for Rain-Hazy Image Restoration
* Fungi or Fatal: Ensemble Learning for Mushroom Edibility Classification in the Wild
* Fusion of Face and Ear Biometrics for Robust Child Recognition: Insights into Age-Dependent Recognition Trends
* Fuzzy neighborhood-based feature selection with missing labels via feature graph matrix and label enhancement
* Game-Theoretic Reinforcement Learning-Based Behavior-Aware Merging in Mixed Traffic
* Game-Theoretical Framework for Safe Decision Making and Control of Mixed Autonomy Vehicles, A
* GAMNet: Graph Attention Mlp-Based Network for 3D Human Pose Estimation
* Garment De-Warping for Virtual Try-on in the Wild
* GaussianGAN: Real-Time Photorealistic controllable Human Avatars
* GCL-GroW: Graph contrastive learning via group whitening
* GEE-UOD: An Underwater Object Detection Network Based on Global and Edge Information Enhancement
* Gender Fairness of Machine Learning Algorithms for Pain Detection
* Generalizable poisoning-resistant backdoor detection and removal framework: From dataset perspective
* Generalization-Aware Remote Sensing Change Detection via Domain-Agnostic Learning
* Generative AI for Virtual Staining in Histopathological Data Analysis
* Generative Approach for Detecting Small Intrusive Foreign Objects in High-Speed Railway Scenario
* Generative Diffusion Model to Solve Inverse Problems for Robust in-NICU Neonatal MRI, A
* Generative Face Video Coding Framework with Disentangled and Consistent Background, A
* Generative Face Video Compression Using Depth Estimation and Compressed Sensing
* Generative image compression by prediction of optimal realism levels
* Generative Personalized Blind Face Restoration Enhanced by Physical Identity
* Generative views recovery and error-guided topological tensor network for incomplete multi-view clustering
* Genetic Algorithms for Parameter Optimization for Disparity Map Generation of Radiata Pine Branch Images
* Geometric Continuity and Consistency Learning for Self-Supervised Point Cloud Completion
* Geometric Mean Improves Loss For Few-Shot Learning
* Geometric Shape Matching for Recovering Protein Conformations from Single-Particle Cryo-EM Data
* Geometry Parametrization Stabilization For Dynamic Mesh Coding
* Geometry Regularized Point Cloud Autoencoder
* GeoScaler: Geometry and Rendering-Aware Downsampling of 3D Mesh Textures
* GestDoor: Gesture-Based User Authentication for Door Entries Utilizing Wearable IMUs
* Gesture Recognition for Emergencies: Dataset and Cross-Condition Analysis
* GHS-VDG: Graph and Hybrid Spatio-Temporal Attention for Video Diffusion Generation
* GIP: Gated Interaction Prompt for Parameter Efficient Vision-Language Fine-Tuning
* GlioSurvNet: Multimodal Survival Prediction for Glioblastoma Using Deep Learning and Clinical Variables from Brain MRI
* Global aggregated gradient-guided adversarial attacks for person re-identification
* GLOSS: Global-Local Matching Network Towards Outfit Recommendation for Diverse Body Shapes and Scenes
* GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
* GMD: A Multimodal Framework for AI-Generated Misinformation Detection
* GMOT-Mamba: Mamba-Based Model Prediction For Generic Multiple Object Tracking
* Gop-Level Adaptive Resampling with CNN-based Super Resolution
* GrabDAE: An Innovative Framework for Unsupervised Domain Adaptation Utilizing Grab-Mask and Denoise Auto-Encoder
* GRAFT-XPCI: Dataset of Synchrotron X-Ray Images for Detection of Acute Cellular Rejection after Heart Transplantation
* Graph Convolutional Network Aggregation For Broad-Spectral Object Detection
* GraphST: Class-imbalanced node classification with semantic relation transfer
* GRASP-Former: A Lightweight Global-Random Sparse Attention for Domain-Aware Multi-Class Obscenity Detection
* Green Learning Approach to LDCT Image Restoration, A
* Grid-Logat: Grid Based Local And Global Area Transcription For Video Question Answering
* Group Equivariant Morphological Networks
* Group Joint Independent Component Analysis (Group jICA): a Novel Method to Jointly Decompose and Link Simultaneous EEG and fMRI
* GTA-Crime: A Synthetic Dataset and Generation Framework for Fatal Violence Detection with Adversarial Snippet-Level Domain Adaptation
* Guided Detail Filter for AVM
* Guided Diffusion For Class-Conditioned Synthesis and Classification Of Microscopic Blood Cell Images
* Hallucination Elimination and Text Annotation Framework for Large Vision-Language Models in Traffic Scenarios
* Hand-Aware Masked Graph Convolutional Network for Skeleton-based Sign Language Recognition
* Handling Multiple Hypotheses In Coarse-To-Fine Dense Image Matching
* HandOcc: NeRF-based Hand Rendering with Occupancy Networks
* Hands-On: Segmenting Individual Signs from Continuous Sequences
* Hardware Friendly Multi-Hypothesis Cross Component Prediction
* Harnessing Feature Distribution Consistency for Federated Learning with Noisy Labels
* Harnessing the Power of LLMS for Image Aesthetics Assessment Through Semantic and Contextual understanding
* Hi-Mamba: Hierarchical Mamba for Efficient Image Super-Resolution
* Hiding Local Manipulations on SAR Images: A Counter-Forensic Attack
* Hierarchical Kernel Decoupling for Graph Convolution: Enhancing Skeleton-Based Action Recognition Through Structured Representation
* Hierarchical Multi-Modal Transformer for Cross-Modal Long Document Classification
* Hierarchical Recursive Interaction and Multi-Stage Goal-Guided Mechanism for Multimodal Trajectory Prediction
* Hierarchical Reinforcement Learning Shared Steering Control Strategy Considering Driver-Vehicle-Road Risk Assessment
* High Specificity Guided Cross-Domain Few-Shot Segmentation
* High-Frequency Semantic Enhancement in Compressed Scenarios for Robust Visual and Machine Vision Applications
* High-Order Internal Model-Based Data-Driven Iterative Learning Control of High-Speed Railways Subject to Faded Channels
* High-quality controlled clustering expert networks
* Holism To Atomism: Enhancing The Vision-Language Alignment For Cross-Modal Few-Shot Learning
* Holistic Coreset Selection for Data Efficient Image Quality Assessment
* HomE: A Homogeneous Ensemble Framework for Dynamic Hand Gesture Recognition
* HSBS: Comprehensive Boosting Of Facial Expression Recognition Via Hierarchical Semantic And Batch-Wise Similarity
* Hue Are You? Can Skin Depigmentation Affect Face Recognition Performance?
* HUGS-Net: A Lightweight and Unified Network for Adverse Weather Image Denoising
* Human or Machine: A Novel Deep Learning Framework for Autonomous Driver Identification Based on Vehicle Trajectories
* Human Pose Estimation Under Occlusion: A Data Restoration Framework Using GANs
* hybrid active contour model driven by local region-based self-organizing map for infrared image segmentation, A
* Hybrid Deep Learning and Handcrafted Feature Fusion for Mammographic Breast Cancer Classification
* Hybrid SIFT-SNN for Efficient Anomaly Detection of Traffic Flow-Control Infrastructure
* Hybrid texture-structural learning for hyperspectral image classification
* Hyperparameter Optimization Method for Affine Projection Algorithm Based on Deep Unrolling
* ICP-3DGS: SFM-Free 3D Gaussian Splatting for Large-Scale Unbounded Scenes
* ICTNet: Image Complexity-Aware Two-Branch Network With Enhanced Decoding for Real-Time Segmentation
* ID-Booth: Identity-consistent Face Generation with Diffusion Models
* ID-TTA: Classifier-Free Test Time Adaptation for Metric Learning
* iHDR: Iterative HDR Imaging With Arbitrary Number Of Exposures
* Illumination Spectrum Estimation for Multispectral Images Using Illuminant Prior
* IlluSign: Illustrating Sign Language Videos by Leveraging the Attention Mechanism
* iLSU-T: an Open Dataset for Uruguayan Sign Language Translation
* Image Motion Blur Removal In The Temporal Dimension With Video Diffusion Models
* IMMix: Class-Imbalanced node classification via prototypical selective mixup augmentation
* Impact of Sunglasses on One-to-Many Facial Identification Accuracy
* Implicit authentication method based on image temporal features
* Implicit Object Recognition via Reinforcement Learning in Out-Of-Domain Scenarios
* Improve Real-Time Flood Segmentation by Encoding and Distilling Foreground Information
* Improved Cervical Cell Detection Model Based on Hybrid-Domain Feature Pyramid Network
* Improved ORB-SLAM2 Algorithm Based on Extended Kalman Filtering and Particle Swarm Optimization, An
* Improved Representation Learning for Unconstrained Face Recognition
* Improved UNet++ Based on Kolmogorov-Arnold Convolutions
* Improving Fine-Grained Understanding for Retrieval in Human Motion and Text
* Improving Infrared Small Target Detection With GAN-Driven Data Augmentation
* Improving Mobility in NDN-Based VANET: A Deep Reinforcement Learning Approach With Deep Prioritization
* Improving Multi-Organ Segmentation in Abdomen CT Images Incorporating Shape Priors and Spatial Information in Deep Learning
* Improving Novel View Synthesis of 360° Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images
* Improving Open-World Class-Agnostic Object Detectors via Feature Distillation with Student-Aware Adaptation
* Improving Pseudo-Labels Selection Using Domain Priors for Semi-Supervised Detection in Capsule Endoscopy
* Improving the Performance of Compressive Spectral Imaging with Bayer Color Filter Array
* Improving the Stability and Efficiency of Diffusion Models for Content Consistent Super-Resolution
* Improving Yolov8 For Fast Few-Shot Object Detection By Dinov2 Distillation
* In2Out: Fine-Tuning Video Inpainting Model for Video Outpainting Using Hierarchical Discriminator
* Incomplete Modalities Restoration via Hierarchical Adaptation for Robust Multimodal Segmentation
* IndicSideFace: A Dataset for Advancing Deepfake Detection on Side-Face Perspectives of Indian Subjects
* Individuation of 3D Perceptual Units from Neurogeometry of Binocular Cells
* Influence of irregular particles on the propagation of polarized pulsed laser beams through turbid underwater environments
* Information-Driven Complementarity and Consistency Mining for Multi-View Clustering
* Infrared and visible image fusion model based on source image interaction
* Instance-wise distribution control of text-to-image diffusion models
* Instant 3DCG Dance Generation System Based on Music and Dance Composition
* Integrated Design of Mobile Battery-Swapping and Charging Services for Electric Vehicles
* intelligent multimodal medical image registration using hybrid meta-heuristic optimization with transformer-based residual UNet, An
* Inter-Trial Coherence Reveals Enhanced Synchrony During Mantra Listening
* Interactive feature fusion for camera-radar-based vehicle segmentation in bird's-eye view
* Interactive Sign Language Question Answering Framework and Dataset for Barrier-Free Exhibition Service
* Interference Mitigation in Automotive Radar Systems: A Current State Survey and Future Trends
* Intermodal correlation modeling for incomplete multi-modal learning in land use and land cover classification
* Interpretable image classification based on antifactual data
* Interpreting the Trispectrum as the Cross-Spectrum of the Wigner-Ville Distribution
* Introducing the short-time fourier Kolmogorov Arnold network: A dynamic graph CNN approach for tree species classification in 3D point clouds
* Invariants of Color Images to N-Fold Symmetric Out-of-Focus Blur
* Inverse Scattering for Schrödinger Equation in the Frequency Domain via Data-Driven Reduced Order Modeling
* Investigating Data Replication in Medical Synthetic Image Generation with Diffusion Models
* Investigating Robustness of Unsupervised Stylegan Image Restoration
* Investigating Role of Big Five Personality Traits in Audio-Visual Rapport Estimation
* Investigating Social Biases in Multimodal LLMs
* IO-LIO: Information-Oriented Voxel Mapping for Efficient and Precise LiDAR-Inertial Odometry
* Is Perturbation-Based Image Protection Disruptive to Image Editing?
* Iterative Filtering and Smoothing with Optical Flow Prediction Models
* Iterative optimal transport for multimodal image registration
* Iterative Self-Improvement of Vision Language Models for Image Scoring and Self-Explanation
* IterDiff: Training-Free Iterative Face Editing Via Efficient Clip-Guided Memory Bank
* Ivory: Adversarial Purification of Obfuscated Faces to Extract Soft-Biometrics using Diffusion Transformers
* J-CaPA: Joint Channel and Pyramid Attention Improves Medical Image Segmentation
* JAM: A Comprehensive Model for Age Estimation, Verification, and Comparability
* JanusGAN: GANs Disentangled Editing with Two Discriminators
* Joint Deep-Unfolding Optimization Learning for Depth Map Arbitrary-Scale Super-Resolution
* Joint distribution alignment on Lie group manifolds for domain adaptation
* Joint Enhancement and Bandwidth Extension for Radar Through-Barrier Speech Acquisition
* Joint Geometry-Attribute Point Cloud Compression with Spatial Context Mining and Dual-Class Attribute Loss
* Joint Optimization of Primary and Secondary Transforms Using Rate-Distortion Optimized Transform Design
* Joint Optimization of Vehicle and Pedestrian Traffic Signals Using Multi-Objective Deep Reinforcement Learning
* Joint Super-Resolution and Segmentation for Low-Resolution Brain MRI Analysis
* Judging From Support-Set: A New Way To Utilize Few-Shot Segmentation For Segmentation Refinement Process
* Keypoint detection in Tai Chi Chuan Essence via Waist and Limbs Feature Separation
* Keypoint Estimation for Real-Time Pinus Radiata Cutpoint Detection
* Knowledge and experience for visible-infrared person re-identification
* Knowledge Distillation Between 2D and 3D Vision Transformers for Point Cloud Quality Assessment
* Knowledge Distillation for Resource Efficient Classification with CLIP-Guided Specialist Networks
* Knowledge Is What You Need For Active Object Tracking
* Knowledge Refinement For Unsupervised Lifelong Person Re-Identification
* Knowledge-Aware Diffusion-Enhanced Multimedia Recommendation
* KPLNet: Keypoint prototype learning for zero image semantic correspondence
* L1-Norm Redundant Delaunay Phase Unwrapping and Gradient Correction
* Landmark-Based Fast LIP Reading with CTC Loss
* Language-Dominated Fusion and Self-Distillation for Multimodal Sentiment Analysis with Incomplete Modalities
* Langvision-Lora-Nas: Neural Architecture Search for Variable Lora Rank In Vision Language Models
* Laplacian-Mamba: Mamba-Based Laplacian Pyramid Enhancement Network for Unpaired High-Definition Images
* Large (Vision) Language Models for Autonomous Vehicles: Current Trends and Future Directions
* Large Vision-Language Models are Generalist Solvers For Pathology Tasks
* LargeSceneGaussian: High-Efficiency 3D Gaussian Splatting for Large-Scale Scene Reconstruction
* Learnable Time-Frequency Transform and Ridge Separation
* Learned Hybrid Video Coding for Human Perception and Multiple Machine Vision Tasks
* Learned Video Compression with Spatial Correlation Priors and Hierarchical Temporal Attention
* Learning Efficient and Adaptive Cross-Channel Dependencies for Weakly-Supervised Object Detection
* Learning from PU Data Using Disentangled Representations
* Learning Generalization From Various Unaware Degradations for Blind Hyperspectral Image Super-Resolution via Transparent Diffusion Model
* Learning Geometry-Aware Representation for Gaze Estimation
* Learning-based Human Relighting: A Survey
* Learning-Based Steering Estimation for Motorcycles via Visual-Inertial Fusion
* Learning-by-generation: Enhancing gaze estimation via controllable generative data and two-stage training
* LeMoRe: Learn More Details for Lightweight Semantic Segmentation
* Leveraging Complementary Attention Maps in Vision Transformers for OCT Image Analysis
* Leveraging Depth Foundation Models in Self Supervised Monocular Depth Estimation
* Leveraging Pupil Facial Fusion for Enhanced Micro-Expression Recognition
* LGD: Leveraging generative descriptions for zero-shot referring image segmentation
* Lift-PCAC: Lifting Based Point Cloud Attribute Compression
* LiftFormer: Lifting and Frame Theory Based Monocular Depth Estimation Using Depth and Edge Oriented Subspace Representation
* Lightweight Attention-Enhanced Multi-Scale Detector for Robust Small Object Detection in UAV
* Lightweight Environment Vector Map Framework and Its Fast Lane-Level Navigation Strategy for Autonomous Vehicles, A
* Lightweight Image Super-Resolution Preprocessor for Jpeg Compression
* Lightweight LiDAR-Based Cooperative Localization Model for Asymmetric Leader-Follower Cooperative Driving Automation System
* Lightweight Temporal Contextual Fine-Tuning Method of Large Multimodal Model for Video Moment Retrieval
* LighTwSVM: Efficient linear nonparallel classifier for millions of data
* LigTomDet: Knowledge distillation in a new lightweight tomato disease detection model in planting fields
* Linea: Fast and Accurate Line Detection using Scalable Transformers
* Lip Enhancement and Multi-View Simulation for Robust Visual Speech Recognition in MAVSR 2025
* Listening for You: Enhancing Speech Image Retrieval via Target Speaker Extraction
* Local Analysis of Iterative Reconstruction from Discrete Generalized Radon Transform Data in the Plane
* Local and Global Structure-Guided No-Reference Point Cloud Quality Assessment
* Local Refinement and Global Strengthening Network for Vehicle Re-Identification
* Long-Short Exposure Fusion With Event Data For Low-Light Video Enhancement
* LoRA Patching: Exposing the Fragility of Proactive Defenses Against Deepfakes
* Lorentz Transformation Neural Network
* Low-Rank Adaptation of Pre-Trained Vision Backbones for Energy-Efficient Image Coding For Machines
* LURE: An Unsupervised Denoising Framework for Multiplicative Lognormal Noise
* LVMF3D: Large Vision Model Boosting Multimodal Fusion for Indoor 3D Object Detection
* M-AIDE: Mechanistic Agentic Interpretability for Decoding Empathy in Language Models
* Machine Learning Models for Predicting Post-Wildfire Methane Emissions in Australia Using Multivariate Data
* Machine Learning-Based Decoding Energy Modeling for VVC Streaming
* Mamba-Based Global Correlation Learning for Light Field Spatial Super-Resolution
* Mamba-SF: Monocular Scene Flow Learning with State Space Models
* Manifold learning based on locally linear embedding for symmetric positive definite matrix
* Mask-RadarNet: Enhancing Radar Object Detection With Spatio-Temporal Context
* Masked Text Pre-Training for Scene Text Detection
* Matching ambiguity-resilient multi-view stereo via adaptive patch deformation
* MCE: Towards a general framework for handling missing modalities under imbalanced missing rates
* MCM: A Multi-Agent Collaborative Multimodal Framework For Traditional Chinese Medicine Diagnosis
* Measuring Anxiety Levels with Head Motion Patterns in Severe Depression Population
* Measuring Distortion Strength with Dewarping Diffusion Models in Anomaly Detection
* Measuring Teacher Empathy in a Virtual Reality Scenario Simulating Racial Bias
* MedKI: Knowledge Dual Injections for Medical Visual Question Answering
* METAH2: A Snapshot Metasurface HDR Hyperspectral Camera
* Metalwork: A Synthetic Dataset and Baseline for Stereo Matching of Metal Workpieces
* METAREG: Robust Camera Parameter Estimation by Leveraging Noisy Camera Extrinsics
* MFA-Net: Motion Field Adaptive Network for Skeleton-Based Action Recognition
* MFB-SAC: A Multi-Scale Frequency and Boundary-Enhanced SAM for Cell Segmentation
* MFHS: Mutual consistency learning-based foundation model integrates hypergraph for semi-supervised medical image segmentation
* MFJLN: Multi-Frequency Feature Joint Learning Network for Rain Removal
* MFU-Net: A Novel Deep Learning Framework for Unmixing Method for Sentinel-2 Imagery of Invasive Serrated Tussock (Nassella Trichotoma)
* MGP-KAD: Multimodal Geometric Priors and Kolmogorov-Arnold Decoder for Single-View 3d Reconstruction in Complex Scenes
* MIP-CLIP: Multimodal Independent Prompt CLIP for Action Recognition
* Mirror Feature-Aware Generative Adversarial Network for RGB-T Salient Object Detection
* Mitigating Bottlenecks Caused by Freeway Exiting Flows' Merging Maneuvers to Hard Shoulder: An Integrated Proactive Control
* Mix-Based Training Strategies for Learning Implicit Neural Representations
* MKGPL: graph prompt learning with multi-view knowledge for few-shot recognition
* MLP Fusion: Revisiting Convolutional Networks with Transformer-Based Insights
* MM-IML: Multi-Modal Image Forgery Detection and Localization
* MMP-2k: A Benchmark Multi-Labeled Macro Photography Image Quality Assessment Database
* MOAT: Multi-Scale Group Interaction Transformer for Trajectory Prediction in Crowded Scenes
* Mobile Robot Navigation Method Based on Multiple External Cameras in Crowded Environment
* MobilityGPT: Enhanced Human Mobility Modeling With a GPT Model
* Modality-Aware Diffusion Distillation Network for Sentiment Analysis in Missing Modalities
* Model Synthesis for Zero-Shot Model Attribution
* Modeling and Analysis of Car-Following Behavior Based on Macro-Micro Coupling
* Modular System for Human Action Detection Combining YOLO and Transformer-Based Video Understanding, A
* MolRL: Self-supervised molecular image representation learning via graph structure bootstrapping
* MONSTR: Model-Oriented Neutron Strain Tomographic Reconstruction
* Mosaic-SR: An Adaptive Multi-Step Super-Resolution Method For Low-Resolution 2d Barcodes
* Motion control of 3-DoF delta robot using adaptive neuro fuzzy inference system
* Motion-Aware Reconstruction for Video Snapshot Compressive Imaging
* Moving Forward with BWC: The Faleb Dataset for Multimodal Image Analysis
* MPEG Edgebreaker: An Efficient Static and Dynamic Mesh Codec in MPEG V-DMC
* MS-DSCLNet: A Multi-Scale Dual-Stream Contrastive Learning Network for Image Tampering Detection and Localization
* MS-RAFT-3D: A Multi-Scale Architecture for Recurrent Image-Based Scene Flow
* MSDP-Net: Multi-scale distribution perception network for rotating object detection in remote sensing
* MSTDNet: Multi-scale traffic object detection network with smooth information perception
* MSTSGM: A multi-scale temporal-spatial guided model for image deblurring
* MTD-Net: A robust multi-task discriminative network for choroidal neovascularization segmentation
* Multi-Agent Deep Reinforcement Learning for Safe Autonomous Driving With RICS-Assisted MEC
* Multi-Agent-Based Approaches for Cooperative Traffic Management in C-ITS: Systematic Literature Review (SLR)
* Multi-Class Part Parsing Based on Multi-Class Boundaries
* Multi-Class Smoothed Hinge Loss Function in Pre-Training for Transfer Learning
* Multi-Domain Biometric Recognition using Body Embeddings
* multi-expert framework for enhancing multimodal large language models in industrial anomaly detection, A
* Multi-Graph Spatio-Temporal Network for Traffic Accident Risk Forecasting
* Multi-Layer End-to-End 360° Image Compression, A
* Multi-Level and Multi-Modal Action Anticipation
* Multi-Level Contrastive Learning for Multimodal Sentiment Analysis
* Multi-level spectral-spatial mutual learning for pansharpening
* Multi-Level Statistical Model Guidance Improves Generalization for Biometric Synthetic Face Detection
* Multi-Method Explainability Evaluation of a Graph-Based Neural Network for Alertness Detection, A
* Multi-Objective Heterogeneous Fleet Vehicle Routing Problem: Formulation and Algorithm
* Multi-Res-3DGS: Multi-Resolution 3d Gaussian Splatting Bound with a Subdivided Mesh Sequence
* Multi-Scale Spatial-Frequency Features Representation and Learnable Cross Modal Feature Fusion in DeepFake Detection
* Multi-Task Learning for Hierarchical Professional Gesture Recognition: State-Space Modeling for Task Temporal Dependencies
* Multi-Teacher Knowledge Distillation for Efficient Object Segmentation
* Multi-View Amodal Instance Segmentation Based on 3d Representation
* Multi-View Dispersion Entropy on Graphs: Application to the Detection of Cerebral Palsy After Neonatal Stroke
* Multi-View Graph Approach for Morphometry-Aware Semantic Image Segmentation, A
* Multi-view graph pooling via dominant sets for graph classification
* Multi-View Knowledge Guided Semantic Prototype Learning for Generalized Zero-Shot Action Recognition
* Multi-view unsupervised feature selection with unified measurement of consistency and diversity
* Multi-year long-term person re-identification using gait and HAR features
* Multifidelity-Based Ant Colony Optimization Algorithm for Capacitated Electric Vehicle Routing Problems, A
* MultiMAE Meets Earth Observation: Pre-Training Multi-Modal Multi-Task Masked Autoencoders for Earth Observation Tasks
* Multimodal Cell Context Instruction Tuning for Conditional DNA Regulatory Sequence Generation with Large Language Models
* Multimodal Classification and Out-of-Distribution Detection for Multimodal Intent Understanding
* Multimodal Cross-Attention for Range of Motion Assessment
* Multimodal Re-Ranking for Heterogeneous Face Re-Identification
* Multimodal-LLM Agent For Text-Driven Multi-Attribute Face Editing
* Multiple object stitching for unsupervised representation learning
* Multiple weather degraded image restoration based on multi-component decomposition
* Multiplicative Reweighting for Robust Neural Network Optimization
* Multiscale Attention-Based Deep Learning Method for DCE-MRI Breast Tumor Segmentation, A
* MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion
* My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing
* Nagisa: A reversible privacy preservation scheme against facial soft-biometric attributes recognition
* NAR-DIFF: A Noise-Adaptive Reflectance Diffusion Model for Low-Light Image Enhancement
* NASSBLiF: No-Reference Light Field Image Quality Assessment Via Neighborhood Attention and Scale Swin
* Natural image stitching using depth maps
* Naturalistic physical adversarial camouflage for object detection via differentiable rendering and style learning
* NeB-SLAM: Neural blocks-based salable RGB-D SLAM for unknown scenes
* Neighbor-Aware Feature-Driven Motion Compensation for Learned Video Compression
* NEPose: A novel benchmark dataset with an improved framework for vision-based nasal endoscope pose estimation
* NeuralSEIR: Modeling uncertainty in non-pharmaceutical interventions with neural epidemic dynamics
* New Multi-Source Distributed Transfer Learning Framework
* Next-Stitch Counting in Crochet Swatches via Multi-Class Semantic Segmentation
* NiCI-Pruning: Enhancing Diffusion Model Pruning via Noise in Clean Image Guidance
* Nissl-Stained Histological Slice Image Completion Based on Generated Masks
* No-Reference Textured Mesh Quality Assessment Using Graph-Based Features
* Noise perturbation augmentation based dual-branch alignment network for cross-domain hyperspectral image classification
* Noise-Robust Approach Using Dynamic Graph Neural Networks for Bus Passenger Flow Prediction, A
* Noise-to-Noise Training Approach for Robust Motion-Compensated Processing in Cardiac-Gated Images, A
* Noisy Label Refinement with Semantically Reliable Synthetic Images
* Non-Invasive Neonatal Jaundice Detection via Two-Phase Self-Supervised Learning and Vision Transformer
* Non-Local N2V: Improving N2V Networks for Spatially Correlated Noise
* Non-Rigid Motion Correction for MRI Reconstruction via Coarse-to-Fine Diffusion Models
* Non-Uniform Illumination Image Restoration for Deep-Sea Exploration with A New Scattering Model
* Nonintrusive Watermarking for CycleGAN
* Nonlinear Elasticity Model in Computer Vision, A
* Nonlinear Modifications of Transform Coefficients in VVC Intra Coding
* Nonnegative Matrix Factorization in Dimensionality Reduction: A Survey
* Novel Adaptive Low-Rank Matrix Approximation Method for Image Compression and Reconstruction, A
* Novel AI Framework for Breast Cancer Molecular Biomarker Response Score Detection on Cells Level Using Marker-Based Watershed Segmentation and Machine Learning Classifiers, A
* Novel Automated System for Pathological Lung Segmentation Using Modified Local Binary Patterns and Hierarchical Transformers, A
* Novel Downsampling Strategy Based on Information Complementarity for Medical Image Segmentation, A
* novel dynamic graph attention aggregation network for multivariate time series classification, A
* Novel Explainable AI-Based System For Improved Prediction of Breast Cancer Response to Neoadjuvant Chemotherapy, A
* Novel Game Graphics Quality Evaluation Model Using Saliency and Resolution Information, A
* Novel Hybrid Model Based on VMD-KAN-Informer for Railway Traction Power Grid Short-Term Load Forecasting, A
* Novel Method and Dataset for Depth-Guided Image Deblurring From Smartphone Lidar, A
* novel weakly supervised immunohistochemical cell segmentation method via counting labels, A
* OASIS: Object-guided Attention for Text-conditional Diffusion Synthesis of Human Interaction Sequences
* Object Detection and Fruit Tree Growth Stage Identification Via YOLO with Inverted and Swin Transformer Blocks
* Object-Guided Semi-Supervised Bird's-Eye View 3D Object Detection With 3D Box Refinement
* Object-IR: Leveraging object consistency and mesh deformation for self-supervised image retargeting
* Objective, Absolute and Hue-Aware Metrics for Intrinsic Image Decomposition on Real-World Scenes: A Proof of Concept
* Oblique Decision Trees as an Image Model for Cubist Image Restyling
* Observer-Based Prescribed-Time Resilient Control for 2-D Plane Heterogeneous Vehicular Platooning System With Hybrid Communication Threats
* OFVL-MS++: Once for visual localization across multiple scenes via a two-stage framework
* On color differences in context
* On the correlations between geometric metrics and fairness in pruning CNN
* On the Impact of Natural Guide-Star Asterism Geometry on Atmospheric Tomography
* One Face, Many Views: Cross-View Consistency of Facial Action Unit Analysis in Multi-Camera Settings
* One-stage Framework for Thyroid Nodule Detection with Mixup and Negative Sample Utilization
* Online Continual Learning of Diffusion Models: Multi-Mode Adaptive Generative Distillation
* Online graph based transforms for intra-predicted imaging data
* OpenFace 3.0: A Lightweight Multitask System for Comprehensive Facial Behavior Analysis
* OpenRR-1k: A Scalable Dataset for Real-World Reflection Removal
* Optimal Transport-Based Domain Alignment as a Preprocessing Step for Federated Learning
* Optimal-Coupling-Observer AV Motion Control Securing Comfort in the Presence of Cyber Attacks
* Optimization model for sign language recognition using hybrid convolution networks
* Optimization of Urban Emergency Multimodal Transportation Scheduling With UAV-Ground Traffic Coordination
* Optimized Learned Image Compression for Facial Expression Recognition
* Optimized temporal inductive path neural network based early-stage detection of autism spectrum disorders
* Optimizing In-Context Learning for Efficient Full Conformal Prediction
* Organoid-ICLIP: Class Imbalance-Aware Vision-Language Learning for Organoid Mitosis Classification
* Oriented Object Detection Based On Composite Trigonometric Function Coder
* Orthogonal Constrained Minimization with Tensor L_2,p Regularization for HSI Denoising and Destriping
* Out-of-Distribution Sample Selection Generated by Diffusion Model toward Model Generalization
* Output-Feedback Safety-Critical Path-Guided Herding Control of MIMO Nonlinear Agents Based on Finite-Time Neural Predictor
* Overlooked Factors in Continual Zero-Shot Learning: Inflexible Semantic Prototypes, Simplistic Loss Functions, and SGD Noise
* P-Norm Based Fractional-Order Robust Subband Adaptive Filtering Algorithm for Impulsive Noise and Noisy Input
* PADNet: Progressive-Difference-Aware Feature Reconstruction Mechanism for Anomaly Detection
* PanoTPS-Net: Panoramic room layout estimation via thin plate spline transformation
* Parallel-Based Fast Coding Mode Decision for Intra Coding in VVC SCC
* Partial label feature selection with dynamic streaming labels
* Partitioned observation network for camouflaged object detection
* Pathological Region Inpainting in MRI Data Using Generative AI
* PawPrint: Whose Footprints are These? Identifying Animal Individuals by their Footprints
* PDD-AGENT: Multimodal Large Language Model-Driven AI Agent for Enhanced Plant Disease Diagnosis
* PDP-FedKD: Personalized Differential Privacy With Adaptive Budget Selection in Heterogeneous Federated Learning
* Peepers and Pixels: Human Recognition Accuracy on Low Resolution Faces
* Perface: Metric Learning in Perceptual Facial Similarity for Enhanced Face Anonymization
* Performance Evaluation of Deep Learning for Tree Branch Segmentation in Autonomous Forestry Systems
* Petri Net-Based Resource Failure and Recovery Strategy for Design and Control of Resilient Intersections, A
* PetsRS - a Dataset and Benchmark for Pet Recognition on a Climate Disaster Scenario
* Phase structure function and spatial coherence in underwater Rayleigh-Benard turbulence: experimental characterization
* Physics-Guided Smoothing Method For Material Modeling With Digital Image Correlation (DIC) Measurements, A
* PillarID: Rethinking Backbone Network Designs for Pillar-Based 3D Object Detection in Infrastructure Point Cloud
* Pioneering Facial Expression Generation from sEMG Signals with Diffusion Models
* PIT-QMM: A Large Multimodal Model for No-Reference Point Cloud Quality Assessment
* PixelShuffler: A Simple Image Translation through Pixel Rearrangement
* Player Perceptions of Path-First Procedural Content Generation Level Design for 3D Platformer Games
* Plug-and-Play Priors as a Score-Based Method
* Point Cloud Pretraining Dataset Effects on MaskPoint Classification Performance
* Polarization Denoising and Demosaicking: Dataset and Baseline Method
* Policy Gradient-Based Optimal Subset Selection for Few-Shot Vision-Language Learning
* Pose Estimation of Artwork Characters with Series and Parallel Dilated Convolution And Style Channel Attention
* Pose-Free 3D Gaussian Splatting via Shape-Ray Estimation
* Pose-Invariant Face Recognition via Feature-Space Pose Frontalization
* PoseMoE: Mixture-of-Experts Network for Monocular 3D Human Pose Estimation
* PosePilot: A Web-Based Application for Human Motion Data Analysis and Visualization
* Power Cost Comparison of Neural-Network Compression Methods for Satellite Imagery
* Prediction of Vitamin D Deficiency Using Machine Learning, Deep Learning, and a Hybrid Model
* Primal-Dual Splitting Algorithm with Convex Combination and Larger Step Sizes for Composite Monotone Inclusion Problems, A
* Principled Diffusion Posterior Sampling for Inverse Problem with Mixed Poisson-Gaussian Noise, A
* Privacy-Preserving CNN Inference for Image Super-Resolution Cross Multiple Ciphertexts
* Privacy-Preserving Face Recognition Scheme Based on Secure Data Storage and Secret Splitting
* Privacy-Preserving Person Re-Identification from Temporal Sequences with Transformer and Hungarian Optimization
* Proactive Collaborative Perception for CAVs: A Multi-Agent Reinforcement Learning Method
* Probabilistic Sampling with Frobenius Norm for Action Recognition
* Probabilistic Temporal Masked Attention for Cross-View Online Action Detection
* Progressive Cross-Validation Learning for Signal Classification with Noisy Labels
* Progressive Distillation Attention for Robust Left Ventricular Ejection Fraction Estimation
* Progressive Text-Semantic-Aware Generative Adversarial Network for Image Fusion
* Projection Difference-Guided Geometry Quality Enhancement for Video-Based Point Cloud Compression
* Prompt-guided dual-channel attention model predicts brain activation from functional and structural profiles
* Propagation Based Recycling Contrastive Learning for Coupled Noisy Visible-Infrared Person Re-Identification
* Propagation of Laguerre-Gaussian beams in anisotropic atmospheric turbulence: analysis via two analytical and a computational method
* Protein interaction pattern recognition using heterogeneous semantics mining and hierarchical graph representation
* Prototype-based scatter learning for smoke segmentation
* Prototype-Driven Multi-View Attribute-Missing Graph Clustering
* PRTF: Polar Space Represented Multi-View 3D Object Detection With Temporal Fusion Enhancement
* Pseudo labels approach to interpretable self-guided subspace clustering
* PSF-SRDN: Point Spread Function-Aware Speckle Reducing Diffusion Network
* PSG-MCANet: Multi-order cross-attention modeling for multimodal fusion based on punning semantic guidance
* PyraSegNet: A Novel Framework for Thermal Facial Image Segmentation
* P^2M: Progressive Perspective Mining for Referring Video Object Segmentation
* Quadratic Equality Constrained Least Squares: Low-Complexity ADMM for Global Optimality
* Quality Versus Sparsity in Image Recovery by Dictionary Learning Using Iterative Shrinkage
* Quanta Diffusion
* Quanta-Slomo: Single Photon Camera Guided 100x Video Frame Interpolation
* Quantum jellyfish search optimizer applied in high-precision wrapper feature selection
* Quantum-Enhanced Cancer Detection for Histopathologic Images
* R-RNet: Probability-Driven Networks for Pedestrian Trajectory Prediction
* RA-GCN: Residual attention based graph convolutional network for multi-label pattern image retrieval
* Radar-Camera Fused Multi-Object Tracking: Online Calibration and Common Feature
* Radius-Aligned Training and Rotated IOU Metrics for Pedestrian Detection in Top-View Fisheye Images
* Rank-based transformation algorithm for image contrast adjustment
* Rapid Object Modeling Initialization for Vector Quantized-Variational AutoEncoder
* Rate-Distortion Optimization with Non-Reference Metrics for UGC Compression
* Rate-Distortion Optimized Chroma Quantization for Point Cloud Compression
* RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-Plane Networks
* RAW: Region Attention-Weighted Guided Network with Inter-Region Exchange for AMD Grading
* Re-Purposing Segment Anything For Skeleton Action Localization
* React: Reference-Based Anime Colorization Transformer
* Reading Between the Lines: How Eye-Tracking Data can Inform Reading Strategies for Large Language Models
* Real projection algorithms for generalized low-rank approximation of large-scale quaternion matrix in color image processing
* Real-Time Detection and Classification of Drones, Vehicles, and Humans from Radar Data Using Deep Learning
* Real-Time Detection of Road Defects Using YOLO Architectures: A Comparative Study
* Real-Time Semantic Video Communication with Temporally Consistent And Controllable Diffusion Models
* Real-Time Traffic Accident Anticipation with Feature Reuse
* Realistic Skin Trouble Simulation Via Image Generation Models
* REC-GCN: Robust ensemble clustering with graph convolutional networks
* Recovering and Classifying Upper Limb Impairment Trajectories After Stroke
* RefComp: A Reference-Guided Unified Framework for Unpaired Point Cloud Completion
* Referring Video Object Segmentation With Cross-Modality Proxy Queries
* ReID: Re-ranking through image description for object re-identification
* Reinforcement Learning-Based Attack Generator for Testing the Security of Connected and Autonomous Vehicles, A
* Reinforcement Learning-Based Decentralized Control Strategy for Eco-Safe Mixed Platooning With CAVs and HDVs, A
* Remembering CIFAR-10 images with the entropic associative memory
* Remote Respiration Measurement with RGB Cameras: A Review and Benchmark
* Remote Sensing Target Detector with Multi Scale Attention Mechanism
* renaissance of explicit motion information mining from transformers for action recognition, A
* Resilient Multi-Agent Reinforcement Learning for Tiered Mixed Autonomy
* Resolving sentiment discrepancy for multimodal sentiment detection via semantics completion and decomposition
* Restoration of partially damaged fingerprints using a partial differential equation
* Rethinking Artifact Mitigation in HDR Reconstruction: From Detection to Optimization
* Rethinking Image Histogram Matching for Image Classification
* Retina adaptation network for low-light image enhancement
* Retinex-based variational model for low-light image enhancement with noise transformation, A
* Retinex-Based Variational Model with A Nonlocal Gradient-Type Constraint for Low-Light Image Enhancement, A
* Retrieval-augmented image harmonization
* Reverse Distillation Based Detection of Anomalies on a Newly Developed Fabric Dataset
* Reversible Column Disentangled Augmentation Tricks for Graph Contrastive Learning
* Reversible Data Hiding in Encrypted Polygonal Faces Using Vertex Index Similarity
* Review and Perspectives on Pedestrian Trajectory Prediction for Safe Transportation
* Revisited Visual Saliency Detection with Deep Learning: A Review of Recent Advancements
* Revisiting color-event based tracking: A unified network, dataset, and metric
* Revisiting Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
* Revisiting the Intrusion Detection in In-Vehicle Networks
* Revisiting the representation learning in long-tailed medical image classification
* Reward-Adaptation: A Novel Test-Time Adaptation Method With Reward Model
* RGAL: Node-adaptive training strategies for reinforced graph adversarial learning
* RGC-Bent: A Novel Dataset for Bent Radio Galaxy Classification
* rho-NeRF: Leveraging Attenuation Priors in Neural Radiance Field for 3d Computed Tomography Reconstruction
* Ridgeformer: Mutli-Stage Contrastive Training for Fine-Grained Cross-Domain Fingerprint Recognition
* Risk-Controlled Multimodal Emotion Coaching for Autism Support Using Self-Supervised Vision and Speech Encoders
* RL-GTN: A reinforced divergence-optimized graph transformer network for skeleton-based action recognition
* RMPT: Retrieval-based multimodal prompt tuning for event detection
* RN-Sam: Road Network-Aided Sam Optimization for Road Segmentation In Satellite Imagery
* RobotFlags: AI-Powered Semaphore Interacting Between Chatbot and Humanoid Robot
* robust and efficient approach using Aggregated-FlexiNet for interpretable musculoskeletal radiograph classification, A
* Robust Character Stroke Segmentation For Diverse Fonts Via Contour Matching and Chain Propagation
* Robust Deepfake Detection for Electronic Know Your Customer Systems Using Registered Images
* Robust Estimation of Bump Height for Wafer-Level Packaging Using Optical Triangulation
* Robust Multi-Label Learning with Human-Guided and Foundation Model-Aided Crowd Framework
* Robust Multimodal Representation Learning with Information Bottleneck and Balanced Fusion for Alzheimers Disease Classification
* Robust Noisy Label Learning via Two-Stream Sample Distillation
* Robust Temporal Action Localization With Meta Boundary Refinement
* Robustness of Deep Learning-Based for Acute Lymphoblastic Leukemia Detection and Classification
* Rollout-Guided Token Pruning for Efficient Video Understanding
* Rotation-Invariant Game State Evaluation via Board Tensor Canonicalisation
* rPPG-NDCL: Unsupervised Remote Physiological Measurement Via Noise-Disentangled Contrastive Learning
* RSOS-Net: Real-Time Surface Obstacle Segmentation Network for Uncrewed Waterborne Vehicles
* RT-X Net: RGB-Thermal Cross Attention Network for Low-Light Image Enhancement
* RUL: Region Uncertainty Learning for Robust Face Recognition
* S3VD Self-Supervised Spatial Video Downsampling Loss: A Method for Training Video FPN Denoising Networks
* SADet: A semantic-aware tiny object detection network against missed detection
* SAIMNet: Object Detection Based on Semantic Alignment of Infrared Image and Microwave Non-image Information Fusion
* Salience Adjustment for Context-Based Emotion Recognition
* SAM 2-Driven Self-Training for Mammogram Segmentation: Zero-Shot Mask Generation Via Pseudo-Video
* SAM-Based Leaf Segmentation with Morphological Quality Assessment for Enhanced Plant Disease Detection
* SAR target recognition based on CNN with 2-D dual-tree complex wavelet transform decomposition
* Saw-Monodetr: Shape-Aware Adaptive Weighted Transformer for Monocular 3d Object Detection
* Scalable Image Compression Using Conditional Diffusion Model in Human-Machine Hybrid Vision, A
* Scalable Multi-View Clustering via Bipartite Graph Consensus Filtering
* Scale-aware adaptive supervised network with limited medical annotations
* SCIGS: 3D Gaussians Splatting from A Snapshot Compressive Image
* SCL-GAN: Spatially-Correlative Lightweight GAN for Efficient and High-Fidelity Thermal-Visible Face Synthesis
* Scribble-Guided Diffusion for Training-Free Text-to-Image Generation
* SDFCNet: A Spatial-Domain and Frequency-Domain Collaborative Network for Building Extraction in High-Resolution Remote Sensing Images
* SDR-GAIN: A High Real-Time Occluded Pedestrian Pose Completion Method for Autonomous Driving
* Seeing Beyond the Airways: Asthma Prediction via Cross-Attention on Dual Retinal Modalities
* Segment-Attention Augmented Dual-Contrastive Aggregation Learning for Unsupervised Visible-Infrared Person Re-Identification
* Segmentation for Early Tumor Detection in Mammograms Via Temporal Discrepancy Analysis and Dynamic Loss Weighting
* SEGMN: A structure-enhanced graph matching network for graph similarity learning
* Self Characterized Fusion Network for Prognosis of Brain Diseases
* Self-distilled learning of adaptive interval 3D lookup tables on real-time image enhancement
* Self-Expert Imitation With Purifying Latent Feature for Generalization in Visual Reinforcement Learning
* Self-Referencing Adapt-Then-Combine Information Diffusion Scheme for Distributed PHD Filtering
* SelfieAvatar: Real-time Head Avatar reenactntment from a Selfie Video
* SelfMAD: Enhancing Generalization and Robustness in Morphing Attack Detection via Self-Supervised Learning
* Semantic Context Re-Mining for Multimodal Guided Human-Object Interaction Detection
* Semantic Prototype-Guided Sampling for Long-Tailed Generalized Category Discovery
* Semantic-guided occlusion simulation based local feature semantic expansion network for person re-identification, A
* Semantics-Aware Spatial-Temporal Dynamic Graph Transformer Network for On-Street Parking Occupancy Prediction
* Semantics-Guided Generative Image Compression
* Semi-supervised crowd counting from unlabeled data
* Semi-Supervised Infrared Meibomian Gland Segmentation with Intra-Patient Registration and Feature Supervision
* Semi-Supervised Seafloor Habitat Classification: A Pseudo-Labeling Framework
* Semi-supervised semantic segmentation meets masked modeling: Fine-grained locality learning matters in consistency regularization
* Sensor Distance Learning For Cross-Camera Color Constancy
* Sentence-Level Lip-Reading with Integrated Synthetic Data and Speaker Normalization
* Sentiment analysis and risk early-warning system for cross-border M&A based on natural language processing
* Session Class Prototype Incremental Learning (SCPIL): Mitigating Catastrophic Forgetting with Distance-Based Prototype Learning
* SF-VQA: Saliency Fragments No-Reference Video Quality Assessment
* SFEformer: Frequency-enhanced model for wind speed prediction
* SFS-NeRF: Enhancing Geometry Consistency in Few-Shot Novel View Synthesis Through Surface-Aware Neural Rendering
* Shallow Neural Network Training via Atomic Norms and Semidefinite Programming
* Shape Reconstruction of Foreground and Background in Scenes with Translucent Objects Based on Coding Curves
* Sheep Facial Pain Assessment Under Weighted Graph Neural Networks
* Shielding Latent Face Representations From Privacy Attacks
* Shuffle PatchMix Augmentation with Confidence-Margin Weighted Pseudo-Labels for Enhanced Source-Free Domain Adaptation
* Siamese Feature Decoupling and Adaptive Prototype Alignment for Clothes Changed Person Re-Identification
* Siavatar: Animatable 3D Gaussian Avatar from a Single Image
* SignDiff: Diffusion Model for American Sign Language Production
* Similarity Normalization and Strong Geometric Augmentation for Local Feature Matching Under Large Scale and Rotation Changes
* Similarity Shuffled Criss-Cross Transformer With Angle Loss for Image-Text Matching
* Simple Self-Organizing Map With Vision Transformers
* Simple Zero-Shot Image Dehazing
* SimPRL: A Simple Contrastive Learning for Path Representation Learning by Joint GPS Trajectories and Road Paths
* Single Snapshot Distillation for Phase Coded Mask Design in Phase Retrieval
* Single-ToF-LiDAR-based plane information recognition
* Sinogram Inpainting with Physics-Guided Latent Diffusion Model for Synchrotron Light Sources
* Skeleton-based Geometric Deep Neural Network for Alzheimer's Disease Mice Behavioral Analysis, A
* SkeletonX: Data-Efficient Skeleton-Based Action Recognition via Cross-Sample Feature Aggregation
* Sketch to Stylized-Image: A Two-Stage Approach for Artistic Image Generation
* Skin Cancer Classification Using Extended 5 Channel (I-RGB-U) Images Generated From RGB Images
* SLICE: Synthetic Caption-Trained Lightweight Image Captioner for Edge Devices
* Small or large superpixel graphs? Gaussian influence walk with rebound can assist
* SmoothFace: Class-Conditional Label Smoothing for Synthetic-based Face Recognition
* Soft Gradient Boosting With Learnable Feature Transforms for Sequential Regression
* Sparse kernel k-means for high-dimensional data
* Sparse R-CNN OBB: Ship Target Detection in SAR Images Based on Oriented Sparse Learnable Proposals
* Sparse subspace learning based redundancy-aware unsupervised feature selection
* Sparse2DGS: Sparse-View Surface Reconstruction Using 2D Gaussian Splatting with Dense Point Cloud
* Sparsity-Driven Parallel Imaging Consistency for Improved Self-Supervised MRI Reconstruction
* Spatial-Spectral Consistency: A Semi-Supervised Approach for Multispectral Scene Classification
* Spatially continuous dual optimization on compactness function for image segmentation
* Spatio-Temporal Data Enhanced Vision-Language Model for Traffic Scene Understanding
* Spatio-Temporal Feature Learning Fusion and Visual Scene Endpoint Prediction for Pedestrian Trajectory Prediction
* Spatiotemporal Face Alignment for Generalizable Deepfake Detection
* Spatiotemporal-Decoupled Training: Enhancing Car-Following Behavior Modeling With Cross-Spatiotemporal Generalization
* SPC TO 3d: Novel View Synthesis from Binary SPC VIA I2I Translation
* Spectral Mixing Augmentation for Preventing False Positives from Hyperspectral Anomaly Detection
* Spectral-aware Global Fusion for RGB-Thermal Semantic Segmentation
* Spectrum-guided feature enhancement network for event person re-identification
* Splitter: Faster Inference through Channel Partitioning and Feature Fusion
* SS-NeRF: Shine-sphere rendering for neural radiance fields
* SSPD: Spatial-Spectral Prior Decoupling Model for Spectral Snapshot Compressive Imaging
* ST-GRIT: Spatio-Temporal Graph Transformer For Internal Ice Layer Thickness Prediction
* Stable-Invertible Graph Convolutional Networks for Label-Efficient Skeleton-Based Recognition
* StableIdentity: Inserting Anybody Into Anywhere at First Sight
* Stacked one-vs-one (SOvO): A new approach for multi-class classification for sEMG recognition
* StegFlow: Flow-Based High-Frequency Distribution Mapping Network for Multi-Image Steganography
* Stencil: Subject-Driven Generation with Context Guidance
* STMixer: Spatial-Temporal Mixer for Continuous Sign Language Recognition
* Structural entropy guided relation extraction on adaptive graph structure
* Structure-Based Drug Design with Geometric Deep Learning: A Comprehensive Survey
* Structure-Guided Diffusion Transformer for Low-Light Image Enhancement
* Structured Instruction Parsing and Scene Alignment For UAV Vision-Language Navigation
* Study on an Intelligent Screening Method for Polycystic Ovary Syndrome Based on Deep PhysicsInformed Neural Network
* Sufficient Conditions for Convergence of RHT and RHTP Algorithms Based on RIC of Order 2S
* Super-resolution time-frequency decomposition with hyperlets for neural spike analysis
* Supervised Contrastive Learning for Indoor Point Cloud Oversegmentation
* Supervised learning for low-resource isolated glyph recognition in palm leaf manuscripts
* Survey of Defenses Against AI-Generated Visual Media: Detection, Disruption, and Authentication, A
* Survey of Small Sea-Surface Target Detection for Maritime Search and Rescue, A
* Survey on Deep Face Restoration: From Non-blind to Blind and Beyond
* Survey on Privacy-Preserving Computing in the Automotive Domain, A
* Survey on Proactive Deepfake Defense: Disruption and Watermarking, A
* survey on video emotion recognition: Segmentation, classification, and explainable AI techniques, A
* SwimVG: Step-Wise Multimodal Fusion and Adaption for Visual Grounding
* Swinscale-LFVS: Parallel Feature Integration for Light Field View Synthesis
* Synthetic Chest X-Ray Augmentation via Generative Variational Autoencoding for Pneumonia Detection
* Synthetic Faces, Real Gains: Improving Age and Gender Classification through Generative Data
* Systematic Literature Review on Vehicular Collaborative Perception: A Computer Vision Perspective, A
* Systematic Review of the Use of Augmented Reality in Pedestrian Navigation, A
* T2RIAD: A Two-Stage Framework for Truck Re-ID With Domain Adversarial and Distillation Learning
* Tackling Ambiguity From Perspectives of Uncertainty Inference and Affinity Diversification for Weakly Supervised Semantic Segmentation
* Tagsim: Topic-Informed Attention Guided Similarity Metric for Image Caption Comparison
* Target Driven Adaptive Loss for Infrared Small Target Detection
* Targetless Extrinsic Calibration of Fisheye Cameras Using Vehicle Detection and Monodepth Alignment in Cylindrical Image Space
* Task-Specific Spatiotemporal Context-Aware Decoupling for Occluded Video Object Detection
* Taylornet: Rethinking monomial-based graph neural networks with taylor expansion
* TCP: Text-Guided Cascade Network for Pedestrian Crossing Intention Prediction
* TDI-TFFNet: Infusing time dependent images and two-stream feature fusion network for gymnastic activity recognition
* Teach Me Sign: Stepwise Prompting LLM for Sign Language Production
* Temporal knowledge graph reasoning with local-global evolutionary patterns
* Tensor band restricted thresholding algorithms for affine tensor rank minimization
* Tensor Completion Framework by Graph Refinement for Incomplete Multi-View Clustering
* Tensor wheel completion with parallel matrix factorization and group smoothness for hyperspectral image recovery
* Tensor-Based Privacy-Aware Driving Route Navigation Based on Cloud-Fog-Edge Calculative User-Vehicle-Road Preferences
* TerraFly-Forensics: A Dataset for Forensic Detection of Generated Map Images with Quality Assessment of Generative Models
* Test-Time Augmentation for Pose-invariant Face Recognition
* Test-Time Vocabulary Adaptation for Language-Driven Object Detection
* Testing Peepers on Pixels: A Demo of Human Recognition Accuracy for Low Resolution Faces
* Tetrahedral molecular pretraining for enhanced property prediction
* Text-to-Floorplan Synthesis via Graph-Conditioned Diffusion Processes
* Texture- and Shape-Based Adversarial Attacks for Overhead Image Vehicle Detection
* Texturing Endoscopic 3D Stomach via Neural Radiance Field Under Uneven Lighting
* TG-TSGNet: A Text-Guided Arbitrary-Resolution Terrain Scene Generation Network
* Time-Efficient Uncertainty Estimation Based on Target Networks in Deep Reinforcement Learning
* Tiny Faces, Big Trouble: Evaluating Super-Resolution for Face Recognition
* TLRR-TF: A fast tensor low-rank representation via tri-factorization
* To Skip or to Mask? A Study of the Adversarial Purification Spectrum
* Tool-Assisted Annotation of Seafloor Sediment-Linked Features Using Weakly Supervised Semantic Segmentation
* Toroidal Adaptive Intensity and Spectrum Updating Image Reconstruction for Fourier Ptychographic Microscopy
* Toward Real-Time BCI Authentication for Enhanced Security in Collaborative Systems
* Toward Thermal Infrared Image Colorization via Large Kernel Convolution and Patch-Wise Graph Contrastive Learning
* Towards All-Time, All-Weather Fod Detection Through Generative AI
* Towards Certified Object Detectors: Certified Runway Detection Using Yolo
* Towards Controllable Real Image Denoising With Camera Parameters
* Towards Dark-Field X-ray Microscopy Through Coherent Encoding
* Towards Effective and Robust Unlearnable Examples Against Object Detection
* Towards Fair and Robust Face Parsing for Generative AI: A Multi-Objective Approach
* Towards Image Copy Detection at E-Commerce Scale
* Towards Invisible Decision-Based Adversarial Attacks Against Visual Object Tracking
* Towards Iris Presentation Attack Detection with Foundation Models
* Towards ML-based Assessment of Synthetic Characters Heads
* Towards Open-set Face Anti-spoofing with Unseen Attack Synthesis
* Towards Reliable Disaster Detection: Comparing Semantic and Heuristic Filters for Multimodal Data
* Towards robust and inversion-free randomized neural networks: The XG-RVFL framework
* Towards Robust Text-Guided Image Compression Under Modality Missing
* Towards Test Time Adaptation in Low Dose Computed Tomography Denoising Via Bias Modulation
* Towards Trustworthy Disaster Severity Scoring: Combining Semantic Alignment and Chain-of-Thought LLMs
* Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models
* TP-IoAV: A Tri-Party Cloud Data Protection Scheme for Internet of Autonomous Vehicle Coupled With Chaotic Biometric Cryptography
* TPEech: Target Speaker Extraction and Noise Suppression With Historical Dialogue Text Cues
* Track2Net: A Fast Lightweight Model With Keypoint Alignment and Track Anchors Identification for Railway Line Tracking
* Training A Phase Detection Autofocus Model Using Hybrid Labels
* Trajectory Planning for Autonomous Driving in Transportation Systems Based on Deep Reinforcement Learning and Spatio-Temporal Voxels
* Trajectory-guided Motion Perception for Facial Expression Quality Assessment in Neurological Disorders
* Transductive One-Shot Learning Meet Subspace Decomposition
* TransFA: Transformer-based representation for face attribute evaluation
* Transfer Learning from Visual Speech Recognition to Mouthing Recognition in German Sign Language
* Transform Set Merging for Neural Network-Based Intra Prediction in beyond VVC
* Transformer Augmented Multi-Resolution Hash Encoding in Diffusion Model for 3D Point Cloud Denoising
* Transformer-Based Approaches to Description Sequence Generation for Chinese Characters
* Transparent and Lightweight Tumor-Aware MRI Super-Resolution Framework to Enhance Prostate Cancer Detection, A
* Tree of Shapes Computation Algorithm for Massively Parallel Architectures, A
* Tri-modal fusion for dynamic hand gesture recognition: Integrating RGB, depth, and skeleton data
* TriDGNet: Triple Feature Encoder-Based Dual Granularity Graph Learning Network for Enhanced Travel Time Estimation
* Triqa: Image Quality Assessment by Contrastive Pretraining on Ordered Distortion Triplets
* TrustSkin: A Fairness Pipeline for Trustworthy Facial Affect Analysis Across Skin Tone
* TURBIT: Generating Turbid Underwater Images with Diffusion and Differential Transformers
* Two-Stage Framework For Enhanced Hyperspectral Anomaly Detection
* UAV-based multimodal object detection via feature enhancement and dynamic gated fusion
* Ultrafast High-Flux Single-Photon Lidar Simulator via Neural Mapping
* UltraSeP: Sequence-aware pre-training for echocardiography probe movement guidance
* Uncertainty-Driven Sampling for Efficient Pairwise Comparison Subjective Assessment
* Unconstrained Body Recognition at Altitude and Range: Comparing Four Approaches
* Understanding the adversarial robustness of deep learning-based single-pixel imaging
* Underwater image compression for human and machine visions with hybrid priors embedding
* UNet-Like Transformer Network for Camouflaged Object Detection, A
* Unified multi-modality conditional latent diffusion model for point cloud generation, A
* Unified Multimodal Vessel Trajectory Prediction With Explainable Navigation Intention
* Unified Transformer-Based Framework with Pretraining for Whole Body Grasping Motion Generation, A
* Unifying and Conquering Adversarial Attacks against Deep Face Recognition
* UniGait: A Unified Transformer-based Multitask Framework for Gait Analysis in the Wild
* UniqueSplat: View-Conditioned 3D Gaussian Splatting for Generalizable 3D Reconstruction
* Unleashing the Potential of Hierarchical Region Clues for Open-Vocabulary Multi-Label Classification
* Unlocking A New Paradigm In Robustness For Multi-Step Facial Forgery Detection
* Unlocking human intent perception through multimodal large models
* Unpredictable Trajectory Optimization for UAV-Assisted Anti-Jamming Data Collection
* Unraveling Urban Mobility: A Domain Knowledge-Free Trajectory Classification Using Gramian Angular Fields
* Unraveling Vanishing Point And Calibrating Tiny Objects For Semantic Scene Completion
* Unrevealed Threats: Adversarial Robustness Analysis of Underwater Image Enhancement Models
* Unrolling Nonconvex Graph Total Variation for Image Denoising
* Unsupervised Deep Learning for Anomaly Detection in Automotive Trucks: A Survey
* US-Loss: Integrating Uncertainty Estimation in the Loss Function of Image Segmentation
* User-in-the-Loop View Sampling with Error Peaking Visualization
* USformer: A U-Shaped Structure Transformer for RGB-Thermal Semantic Segmentation and Traffic Scene Understanding
* Using Cross-Domain Detection Loss to Infer Multi-Scale Information for Improved Tiny Head Tracking
* Utilization of Diffusion Models for Noise Reduction in Ultrasound Images
* Utilizing Keypoint R-CNN for Automated Root Angulation Detection in OPGs
* Value-Based Parallel Update MCTS Method for Multi-Agent Cooperative Decision-Making of Connected and Automated Vehicles, A
* Variable priority for unsupervised variable selection
* Variable Rate Learned Wavelet Video Coding Using Temporal Layer Adaptivity
* variational Bayesian approach for multimodal multi-instance classification, A
* Variational graph filter autoencoder for uncovering community structure in multiplex networks
* Vehicle Dynamics Embedded World Models for Autonomous Driving
* Vehicle Visual Perception Under Low Visibility Road Environments Based on AoP&DoP Multi-Polarization Parameter Characterization
* Veta-Gs: View-Dependent Deformable 3d Gaussian Splatting for Thermal Infrared Novel-View Synthesis
* VIDA: Unsupervised Visible-to-Infrared Domain Adaptation for Object Detection Using Large Vision Language Model
* Video Individual Counting with Implicit One-to-Many Matching
* Video is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation
* Video-Based Assessment of Bradykinesia in Ataxia-Teleangiectasia Patients
* Viewpoint-Dependent 3D Visual Grounding for Mobile Robots
* Virtual Reference Frame-Based Inter Prediction for MPEG Enhanced G-PCC
* Vision Language Model Interpretability with Concept Guided Decoding
* Vision-Based Mobile App GUI Testing: A Survey
* Visionary Co-Driver: Enhancing Driver Perception of Potential Risks With LLM and HUD
* VisionScores - A System-Segmented Image Score Dataset for Deep Learning Tasks
* Visual Artificial Intelligence: Unlocking Efficiency with Psychovisual Models
* Visual Encoders for Generalized Chromosome Recognition
* Visual Keyword Spotting with Multi-Encoder for MAVSR 2025
* Visual Measurement and Uncertainty Prediction of Insulator Thickness in Insulated Rail Joints
* Visual object tracking via adaptive feature fusion and two-stage channel selection
* Visual Prompt Aided Single Shot Object Part Segmentation
* Visual Prompting Through Image Mines
* Visual Question Answering Using Multimodal Data Augmentation for Hausa
* Visual reasoning consistency and robustness analysis of multimodal LLMs
* VisualCent: Visual Human Analysis using Dynamic Centroid Representation
* ViTA-PAR: Visual And Textual Attribute Alignment With Attribute Prompting For Pedestrian Attribute Recognition
* ViV-ReID: Bidirectional Structural-Aware Spatial-Temporal Graph Networks on Large-Scale Video-Based Vessel Re-Identification Dataset
* VQIT-GNN: A collaborative knowledge transfer for node-level structure imbalance
* VQM4HAS: A Real-Time Quality Metric for HEVC Videos in HTTP Adaptive Streaming
* VRScout: Towards Real-Time, Autonomous Testing of Virtual Reality Games
* Warmer Start to Active Learning with Adaptive Gaussian Mixture Models for Skin Lesion Segmentation, A
* Watermarking Diffusion Models By Constructing Generative Classifiers
* WaveE2VID: Frequency-Aware Event-Based Video Reconstruction
* Wavelet Packing for Self-Supervised Monocular Depth Estimation
* Wavelet-Based Denoising Transformer With Fourier Adjustment for UAV Nighttime Tracking
* Weakly Supervised Defect Localization with Residual Features
* Weakly-Supervised Nuclei Segmentation Integrating Hybrid Decoder and Graph-Based Spatial Modeling
* Wearable-Derived Behavioral and Physiological Biomarkers for Classifying Unipolar and Bipolar Depression Severity
* Weighted Average Prediction for Region Adaptive Hierarchical Transform in Solid Geometry Point Cloud Compression
* WF-MDC: Enhancing few-shot website fingerprinting via multiplicative distribution calibration
* When 512X512 is Not Enough: Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution
* Which Image Quality Measure is Optimal for Ultrasound Imaging?
* Wireless Channel as a Sensor: An Anti-Electromagnetic Interference Vehicle Detection Method Based on Wireless Sensing Technology
* Would a Simple Res-UNet Unsupervised Domain Adaptation Solve DubaiSat2 Delineated Labels?
* X265-PVMAF: A Real-Time Perceptual Video Quality Metric for HEVC Video Encoding
* Y-LIChess: Live and Interactive Over-The-Board Chess Recognition and Play with Yolo
* YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications
* YOLO-Based Waste Detection in Smart Waste Management for a Cleaner Future: A Review
* YOLO-VG: Enhancing Multi-Stage Feature Interaction for Visual Grounding
* YOLOFeat: Unified Object Detection and Feature Extraction for Multi-Object Tracking
* Your Face, Your Privacy: Combating Unauthorized Usage
* Zero-shot egocentric action recognition via chain-of-imagination prompts and inertial strengthening adaptor
* Zero-Shot Pseudo Labels Generation Using Sam and Clip for Semi-Supervised Semantic Segmentation
* ZVIR: Zero-shot implicit deep image prior with prior activation for infrared and visible image fusion
1259 for 2601

Index for "2"


Last update: 8-Jan-26 14:22:03
Use price@usc.edu for comments.