Keith Price Bibliography update Details for 2210

Update Dates 2210

2210 * *Affective Behavior Analysis In-the-Wild
* *AgriVision: Agriculture-Vision: Challenges and Opportunities for Computer Vision in Agriculture
* *AI City Challenge
* *Art of Robustness: Devil and Angel in Adversarial Machine Learning, The
* *Autonomous Driving
* *Biometrics
* *Bridging the Gap Between Computational Photography and Visual Recognition
* *Computer Vision for Fashion, Art and Design
* *Computer Vision for Microscopy Image Analysis
* *Computer Vision for Physiological Measurement
* *Computer Vision in Sports
* *Continual Learning in Computer Vision
* *CVPR
* *Deep Learning for Geometric Computing
* *EarthVision: Large Scale Computer Vision for Remote Sensing Imagery
* *Efficient Deep Learning for Computer Vision
* *Embedded Vision
* *Fair, Data-Efficient and Trusted Computer Vision
* *Federated Learning for Computer Vision
* *Gaze Estimation and Prediction in the Wild
* *Human-Centered Intelligent Services: Safe and Trustworthy
* *Image Matching: Local Features and Beyond
* *International Conference on Distributed Smart Cameras
* *Joint Ego4D and Egocentric Perception, Interaction & Computing (Ego4D-EPIC)
* *LatinX in CV Research
* *Learned Image Compression
* *Learning With Limited Labelled Data for Image and Video Understanding
* *Media Forensics
* *Mobile AI
* *Multimodal Learning and Applications
* *Neural Architecture Search: Lightweight NAS Challenge (NAS)
* *New Trends in Image Restoration and Enhancement
* *Omnidirectional Computer Vision in Research and Industry
* *Open-Domain Retrieval Under Multi-Modal Settings
* *Perception Beyond the Visible Spectrum
* *Precognition: Seeing Through the Future
* *Robustness in Sequential Data
* *Sketch-Oriented Deep Learning
* *Video Action Detection: Analysing Limitations and Challenges
* *Vision Datasets Understanding
* *Vision for All Seasons: Adverse Weather and Lighting Conditions
* *VOCVALC: Visual Odometry and Computer Vision Applications Based on Location Clues - With a Focus on Mobile Platform Applications
* *Women in Computer Vision
* 2D Compressed Sensing Using Nonlocal Low-Rank Prior Reconstruction for Cipher-Image Coding
* 360-Attack: Distortion-Aware Perturbations from Perspective-Views
* 360MonoDepth: High-Resolution 360° Monocular Depth Estimation
* 3D Ball Localization From A Single Calibrated Image
* 3D Common Corruptions and Data Augmentation
* 3D human tongue reconstruction from single in-the-wild images
* 3D LiDAR Aided GNSS NLOS Mitigation in Urban Canyons
* 3D LoD2 and LoD3 Modeling of Buildings with Ornamental Towers and Turrets Based on LiDAR Data
* 3D Mesh-Based Lifting-and-Projection Network for Human Pose Transfer, A
* 3D Moments from Near-Duplicate Photos
* 3D Photo Stylization: Learning to Generate Stylized Novel Views from a Single Image
* 3D Point Cloud Instance Segmentation of Lettuce Based on PartNet
* 3D Room Layout Recovery Generalizing across Manhattan and Non-Manhattan Worlds
* 3D Scene Painting via Semantic Image Synthesis
* 3D Search Based Hybrid Optimal Trajectory Planning for Autonomous LHD in Turning Maneuvers
* 3D Shape Reconstruction from 2D Images with Disentangled Attribute Flow
* 3D Shape Variational Autoencoder Latent Disentanglement via Mini-Batch Feature Swapping for Bodies and Faces
* 3D-aware Image Synthesis via Learning Structural and Textural Representations
* 3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection
* 3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection
* 3DAC: Learning Attribute Compression for Point Clouds
* 3DeformRS: Certifying Spatial Deformations on Point Clouds
* 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
* 3DRRDB: Super Resolution of Multiple Remote Sensing Images using 3D Residual in Residual Dense Blocks
* 3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos
* 3PSDF: Three-Pole Signed Distance Function for Learning Surfaces with Arbitrary Topologies
* 6th AI City Challenge, The
* A(DP)^2SGD: Asynchronous Decentralized Parallel Stochastic Gradient Descent With Differential Privacy
* A-ViT: Adaptive Tokens for Efficient Vision Transformer
* A3D: Studying Pretrained Representations with Programmable Datasets
* AAFormer: A Multi-Modal Transformer Network for Aerial Agricultural Images
* AAGAN: Accuracy-Aware Generative Adversarial Network for Supervised Tasks
* AARGNN: An Attentive Attributed Recurrent Graph Neural Network for Traffic Flow Prediction Considering Multiple Dynamic Factors
* Abandoning the Bayer-Filter to See in the Dark
* ABAW: Valence-Arousal Estimation, Expression Recognition, Action Unit Detection & Multi-Task Learning Challenges
* ABCNet v2: Adaptive Bezier-Curve Network for Real-Time End-to-End Text Spotting
* ABO: Dataset and Benchmarks for Real-World 3D Object Understanding
* ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo
* Absolute Accuracy Assessment of Lidar Point Cloud Using Amorphous Objects
* AC-WGAN-GP: Generating Labeled Samples for Improving Hyperspectral Image Classification with Small-Samples
* Accelerating DETR Convergence via Semantic-Aligned Matching
* Accelerating Video Object Segmentation with Compressed Video
* Acceleration Strategies for MR-STAT: Achieving High-Resolution Reconstructions on a Desktop PC Within 3 Minutes
* Accurate 3D Body Shape Regression using Metric and Semantic Attributes
* Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation
* Accurate Maritime Radio Propagation Loss Prediction Approach Employing Neural Networks, An
* accurate stereo matching method based on color segments and edges, An
* Accurately Stable Q-Compensated Reverse-Time Migration Scheme for Heterogeneous Viscoelastic Media
* ACPL: Anti-curriculum Pseudo-labelling for Semi-supervised Medical Image Classification
* Acquiring a Dynamic Light Field through a Single-Shot Coded Image
* ActAR: Actor-Driven Pose Embeddings for Video Action Recognition
* Action unit detection by exploiting spatial-temporal and label-wise attention with transformer
* Action-State Joint Learning-Based Vehicle Taillight Recognition in Diverse Actual Traffic Scenes
* Active Fault Trace Identification Using a LiDAR High-Resolution DEM: A Case Study of the Central Yangsan Fault, Korea
* Active Learning by Feature Mixing
* Active Learning for Open-set Annotation
* Active Object Detection with Epistemic Uncertainty and Hierarchical Information Aggregation
* Active Teacher for Semi-Supervised Object Detection
* ActiveZero: Mixed Domain Learning for Active Stereovision with Zero Annotation
* Activity and Kinematics of Two Adjacent Freeze-Thaw-Related Landslides Revealed by Multisource Remote Sensing of Qilian Mountain
* AdaFace: Quality Adaptive Margin for Face Recognition
* AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition
* AdaInt: Learning Adaptive Intervals for 3D Lookup Tables on Real-time Image Enhancement
* ADAM Challenge: Detecting Age-Related Macular Degeneration from Fundus Images
* AdaMixer: A Fast-Converging Query-Based Object Detector
* ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts
* Adaptive Authority Allocation Approach for Shared Steering Control System
* Adaptive Bitrate Quantization Scheme Without Codebook for Learned Image Compression
* Adaptive Differential Filters for Fast and Communication-Efficient Federated Learning
* Adaptive Early-Learning Correction for Segmentation from Noisy Annotations
* Adaptive Feature Consolidation Network for Burst Super-Resolution
* Adaptive Feature Denoising Based Deep Convolutional Network for Single Image Super-Resolution
* Adaptive feedback connection with a single-level feature for object detection
* Adaptive Gating for Single-Photon 3D Imaging
* Adaptive gradients and weight projection based on quantized neural networks for efficient image classification
* Adaptive Hierarchical Representation Learning for Long-Tailed Object Detection
* Adaptive momentum variance for attention-guided sparse adversarial attacks
* Adaptive Packet Coding for Reliable Underwater Acoustic Communications
* Adaptive preference transfer for personalized IoT entity recommendation
* Adaptive Short-Temporal Induced Aware Fusion Network for Predicting Attention Regions Like a Driver
* Adaptive Spatiotemporal Dependence Learning for Multi-Mode Transportation Demand Prediction
* Adaptive Trajectory Prediction via Transferable GNN
* Adaptive Viewpoint Feature Enhancement-Based Binocular Stereoscopic Image Saliency Detection
* AdaptPose: Cross-Dataset Adaptation for 3D Human Pose Estimation by Learnable Motion Generation
* ADAS: A Direct Adaptation Strategy for Multi-Target Domain Adaptive Semantic Segmentation
* AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks
* AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
* ADeLA: Automatic Dense Labeling with Attention for Viewpoint Shift in Semantic Segmentation
* Adiabatic Quantum Computing for Multi Object Tracking
* ADS-B-Based Spatiotemporal Alignment Network for Airport Video Object Segmentation
* advanced bidirectional reflectance factor (BRF) spectral approach for estimating flavonoid content in leaves of Ginkgo plantations, An
* Advanced Operational Approach for Tropical Cyclone Center Estimation Using Geostationary-Satellite-Based Water Vapor and Infrared Channels, An
* Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions
* Adversarial Eigen Attack on BlackBox Models
* Adversarial Machine Learning Attacks Against Video Anomaly Detection Systems
* Adversarial Parametric Pose Prior
* Adversarial Reciprocal Points Learning for Open Set Recognition
* Adversarial Robustness through the Lens of Convolutional Filters
* Adversarial Sample Attack and Defense Method for Encrypted Traffic Data
* Adversarial scratches: Deployable attacks to CNN classifiers
* Adversarial Texture for Fooling Person Detectors in the Physical World
* AEDNet: Adaptive Edge-Deleting Network For Subgraph Matching
* AEGNN: Asynchronous Event-based Graph Neural Networks
* Aerial and Ground Multi-Agent Cooperative Location Framework in GNSS-Challenged Environments, An
* Aerosols on the Tropical Island of La Reunion (21°S, 55°E): Assessment of Climatology, Origin of Variability and Trend
* Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
* Affine Medical Image Registration with Coarse-to-Fine Vision Transformer
* Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation
* Agent-Based Traffic Recommendation System: Revisiting and Revising Urban Traffic Management Strategies, An
* AGO-Net: Association-Guided 3D Point Cloud Object Detection Network
* AIM: an Auto-Augmenter for Images and Meshes
* AirObject: A Temporally Evolving Graph Embedding for Object Identification
* AKB-48: A Real-World Articulated Object Knowledge Base
* Aladdin: Joint Atlas Building and Diffeomorphic Registration Learning with Pairwise Alignment
* Alias-and-Separate: Wideband Speech Coding Using Sub-Nyquist Sampling and Speech Separation
* Align and Prompt: Video-and-Language Pre-training with Entity Prompts
* Align Representations with Base: A New Approach to Self-Supervised Learning
* Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification
* AlignMixup: Improving Representations By Interpolating Aligned Features
* AlignQ: Alignment Quantization with ADMM-based Correlation Preservation
* All-Higher-Stages-In Adaptive Context Aggregation for Semantic Edge Detection
* All-In-One Image Restoration for Unknown Corruption
* All-photon Polarimetric Time-of-Flight Imaging
* Alleviating Representational Shift for Continual Fine-tuning
* Alleviating Semantics Distortion in Unsupervised Low-Level Image-to-Image Translation via Structure Consistency Constraint
* Alpha Matte Generation from Single Input for Portrait Matting
* AME: Attention and Memory Enhancement in Hyper-Parameter Optimization
* Amodal Panoptic Segmentation
* Amodal Segmentation through Out-of-Task and Out-of-Distribution Generalization with a Bayesian Model
* Analysis and Extensions of Adversarial Training for Video Classification
* Analysis of Diurnal Evolution of Cloud Properties and Convection Tracking over the South China Coastal Area
* Analysis of Global Sea Level Change Based on Multi-Source Data
* Analysis of Local Site Effects in the Medimurje Region (North Croatia) and Its Consequences Related to Historical and Recent Earthquakes
* Analysis of Spatiotemporal Variation and Drivers of Ecological Quality in Fuzhou Based on RSEI
* Analysis of Super-Net Heuristics in Weight-Sharing NAS, An
* Analysis of Temporal Tensor Datasets on Product Grassmann Manifold
* Analysis of the Anomalous Environmental Response to the 2022 Tonga Volcanic Eruption Based on GNSS
* Analysis of the Information Entropy on Traffic Flows
* Analysis of the Ionospheric Irregularities and Phase Scintillation at Low and Middle Latitudes Based on Swarm Observations
* Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding
* Anisotropic Convolutional Neural Networks for RGB-D Based Semantic Scene Completion
* AnoDDPM: Anomaly Detection with Denoising Diffusion Probabilistic Models using Simplex Noise
* Anomaly Detection in Autonomous Driving: A Survey
* Anomaly Detection via Reverse Distillation from One-Class Embedding
* Answering knowledge-based visual questions via the exploration of Question Purpose
* ANT: Adapt Network Across Time for Efficient Video Processing
* Anti-Jamming Method and Implementation for GNSS Receiver Based on Array Antenna Rotation
* Anti-phishing technique based on dynamic image captcha using multi secret sharing scheme
* AnyFace: Free-style Text-to-Face Synthesis and Manipulation
* AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network
* APES: Articulated Part Extraction from Sprite Sheets
* Appearance and Structure Aware Robust Deep Visual Graph Matching: Attack, Defense and Beyond
* Application of DNN for radar micro-doppler signature-based human suspicious activity recognition
* Application of Hyperspectral Image Clustering Based on Texture-Aware Superpixel Technique in Deep Sea, An
* Application of Machine Learning in Forecasting the Impact of Mining Deformation: A Case Study of Underground Copper Mines in Poland
* Application of Multi-Source Data for Mapping Plantation Based on Random Forest Algorithm in North China
* Application of Random Forest Algorithm on Tornado Detection
* Application of Time-Domain Airborne Electromagnetic Method to the Study of Qingchengzi Ore Concentration Area in China
* Approximate Analytical Solution for the Top-of-Atmosphere Spectral Reflectance of Atmosphere: Underlying Snow System over Antarctica, The
* APRIL: Finding the Achilles' Heel on Privacy for Vision Transformers
* AquaGAN: Restoration of Underwater Images
* AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields
* Arbitrary-Scale Image Synthesis
* Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search
* ARCS: Accurate Rotation and Correspondence Search
* Are Multimodal Transformers Robust to Missing Modality?
* Area Under the ROC Curve Maximization for Metric Learning
* ARIA: Adversarially Robust Image Attribution for Content Provenance
* ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation
* ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis
* Artificial Intelligence for Dunhuang Cultural Heritage Protection: The Project and the Dataset
* Artificial Intelligence-Based Energy Efficient Communication System for Intelligent Reflecting Surface-Driven VANETs
* Artistic Style Discovery with Independent Components
* Artistic Style Novel View Synthesis Based on A Single Image
* ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
* Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities
* Assessing Driving Styles in Commercial Motor Vehicle Drivers After Take-Over Conditions in Highly Automated Vehicles
* Assessing Height Variations in Qinghai-Tibet Plateau from Time-Varying Gravity Data and Hydrological Model
* Assessing Spatiotemporal Changes of SDG Indicators at the Neighborhood Level in Guilin, China: A Geospatial Big Data Approach
* Assessing the 2022 Flood Impacts in Queensland Combining Daytime and Nighttime Optical and Imaging Radar Data
* Assessing the Impact of Neighborhood Size on Temporal Convolutional Networks for Modeling Land Cover Change
* Assessing the Performance of the Satellite-Based Precipitation Products (SPP) in the Data-Sparse Himalayan Terrain
* Assessment and Calibration of ERA5 Severe Winds in the Atlantic Ocean Using Satellite Data
* Assessment of crop traits retrieved from airborne hyperspectral and thermal remote sensing imagery to predict wheat grain protein content
* Assessment of Human-Induced Effects on Sea/Brackish Water Chlorophyll-a Concentration in Ha Long Bay of Vietnam with Google Earth Engine
* Assessment of Iran's Mangrove Forest Dynamics (1990-2020) Using Landsat Time Series
* Assessment of Material Layers in Building Walls Using GeoRadar
* Assessment of Sentinel-2-MSI Atmospheric Correction Processors and In Situ Spectrometry Waters Quality Algorithms
* Asymmetric Information Distillation Network for Lightweight Super Resolution
* Atmospheric Correction Model for Water-Land Boundary Adjacency Effects in Landsat-8 Multispectral Images and Its Impact on Bathymetric Remote Sensing
* Atmospheric turbulence degraded image restoration using a modified dilated convolutional network
* ATPFL: Automatic Trajectory Prediction Model Design under Federated Learning Framework
* ATPS: An AI Based Trust-Aware and Privacy-Preserving System for Vehicle Managements in Sustainable VANETs
* Attention Concatenation Volume for Accurate and Efficient Stereo Matching
* Attention Consistency on Visual Corruptions for Single-Source Domain Generalization
* Attention in Attention: Modeling Context Correlation for Efficient Video Classification
* Attention in Reasoning: Dataset, Analysis, and Modeling
* Attention transfer from human to neural networks for road object detection in winter
* Attention-based Method for Multi-label Facial Action Unit Detection, An
* Attention-Guided Collaborative Counting
* Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network
* Attentive Fine-Grained Structured Sparsity for Image Restoration
* Attenuating Catastrophic Forgetting by Joint Contrastive and Incremental Learning
* AttGGCN Model: A Novel Multi-Sensor Fault Diagnosis Method for High-Speed Train Bogie
* Attributable Visual Similarity Learning
* Attribute Group Editing for Reliable Few-shot Image Generation
* Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning
* Audio-Adaptive Activity Recognition Across Video Domains
* Audio-driven Neural Gesture Reenactment with Video Motion Graphs
* Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
* Audiovisual Generalised Zero-shot Learning with Cross-modal Attention and Language
* Auditing Privacy Defenses in Federated Learning via Generative Gradient Leakage
* Aug-NeRF: Training Stronger Neural Radiance Fields with Triple-Level Physically-Grounded Augmentations
* AugLy: Data Augmentations for Adversarial Robustness
* Augmentation Invariance and Adaptive Sampling in Semantic Segmentation of Agricultural Aerial Images
* Augmentation of Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models
* Augmented Geometric Distillation for Data-Free Incremental Person ReID
* Auto Arborist Dataset: A Large-Scale Benchmark for Multiview Urban Forest Monitoring Under Domain Shift, The
* Auto calibration of multi-camera system for human pose estimation
* Autoencoders: A Comparative Analysis in the Realm of Anomaly Detection
* Autofocus for Event Cameras
* AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation
* AutoLoss-GMS: Searching Generalized Margin-based Softmax Loss Function for Person Re-identification
* AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks
* Automated Health Estimation of Capsicum annuum L. Crops by Means of Deep Learning and RGB Aerial Images
* Automated Progressive Learning for Efficient Training of Vision Transformers
* Automated Radiographic Report Generation Purely on Transformer: A Multicriteria Supervised Approach
* Automated Small River Mapping (ASRM) for the Qinghai-Tibet Plateau Based on Sentinel-2 Satellite Imagery and MERIT DEM
* Automatic Clustering for Unsupervised Risk Diagnosis of Vehicle Driving for Smart Road
* Automatic Color Image Stitching Using Quaternion Rank-1 Alignment
* Automatic Defect Detection of Pavement Diseases
* Automatic dottization of Arabic text (Rasms) using deep recurrent neural networks
* Automatic Drift-Measurement-Data-Processing Method with Digital Ionosondes, An
* Automatic emergency braking/anti-lock braking system coordinated control with road adhesion coefficient estimation for heavy vehicle
* Automatic Extrinsic Calibration Method for LiDAR and Camera Sensor Setups
* Automatic Itinerary Planning Using Triple-Agent Deep Reinforcement Learning
* Automatic Liver Tumor Segmentation on Dynamic Contrast Enhanced MRI Using 4D Information: Deep Learning Model Based on 3D Convolution and Convolutional LSTM
* Automatic Registration for Panoramic Images and Mobile LiDAR Data Based on Phase Hybrid Geometry Index Features
* Automatic Relation-aware Graph Network Proliferation
* Automatic signboard detection and localization in densely populated developing cities
* Automatic Supraglacial Lake Extraction in Greenland Using Sentinel-1 SAR Images and Attention-Based U-Net
* Automatic Synthesis of Diverse Weak Supervision Sources for Behavior Analysis
* AutoMine: An Unmanned Mine Dataset
* Autonomous Intersection Crossing With Vehicle Location Uncertainty
* Autonomous Vehicle Cut-In Algorithm for Lane-Merging Scenarios via Policy-Based Reinforcement Learning Nested Within Finite-State Machine
* Autonomous Vehicle Intelligent System: Joint Ride-Sharing and Parcel Delivery Strategy
* Autoregressive Image Generation using Residual Quantization
* AutoRF: Learning 3D Object Radiance Fields from Single View Observations
* AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation
* AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis
* Auxiliary Learning for Self-Supervised Video Representation via Similarity-based Knowledge Distillation
* AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data
* Awareness on Present and Future Trajectory of Vehicle Using Multiple Hypotheses in the Mixed Traffic of Intersection
* AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval
* AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception
* B-cos Networks: Alignment is All We Need for Interpretability
* Back to Reality: Weakly-supervised 3D Object Detection with Shape-Guided Label Enhancement
* Back-compatible Color QR Codes for colorimetric applications
* Backdoor Attacks on Self-Supervised Learning
* Background Activation Suppression for Weakly Supervised Object Localization
* Backscattering Analysis Utilizing Relaxed Hierarchical Equivalent Source Algorithm (RHESA) for Scatterers in Vegetation Medium
* Bacon: Band-Limited Coordinate Networks for Multiscale Scene Representation
* Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory
* Balanced and Hierarchical Relation Learning for One-shot Object Detection
* Balanced Contrastive Learning for Long-Tailed Visual Recognition
* Balanced MSE for Imbalanced Visual Regression
* Balanced Multimodal Learning via On-the-fly Gradient Modulation
* BaLeNAS: Differentiable Architecture Search via the Bayesian Learning Rule
* Bandits for Structure Perturbation-based Black-box Attacks to Graph Neural Networks with Theoretical Guarantees
* BANMo: Building Animatable 3D Neural Models from Many Casual Videos
* BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information
* BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment
* BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning
* Bayesian Deep Learning for Aircraft Hard Landing Safety Assessment
* Bayesian Invariant Risk Minimization
* Bayesian Kernelized Matrix Factorization for Spatiotemporal Traffic Data Imputation and Kriging
* Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection
* BCI: Breast Cancer Immunohistochemical Image Generation through Pyramid Pix2pix
* BCOT: A Markerless High-Precision 3D Object Tracking Benchmark
* BDC-GAN: Bidirectional Conversion Between Computer-Generated and Natural Facial Images for Anti-Forensics
* BE-STI: Spatial-Temporal Integrated Network for Class-agnostic Motion Prediction with Bidirectional Enhancement
* Beach Profile, Water Level, and Wave Runup Measurements Using a Standalone Line-Scanning, Low-Cost (LLC) LiDAR System
* BEHAVE: Dataset and Method for Tracking Human Object Interactions
* Benchmarking human face similarity using identical twins
* Bending Graphs: Hierarchical Shape Matching using Gated Optimal Transport
* Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation
* Best of Both Worlds: Combining Model-based and Nonparametric Approaches for 3D Human Body Estimation, The
* beta-DARTS: Beta-Decay Regularization for Differentiable Architecture Search
* Better pseudo-label: Joint domain-aware label and dual-classifier for semi-supervised domain generalization
* Better Trigger Inversion Optimization in Backdoor Scanning
* BEVT: BERT Pretraining of Video Transformers
* Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds
* Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
* Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image
* Beyond Fixation: Dynamic Window Visual Transformer
* Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-Based Beam Search
* Beyond Semantic to Instance Segmentation: Weakly-Supervised Instance Segmentation via Semantic Knowledge Transfer and Self-Refinement
* Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning
* BH2I-GAN: Bidirectional Hashcode-to-Image Translation using Multi-Generative Multi-Adversarial Nets
* Bi-Directional Object-Context Prioritization Learning for Saliency Ranking
* Bi-level Alignment for Cross-Domain Crowd Counting
* Bi-level Doubly Variational Learning for Energy-based Latent Variable Models
* Bibliometric Analysis of IEEE T-ITS Literature Between 2010 and 2019, A
* Bidirectional Motion Estimation with Cyclic Cost Volume for High Dynamic Range Imaging
* Big Geospatial Data and Data-Driven Methods for Urban Dengue Risk Forecasting: A Review
* BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations
* BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
* BigDL 2.0: Seamless Scaling of AI Pipelines from Laptops to Distributed Cluster
* Bijective Mapping Network for Shadow Removal
* Bilateral Video Magnification Filter
* Biomass Calculations of Individual Trees Based on Unmanned Aerial Vehicle Multispectral Imagery and Laser Scanning Combined with Terrestrial Laser Scanning in Complex Stands
* BIOSIG 2021 Special issue on efficient, reliable, and privacy-friendly biometrics
* Bird-Borne Samplers for Monitoring CO2 and Atmospheric Physical Parameters
* Black-Box Test-Time Shape REFINEment for Single View 3D Reconstruction
* Blended Diffusion for Text-driven Editing of Natural Images
* Blind Face Restoration via Integrating Face Shape and Generative Priors
* Blind Image Super-resolution with Elaborate Degradation Modeling on Noise and Kernel
* Blind Non-Uniform Motion Deblurring using Atrous Spatial Pyramid Deformable Convolution and Deblurring-Reblurring Consistency
* Blind Restoration of Atmospheric Turbulence-Degraded Images Based on Curriculum Learning
* Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots
* Block-based progressive visual cryptography scheme with uniform progressive recovery and consistent background
* Block-NeRF: Scalable Large Scene Neural View Synthesis
* Blockchain in Digital Twins-Based Vehicle Management in VANETs
* Blockchain-Based Emergency Message Transmission Protocol for Cooperative VANET, A
* Blockchain-Enabled Conditional Decentralized Vehicular Crowdsensing System
* Blood Vessel Segmentation from Low-Contrast and Wide-Field Optical Microscopic Images of Cranial Window by Attention-Gate-Based Network
* Blood-contaminated endoscopic image restoration based on residual VQ-VAE with cascaded structure
* Blueprint Separable Residual Network for Efficient Image Super-Resolution
* BNUDC: A Two-Branched Deep Neural Network for Restoring Images from Under-Display Cameras
* BNV-Fusion: Dense 3D Reconstruction using Bi-level Neural Volume Fusion
* BodyGAN: General-purpose Controllable Neural Human Body Generation
* BodyMap: Learning Full-Body Dense Correspondence Map
* BokehMe: When Neural Rendering Meets Classical Rendering
* Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions
* BoosterNet: Improving Domain Generalization of Deep Neural Nets using Culpability-Ranked Features
* Boosting 3D Object Detection by Simulating Multimodality on Point Clouds
* Boosting Black-Box Attack with Partially Transferred Conditional Adversarial Distribution
* Boosting Crowd Counting via Multifaceted Attention
* Boosting Robustness of Image Matting with Context Assembling and Strong Data Augmentation
* Boosting View Synthesis with Residual Transfer
* BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation
* Bootstrapped Representation Learning for Skeleton-Based Action Recognition
* Bootstrapping ViTs: Towards Liberating Vision Transformers from Pre-training
* Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding
* Boundary TextSpotter: Toward Arbitrary-Shaped Scene Text Spotting
* Boundary-aware Image Inpainting with Multiple Auxiliary Cues
* Bounded Adversarial Attack on Deep Content Features
* Bounding-box deep caibration for high performance face detection
* Box-Grained Reranking Matching for Multi-Camera Multi-Target Tracking
* BoxeR: Box-Attention for 2D and 3D Transformers
* BP-triplet net for unsupervised domain adaptation: A Bayesian perspective
* BppAttack: Stealthy and Efficient Trojan Attacks against Deep Neural Networks via Image Quantization and Contrastive Adversarial Learning
* Brain Connectivity Based Graph Convolutional Networks and Its Application to Infant Age Prediction
* Brain-inspired Multilayer Perceptron with Spiking Neurons
* Brain-Supervised Image Editing
* Brand New Dance Partner: Music-Conditioned Pluralistic Dancing Controlled by Multiple Dance Genres, A
* Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
* Bridged Transformer for Vision and Point Cloud 3D Object Detection
* Bridging Global Context Interactions for High-Fidelity Image Completion
* Bridging the Gap Between Automated and Human Facial Emotion Perception
* Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization
* Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
* Bridging Video-Text Retrieval with Multiple Choice Questions
* Brief Analysis of the Dense Extreme Inception Network for Edge Detection, A
* Brief Analysis of the Holistically-Nested Edge Detector, A
* Bring Evanescent Representations to Life in Lifelong Class Incremental Learning
* Bringing Old Films Back to Life
* BSC-Net: Background Suppression Algorithm for Stray Lights in Star Images
* BSRT: Improving Burst Super-Resolution with SWIN Transformer and Flow-Guided Deformable Alignment
* BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
* Building Function Type Identification Using Mobile Signaling Data Based on a Machine Learning Method
* Burst Image Restoration and Enhancement
* Bus Crowdedness Sensing System Using Deep-Learning Based Object Detection, A
* Bus Headways Analysis for Anomaly Detection
* C-CAM: Causal CAM for Weakly Supervised Semantic Segmentation on Medical Image
* C2 AM: Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation
* C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection
* C2SLR: Consistency-enhanced Continuous Sign Language Recognition
* CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification
* CaDeX: Learning Canonical Deformation Coordinate Space for Dynamic Surface Representation via Neural Homeomorphism
* CADTransformer: Panoptic Symbol Spotting Transformer for CAD Drawings
* CAFE: Learning to Condense Dataset by Aligning Features
* Calibrating Deep Neural Networks by Pairwise Constraints
* Calibration Inter-Comparison of MODIS and VIIRS Reflective Solar Bands Using Lunar Observations
* Camera Pose Estimation using Implicit Distortion Models
* Camera-Conditioned Stable Feature Generation for Isolated Camera Supervised Person Re-IDentification
* CAMION: Cascade Multi-input Multi-output Network for Skeleton Extraction
* CamLiFlow: Bidirectional Camera-LiDAR Fusion for Joint Optical Flow and Scene Flow Estimation
* Camouflaged Object Detection via Context-Aware Cross-Level Fusion
* Can domain adaptation make object recognition work for everyone?
* Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective
* Can the Mathematical Correctness of Object Configurations Affect the Accuracy of Their Perception?
* Can we trust bounding box annotations for object detection?
* Can You Spot the Chameleon? Adversarially Camouflaging Images from Co-Salient Object Detection
* Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in Videos
* Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes
* CAPRI-Net: Learning Compact CAD Shapes with Adaptive Primitive Assembly
* Capturing and Inferring Dense Full-Body Human-Scene Contact
* Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
* CAR-Net: A Deep Learning-Based Deformation Model for 3D/2D Coronary Artery Registration
* CarlaScenes: A synthetic dataset for odometry in autonomous driving
* Cartoon Image Processing: A Survey
* Cascade Transformers for End-to-End Person Search
* Cascaded Refinement Network for Point Cloud Completion With Self-Supervision
* Cascaded Siamese Self-supervised Audio to Video GAN
* case for using rotation invariant features in state of the art feature matchers, A
* CAT-Det: Contrastively Augmented Transformer for Multimodal 3D Object Detection
* Catching Both Gray and Black Swans: Open-set Supervised Anomaly Detection*
* Categorized Reflection Removal Dataset with Diverse Real-world Scenes, A
* Category Contrast for Unsupervised Domain Adaptation in Visual Tasks
* Category-Aware Transformer Network for Better Human-Object Interaction Detection
* Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention
* Causal Transportability for Visual Recognition
* Causality Inspired Representation Learning for Domain Generalization
* CBASH: Combined Backbone and Advanced Selection Heads With Object Semantic Proposals for Weakly Supervised Object Detection
* CCIBA*: An Improved BA* Based Collaborative Coverage Path Planning Method for Multiple Unmanned Surface Mapping Vehicles
* CCNet: CNN model with channel attention and convolutional pooling mechanism for spatial image steganalysis
* CD-SDN: Unsupervised Sensitivity Disparity Networks for Hyper-Spectral Image Change Detection
* CD2-pFed: Cyclic Distillation-guided Channel Decoupling for Model Personalization in Federated Learning
* CDAD: A Common Daily Action Dataset with Collected Hard Negative Samples
* CDFKD-MFS: Collaborative Data-Free Knowledge Distillation via Multi-Level Feature Sharing
* CDGNet: Class Distribution Guided Network for Human Parsing
* Cell Edge User Capacity-Coverage Reliability Tradeoff for 5G-R Systems With Overlapped Linear Coverage
* Cell Selection-based Data Reduction Pipeline for Whole Slide Image Analysis of Acute Myeloid Leukemia
* CellTypeGraph: A New Geometric Computer Vision Benchmark
* CENet: Consolidation-and-Exploration Network for Continuous Domain Adaptation
* Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing
* Certified Patch Robustness via Smoothed Vision Transformers
* CFA: Constraint-based Finetuning Approach for Generalized Few-Shot Object Detection
* Challenging Benchmark of Anime Style Recognition, A
* Channel Balancing for Accurate Quantization of Winograd Convolutions
* Characterisation of Benthic Currents from Seabed Bathymetry: An Object-Based Image Analysis of Cold-Water Coral Mounds, A
* Characteristics of Snow Depth and Snow Phenology in the High Latitudes and High Altitudes of the Northern Hemisphere from 1988 to 2018
* Characterization of an Active Fault through a Multiparametric Investigation: The Trecastagni Fault and Its Relationship with the Dynamics of Mt. Etna Volcano (Sicily, Italy)
* Characterization of Extremely Fresh Biomass Burning Aerosol by Means of Lidar Observations
* Characterization of Long-Time Series Variation of Glacial Lakes in Southwestern Tibet: A Case Study in the Nyalam County
* Characterizing and Modeling Tropical Sandy Soils through VisNIR-SWIR, MIR Spectroscopy, and X-ray Fluorescence
* Characterizing pandemic waves: A latent class analysis of COVID-19 spread across US counties
* Characterizing Spatiotemporal Patterns of Winter Wheat Phenology from 1981 to 2016 in North China by Improving Phenology Estimation
* Characterizing Target-absent Human Attention
* Charging-Expense Minimization Through Assignment Rescheduling of Movable Charging Stations in Electric Vehicle Networks
* CHEX: CHannel EXploration for CNN Model Compression
* ChiTransformer: Towards Reliable Stereo from Cues
* CIPPSRNet: A Camera Internal Parameters Perception Network Based Contrastive Learning for Thermal Image Super-Resolution
* Citizen Science to Assess Light Pollution with Mobile Phones
* City-Scale Multi-Camera Vehicle Tracking based on Space-Time-Appearance Features
* CL3D: Camera-LiDAR 3D Object Detection With Point Feature Enhancement and Point-Guided Fusion
* Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation
* Class Similarity Weighted Knowledge Distillation for Continual Semantic Segmentation
* Class-Aware Contrastive Semi-Supervised Learning
* Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation
* Class-Incremental Learning by Knowledge Distillation with Adaptive Feature Consolidation
* Class-Incremental Learning with Strong Pre-trained Models
* Class-wise Thresholding for Robust Out-of-Distribution Detection
* Classification of emotions using EEG activity associated with different areas of the brain
* Classification of Facial Expression In-the-Wild based on Ensemble of Multi-head Cross Attention Networks
* Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
* Classifying Sparse Vegetation in a Proglacial Valley Using UAV Imagery and Random Forest Algorithm
* Clean Implicit 3D Structure from Noisy 2D STEM Images
* CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation
* CLIP-Event: Connecting Text and Images with Event Structures
* CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation
* CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields
* Clipped Hyperbolic Classifiers Are Super-Hyperbolic Classifiers
* CLIPstyler: Image Style Transfer with a Single Text Condition
* Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification
* Closer Look at Blind Super-Resolution: Degradation Models, Baselines, and Performance Upper Bounds, A
* Closer Look at Few-shot Image Generation, A
* Closing the Generalization Gap of Cross-Silo Federated Medical Image Segmentation
* Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization
* Clothes-Changing Person Re-identification with RGB Modality Only
* ClothFormer: Taming Video Virtual Try-on in All Module
* Cloud and Snow Identification Based on DeepLab V3+ and CRF Combined Model for GF-1 WFV Images
* Cloud Contaminated Multispectral Remote Sensing Image Enhancement Algorithm Based on MobileNet
* Clouds in the Vicinity of the Stratopause Observed with Lidars at Midlatitudes (40.5-41°N) in China
* CLRNet: Cross Layer Refinement Network for Lane Detection
* Clues of Lithosphere, Atmosphere and Ionosphere Variations Possibly Related to the Preparation of La Palma 19 September 2021 Volcano Eruption
* Cluster-Based Partition Method of Remote Sensing Data for Efficient Distributed Image Processing, A
* Cluster-guided Image Synthesis with Unconditional Models
* Cluster-to-adapt: Few Shot Domain Adaptation for Semantic Segmentation across Disjoint Labels
* ClusterGNN: Cluster-based Coarse-to-Fine Graph Neural Network for Efficient Feature Matching
* Clustering Arid Rangelands Based on NDVI Annual Patterns and Their Persistence
* Clustering Plotted Data by Image Segmentation
* Clustering-Based Optimization Method for the Driving Cycle Construction: A Case Study in Fuzhou and Putian, China, A
* CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
* CMT: Convolutional Neural Networks Meet Vision Transformers
* CNLL: A Semi-supervised Approach For Continual Noisy Label Learning
* CNN Ensemble Based on a Spectral Feature Refining Module for Hyperspectral Image Classification, A
* CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters
* CNN-Enhanced Heterogeneous Graph Convolutional Network: Inferring Land Use from Land Cover with a Case Study of Park Segmentation
* Co-advise: Cross Inductive Bias Distillation
* Co-Attention Fusion Network for Multimodal Skin Cancer Diagnosis
* Co-domain Symmetry for Complex-Valued Deep Learning
* Co-segmentation inspired attention module for video-based computer vision tasks
* CO-SNE: Dimensionality Reduction and Visualization for Hyperbolic Data
* COAP: Compositional Articulated Occupancy of People
* Coarse-to-Fine Boundary Localization method for Naturalistic Driving Action Recognition, A
* Coarse-to-Fine Cascaded Networks with Smooth Predicting for Video Facial Expression Recognition
* Coarse-To-Fine Deep Video Coding with Hyperprior-Guided Mode Prediction
* Coarse-to-Fine Feature Mining for Video Semantic Segmentation
* Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation
* Coarse-to-Fine Reasoning for Visual Question Answering
* Coastal Upwelling in the Western Bay of Bengal: Role of Local and Remote Windstress
* Coastal Waveform Retracking for HY-2B Altimeter Data by Determining the Effective Trailing Edge and the Low Noise Leading Edge
* Code Optimization and Angle-Doppler Imaging for ST-CDM LFMCW MIMO Radar Systems
* CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance
* CoDo: Contrastive Learning with Downstream Background Invariance for Detection
* Coherent Point Drift Revisited for Non-rigid Shape Matching and Registration
* Colar: Effective and Efficient Online Action Detection by Consulting Exemplars
* Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution
* Collaborative Learning with Unreliability Adaptation for Semi-Supervised Image Classification
* Collaborative Transformers for Grounded Situation Recognition
* Color Invariant Skin Segmentation
* Combined Effects of the ENSO and the QBO on the Ozone Valley over the Tibetan Plateau
* Combining 2D texture and 3D geometry features for Reliable iris presentation attack detection using light field focal stack
* Combining Deep Semantic Edge and Object Segmentation for Large-Scale Roof-Part Polygon Extraction from Ultrahigh-Resolution Aerial Imagery
* Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction
* Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data
* Communication Quality Prediction for Internet of Vehicle (IoV) Networks: An Elman Approach
* Communication-Efficient Federated Data Augmentation on Non-IID Data
* Compact Vehicle Driver Fatigue Recognition Technology Based on EEG Signal
* Comparative Analysis of Binhu and Cosmic-2 Radio Occultation Data
* Comparing Correspondences: Video Prediction with Correspondence-wise Losses
* Comparing deep learning models for low-light natural scene image enhancement and their impact on object detection and classification: Overview, empirical evaluation, and challenges
* Comparing Machine and Deep Learning Methods for the Phenology-Based Classification of Land Cover Types in the Amazon Biome Using Sentinel-1 Time Series
* Comparing the Observable Response Times of ACC and CACC Systems
* Comparison and Analysis of Stellar Occultation Simulation Results and SABER-Satellite-Measured Data in Near Space
* Comparison of CoModGANs, LaMa and GLIDE for Art Inpainting Completing M.C Escher's Print Gallery
* Comparison of Ground Point Filtering Algorithms for High-Density Point Clouds Collected by Terrestrial LiDAR
* Comparison of Lake Ice Extraction Methods Based on MODIS Images
* Comparison of Land Use Land Cover Classifiers Using Different Satellite Imagery and Machine Learning Techniques
* Comparison of Processing Schemes for Automotive MIMO SAR Imaging, A
* Comparison of S5P/TROPOMI Inferred NO2 Surface Concentrations with In Situ Measurements over Central Europe
* Comparison of Satellite Precipitation Products: IMERG and GSMaP with Rain Gauge Observations in Northern China
* Compensating for Local Ambiguity With Encoder-Decoder in Urban Scene Segmentation
* Complete and temporally consistent video outpainting
* Complete DC Trolleybus Grid Model With Bilateral Connections, Feeder Cables, and Bus Auxiliaries, A
* Complex Backdoor Detection by Symmetric Feature Differencing
* Complex Mountain Road Extraction in High-Resolution Remote Sensing Images via a Light Roadformer and a New Benchmark
* Complex Video Action Reasoning via Learnable Markov Logic Network
* Composite Travel Generative Adversarial Networks for Tabular and Sequential Population Synthesis
* Compositional Mixture Representations for Vision and Text
* Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning
* Compound Domain Generalization via Meta-Knowledge Encoding
* Comprehending and Ordering Semantics for Image Captioning
* Comprehensive Analysis Method for Reversible Data Hiding in Stream-Cipher-Encrypted Images, A
* Comprehensive Remote Sensing Technology for Monitoring Landslide Hazards and Disaster Chain in the Xishan Mining Area of Beijing
* Comprehensive Review of Modern Object Segmentation Approaches, A
* comprehensive review of video steganalysis, A
* Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual Attributes, A
* Compressing Models with Few Samples: Mimicking then Replacing
* Compressive Single-Photon 3D Cameras
* Computing Wasserstein-p Distance Between Images with Linear Cost
* Concept Activation Vectors for Generating User-Defined 3D Shapes
* Concrete Bridge Defects Identification and Localization Based on Classification Deep Convolutional Neural Networks and Transfer Learning
* Condensing CNNs with Partial Differential Equations
* Conditional Feature Embedding by Visual Clue Correspondence Graph for Person Re-Identification
* Conditional Feature Learning Based Transformer for Text-Based Person Search
* Conditional GAN with 3D discriminator for MRI generation of Alzheimer's disease progression
* Conditional mixture modeling and model-based clustering
* Conditional Prompt Learning for Vision-Language Models
* Conditioned and composed image retrieval combining and partially fine-tuning CLIP-based features
* ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes
* CoNeRF: Controllable Neural Radiance Fields
* Confidence based class weight and embedding discrepancy constraint network for partial domain adaptation
* Confidence Propagation Cluster: Unleash Full Potential of Object Detectors
* Conformer and Blind Noisy Students for Improved Image Quality Assessment
* Conjugate Adder Net (CAddNet) - a Space-Efficient Approximate CNN
* Connecting the Complementary-view Videos: Joint Camera Identification and Subject Association
* Conservative Approach for Unbiased Learning on Unknown Biases, A
* Consistency driven Sequential Transformers Attention Model for Partially Observable Scenes
* Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection
* Consistency-based Active Learning for Object Detection
* Consistent Explanations by Contrastive Learning
* Consortium Blockchain-Based Computation Offloading Using Mobile Edge Platoon Cloud in Internet of Vehicles
* Constellations: A novel dataset for studying iterative inference in humans and AI
* Constrained Few-shot Class-incremental Learning
* Constrained Optimization of FPGA Design for Spaceborne InSAR Processing
* Contactless automated lifting of latent fingerprints from difficult curved surfaces
* Contactless Blood Pressure Measurement via Remote Photoplethysmography with Synthetic Data Generation Using Generative Adversarial Network
* Content-Augmented Feature Pyramid Network with Light Linear Spatial Transformers for Object Detection
* Context Attention Network for Skeleton Extraction
* Context-Aware 3D Object Detection From a Single Image in Autonomous Driving
* Context-Aware Feature Learning for Noise Robust Person Search
* Context-Aware Sequence Alignment using 4D Skeletal Augmentation
* Context-Aware Video Reconstruction for Rolling Shutter Cameras
* Contextual Debiasing for Visual Recognition with Causal Mechanisms
* Contextual Instance Decoupling for Robust Multi-Person Pose Estimation
* Contextual Outpainting with Object-Level Contrastive Learning
* Contextual Similarity Distillation for Asymmetric Image Retrieval
* Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision
* ContIG: Self-supervised Multimodal Contrastive Learning for Medical Imaging with Genetics
* Continual Active Adaptation to Evolving Distributional Shifts
* Continual Hippocampus Segmentation with Transformers
* Continual Learning Based on OOD Detection and Task Masking
* Continual Learning for Visual Search with Backward Consistent Feature Embedding
* Continual Learning with Lifelong Vision Transformer
* Continual Learning with Transformers for Image Classification
* Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism
* Continual Predictive Learning from Videos
* Continual semi-supervised learning through contrastive interpolation consistency
* Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture
* Continual Test-Time Domain Adaptation
* Continually Learning Self-Supervised Representations with Projected Functional Regularization
* Continuous and Unified Person Re-Identification
* Continuous Emotion Recognition using Visual-audio-linguistic Information: A Technical Report for ABAW3
* Continuous label distribution learning
* Continuous Scene Representations for Embodied AI
* Contour information regularized tensor ring completion for realistic image restoration
* Contour loss for instance segmentation via k-step distance transformation image
* Contour-Hugging Heatmaps for Landmark Detection
* Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition
* Contrastive Boundary Learning for Point Cloud Segmentation
* Contrastive Conditional Neural Processes
* Contrastive Dual Gating: Learning Sparse Features With Contrastive Learning
* Contrastive Learning for Space-time Correspondence via Self-cycle Consistency
* Contrastive Learning for Unsupervised Video Highlight Detection
* Contrastive Learning-based Robust Object Detection under Smoky Conditions
* Contrastive Regression for Domain Adaptation on Gaze Estimation
* Contrastive Regularization for Semi-Supervised Learning
* Contrastive Test-Time Adaptation
* ContrastMask: Contrastive Learning to Segment Every Thing
* Controllable Animation of Fluid Elements in Still Images
* Controllable Dynamic Multi-Task Architectures
* Controls on Alpine Lake Dynamics, Tien Shan, Central Asia
* ConvNet for the 2020s, A
* Convolution of Convolution: Let Kernels Spatially Collaborate
* Convolutional Neural Network for Large-Scale Greenhouse Extraction from Satellite Images Considering Spatial Features, A
* Convolutions for Spatial Interaction Modeling
* Cooling by Cyprus Lows of Surface and Epilimnion Water in Subtropical Lake Kinneret in Rainy Seasons
* Cooperated Spectral Low-Rankness Prior and Deep Spatial Prior for HSI Unsupervised Denoising
* Cooperative Exchange-Based Platooning Using Predicted Fuel-Optimal Operation of Heavy-Duty Vehicles
* Cooperative Merging Strategy for Connected and Automated Vehicles Based on Game Theory With Transferable Utility, A
* Cooperative Optimal Control of the Following Operation of High-Speed Trains
* Cooperative Spacing Sampled Control of Vehicle Platoon Considering Undirected Topology and Analog Fading Networks
* Coopernaut: End-to-End Driving with Cooperative Perception for Networked Vehicles
* CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs
* CoRe: Color Regression for Multicolor Fashion Garments
* CORE: Consistent Representation Learning for Face Forgery Detection
* Corner-based object detection method for reactivating box constraints
* Corrected rank residual constraint model for image denoising
* Correlation Verification for Image Retrieval
* Correlation-Aware Deep Tracking
* CorrGAN: Input Transformation Technique Against Natural Corruptions
* CoSSL: Co-Learning of Representation and Classifier for Imbalanced Semi-Supervised Learning
* COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval
* Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation
* Counting People by Estimating People Flows
* Coupled Iterative Refinement for 6D Multi-Object Pose Estimation
* Coupling Vision and Proprioception for Navigation of Legged Robots
* Coupling Vision and Proprioception for Navigation of Legged Robots
* Coupling Vision and Proprioception for Navigation of Legged Robots
* CPPF: Towards Robust Category-Level 9D Pose Estimation in the Wild
* CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow
* Crafting Better Contrastive Views for Siamese Representation Learning
* CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping
* CRIS: CLIP-Driven Referring Image Segmentation
* Critical Regularizations for Neural Surface Reconstruction in the Wild
* CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition
* CroMo: Cross-Modal Learning for Monocular Depth Estimation
* Crop Water Productivity Mapping and Benchmarking Using Remote Sensing and Google Earth Engine Cloud Computing
* Cross Domain Object Detection by Target-Perceived Dual Branch Distillation
* Cross Modal Retrieval with Querybank Normalisation
* Cross Transferring Activity Recognition to Word Level Sign Language Detection
* Cross-Architecture Self-supervised Video Representation Learning
* Cross-dataset Learning for Generalizable Land Use Scene Classification
* Cross-Domain Adaptive Teacher for Object Detection
* Cross-Domain Attention Network for Unsupervised Domain Adaptation Crowd Counting
* Cross-Domain Correlation Distillation for Unsupervised Domain Adaptation in Nighttime Semantic Segmentation
* Cross-domain Few-shot Learning with Task-specific Adapters
* Cross-Domain Gated Learning for Domain Generalization
* Cross-Image Relational Knowledge Distillation for Semantic Segmentation
* Cross-modal Background Suppression for Audio-Visual Event Localization
* Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation
* Cross-Modal Cross-Domain Dual Alignment Network for RGB-Infrared Person Re-Identification
* Cross-modal Image Synthesis within Dual-Energy X-ray Security Imagery
* Cross-modal Map Learning for Vision and Language Navigation
* Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
* Cross-modal Target Retrieval for Tracking by Natural Language
* Cross-Modal Transferable Adversarial Attacks from Images to Videos
* Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
* Cross-patch Dense Contrastive Learning for Semi-supervised Segmentation of Cellular Nuclei in Histopathologic Images
* Cross-phenological-region crop mapping framework using Sentinel-2 time series Imagery: A new perspective for winter crops in China
* Cross-Resolution Distillation for Efficient 3D Medical Image Registration
* Cross-view Transformers for real-time Map-view Semantic Segmentation
* CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data
* Crossmodal Representation Learning for Zero-shot Action Recognition
* CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding
* Crowd Counting in the Frequency Domain
* CSG0: Continual Urban Scene Generation with Zero Forgetting
* CSR: Cascade Conditional Variational Auto Encoder with Socially-aware Regression for Pedestrian Trajectory Prediction
* CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
* CTTE: Customized Travel Time Estimation via Mobile Crowdsensing
* CVF-SID: Cyclic multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise from Image
* CVNet: Contour Vibration Network for Building Extraction
* Cycle-Consistent Counterfactuals by Latent Transformations
* CycleMix: A Holistic Strategy for Medical Image Segmentation from Scribble Supervision
* Cyclical Pruning for Sparse Neural Networks
* CyCoSeg: A Cyclic Collaborative Framework for Automated Medical Image Segmentation
* D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
* D2-Net: Dual Disentanglement Network for Brain Tumor Segmentation With Missing Modalities
* DA-AE: Disparity-Alleviation Auto-Encoder Towards Categorization of Heritage Images for Aggrandized 3D Reconstruction
* DA3: Dynamic Additive Attention Adaption for Memory-Efficient On-Device Multi-Domain Learning
* DAD-3DHeads: A Large-scale Dense, Accurate and Diverse Dataset for 3D Head Alignment from a Single Image
* DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic Segmentation
* Daily Spatial Distribution of Apparent Temperature Comfort Zone in China Based on Heat Index
* DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection
* DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion
* Dancing under the stars: video denoising in starlight
* DArch: Dental Arch Prior-assisted 3D Tooth Instance Segmentation with Weak Annotations
* Dark Corner on Skin Lesion Image Dataset: Does it matter?
* DASO: Distribution-Aware Semantics-Oriented Pseudo-label for Imbalanced Semi-Supervised Learning
* Data Protection in Palmprint Recognition via Dynamic Random Invisible Watermark Embedding
* Data Rate Reduction for Video Streams in Teleoperated Driving
* Data-Driven Approach for Electric Bus Energy Consumption Estimation, A
* Data-Driven Optimization for Dynamic Shortest Path Problem Considering Traffic Safety
* Data-Free Network Compression via Parametric Non-uniform Mixed Precision Quantization
* DATA: Domain-Aware and Task-Aware Self-supervised Learning
* Dataset Distillation by Matching Training Trajectories
* Dataset Distillation by Matching Training Trajectories
* DAtRNet: Disentangling Fashion Attribute Embedding for Substitute Item Retrieval
* Day-to-Night Image Synthesis for Training Nighttime Neural ISPs
* DC-SSL: Addressing Mismatched Class Distribution in Semi-Supervised Learning
* DcTr: Noise-robust point cloud completion by dual-channel transformer with cross-attention
* De-rendering 3D Objects in the Wild
* DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers
* Debiased Learning from Naturally Imbalanced Pseudo-Labels
* Deblur-NeRF: Neural Radiance Fields from Blurry Images
* Deblurring via Stochastic Refinement
* Decision surface optimization in mapping exotic mangrove species (Sonneratia apetala) across latitudinal coastal areas of China
* Decomposition-Based Heuristic Method for Inventory Routing Problem, A
* DECORE: Deep Compression with Reinforcement Learning
* Decoupled Knowledge Distillation
* Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
* Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition
* Decoupling Makes Weakly Supervised Local Feature Better
* Decoupling multi-task causality for improved skin lesion segmentation and classification
* Decoupling Zero-Shot Semantic Segmentation
* DeeCap: Dynamic Early Exiting for Efficient Image Captioning
* Deep 3D-to-2D Watermarking: Embedding Messages in 3D Meshes and Extracting Them from 2D Renderings
* Deep Adaptively-Enhanced Hashing With Discriminative Similarity Guidance for Unsupervised Cross-Modal Retrieval
* Deep Anomaly Discovery from Unlabeled Videos via Normality Advantage and Self-Paced Refinement
* Deep autoregressive models with spectral attention
* Deep Color Consistent Network for Low-Light Image Enhancement
* Deep Constrained Least Squares for Blind Image Super-Resolution
* Deep Decomposition for Stochastic Normal-Abnormal Transport
* Deep density estimation based on multi-spectral remote sensing data for in-field crop yield forecasting
* Deep Depth from Focus with Differential Focus Volume
* Deep Equilibrium Optical Flow Estimation
* Deep Generalized Unfolding Networks for Image Restoration
* Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
* Deep Hierarchical Semantic Segmentation
* Deep Hybrid Models for Out-of-Distribution Detection
* Deep Hyperspectral-Depth Reconstruction Using Single Color-Dot Projection
* Deep Image Inpainting With Enhanced Normalization and Contextual Attention
* Deep Image Interpolation: A Unified Unsupervised Framework for Pansharpening
* Deep Image Retrieval is not Robust to Label Noise
* Deep Image-based Illumination Harmonization
* Deep Learning Approach to Clustering Visual Arts, A
* Deep Learning Classification by ResNet-18 Based on the Real Spectral Dataset from Multispectral Remote Sensing Images
* Deep Learning Classifier for Advancing Video Monitoring of Atrial Fibrillation
* Deep learning for 3D vision
* Deep learning for deepfakes creation and detection: A survey
* Deep Learning for InSAR Phase Filtering: An Optimized Framework for Phase Unwrapping
* Deep learning in the grading of diabetic retinopathy: A review
* Deep Learning-Based Automatic Extraction of Cyanobacterial Blooms from Sentinel-2 MSI Satellite Data
* Deep Multi-Branch Aggregation Network for Real-Time Semantic Segmentation in Street Scenes
* Deep Neural Network with Walsh-Hadamard Transform Layer For Ember Detection during a Wildfire
* Deep Normalized Cross-Modal Hashing with Bi-Direction Relation Reasoning
* Deep orientation-aware functional maps: Tackling symmetry issues in Shape Matching
* Deep patch-wise supervision for presentation attack detection
* Deep Rectangling for Image Stitching: A Learning Baseline
* Deep RL-Based Algorithm for Coordinated Charging of Electric Vehicles, A
* Deep Safe Multi-view Clustering: Reducing the Risk of Clustering Performance Degradation Caused by View Increase
* Deep Saliency Prior for Reducing Visual Distraction
* Deep Scale-space Mining Network for Single Image Deraining
* Deep Shape-Aware Person Re-Identification for Overcoming Moderate Clothing Changes
* Deep Sparse Representation Based Image Restoration With Denoising Prior
* Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization
* Deep Stereo Image Compression via Bi-directional Coding
* Deep Unfolded Prior-Aided RPCA Network for Cloud Removal, A
* Deep Unlearning via Randomized Conditionally Independent Hessians
* Deep vanishing point detection: Geometric priors make dataset variations vanish
* Deep Visual Geo-localization Benchmark
* Deep-FlexISP: A Three-Stage Framework for Night Photography Rendering
* Deep-Learning-Based Fast Optical Coherence Tomography (OCT) Image Denoising for Smart Laser Osteotomy
* DeepACO: A Robust Deep Learning-based Automatic Checkout System
* DeepCurrents: Learning Implicit Representations of Shapes with Boundaries
* DeepDPM: Deep Clustering With an Unknown Number of Clusters
* Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information, A
* Deeper Look into Aleatoric and Epistemic Uncertainty Disentanglement, A
* DeepFace-EMD: Re-ranking Using Patch-wise Earth Mover's Distance Improves Out-Of-Distribution Face Identification
* DeepFake Disrupter: The Detector of DeepFake Is My Friend
* DeepForest: Novel Deep Learning Models for Land Use and Land Cover Classification Using Multi-Temporal and -Modal Sentinel Data of the Amazon Basin
* DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
* DeepLIIF: An Online Platform for Quantification of Clinical Pathology Slides
* deepPIC: Deep Perceptual Image Clustering For Identifying Bias In Vision Datasets
* DeepTrack: Lightweight Deep Learning for Vehicle Trajectory Prediction in Highways
* DEFEAT: Deep Hidden Feature Backdoor Attacks by Imperceptible Perturbation and Latent Representation Constraints
* Defending against attacks tailored to transfer learning via feature distancing
* Defensive Patches for Robust Recognition in the Physical World
* DeFLoc: Deep Learning Assisted Indoor Vehicle Localization Atop FM Fingerprint Map
* Deforestation Detection in the Amazon Using DeepLabv3+ Semantic Segmentation Model Variants
* Deformable ProtoPNet: An Interpretable Image Classifier Using Deformable Prototypes
* Deformable Sprites for Unsupervised Video Decomposition
* Deformable Video Transformer
* Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds
* Degradation-agnostic Correspondence from Resolution-asymmetric Stereo
* Degree-of-linear-polarization-based Color Constancy
* DeltaCNN: End-to-End CNN Inference of Sparse Frame Differences in Videos
* Delving Deep into the Generalization of Vision Transformers under Distribution Shifts
* Delving Deeper Into Mask Utilization in Video Object Segmentation
* Delving into High-Quality Synthetic Face Occlusion Segmentation Datasets
* Delving into the Estimation Shift of Batch Normalization in a Network
* Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection
* DEMVSNet: Denoising and depth inference for unstructured multi-view stereo on noised images
* Demystifying the Neural Tangent Kernel from a Practical Perspective: Can it be trusted for Neural Architecture Search without training?
* Denoising Pretraining for Semantic Segmentation
* Dense Depth Priors for Neural Radiance Fields from Sparse Input Views
* Dense Learning based Semi-Supervised Object Detection
* Dense Relational Image Captioning via Multi-Task Triple-Stream Networks
* DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
* Density-Guided Label Smoothing for Temporal Localization of Driving Actions
* Density-preserving Deep Point Cloud Compression
* Depth Estimation by Combining Binocular Stereo and Monocular Structured-Light
* Depth Repeated-Enhancement RGB Network for Rail Surface Defect Inspection
* Depth-Aware Generative Adversarial Network for Talking Head Video Generation
* Depth-Guided Progressive Network for Object Detection
* Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows
* Depth-supervised NeRF: Fewer Views and Faster Training for Free
* Depthwise Convolution For Compact Object Detector In nighttime Images
* Desert Soil Salinity Inversion Models Based on Field In Situ Spectroscopy in Southern Xinjiang, China
* DeSI: Deepfake Source Identifier for Social Media
* Design and Analysis of a New Deployable Docking Mechanism for Microsatellites
* Design of a focused light field fundus camera for retinal imaging
* DESTR: Object Detection with Split Transformer
* Detailed Avatar Recovery From Single Image
* Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution
* Detecting and Suppressing Marine Snow for Underwater Visual SLAM
* Detecting Camouflaged Object in Frequency Domain
* Detecting Changes in the Spatiotemporal Pattern of Bike Sharing: A Change-Point Topic Model
* Detecting Deepfakes with Self-Blended Images
* Detecting Driver Cognition Alertness State From Visual Activities in Normal and Emergency Scenarios
* Detecting Objects in Less Response Time for Processing Multimedia Events in Smart Cities
* Detecting Real-Time Deep-Fake Videos Using Active Illumination
* Detecting Undeclared-Leader-Follower Structure in Pedestrian Evacuation Using Transfer Entropy
* Detecting Vehicles on the Edge: Knowledge Distillation to Improve Performance in Heterogeneous Road Traffic
* Detecting, Tracking and Counting Motorcycle Rider Traffic Violations on Unconstrained Roads
* Detection and Counting of Corn Plants in the Presence of Weeds with Convolutional Neural Networks
* Detection and Tracking Meet Drones Challenge
* Detection of Bering Sea Slope Mesoscale Eddies Derived from Satellite Altimetry Data by an Attention Network
* Detection of Carbon Use Efficiency Extremes and Analysis of Their Forming Climatic Conditions on a Global Scale Using a Remote Sensing-Based Model
* Detection of Intrusion behavior in cloud applications using Pearson's chi-squared distribution and decision tree classifiers
* Detection of Localization Failures Using Markov Random Fields With Fully Connected Latent Variables for Safe LiDAR-Based Automated Driving
* Detection of Peanut Leaf Spot Disease Based on Leaf-, Plant-, and Field-Scale Hyperspectral Reflectance
* Detection of Ships Cruising in the Azimuth Direction Using Spotlight SAR Images with a Deep Learning Method
* Detection of Waste Plastics in the Environment: Application of Copernicus Earth Observation Data
* Detector-Free Weakly Supervised Group Activity Recognition
* DetectorDetective: Investigating the Effects of Adversarial Examples on Object Detectors
* Determination of foliar traits in an ecologically distinct conifer species in Maine using Sentinel-2 imagery and site variables: Assessing the effect of leaf trait expression and upscaling approach on prediction accuracy
* Deterministic Point Cloud Registration via Novel Transformation Decomposition
* DETReg: Unsupervised Pretraining with Region Priors for Object Detection
* Developing a New Parameterization Scheme of Temperature Lapse Rate for the Hydrological Simulation in a Glacierized Basin Based on Remote Sensing
* Developing a sub-meter phenological spectral feature for mapping poplars and willows in urban environment
* Development of Collision Avoidance System in Slippery Road Conditions
* DEVIL is in the Details: A Diagnostic Evaluation Benchmark for Video Inpainting, The
* Devil Is in the Details: Window-Based Attention for Image Compression, The
* Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation, The
* Devil is in the Margin: Margin-based Label Smoothing for Network Calibration, The
* Devil is in the Pose: Ambiguity-free 3D Rotation-invariant Learning via Pose-aware Convolution, The
* DevNet: Deviation Aware Network for Lane Detection
* DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
* DGECN: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation
* Diagnose Like a Radiologist: Hybrid Neuro-Probabilistic Reasoning for Attribute-Based Medical Image Diagnosis
* Differentiable Dynamics for Articulated 3d Human Motion Reconstruction
* Differentiable Stereopsis: Meshes from multiple views using differentiable rendering
* Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift, A
* Differentially Private Federated Learning with Local Regularization and Sparsification
* DiffPoseNet: Direct Differentiable Camera Pose Estimation
* Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
* Diffusion Kernel Attention Network for Brain Disorder Classification
* Diffusion Model with Detail Complement for Super-Resolution of Remote Sensing
* Diffusion Particle Filtering on the Special Orthogonal Group Using Lie Algebra Statistics
* DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation
* DIFNet: Boosting Visual Information Flow for Image Captioning
* DiGS: Divergence guided shape implicit neural representation for unoriented point clouds
* DiLiGenT102: A Photometric Stereo Benchmark Dataset with Controlled Shape and Material Variation
* Dimension Embeddings for Monocular 3D Object Detection
* DINE: Domain Adaptation from Single and Multiple Black-box Predictors
* DIP: Deep Inverse Patchmatch for High-Resolution Optical Flow
* DiRA: Discriminative, Restorative, and Adversarial Learning for Self-supervised Medical Image Analysis
* DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
* Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction
* Directional Self-supervised Learning for Heavy Image Augmentations
* DisARM: Displacement Aware Relation Module for 3D Detection
* Discovering Objects that Can Move
* Discrete Cosine Transform Network for Guided Depth Map Super-Resolution
* Discrete time convolution for fast event-based stereo
* Discretization-Free Particle-Based Taxi Dispatch Methods With Network Flow Decomposition
* Discriminability-enforcing loss to improve representation learning
* Discriminative training of spiking neural networks organised in columns for stream-based biometric authentication
* Disentangled Loss for Low-Bit Quantization-Aware Training
* Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images
* Disentangling Normal Aging From Severity of Disease via Weak Supervision on Longitudinal MRI
* Disentangling the correlated continuous and discrete generative factors of data
* Disentangling visual and written concepts in CLIP
* Disentangling Visual Embeddings for Attributes and Objects
* Disparity-Based Multiscale Fusion Network for Transportation Detection
* DiSparse: Disentangled Sparsification for Multitask Model Compression
* DisRFC: a dissimilarity-based Random Forest Clustering approach
* Dist-PU: Positive-Unlabeled Learning from a Label Distribution Perspective
* Distillation Using Oracle Queries for Transformer-based Human-Object Interaction Detection
* Distilling Knowledge by Mimicking Features
* Distinguishing Unseen from Seen for Generalized Zero-shot Learning
* Distributed Adaptive Consensus Protocol for Connected Vehicle Platoon With Heterogeneous Time-Varying Delays and Switching Topologies
* Distributed Cooperative Surrounding Control for Mobile Robots With Uncertainties and Aperiodic Sampling
* Distributed Gradient Approach for System Optimal Dynamic Traffic Assignment, A
* Distributed Model Predictive Control for Vehicle Platoon With Mixed Disturbances and Model Uncertainties
* Distribution Consistent Neural Architecture Search
* Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation
* Ditto: Building Digital Twins of Articulated Objects from Interaction
* DIVeR: Real-time and Accurate Neural Radiance Fields with Deterministic Integration for Volume Rendering
* Divergence-Agnostic Unsupervised Domain Adaptation by Adversarial Attacks
* Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation
* Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection
* Divide and Conquer: Compositional Experts for Generalized Novel Class Discovery
* DLFormer: Discrete Latent Transformer for Video Inpainting
* DLI-Net: Dual Local Interaction Network for Fine-Grained Sketch-Based Image Retrieval
* DMA-Net: DeepLab With Multi-Scale Attention for Pavement Crack Segmentation
* DMA-Net: Dual multi-instance attention network for X-ray image classification
* DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
* DNAS:A Decoupled Global Neural Architecture Search Method
* Do Explanations Explain? Model Knows Best
* Do learned representations respect causal relationships?
* Do What You Can, With What You Have: Scale-aware and High Quality Monocular Depth Estimation Without Real World Labels
* DO-GAN: A Double Oracle Framework for Generative Adversarial Networks
* Does Federated Dropout actually work?
* Does Interference Exist When Training a Once-For-All Network?
* Does Robustness on ImageNet Transfer to Downstream Tasks?
* Does text attract attention on e-commerce images: A novel saliency prediction dataset and method
* Domain Adaptable Normalization for Semi-Supervised Action Recognition in the Dark
* Domain Adaptation on Point Clouds via Geometry-Aware Implicits
* Domain Adversarial Disentanglement Network With Cross-Domain Synthesis for Generalized Face Anti-Spoofing
* Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing
* Domain-Agnostic Prior for Transfer Semantic Segmentation
* Doodle It Yourself: Class Incremental Learning by Drawing a Few Sketches
* DooDLeNet: Double DeepLab Enhanced Feature Fusion for Thermal-color Semantic Segmentation
* Doppelgänger Saliency: Towards More Ethical Person Re-Identification
* DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Reconstruction and Rendering
* Doubling down: sparse grounding with an additional, almost-matching caption for detection-oriented multimodal pretraining
* DPGEN: Differentially Private Generative Energy-Guided Network for Natural Image Synthesis
* DPICT: Deep Progressive Image Compression Using Trit-Planes
* DPODv2: Dense Correspondence-Based 6 DoF Pose Estimation
* DR.VIC: Decomposition and Reasoning for Video Individual Counting
* DRCR Net: Dense Residual Channel Re-calibration Network with Non-local Purification for Spectral Super Resolution
* Dreaming to Prune Image Deraining Networks
* Dress Code: High-Resolution Multi-category Virtual Try-On
* Dressing in the Wild by Watching Dance Videos
* DRHDR: A Dual branch Residual Network for Multi-Bracket High Dynamic Range Imaging
* Driver Distraction Detection Based on the True Driver's Focus of Attention
* Driver Distraction Detection Using Bidirectional Long Short-Term Network Based on Multiscale Entropy of EEG
* Driver Identification Through Heterogeneity Modeling in Car-Following Sequences
* Drivers of Groundwater Change in China and Future Projections
* Driving Behavior Identification and Real-World Fuel Consumption Estimation With Crowdsensing Data
* Drone-Based RGB-Infrared Cross-Modality Vehicle Detection Via Uncertainty-Aware Learning
* Drop the GAN: In Defense of Patches Nearest Neighbors as Single Image Generative Models
* DRT: A Lightweight Single Image Deraining Recursive Transformer
* DSNUNet: An Improved Forest Change Detection Network by Combining Sentinel-1 and Sentinel-2 Images
* DSRC-Enabled Train Safety Communication System at Unmanned Crossings
* DST: Dynamic Substitute Training for Data-free Black-box Attack
* DTA: Physical Camouflage Attacks using Differentiable Transformation Network
* DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for Histopathology Whole Slide Image Classification
* Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution
* Dual adversarial model: Exploring low-dimensional space features for point clouds generating and completing
* Dual attention interactive fine-grained classification network based on data augmentation
* Dual Cross-Attention Learning for Fine-Grained Visual Categorization and Object Re-Identification
* Dual Heterogeneous Complementary Networks for Single Image Deraining
* dual quantum image feature extraction method: PSQIFE, A
* Dual Task Learning by Leveraging Both Dense Correspondence and Mis-Correspondence for Robust Change Detection With Imperfect Matches
* Dual Temperature Helps Contrastive Learning Without Many Negative Samples: Towards Understanding and Simplifying MoCo
* Dual Weighting Label Assignment Scheme for Object Detection, A
* Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition
* Dual-Branch Collaborative Transformer for Virtual Try-On
* Dual-Domain Image Synthesis using Segmentation-Guided GAN
* Dual-Encoder-Condensed Convolution Method for High-Precision Indoor Positioning, A
* Dual-Generator Face Reenactment
* Dual-Key Multimodal Backdoors for Visual Question Answering
* Dual-path Image Inpainting with Auxiliary GAN Inversion
* Dual-Scale Single Image Dehazing via Neural Augmentation
* Dual-Shutter Optical Vibration Sensing
* Dynamic 3D Gaze from Afar: Deep Gaze Estimation from Temporal Eye-Head-Body Coordination
* Dynamic Convolution Self-Attention Network for Land-Cover Classification in VHR Remote-Sensing Images
* Dynamic Crowd Accident-Risk Assessment Based on Internal Energy and Information Entropy for Large-Scale Crowd Flow Considering COVID-19 Epidemic
* Dynamic dense CRF inference for video segmentation and semantic SLAM
* Dynamic Dual-Output Diffusion Models
* Dynamic Imaging Using Deep Bi-Linear Unsupervised Representation (DEBLUR)
* Dynamic Kernel Selection for Improved Generalization and Memory Efficiency in Meta-learning
* Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal Information
* Dynamic Neural Networks: A Survey
* Dynamic Order Dispatching With Multiobjective Reward Learning
* Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation
* Dynamic Scene Graph Generation via Anticipatory Pre-training
* Dynamic Sparse R-CNN
* Dynamical Deep Generative Latent Modeling of 3D Skeletal Motion
* DynamicEarthNet: Daily Multi-Spectral Satellite Dataset for Semantic Change Segmentation
* Dynamics and Processes in Operations Control Centers in Urban Public Transport: Potentials for Improvement
* DyRep: Bootstrapping Training with Dynamic Re-parameterization
* DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion
* E-CIR: Event-Enhanced Continuous Intensity Recovery
* E2(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition
* E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation
* EAGAN: Event-based attention generative adversarial networks for optical flow and depth estimation
* EASE: Unsupervised Discriminant Subspace Learning for Transductive Few-Shot Learning
* ECCNet: Efficient chained centre network for real-time multi-category vehicle tracking and vehicle speed estimation
* Echocardiography Segmentation With Enforced Temporal Consistency
* Ecological Assessment of Terminal Lake Basins in Central Asia under Changing Landscape Patterns
* Eddy Induced Cross-Shelf Exchanges in the Black Sea
* Edge-Aware Extended Star-Tetrix Transforms for CFA-Sampled Raw Camera Image Compression
* Edge-Aware Graph Matching Network for Part-Based Semantic Segmentation
* Edge-aware motion based facial micro-expression generation with attention mechanism
* Edge-Based Video Compression Texture Synthesis Using Generative Adversarial Network
* Edge-enhanced Feature Distillation Network for Efficient Super-Resolution
* EdgeNets: Edge Varying Graph Neural Networks
* Editorial for the special issue on deep learning for precise and efficient object detection
* EDTER: Edge Detection with Transformer
* Effect of Controlled Tile Drainage on Growth and Grain Yield of Spring Barley as Detected by UAV Images, Yield Map and Soil Moisture Content, The
* Effect of Improving Annotation Quality on Object Detection Datasets: A Preliminary Study, The
* Effect of Radio Frequency Interference-Contaminated AMSR2 Signal Restoration on Soil Moisture Retrieval
* Effect of Vegetation Carryover and Climate Variability on the Seasonal Growth of Vegetation in the Upper and Middle Reaches of the Yellow River Basin
* Effective conditioned and composed image retrieval combining CLIP-based features
* Effective Framework of Multi-Class Product Counting and Recognition for Automated Retail Checkout, An
* Effective Intrusion Detection and Prevention for the Commercial Vehicle SAE J1939 CAN Bus
* effective LRTC model integrated with total a-order variation and boundary adjustment for multichannel visual data inpainting, An
* Effective Temporal Localization Method with Multi-View 3D Action Recognition for Untrimmed Naturalistic Driving Videos, An
* Effects of Anthropogenic Pressure on Rivers: A Case Study in the Metropolitan City of Reggio Calabria, The
* Effects of Rainfall on Over-the-Horizon Propagation in the Evaporation Duct over the South China Sea, The
* Efficient 6D object pose estimation based on attentive multi-scale contextual information
* Efficient Classification of Very Large Images with Tiny Objects
* Efficient Conditional Pre-training for Transfer Learning
* Efficient Deep Embedded Subspace Clustering
* Efficient Deterministic Search With Robust Loss Functions for Geometric Model Fitting
* Efficient Domain-Incremental Learning Approach to Drive in All Weather Conditions, An
* Efficient Framework of Reference Picture Resampling (RPR) for Video Coding, An
* Efficient Generative Adversarial Networks for Imbalanced Traffic Collision Datasets
* Efficient Geometry-aware 3D Generative Adversarial Networks
* Efficient Hybrid Model for Low-light Image Enhancement in Mobile Devices, An
* Efficient Image Super-Resolution with Collapsible Linear Blocks
* Efficient Information-Reinforced Lidar Deep Completion Network without RGB Guided, An
* Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation
* Efficient Large-scale Localization by Global Instance Recognition
* Efficient Maximal Coding Rate Reduction by Variational Forms
* Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for Edge Devices
* Efficient Multi-view Stereo by Iterative Dynamic Cost Volume
* Efficient Multimodal Aggregation Network for Video-Text Retrieval, An
* Efficient Progressive High Dynamic Range Image Restoration via Attention and Alignment Network
* Efficient Remote Photoplethysmography with Temporal Derivative Modules and Time-Shift Invariant Loss
* Efficient tracking of team sport players with few game-specific annotations
* Efficient Training Approach for Very Large Scale Face Recognition, An
* Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer
* Efficient Two-stage Model Retraining for Machine Unlearning
* Efficient Video Grounding With Which-Where Reading Comprehension
* Efficient Video Instance Segmentation via Tracklet Query and Proposal
* EfficientNeRF: Efficient Neural Radiance Fields
* Ego4D: Around the World in 3,000 Hours of Egocentric Video
* Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization
* Egocentric Indoor Localization from Coplanar Two-Line Room Layouts
* Egocentric Prediction of Action Target in 3D
* Egocentric Scene Understanding via Multimodal Spatial Rectifier
* EI-CLIP: Entity-aware Interventional Contrastive Learning for E-commerce Cross-modal Retrieval
* Eigencontours: Novel Contour Descriptors Based on Low-Rank Approximation
* Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes
* ElasticFace: Elastic Margin Loss for Deep Face Recognition
* Electric Vehicle Trip Chain Information-Based Hierarchical Stochastic Energy Management With Multiple Uncertainties
* ElePose: Unsupervised 3D Human Pose Estimation by Predicting Camera Elevation and Learning Normalizing Flows on 2D Poses
* ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding
* ELSR: Efficient Line Segment Reconstruction with Planes and Points Guidance
* Embedding Arithmetic of Multimodal Queries for Image Retrieval
* Embracing Single Stride 3D Object Detector with Sparse Transformer
* Emerging Trends of Multi-Label Learning, The
* EMOCA: Emotion Driven Monocular Face Capture and Animation
* Emphasizing Complementary Samples for Non-literal Cross-modal Retrieval
* Empirical study of Data-Free Quantization's Tuning Robustness, An
* Empirical Study of End-to-End Temporal Action Detection, An
* Empirical Study of Training End-to-End Vision-and-Language Transformers, An
* EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
* En-Compactness: Self-Distillation Embedding & Contrastive Generation for Generalized Zero-Shot Learning
* Enabling Equivariance for Arbitrary Lie Groups
* End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
* End-to-End Curriculum Learning Approach for Autonomous Driving Scenarios, An
* End-to-end Generative Pretraining for Multimodal Video Captioning
* End-to-End High-Risk Tackle Detection System for Rugby
* End-to-End Human-Gaze-Target Detection with Transformers
* End-to-End Multi-Person Pose Estimation with Transformers
* End-to-End Object Separation for Threat Detection in Large-Scale X-Ray Security Images
* End-to-End Optimized 360° Image Compression
* End-to-End Reconstruction-Classification Learning for Face Forgery Detection
* End-to-End Referring Video Object Segmentation with Multimodal Transformers
* End-to-End Semi-Supervised Learning for Video Action Detection
* End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps
* Energy and Time Optimal Autopilot for Electric Vehicles Performing Ackerman Cornering
* Energy-based Latent Aligner for Incremental Learning
* Energy-driven reference selection for hierarchical light field compression
* Energy-efficient train control considering the traction system efficiency
* Enhanced Dual Filter for Floating Wind Lidar Motion Correction: The Impact of Wind and Initial Scan Phase Models
* enhanced image quality assessment by synergizing superpixels and visual saliency, An
* Enhanced Spatial-Temporal Salience for Cross-View Gait Recognition
* Enhanced Understanding of Groundwater Storage Changes under the Influence of River Basin Governance Using GRACE Data and Downscaling Model
* Enhancing Adversarial Robustness for Deep Metric Learning
* Enhancing Adversarial Training with Second-Order Statistics of Weights
* Enhancing Classifier Conservativeness and Robustness by Polynomiality
* Enhancing Face Recognition with Self-Supervised 3D Reconstruction
* Enriched Robust Multi-View Kernel Subspace Clustering
* Ensemble Approach for Facial Behavior Analysis in-the-wild Video, An
* Ensemble Learning and Slice Fusion Strategy for Three-Dimensional Nuclei Instance Segmentation, An
* Ensembling Off-the-shelf Models for GAN Training
* Entropy-based Active Learning for Object Detection with Progressive Diversity Constraint
* Entropy-based Stability-Plasticity for Lifelong Learning
* Envedit: Environment Editing for Vision-and-Language Navigation
* Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices
* Episodic Memory Question Answering
* Epistemic Uncertainty-Weighted Loss for Visual Bias Mitigation
* EPLL image restoration with a bounded asymmetrical Student's-t mixture model
* EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation
* Equalized Focal Loss for Dense Long-Tailed Object Detection
* Equivariance Allows Handling Multiple Nuisance Variables When Analyzing Pooled Neuroimaging Datasets
* Equivariant Point Cloud Analysis via Learning Orientations for Message Passing
* ES6D: A Computation Efficient and Symmetry-Aware 6D Pose Regression Framework
* Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination
* ESCNet: Gaze Target Detection with the Understanding of 3D Scenes
* Estimates of Power Shortages and Affected Populations during the Initial Period of the Ukrainian-Russian Conflict
* Estimating Crop Seed Composition Using Machine Learning from Multisensory UAV Data
* Estimating Egocentric 3D Human Pose in the Wild with External Weak Supervision
* Estimating Example Difficulty using Variance of Gradients
* Estimating Fine-Grained Noise Model via Contrastive Learning
* Estimating Multiple Emotion Descriptors by Separating Description and Inference
* Estimating Regional Snow Line Elevation Using Public Webcam Images
* Estimating Structural Disparities for Face Models
* Estimating the Legacy Effect of Post-Cutting Shelterbelt on Crop Yield Using Google Earth and Sentinel-2 Data
* Estimation of Chlorophyll-A Concentration with Remotely Sensed Data for the Nine Plateau Lakes in Yunnan Province
* Estimation of Multiple Illuminant Colors Using Color Line Features
* Estimation of Vegetation Leaf-Area-Index Dynamics from Multiple Satellite Products through Deep-Learning Method
* ESTNet: Embedded Spatial-Temporal Network for Modeling Traffic Flow Dynamics
* ETHSeg: An Amodel Instance Segmentation Network and a Real-world Dataset for X-Ray Waste Inspection
* ETS-3D: An Efficient Two-Stage Framework for Stereo 3D Object Detection
* Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition
* Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions with Superior OOD Generalization
* Evaluating Anthropogenic CO2 Bottom-Up Emission Inventories Using Satellite Observations from GOSAT and OCO-2
* Evaluating Long-Term Variability of the Arctic Stratospheric Polar Vortex Simulated by CMIP6 Models
* Evaluating the Effects of Climate Change and Human Activities on the Seasonal Trends and Spatial Heterogeneity of Soil Moisture
* Evaluating the Stability of Deep Image Quality Assessment with Respect to Image Scaling
* Evaluating the Understandability of Light Patterns and Pictograms for Autonomous Vehicle-to-Pedestrian Communication Functions
* Evaluation of IMERG Precipitation Products in the Southeast Costal Urban Region of China
* Evaluation of Real-time Precise Point Positioning with Ambiguity Resolution Based on Multi-GNSS OSB Products from CNES
* Evaluation of the Influence of Field Conditions on Aerial Multispectral Images and Vegetation Indices
* Evaluation of Wind and Solar Insolation Influence on Ocean Near-Surface Temperature from In Situ Observations and the Geostationary Himawari-8 Satellite
* Evaluation-oriented Knowledge Distillation for Deep Face Recognition
* Event Transformer. A sparse-aware solution for efficient event data processing
* Event-aided Direct Sparse Odometry
* Event-based Video Reconstruction via Potential-assisted Spiking Neural Network
* Everything at Once - Multi-modal Fusion Transformer for Video Retrieval
* EvUnroll: Neuromorphic Events based Rolling Shutter Image Correction
* Ex-Model: Continual Learning from a Stream of Trained Models
* Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization
* Examination of Bias of Facial Analysis based BMI Prediction Models, An
* Exemplar-based Pattern Synthesis with Implicit Periodic Field Network
* Expanding Large Pre-trained Unimodal Models with Multimodal Information Injection for Image-Text Multimodal Classification
* Expanding Low-Density Latent Regions for Open-Set Object Detection
* Experimental Study of Accuracy of High-Rate GNSS in Context of Structural Health Monitoring
* Experiments on Deep Single-Image Portrait Relighting
* Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention
* Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation
* Exploiting Distortion Information for Multi-degraded Image Restoration
* Exploiting Explainable Metrics for Augmented SGD
* Exploiting Pseudo Labels in a Self-Supervised Learning Framework for Improved Monocular Depth Estimation
* Exploiting Rigidity Constraints for LiDAR Scene Flow Estimation
* Exploiting Temporal Relations on Radar Perception for Autonomous Driving
* Exploration of the spatiotemporal heterogeneity of metro ridership prompted by built environment: A multi-source fusion perspective
* Exploratory Adversarial Attacks on Graph Neural Networks for Semi-Supervised Node Classification
* Explore Spatio-Temporal Aggregation for Insubstantial Object Detection: Benchmark Dataset and Baseline
* Exploring and Evaluating Image Restoration Potential in Dynamic Scenes
* Exploring Denoised Cross-video Contrast for Weakly-supervised Temporal Action Localization
* Exploring Domain-Invariant Parameters for Source Free Domain Adaptation
* Exploring Dual-task Correlation for Pose Guided Person Image Generation
* Exploring Effective Data for Surrogate Training Towards Black-box Attack
* Exploring Endogenous Shift for Cross-domain Detection: A Large-scale Benchmark and Perturbation Suppression Network
* Exploring Ephemeral Features with Ground-Penetrating Radar: An Approach to Roman Military Camps
* Exploring Frequency Adversarial Attacks for Face Forgery Detection
* Exploring Geometric Consistency for Monocular 3D Object Detection
* Exploring Motion Information for Distractor Suppression in Visual Tracking
* Exploring Patch-wise Semantic Relation for Contrastive Learning in Image-to-Image Translation Tasks
* Exploring Robustness Connection between Artificial and Natural Adversarial Examples
* Exploring Set Similarity for Dense Self-supervised Representation Learning
* Exploring Spatial Correlation for Light Field Saliency Detection: Expansion From a Single View
* Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection
* Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework
* Exposure Correction Model to Enhance Image Quality
* Exposure Normalization and Compensation for Multiple-Exposure Correction
* Exposure Trajectory Recovery From Motion Blur
* Expressive Talking Head Generation with Granular Audio-Visual Control
* Extending Momentum Contrast With Cross Similarity Consistency Regularization
* External Attention Based TransUNet and Label Expansion Strategy for Crack Detection
* Extracting Triangular 3D Models, Materials, and Lighting From Images
* EyePAD++: A Distillation-based approach for joint Eye Authentication and Presentation Attack Detection using Periocular Images
* Face morphing attacks and face image quality: The effect of morphing and the unsupervised attack detection by quality
* Face Relighting with Geometrically Consistent Shadows
* Face2Exp: Combating Data Biases for Facial Expression Recognition
* FaceFormer: Speech-Driven 3D Facial Animation with Transformers
* FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset
* Facial Chirality: From Visual Self-Reflection to Robust Facial Feature Learning
* Facial Expression Classification using Fusion of Deep Neural Network in Video
* Factors Controlling a Synthetic Aperture Radar (SAR) Derived Root-Zone Soil Moisture Product over The Seward Peninsula of Alaska
* Factors Influencing Seasonal Changes in Inundation of the Daliyaboyi Oasis, Lower Keriya River Valley, Central Tarim Basin, China
* Failure Modes of Domain Generalization Algorithms
* Fair Contrastive Learning for Facial Attribute Classification
* Fairness-aware Adversarial Perturbation Towards Bias Mitigation for Deployed Deep Models
* Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representations
* Fake Face Images Detection and Identification of Celebrities Based on Semantic Segmentation
* FAM: Visual Explanations for the Feature Representations from Deep Convolutional Networks
* FashionVLP: Vision Language Transformer for Fashion Retrieval with Feedback
* Fast Algorithm for Low-rank Tensor Completion in Delay-embedded Space
* Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution
* Fast and Unsupervised Action Boundary Detection for Action Segmentation
* Fast bilateral complementary network for deep learning compressed sensing image reconstruction
* Fast building segmentation from satellite imagery and few local labels
* Fast Light-Weight Near-Field Photometric Stereo
* Fast Point Transformer
* Fast Registration Method for Optical and SAR Images Based on SRAWG Feature Description, A
* Fast robust fuzzy clustering based on bipartite graph for hyper-spectral image classification
* Fast, Accurate and Memory-Efficient Partial Permutation Synchronization
* Fast-n-Squeeze: towards real-time spectral reconstruction from RGB images
* FastDOG: Fast Discrete Optimization on GPU
* Faster, Lighter, Robuster: A Weakly-Supervised Crowd Analysis Enhancement Network and A Generic Feature Extraction Framework
* FCHP: Exploring the Discriminative Feature and Feature Correlation of Feature Maps for Hierarchical DNN Pruning and Compression
* Feasibility of Bi-Temporal Airborne Laser Scanning Data in Detecting Species-Specific Individual Tree Crown Growth of Boreal Forests
* Feature Erasing and Diffusion Network for Occluded Person Re-Identification
* Feature hallucination in hypersphere space for few-shot classification
* Feature Query Networks: Neural Surface Description for Camera Pose Refinement
* Feature Statistics Mixing Regularization for Generative Adversarial Networks
* Features of the Extreme Fire Season of 2021 in Yakutia (Eastern Siberia) and Heavy Air Pollution Caused by Biomass Burning
* FedCor: Correlation-Based Active Client Selection Strategy for Heterogeneous Federated Learning
* FedCorr: Multi-Stage Federated Learning for Label Noise Correction
* FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction
* Federated Class-Incremental Learning
* Federated Learning with Position-Aware Neurons
* Federated Learning-based Driver Activity Recognition for Edge Devices
* Federated Remote Physiological Measurement with Imperfect Data
* FenceNet: Fine-grained Footwork Recognition in Fencing
* FENeRF: Face Editing in Neural Radiance Fields
* FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos
* Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection
* Few Shot Generative Model Adaption via Relaxed Spatial Structural Alignment
* Few-shot Backdoor Defense Using Shapley Estimation
* Few-Shot Class Incremental Learning Leveraging Self-Supervised Features
* Few-Shot Font Generation by Learning Fine-Grained Local Styles
* Few-Shot Head Swapping in the Wild
* Few-Shot Image Classification Along Sparse Graphs
* Few-Shot Image Classification Benchmarks are Too Far From Reality: Build Back Better with Semantic Task Sampling
* Few-Shot Incremental Learning for Label-to-Image Translation
* Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species
* Few-shot Learning with Noisy Labels
* Few-shot learning with unsupervised part discovery and part-aligned similarity
* Few-Shot Object Detection with Fully Cross-Transformer
* Few-Shot Supervised Prototype Alignment for Pedestrian Detection on Fisheye Images
* FIBA: Frequency-Injection based Backdoor Attack in Medical Image Analysis
* Field Reflectance Measurements at Night of Beach and Desert Sands within a Particulate BRDF Model
* Field-Data-Aided Comparison of Three 10 m Land Cover Products in Southeast Asia, A
* FIFO: Learning Fog-invariant Features for Foggy Scene Segmentation
* Finding Badly Drawn Bunnies
* Finding Fallen Objects Via Asynchronous Audio-Visual Integration
* Finding Good Configurations of Planar Primitives in Unorganized Point Clouds
* Fine-Grained Object Classification via Self-Supervised Pose Alignment
* Fine-Grained Predicates Learning for Scene Graph Generation
* Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
* Fine-tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning
* Fine-tuning Image Transformers using Learnable Memory
* FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment
* Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations
* Finite Aperture Stereo
* Finite-Time Tracking Control of Autonomous Underwater Vehicle Without Velocity Measurements
* Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction
* Fisher Information Guidance for Learned Time-of-Flight Imaging
* FisherMatch: Semi-Supervised Rotation Regression via Entropy-based Filtering
* Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction
* Flag Median and FlagIRLS, The
* FLAG: Flow-based 3D Avatar Generation from Sparse Observations
* FLAVA: A Foundational Language And Vision Alignment Model
* FlexIT: Towards Flexible Semantic Image Translation
* FLNet: A Near-shore Ship Detection Method Based on Image Enhancement Technology
* FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene Parsing
* Flood Hazard Analysis Based on Rainfall Fusion: A Case Study in Dazhou City, China
* Flood Vulnerability Assessment and Mapping: A Case Study for Australia's Hawkesbury-Nepean Catchment
* FMCNet: Feature-Level Modality Compensation for Visible-Infrared Person Re-Identification
* Focal and Global Knowledge Distillation for Detectors
* Focal Length and Object Pose Estimation via Render and Compare
* Focal Sparse Convolutional Networks for 3D Object Detection
* FocalClick: Towards Practical Interactive Image Segmentation
* FocusCut: Diving into a Focus View in Interactive Segmentation
* Focused Feature Differentiation Network for Image Quality Assessment
* FoggyStereo: Stereo Matching with Fog Volume Representation
* Forecasting Characteristic 3D Poses of Human Actions
* Forecasting from LiDAR via Future Object Detection
* Foreign Body Detection in Rail Transit Based on a Multi-Mode Feature-Enhanced Convolutional Neural Network
* Forest Carbon Flux Simulation Using Multi-Source Data and Incorporation of Remotely Sensed Model with Process-Based Model
* formal approach to good practices in Pseudo-Labeling for Unsupervised Domain Adaptive Re-Identification, A
* Forward Compatible Few-Shot Class-Incremental Learning
* Forward Compatible Training for Large-Scale Embedding Retrieval Systems
* Forward Propagation, Backward Regression, and Pose Association for Hand Tracking in the Wild
* Fourier Document Restoration for Robust Document Dewarping and Recognition
* Fourier Image Transformer
* Fourier PlenOctrees for Dynamic Radiance Field Rendering in Real-time
* Frame Averaging for Equivariant Shape Space Learning
* Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning
* Framework for Learning Ante-hoc Explainable Models via Concepts, A
* Framework Integrating DeeplabV3+, Transfer Learning, Active Learning, and Incremental Learning for Mapping Building Footprints, A
* FreeSOLO: Learning to Segment Objects without Annotations
* Frequency aware face hallucination generative adversarial network with semantic structural constraint
* Frequency-driven Imperceptible Adversarial Attack on Semantic Similarity
* From Distortion Manifold to Perceptual Quality: a Data Efficient Blind Image Quality Assessment Approach
* From Less to More: Progressive Generalized Zero-Shot Detection With Curriculum Learning
* From Less to More: Spectral Splitting and Aggregation Network for Hyperspectral Face Super-Resolution
* From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering
* FS-NCSR: Increasing Diversity of the Super-Resolution Space via Frequency Separation and Noise-Conditioned Normalizing Flow
* FS6D: Few-Shot 6D Pose Estimation of Novel Objects
* full data augmentation pipeline for small object detection based on generative adversarial networks, A
* Full-Range Virtual Try-On with Recurrent Tri-Level Transform
* Fullest COLREGs Evaluation Using Fuzzy Logic for Collaborative Decision-Making Analysis of Autonomous Ships in Complex Situations
* Fully Distributed Model Predictive Control of Connected Automated Vehicles in Intersections: Theory and Vehicle Experiments
* Fully-Automated Spike Detection and Dipole Analysis of Epileptic MEG Using Deep Learning
* Functional Brain Network Classification Based on Deep Graph Hashing Learning
* Functional Parcellation of Human Brain Using Localized Topo-Connectivity Mapping
* Future Frame Prediction Network for Video Anomaly Detection
* Future Transformer for Long-term Action Anticipation
* Fuzzy Adaptive Protective Control for High-Speed Trains: An Outstretched Error Feedback Approach
* Fuzzy Logic Modeling of Land Degradation in a Loess Plateau Watershed, China
* Fuzzy Logic Strategy for Priority Control of Electric Vehicle Charging
* FvOR: Robust Joint Shape and Pose Optimization for Few-view Object Reconstruction
* FW-GAN: Underwater image enhancement using generative adversarial network with multi-scale fusion
* FWD: Real-time Novel View Synthesis with Forward Warping and Depth
* GAF-NAU: Gramian Angular Field encoded Neighborhood Attention U-Net for Pixel-Wise Hyperspectral Image Classification
* GAFL: Global adaptive filtering layer for computer vision
* Gain With No Pain: Exploring Intelligent Traffic Signal Control for Emergency Vehicles, A
* Gait Recognition in the Wild with Dense 3D Representations and A Benchmark
* Game-theoretic hypergraph matching with density enhancement
* Gamma-enhanced Spatial Attention Network for Efficient High Dynamic Range Imaging
* GAN-Supervised Dense Visual Alignment
* GANORCON: Are Generative Models Useful for Few-shot Segmentation?
* GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation
* Gap-Filling and Missing Information Recovery for Time Series of MODIS Data Using Deep Learning-Based Methods
* GASP, a generalized framework for agglomerative clustering of signed graphs and its application to Instance Segmentation
* GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD Drawings
* GaTector: A Unified Framework for Gaze Object Prediction
* Gated fusion network for SAO filter and inter frame prediction in Versatile Video Coding
* Gated Recurrent Unit-Based RNN for Remote Photoplethysmography Signal Segmentation
* Gated2Gated: Self-Supervised Depth Estimation from Gated Images
* GateHUB: Gated History Unit with Background Suppression for Online Action Detection
* Gaussian correction for adversarial learning of boundaries
* Gaussian Process Modeling of Approximate Inference Errors for Variational Autoencoders
* GazeOnce: Real-Time Multi-Person Gaze Estimation
* GCA-Net: Utilizing Gated Context Attention for Improving Image Forgery Localization and Detection
* GCFSR: a Generative and Controllable Face Super Resolution Method Without Facial and GAN Priors
* GCM: Efficient video recognition with glance and combine module
* GCN-based fast CU partition method of intra-mode VVC, A
* GCP: Graph Encoder With Content-Planning for Sentence Generation From Knowledge Bases
* GCR: Gradient Coreset based Replay Buffer Selection for Continual Learning
* gDNA: Towards Generative Detailed Neural Avatars
* GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection
* Gender and ethnicity recognition based on visual attention-driven deep architectures
* GenDR: A Generalized Differentiable Renderer
* General Facial Representation Learning in a Visual-Linguistic Manner
* General Framework for Decentralized Safe Optimal Control of Connected and Automated Vehicles in Multi-Lane Signal-Free Intersections, A
* General Incremental Learning with Domain-aware Categorical Representations
* General Self-Supervised Framework for Remote Sensing Image Classification, A
* Generalizable Cross-modality Medical Image Segmentation via Style Augmentation and Dual Normalization
* Generalizable Human Pose Triangulation
* Generalized Binary Search Network for Highly-Efficient Multi-View Stereo
* Generalized Category Discovery
* Generalized Classification of Satellite Image Time Series with Thermal Positional Encoding
* Generalized Few-shot Semantic Segmentation
* Generalizing Adversarial Explanations with Grad-CAM
* Generalizing Gaze Estimation with Rotation Consistency
* Generalizing Interactive Backpropagating Refinement for Dense Prediction Networks
* Generating 3D Bio-Printable Patches Using Wound Segmentation and Reconstruction to Treat Diabetic Foot Ulcers
* Generating Diverse 3D Reconstructions from a Single Occluded Face Image
* Generating Diverse and Natural 3D Human Motions from Text
* Generating High Fidelity Data from Low-density Regions using Diffusion Models
* Generating Representative Samples for Few-Shot Classification
* Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior
* Generation System and Method
* Generative Cooperative Learning for Unsupervised Video Anomaly Detection
* Generative Flows as a General Purpose Solution for Inverse Problems
* Generative Flows with Invertible Attentions
* Generative Probabilistic Novelty Detection with Isometric Adversarial Autoencoders
* GenISP: Neural ISP for Low-Light Machine Cognition
* GeoEngine: A Platform for Production-Ready Geospatial Research
* Geometric Anchor Correspondence Mining with Uncertainty Modeling for Universal Domain Adaptation
* Geometric and Textural Augmentation for Domain Gap Reduction
* Geometric Partitioning Mode with Inter and Intra Prediction for Beyond Versatile Video Coding
* Geometric Structure Preserving Warp for Natural Image Stitching
* Geometric Transformer for Fast and Robust Point Cloud Registration
* Geometry-Aware Guided Loss for Deep Crack Recognition
* GeoNeRF: Generalizing NeRF with Geometry Priors
* GIFS: Neural Implicit Function for General Shape Representation
* GigaMVS: A Benchmark for Ultra-Large-Scale Gigapixel-Level 3D Reconstruction
* GIQE: Generic Image Quality Enhancement via Nth Order Iterative Degradation
* GIRAFFE HD: A High-Resolution 3D-aware Generative Model
* Give Me Your Attention: Dot-Product Attention Considered Harmful for Adversarial Patch Robustness
* GLaMa: Joint Spatial and Frequency Loss for General Image Inpainting
* GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras
* Glass Segmentation using Intensity and Spectral Polarization Cues
* GLASS: Geometric Latent Augmentation for Shape Spaces
* GlideNet: Global, Local and Intrinsic based Dense Embedding NETwork for Multi-category Attributes Prediction
* Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
* Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning
* Global Matching with Overlapping Attention for Optical Flow Estimation
* Global Sensing and Measurements Reuse for Image Compressed Sensing
* Global Tracking Transformers
* Global Tracking via Ensemble of Local Trackers
* Global-Aware Registration of Less-Overlap RGB-D Scans
* Globetrotter: Connecting Languages by Connecting Images
* GLTC: A Metro Passenger Identification Method Across AFC Data and Sparse WiFi Data
* GMFlow: Learning Optical Flow via Global Matching
* Goal-driven Self-Attentive Recurrent Networks for Trajectory Prediction
* GOAL: Generating 4D Whole-Body Motion for Hand-Object Grasping
* Good, Better, Best: Textual Distractors Generation for Multiple-Choice Visual Question Answering via Reinforcement Learning
* GP22: A Car Styling Dataset for Automotive Designers
* GPCA: A Probabilistic Framework for Gaussian Process Embedded Channel Attention
* GPR Energy Attribute Slices Based on Multivariate Variational Mode Decomposition and Teager-Kaiser Energy Operator
* GPR Image Clutter Suppression Using Gaussian Curvature Decomposition in the PCA Domain
* GPU-Based Homotopy Continuation for Minimal Problems in Computer Vision
* GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting
* Gradient Boosting and Linear Regression for Estimating Coastal Bathymetry Based on Sentinel-2 Images
* Gradient Matters: Designing Binarized Neural Networks via Enhanced Information-Flow
* Gradient-SDF: A Semi-Implicit Surface Representation for 3D Reconstruction
* GradViT: Gradient Inversion of Vision Transformers
* GraFormer: Graph-oriented Transformer for 3D Pose Estimation
* GraftNet: Towards Domain Generalized Stereo Matching with a Broad-Spectrum and Task-Oriented Feature
* GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains
* GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation
* Graph Convolution RPCA With Adaptive Graph
* Graph Sampling Based Deep Metric Learning for Generalizable Person Re-Identification
* Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory Prediction
* Graph-Based Spatial-Temporal Convolutional Network for Vehicle Trajectory Prediction in Autonomous Driving
* Graph-context Attention Networks for Size-varied Deep Graph Matching
* Graph-embedded subspace support vector data description
* GraphWalks: Efficient Shape Agnostic Geodesic Shortest Path Estimation
* Grass band detection in soccer images for improved image registration
* Gravitationally Lensed Black Hole Emission Tomography
* Gravity Wave Parameters and Their Seasonal Variations Study near 120° E China Based on Na LIDAR Observations
* GreedyNASv2: Greedier Search with a Greedy Path Filter
* GridShift: A Faster Mode-seeking Algorithm for Image Segmentation and Object Tracking
* GripNet: Graph information propagation on supergraph for heterogeneous graphs
* Ground Penetrating Radar in Coastal Hazard Mitigation Studies Using Deep Convolutional Neural Networks
* Grounded Language-Image Pre-training
* Grounding Answers for Visual Questions Asked by Visually Impaired People
* Group Contextualization for Video Recognition
* Group R-CNN for Weakly Semi-supervised Object Detection with Points
* Group'n Route: An Edge Learning-Based Clustering and Efficient Routing Scheme Leveraging Social Strength for the Internet of Vehicles
* Group-Wise Hub Identification by Learning Common Graph Embeddings on Grassmannian Manifold
* Grouping-based Oversampling in Kernel Space for Imbalanced Data Classification
* GroupNet: Multiscale Hypergraph Neural Networks for Trajectory Prediction with Relational Reasoning
* GroupViT: Semantic Segmentation Emerges from Text Supervision
* Guest Editorial Artificial Intelligence and Deep Learning for Intelligent and Sustainable Traffic and Vehicle Management (VANETs)
* Guided Deep Metric Learning
* Guided Event Filtering: Synergy Between Intensity Images and Neuromorphic Events for High Performance Imaging
* Guided Hyperspectral Image Denoising with Realistic Data
* GuideFormer: Transformers for Image Guided Depth Completion
* Guiding Attention using Partial-Order Relationships for Image Captioning
* H-EMD: A Hierarchical Earth Mover's Distance Method for Instance Segmentation
* H-Net: Unsupervised Attention-based Stereo Depth Estimation Leveraging Epipolar Geometry
* H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-domain Weakly Supervised Object Detection
* H4D: Human 4D Modeling by Learning Neural Compositional Representation
* Habitat Prediction of Northwest Pacific Saury Based on Multi-Source Heterogeneous Remote Sensing Data Fusion
* Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
* HairCLIP: Design Your Hair by Text and Reference Image
* HairMapper: Removing Hair from Portraits Using GANs
* Hallucinated Neural Radiance Fields in the Wild
* HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network
* HapWheel: Bringing In-Car Controls to Driver's Fingertips by Embedding Ubiquitous Haptic Displays into a Steering Wheel
* HARA: A Hierarchical Approach for Robust Rotation Averaging
* Harmony: A Generic Unsupervised Approach for Disentangling Semantic Content from Parameterized Transformations
* HCSC: Hierarchical Contrastive Selective Coding
* HDNet: High-resolution Dual-domain Learning for Spectral Compressive Imaging
* HDR-NeRF: High Dynamic Range Neural Radiance Fields
* HeadNeRF: A Realtime NeRF-based Parametric Head Model
* HEAT: Holistic Edge Attention Transformer for Structured Reconstruction
* Heatmap Regression via Randomized Rounding
* Hephaestus: A large scale multitask dataset towards InSAR understanding
* HerosNet: Hyperspectral Explicable Reconstruction and Optimal Sampling Deep Network for Snapshot Compressive Imaging
* Heterogeneity of Increases in Net Primary Production under Intensified Human Activity and Climate Variability on the Loess Plateau of China
* Heterogeneous Visible Light and Radio Communication for Improving Safety Message Dissemination at Road Intersection
* HEVC's intra mode process expedited using Histogram of Oriented Gradients
* Hidden Markov Model Based Control Augmentation Design for a Class of Human-in-the-Loop Systems
* Hierarchical Bayesian LSTM for Head Trajectory Prediction on Omnidirectional Images
* Hierarchical Modular Network for Video Captioning
* Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction
* Hierarchical Optimal Maneuver Planning and Trajectory Control at On-Ramps With Multiple Mainstream Lanes
* Hierarchical Self-supervised Representation Learning for Movie Understanding
* Hierarchical Superpixel Segmentation for PolSAR Images Based on the Boruvka Algorithm
* High Quality Segmentation for Ultra High-Resolution Images
* High Voltage Driving Chiplet in Standard 0.18-µm CMOS for Micro-Pixelated LED Displays Integrated With LTPS TFTs, A
* High-Fidelity GAN Inversion for Image Attribute Editing
* High-Fidelity Human Avatars from a Single RGB Camera
* High-Frequency Trajectory Map Matching Algorithm Based on Road Network Topology
* High-Precision Stand Age Data Facilitate the Estimation of Rubber Plantation Biomass: A Case Study of Hainan Island, China
* High-resolution Face Swapping via Latent Semantics Disentanglement
* High-Resolution Image Harmonization via Collaborative Dual Transformations
* High-Resolution Image Synthesis with Latent Diffusion Models
* High-Resolution Inversion Method for the Snow Water Equivalent Based on the GF-3 Satellite and Optimized EQeau Model
* High-Resolution UAV Image Generation for Sorghum Panicle Detection
* High-Sensitivity MEMS Shear Probe for Autonomous Profiling Observation of Marine Turbulence
* Higher-Order Explanations of Graph Neural Networks via Relevant Walks
* Highly Efficient Model to Study the Semantics of Salient Object Detection, A
* Highly-efficient Incomplete Largescale Multiview Clustering with Consensus Bipartite Graph
* HiMODE: A Hybrid Monocular Omnidirectional Depth Estimation Model
* HINT: Hierarchical Neuron Concept Explainer
* Hire-MLP: Vision MLP via Hierarchical Rearrangement
* HiVT: Hierarchical Vector Transformer for Multi-Agent Motion Prediction
* HL-Net: Heterophily Learning Network for Scene Graph Generation
* HLRTF: Hierarchical Low-Rank Tensor Factorization for Inverse Problems in Multi-Dimensional Imaging
* HMIway-env: A Framework for Simulating Behaviors and Preferences to Support Human-AI Teaming in Driving
* HODEC: Towards Efficient High-Order DEcomposed Convolutional Neural Networks
* HODOR: High-level Object Descriptors for Object Re-Segmentation in Video Learned from Static Images
* HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction
* Holistic Approach to Measure Sample-level Adversarial Vulnerability and its Utility in Building Trustworthy Systems
* Holocurtains: Programming Light Curtains via Binary Holography
* Homography Loss for Monocular 3D Object Detection
* HOP: History-and-Order Aware Pretraining for Vision-and-Language Navigation
* Hot-started NAS for Task-specific Embedded Applications
* How Do Neural Networks Estimate Optical Flow? A Neuropsychology-Inspired Study
* How Do You Do It? Fine-Grained Action Understanding with Pseudo-Adverbs
* How Good Is Aesthetic Ability of a Fashion Model?
* How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting
* How much does input data type impact final face model accuracy?
* How Much More Data Do I Need? Estimating Requirements for Downstream Tasks
* How to Query an Oracle? Efficient Strategies to Label Data
* How Well Do Sparse ImageNet Models Transfer?
* HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing Capsule Network
* HR-STAN: High-Resolution Spatio-Temporal Attention Network for 3D Human Motion Prediction
* HSC4D: Human-centered 4D Scene Capture in Large-scale Indoor-outdoor Space Using Wearable IMUs and LiDAR
* HSI-Guided Intrinsic Image Decomposition for Outdoor Scenes
* Human Hands as Probes for Interactive Object Understanding
* Human Instance Matting via Mutual Guidance and Multi-Instance Refinement
* Human Mesh Recovery from Multiple Shots
* Human Stools Classification for Gastrointestinal Health based on an Improved ResNet18 Model with Dual Attention Mechanism
* Human Trajectory Prediction with Momentary Observation
* Human-Aware Object Placement for Visual Environment Reconstruction
* Human-Lead-Platooning Cooperative Adaptive Cruise Control
* Human-Object Interaction Detection via Disentangled Transformer
* HumanNeRF: Efficiently Generated Human Radiance Field from Sparse Inputs
* HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video
* HVH: Learning a Hybrid Neural Volumetric Representation for Dynamic Hair Performance Capture
* Hybrid biometric template protection: Resolving the agony of choice between bloom filters and homomorphic encryption
* Hybrid Classification of Imbalanced Hyperspectral Images Using ADASYN and Enhanced Deep Subsampled Multi-Grained Cascaded Forest, A
* Hybrid Consistency Training with Prototype Adaptation for Few-Shot Learning
* hybrid deep and machine learning model for short-term traffic volume forecasting of adjacent intersections, A
* Hybrid Egocentric Activity Anticipation Framework via Memory-Augmented Recurrent and One-shot Representation Forecasting, A
* Hybrid Network of CNN and Transformer for Lightweight Image Super-Resolution, A
* Hybrid Quantum-Classical Algorithm for Robust Fitting, A
* Hybrid Relation Guided Set Matching for Few-shot Action Recognition
* Hybrid video coding scheme based on VVC and spatio-temporal attention convolution neural network
* HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization
* Hydrological Connectivity Improves the Water-Related Environment in a Typical Arid Inland River Basin in Xinjiang, China
* Hydrological Drivers for the Spatial Distribution of Wetland Herbaceous Communities in Poyang Lake
* Hyperbolic Image Segmentation
* Hyperbolic Vision Transformers: Combining Improvements in Metric Learning
* HyperDet3D: Learning a Scene-conditioned 3D Object Detector
* Hypergraph-Induced Semantic Tuplet Loss for Deep Metric Learning
* HyperInverter: Improving StyleGAN Inversion via Hypernetwork
* HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation using HyperNet
* Hyperspectral Image Classification via Spectral Pooling and Hybrid Transformer
* Hyperspectral Image Classification with IFormer Network Feature Extraction
* Hyperspectral Reconnaissance: Joint Characterization of the Spectral Mixture Residual Delineates Geologic Unit Boundaries in the White Mountains, CA
* Hyperspherical Consistency Regularization
* HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing
* HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening
* I M Avatar: Implicit Morphable Head Avatars from Videos
* Ice hockey player identification via transformers and weakly supervised learning
* ICON: Implicit Clothed humans Obtained from Normals
* Id-Free Person Similarity Learning
* IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment
* Identifying a Leading Predictor of Arctic Stratospheric Ozone for April Precipitation in Eastern North America
* Identifying Ambiguous Similarity Conditions via Semantic Matching
* Identifying Potential Sites for Rainwater Harvesting Structures in Ghazi Tehsil, Khyber Pakhtunkhwa, Pakistan, Using Geospatial Approach
* Identity Preserving Loss for Learned Image Compression
* Identity-Unrelated Information Decoupling Model for Vehicle Re-Identification
* IDR: Self-Supervised Image Denoising via Iterative Data Refinement
* IFOR: Iterative Flow Minimization for Robotic Object Rearrangement
* IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
* iFS-RCNN: An Incremental Few-shot Instance Segmenter
* Illumination Unification for Person Re-Identification
* Illumination-Resilient Lane Detection by Threshold Self-Adjustment Using Newton-Based Extremum Seeking
* Image Animation with Perturbed Masks
* Image Based Reconstruction of Liquids from 2D Surface Detections
* Image Dehazing Transformer with Transmission-Aware 3D Position Embedding
* Image Disentanglement Autoencoder for Steganography without Embedding
* Image inpainting via spatial projections
* Image manipulation detection by multiple tampering traces and edge artifact enhancement
* Image Multi-Inpainting via Progressive Generative Adversarial Networks
* Image Patch is a Wave: Phase-Aware Vision MLP, An
* Image Quality Assessment with Gradient Siamese Network
* Image Quality Assessment with Transformers and Multi-Metric Fusion Modules
* Image Segmentation Using Text and Image Prompts
* Image Understanding With Reinforcement Learning: Auto-Tuning Image Attributes and Model Parameters for Object Detection and Segmentation
* Image-Based Classification of Double-Barred Beach States Using a Convolutional Neural Network and Transfer Learning
* Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data
* ImageSig: A signature transform for ultra-lightweight image recognition
* IMDeception: Grouped Information Distilling Super-Resolution Network
* ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations
* Imitation Learning-Enhanced Iterated Matching Algorithm for On-Demand Food Delivery, An
* Imitative Collaboration: A mirror-neuron inspired mixed reality collaboration method with remote hands and local replicas
* Impact of Fengyun-3E Microwave Temperature and Humidity Sounder Data on CMA Global Medium Range Weather Forecasts
* Impact of Future Sea-Level Rise on Low-Lying Subsiding Coasts: A Case Study of Tavoliere Delle Puglie (Southern Italy), The
* impact of ride-hailing services on the use of traditional taxis: Evidence from Chinese urban panel data, The
* Impacts of FY-4A AGRI Radiance Data Assimilation on the Forecast of the Super Typhoon In-Fa (2021)
* Implicit Feature Decoupling with Depthwise Quantization
* Implicit Motion Handling for Video Camouflaged Object Detection
* Implicit Sample Extension for Unsupervised Person Re-Identification
* Implicit Values of A Good Hand Shake: Handheld Multi-Frame Neural Depth Refinement, The
* ImplicitAtlas: Learning Deformable Shape Templates in Medical Imaging
* Importance is in your attention: Agent importance prediction for autonomous driving
* Imposing Consistency for Optical Flow Estimation
* Improved RANSAC Outlier Rejection Method for UAV-Derived Point Cloud, An
* Improved Source Model of the 2021 Mw 6.1 Yangbi Earthquake (Southwest China) Based on InSAR and BOI Datasets, An
* Improved Variance Reduction Methods for Riemannian Non-Convex Optimization
* Improving Adversarial Transferability via Neuron Attribution-based Attacks
* Improving Adversarially Robust Few-shot Image Classification with Generalizable Representations
* Improving Clear-Sky Solar Power Prediction over China by Assimilating Himawari-8 Aerosol Optical Depth with WRF-Chem-Solar
* Improving Deep Metric Learning by Divide and Conquer
* Improving Estimates and Change Detection of Forest Above-Ground Biomass Using Statistical Methods
* Improving GAN Equilibrium by Raising Spatial Awareness
* Improving Image Segmentation with Boundary Patch Refinement
* Improving Multi-Target Multi-Camera Tracking by Track Refinement and Completion
* Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations
* Improving Multiscale Object Detection With Off-Centered Semantics Refinement
* Improving neural implicit surfaces geometry with patch warping
* Improving RGB-D Salient Object Detection via Modality-Aware Decoder
* Improving Robustness Against Stealthy Weight Bit-Flip Attacks by Output Code Matching
* Improving Robustness of License Plates Automatic Recognition in Natural Scenes
* Improving Robustness to Texture Bias via Shape-focused Augmentation
* Improving Segmentation of the Inferior Alveolar Nerve through Deep Label Propagation
* Improving Subgraph Recognition with Variational Graph Information Bottleneck
* Improving the Modeling of Sea Surface Currents in the Persian Gulf and the Oman Sea Using Data Assimilation of Satellite Altimetry and Hydrographic Observations
* Improving the Performance of Automated Rooftop Extraction through Geospatial Stratified and Optimized Sampling
* Improving the Reconstruction of Vertical Temperature Profiles on Account of Oceanic Front Impacts
* Improving the Transferability of Targeted Adversarial Examples through Object-Based Diverse Input
* Improving Video Model Transfer with Dynamic Representation Learning
* Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning
* In-Vehicle Warning Information Provision Strategy for V2V-Based Proactive Traffic Safety Management, An
* Incoherent Interference Detection and Mitigation for Millimeter-Wave FMCW Radars
* Incorporating Kinematic Wave Theory Into a Deep Learning Method for High-Resolution Traffic Speed Estimation
* Incorporating Semi-Supervised and Positive-Unlabeled Learning for Boosting Full Reference Image Quality Assessment
* Increasing Negative Impacts of Climatic Change and Anthropogenic Activities on Vegetation Variation on the Qinghai-Tibet Plateau during 1982-2019
* Incremental Cross-view Mutual Distillation for Self-supervised Medical CT Synthesis
* Incremental learning for transductive support vector machine
* Incremental Learning in Semantic Segmentation from Image Labels
* Incremental Meta-Learning via Episodic Replay Distillation for Few-Shot Image Recognition
* Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding
* Industrial Style Transfer with Large-scale Geometric Warping and Content Preservation
* Inertia-Guided Flow Completion and Style Fusion for Video Inpainting
* Influence of the Accuracy of Chlorophyll-Retrieval Algorithms on the Estimation of Solar Radiation Absorbed in the Barents Sea
* InfoGCN: Representation Learning for Human Skeleton-based Action Recognition
* InfoNeRF: Ray Entropy Minimization for Few-Shot Neural Volume Rendering
* Information Elevation Network for Online Action Detection and Anticipation
* Information-Theoretic Odometry Learning
* Infrared Invisible Clothing: Hiding from Infrared Detectors at Multiple Angles in Real World
* Infrared-Visible Cross-Modal Person Re-Identification via Dual-Attention Collaborative Learning
* Injecting Semantic Concepts into End-to-End Image Captioning
* InOut: Diverse Image Outpainting via GAN Inversion
* Input-level Inductive Biases for 3D Reconstruction
* INS-Conv: Incremental Sparse Convolution for Online 3D Segmentation
* InsetGAN for Full-Body Image Generation
* InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
* Instance Segmentation with Mask-supervised Polygonal Boundary Transformers
* Instance-Aware Dynamic Neural Network Quantization
* Instance-Aware Semantic Segmentation of Road Furniture in Mobile Laser Scanning Data
* Instance-Dependent Label-Noise Learning with Manifold-Regularized Transition Matrix Estimation
* Instance-Level Knowledge Transfer for Data-Driven Driver Model Adaptation With Homogeneous Domains
* Instance-Level Relative Saliency Ranking With Graph Reasoning
* Instance-wise Occlusion and Depth Orders in Natural Scenes
* Integrated Longitudinal and Lateral Vehicle Stability Control for Extreme Conditions With Safety Dynamic Requirements Analysis
* Integrated Radar and Communications Waveform Design Based on Multi-Symbol OFDM
* Integrating a UAV-Derived DEM in Object-Based Image Analysis Increases Habitat Classification Accuracy on Coral Reefs
* Integrating deep learning and traditional image enhancement techniques for underwater image enhancement
* Integrating Language Guidance into Vision-based Deep Metric Learning
* Integrating Pose and Mask Predictions for Multi-person in Videos
* Integrating Remote Sensing and Spatiotemporal Analysis to Characterize Artificial Vegetation Restoration Suitability in Desert Areas: A Case Study of Mu Us Sandy Land
* Integration of Hyperspectral and Magnetic Data for Geological Characterization of the Niaqornarssuit Ultramafic Complex in West-Greenland
* Integration of Sentinel-1A, ALOS-2 and GF-1 Datasets for Identifying Landslides in the Three Parallel Rivers Region, China
* Integrative Few-Shot Learning for Classification and Segmentation
* Intelligent 3D Objects Classification for Vehicular Ad Hoc Network Based on Lidar and Deep Learning Approaches
* Intelligent Caching Strategy Considering Time-Space Characteristics in Vehicular Named Data Networks, An
* Intelligent Control Scheme for Optimum Efficiency and Reduced Emission Operation of Marine Transportation System, An
* Intelligent Driver Drowsiness Detection for Traffic Safety Based on Multi CNN Deep Model and Facial Subsampling
* Intelligent Virtual Resource Allocation of QoS-Guaranteed Slices in B5G-Enabled VANETs for Intelligent Transportation Systems
* Intention-Aware Vehicle Trajectory Prediction Based on Spatial-Temporal Dynamic Attention Network for Internet of Vehicles
* IntentVizor: Towards Generic Query Guided Interactive Video Summarization
* Interact before Align: Leveraging Cross-Modal Knowledge for Domain Adaptive Action Recognition
* Interacting Attention Graph for Single Image Two-Hand Reconstruction
* Interaction Classification with Key Actor Detection in Multi-Person Sports Videos
* Interactive Disentanglement: Learning Concepts by Interacting with their Prototype Representations
* Interactive Image Synthesis with Panoptic Layout Generation
* Interactive Multi-Class Tiny-Object Detection
* Interactive Segmentation and Visualization for Tiny Objects in Multi-megapixel Images
* Interactive Trajectory Prediction Using a Driving Risk Map-Integrated Deep Learning Method for Surrounding Vehicles on Highways
* Interactiveness Field in Human-Object Interactions
* Interactron: Embodied Adaptive Object Detection
* Interannual and Monthly Variability of Typical Inland Lakes on the Tibetan Plateau Located in Three Different Climatic Zones
* Interferometric Orbit Determination System for Geosynchronous SAR Missions: Experimental Proof of Concept
* Internal Solitary Waves in the White Sea: Hot-Spots, Structure, and Kinematics from Multi-Sensor Observations
* Interpretable part-whole hierarchies and conceptual-semantic relationships in neural networks
* Intersection Management Protocol for Mixed Autonomous and Human-Operated Vehicles
* Interspace Pruning: Using Adaptive Filter Representations to Improve Training of Sparse CNNs
* Intraoperative Glioma Grading Using Neural Architecture Search and Multi-Modal Imaging
* IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization
* Intrinsic Calibration of Multi-Beam LiDARs for Agricultural Robots
* Intrinsic Image Decomposition Using Paradigms
* Intrinsic image decomposition using physics-based cues and CNNs
* Invariant Grounding for Video Question Answering
* Investigating intrinsic degradation factors by multi-branch aggregation for real-world underwater image enhancement
* Investigating Neural Architectures by Synthetic Dataset Design
* Investigating the Impact of Multi-LiDAR Placement on Object Detection for Autonomous Driving
* Investigating Top-k White-Box and Transferable Black-box Attack
* Investigating Tradeoffs in Real-World Video Super-Resolution
* investigation into lidar scan angle impacts on stand attribute predictions in different forest environments, An
* Investigation of Absorption Bands around 3.3 mu-m in CRISM Data
* iPLAN: Interactive and Procedural Layout Planning
* IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes
* IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images
* Is it Really Easy to Detect Sybil Attacks in C-ITS Environments: A Position Paper
* Is Mapping Necessary for Realistic PointGoal Navigation?
* Is Neuron Coverage Needed to Make Person Detection More Robust?
* Is synthetic voice detection research going into the right direction?
* ISDNet: Integrating Shallow and Deep Networks for Efficient Ultra-high Resolution Segmentation
* ISNAS-DIP: Image-Specific Neural Architecture Search for Deep Image Prior
* ISNet: Shape Matters for Infrared Small Target Detection
* It is Okay to Not Be Okay: Overcoming Emotional Bias in Affective Image Captioning by Contrastive Data Collection
* It's About Time: Analog Clock Reading in the Wild
* It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher
* It's Time for Artistic Correspondence in Music and Video
* Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless Objects
* Iterative Deep Homography Estimation
* Iterative Knowledge Exchange Between Deep Learning and Space-Time Spectral Clustering for Unsupervised Segmentation in Videos
* Iterative Quantum Approach for Transformation Estimation from Point Sets, An
* IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo
* IterNet++: An improved model for retinal image segmentation by curvelet enhancing, guided filtering, offline hard-sample mining, and test-time augmenting
* Ithaca365: Dataset and Driving Perception under Repeated and Challenging Weather Conditions
* ITSA: An Information-Theoretic Approach to Automatic Shortcut Avoidance and Domain Generalization in Stereo Matching Networks
* JIFF: Jointly-aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction
* JoinABLe: Learning Bottom-up Assembly of Parametric CAD Joints
* Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition, A
* joint deep learning network of point clouds and multiple views for roadside object classification from lidar point clouds, A
* Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification
* Joint Forecasting of Panoptic Segmentations with Difference Attention
* Joint Forecasting of Panoptic Segmentations with Difference Attention
* Joint Framework for Single Image Reconstruction and Super-Resolution With an Event Camera
* Joint Global and Local Hierarchical Priors for Learned Image Compression
* Joint Hand Motion and Interaction Hotspots Prediction from Egocentric Videos
* Joint Optimal Quantization and Aggregation of Federated Learning Scheme in VANETs
* Joint Progressive and Coarse-to-Fine Registration of Brain MRI via Deformation Field Integration and Non-Rigid Feature Fusion
* Joint Video Summarization and Moment Localization by Cross-Task Sample Transfer
* JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection
* K-Lane: Lidar Lane Dataset and Benchmark for Urban Roads and Highways
* K-Shot Contrastive Learning of Visual Features With Multiple Instance Augmentations
* Kernelized Few-shot Object Detection with Efficient Integral Aggregation
* Key Point-Based Driver Activity Recognition
* Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation
* Keypoint-based Global Association Network for Lane Detection, A
* KeyTr: Keypoint Transporter for 3D Reconstruction of Deformable Objects in Videos
* KG-SP: Knowledge Guided Simple Primitives for Open World Compositional Zero-Shot Learning
* Killing Two Birds with One Stone: Efficient and Robust Training of Face Recognition CNNs by Partial FC
* KNN Local Attention for Image Restoration
* Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability
* Knowledge Distillation via the Target-aware Transformer
* Knowledge Distillation with the Reused Teacher Classifier
* Knowledge distillation: A good teacher is patient and consistent
* Knowledge Mining with Scene Text for Fine-Grained Recognition
* Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis
* Knowledge-Driven Self-Supervised Representation Learning for Facial Action Unit Recognition
* Kubric: A scalable dataset generator
* L-Verse: Bidirectional Generation Between Image and Text
* L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation
* Label Matching Semi-Supervised Object Detection
* Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification
* Label, Verify, Correct: A Simple Few Shot Object Detection Method
* Label-Only Model Inversion Attacks via Boundary Repulsion
* LAE-Net: A locally-adaptive embedding network for low-light image enhancement
* Lagrange Motion Analysis and View Embeddings for Improved Gait Recognition
* LAKe-Net: Topology-Aware Point Cloud Completion by Localizing Aligned Keypoints
* LAN: Lightweight Attention-based Network for RAW-to-RGB Smartphone Image Processing
* landmark-free approach for automatic, dense and robust correspondence of 3D faces, A
* Landsat Data Based Prediction of Loblolly Pine Plantation Attributes in Western Gulf Region, USA
* Landscape Ecological Risk Assessment and Impact Factor Analysis of the Qinghai-Tibetan Plateau
* Landslide Detection and Mapping Based on SBAS-InSAR and PS-InSAR: A Case Study in Gongjue County, Tibet, China
* Landslide Susceptibility Modeling Using Remote Sensing Data and Random SubSpace-Based Functional Tree Classifier
* Lane Detection with Versatile AtrousFormer and Local Semantic Guidance
* Lane-Based Large-Scale UAS Traffic Management
* Language as Queries for Referring Video Object Segmentation
* Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
* Laplacian Pyramid Generative Adversarial Network for Infrared and Visible Image Fusion
* LAR-SR: A Local Autoregressive Model for Image Super-Resolution
* Large Loss Matters in Weakly Supervised Multi-Label Classification
* Large-scale Comprehensive Dataset and Copy-overlap Aware Evaluation Protocol for Segment-level Video Copy Detection, A
* Large-Scale Deep Learning Based Binary and Semantic Change Detection in Ultra High Resolution Remote Sensing Imagery: From Benchmark Datasets to Urban Application
* Large-Scale Pre-training for Person Re-identification with Noisy Labels
* Large-scale road extraction from high-resolution remote sensing images based on a weakly-supervised structural and orientational consistency constraint network
* Large-scale Video Panoptic Segmentation in the Wild: A Benchmark
* LARGE: Latent-Based Regression through GAN Semantics
* LAS-AT: Adversarial Training with Learnable Attack Strategy
* Laser Ranging Bathymetry Using a Photon-Number-Resolving Detector
* LASER: LAtent SpacE Rendering for 2D Visual Localization
* Latest Altimetry-Based Sea Ice Freeboard and Volume Inter-Annual Variability in the Antarctic over 2003-2020
* LaTr: Layout-Aware Transformer for Scene-Text VQA
* LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
* Layer-wised Model Aggregation for Personalized Federated Learning
* Layered Depth Refinement with Mask Guidance
* LC-FDNet: Learned Lossless Image Compression with Frequency Decomposition Network
* LD-ConGR: A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition
* Learn from Others and Be Yourself in Heterogeneous Federated Learning
* Learnable Depth-Sensitive Attention for Deep RGB-D Saliency Detection with Multi-modal Fusion Architecture Search
* Learnable Irrelevant Modality Dropout for Multimodal Action Recognition on Modality-Specific Annotated Videos
* Learnable Lookup Table for Neural Network Quantization
* Learned Compression of High Dimensional Image Datasets
* Learned Low Bitrate Video Compression with Space-Time Super-Resolution
* Learned Queries for Efficient Local Attention
* Learning 3D Object Shape and Layout without 3D Supervision
* Learning a Structured Latent Space for Unsupervised Point Cloud Completion
* Learning ABCs: Approximate Bijective Correspondence for isolating factors of variation with weak supervision
* Learning Accurate, Speedy, Lightweight CNNs via Instance-Specific Multi-Teacher Knowledge Distillation for Distracted Driver Posture Identification
* Learning Adaptive Warping for RealWorld Rolling Shutter Correction
* Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers
* Learning Affordance Grounding from Exocentric Images
* Learning Asymmetric and Local Features in Multi-Dimensional Data Through Wavelets With Recursive Partitioning
* Learning based Multi-modality Image and Video Compression
* Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning
* Learning Brain Dynamics of Evolving Manifold Functional MRI Data Using Geometric-Attention Neural Network
* Learning Canonical F-Correlation Projection for Compact Multiview Representation
* Learning Co-segmentation by Segment Swapping for Retrieval and Discovery
* Learning Deep Implicit Functions for 3D Shapes with Dynamic Code Clouds
* Learning Deformable Image Registration From Optimization: Perspective, Modules, Bilevel Training and Beyond
* Learning Degradation-Invariant Representation for Robust Real-World Person Re-Identification
* Learning Distinctive Margin toward Active Domain Adaptation
* Learning Fair Classifiers with Partially Annotated Group Labels
* Learning from All Vehicles
* Learning From Imbalanced Data With Deep Density Hybrid Sampling
* Learning from Pixel-Level Noisy Label: A New Perspective for Light Field Saliency Detection
* Learning from Temporal Gradient for Semi-supervised Action Recognition
* Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
* Learning Generalized Feature for Temporal Action Detection: Application for Natural Driving Action Recognition Challenge
* Learning Graph Regularisation for Guided Super-Resolution
* Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
* Learning Interaction-Aware Guidance for Trajectory Optimization in Dense Traffic Scenarios
* Learning Invisible Markers for Hidden Codes in Offline-to-online Photography
* Learning Local Displacements for Point Cloud Completion
* Learning Local-Global Contextual Adaptation for Multi-Person Pose Estimation
* Learning Memory-Augmented Unidirectional Metrics for Cross-modality Person Re-identification
* Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification
* Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera
* Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation
* Learning Multiple Adverse Weather Removal via Two-stage Knowledge Learning and Multi-contrastive Regularization: Toward a Unified Model
* Learning Multiple Dense Prediction Tasks from Partially Annotated Data
* Learning Neural Light Fields with Ray-Space Embedding
* Learning Non-target Knowledge for Few-shot Semantic Segmentation
* Learning Object Context for Novel-view Scene Layout Generation
* Learning of Global Objective for Network Flow in Multi-Object Tracking
* Learning Optical Flow with Kernel Patch Attention
* Learning Optimal K-space Acquisition and Reconstruction using Physics-Informed Neural Networks
* Learning Part Segmentation through Unsupervised Domain Adaptation from Synthetic Vehicles
* Learning Pixel Trajectories with Multiscale Contrastive Random Walks
* Learning Pixel-Level Distinctions for Video Highlight Detection
* Learning Program Representations for Food Images and Cooking Recipes
* Learning Robust Image-Based Rendering on Sparse Scene Geometry via Depth Completion
* Learning Second Order Local Anomaly for General Face Forgery Detection
* Learning Semantic Associations for Mirror Detection
* Learning Semantic Segmentation of Large-Scale Point Clouds With Random Sampling
* Learning Soft Estimator of Keypoint Scale and Orientation with Probabilistic Covariant Loss
* Learning Spatially Variant Linear Representation Models for Joint Filtering
* Learning spectral transform for 3D human motion prediction
* Learning Spherical Convolution for 360° Recognition
* Learning sRGB-to-Raw-RGB De-rendering with Content-Aware Metadata
* Learning Structured Gaussians to Approximate Deep Ensembles
* Learning to Affiliate: Mutual Centralized Learning for Few-shot Classification
* Learning to Align Sequential Actions in the Wild
* Learning to Answer Questions in Dynamic Audio-Visual Scenarios
* Learning to Anticipate Future with Dynamic Context Removal
* Learning to Ask Informative Sub-Questions for Visual Question Answering
* Learning to Collaborate in Decentralized Learning of Personalized Models
* Learning to Deblur using Light Field Generated and Real Defocus Images
* Learning to Detect Mobile Objects from LiDAR Scans Without Labels
* Learning to Detect Scene Landmarks for Camera Localization
* Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes
* Learning to Find Good Models in RANSAC
* Learning to Forget for Meta-Learning via Task-and-Layer-Wise Attenuation
* Learning to generate line drawings that convey geometry and semantics
* Learning to Imagine: Diversify Memory for Incremental Learning using Unlabeled Data
* Learning to Learn across Diverse Data Biases in Deep Face Recognition
* Learning to Learn and Remember Super Long Multi-Domain Task Sequence
* Learning to Learn by Jointly Optimizing Neural Architecture and Weights
* Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion
* Learning to Memorize Feature Hallucination for One-Shot Image Generation
* Learning to Prompt for Continual Learning
* Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model
* Learning To Recognize Procedural Activities with Distant Supervision
* Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization
* Learning to Restore 3D Face from In-the-Wild Degraded Images
* Learning to See Through Obstructions With Layered Decomposition
* Learning to Solve Hard Minimal Problems
* Learning to Zoom Inside Camera Imaging Pipeline
* Learning Trajectory-Aware Transformer for Video Super-Resolution
* Learning Transferable and Discriminative Representations for 2D Image-Based 3D Model Retrieval
* Learning Transferable Human-Object Interaction Detector with Natural Language Supervision
* Learning Versatile Convolution Filters for Efficient Visual Recognition
* Learning Video Representations of Human Motion from Synthetic Data
* Learning What Not to Segment: A New Perspective on Few-Shot Segmentation
* Learning Where to Learn in Cross-View Self-Supervised Learning
* Learning With Multiclass AUC: Theory and Algorithms
* Learning with Neighbor Consistency for Noisy Labels
* Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification
* Learning-Based Resource Allocation for Backscatter-Aided Vehicular Networks
* Learning-by-Novel-View-Synthesis for Full-Face Appearance-Based 3D Gaze Estimation
* Lepard: Learning partial point cloud matching in rigid and deformable scenes
* Less is More: Generating Grounded Navigation Instructions from Landmarks
* Less is More: Proxy Datasets in NAS approaches
* Leveling Down in Computer Vision: Pareto Inefficiencies in Fair Deep Classifiers
* Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy
* Leveraging Adversarial Examples to Quantify Membership Information Leakage
* Leveraging Equivariant Features for Absolute Pose Regression
* Leveraging Geometric Structure for Label-Efficient Semi-Supervised Scene Segmentation
* Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection
* Leveraging Self-Supervision for Cross-Domain Crowd Counting
* Leveraging Unlabeled Data for Sketch-based Understanding
* LGT-Net: Indoor Panoramic Room Layout Estimation with Geometry-Aware Transformer Network
* LHPHGCNN: Lightweight Hierarchical Parallel Heterogeneous Group Convolutional Neural Networks for Point Cloud Scene Prediction
* Lidar Positioning for Indoor Precision Navigation
* LiDAR Snowfall Simulation for Robust 3D Object Detection
* LiDARCap: Long-range Markerless 3D Human Motion Capture with LiDAR Point Clouds
* Lifelong Graph Learning
* Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation
* LIFT: Learning 4D LiDAR Image Fusion Transformer for 3D Object Detection
* Light field extraction from a conventional camera
* Light Field FDL-HCGH Feature in Scale-Disparity Space, A
* Light Field Neural Rendering
* Light field occlusion removal network via foreground location and background recovery
* Lightweight Network for High Dynamic Range Imaging, A
* Likert Scoring with Grade Decoupling for Long-term Action Assessment
* Linear Combination Approximation of Feature for Channel Pruning
* Linear RGB-D SLAM for Structured Environments
* LISA: Learning Implicit Shape and Appearance of Hands
* Listening for Sirens: Locating and Classifying Acoustic Alarms in City Scenes
* LiT: Zero-Shot Transfer with Locked-Image text Tuning
* Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation
* Lite Vision Transformer with Enhanced Self-Attention
* Lite-MDETR: A Lightweight Multi-Modal Detector
* Lithosphere Ionosphere Coupling Associated with Seismic Swarm in the Balkan Peninsula from ROB-TEC and GPS
* LMGP: Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object Tracking
* LMSD-YOLO: A Lightweight YOLO Algorithm for Multi-Scale SAR Ship Detection
* Local Attention Pyramid for Scene Image Generation
* Local Learning Matters: Rethinking Data Heterogeneity in Federated Learning
* Local Texture Estimator for Implicit Representation Function
* Local to Global Feature Learning for Salient Object Detection
* Local-Adaptive Face Recognition via Graph-based Meta-Clustering and Regularized Adaptation
* Local-Global Graph Pooling via Mutual Information Maximization for Video-Paragraph Retrieval
* Locality preserving binary face representations using auto-encoders
* Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence Learning
* Localization Distillation for Dense Object Detection
* Localization of Craniomaxillofacial Landmarks on CBCT Images Using 3D Mask R-CNN and Local Dependency Learning
* Localized Adversarial Domain Generalization
* Locating Urban Trees near Electric Wires using Google Street View Photos: A New Dataset and A Semi-Supervised Learning Approach in the Wild
* Location-Free Human Pose Estimation
* Logging Pattern Detection by Multispectral Remote Sensing Imagery in North Subtropical Plantation Forests
* LOLNeRF: Learn from One Look
* Long-Short Temporal Contrastive Learning of Video Transformers
* Long-tail Recognition via Compositional Knowledge Transfer
* Long-Tailed Recognition via Weight Balancing
* Long-tailed Visual Recognition via Gaussian Clouded Logit Adjustment
* Long-term Action Forecasting Using Multi-headed Attention-based Variational Recurrent Neural Networks
* Long-Term Performance Evaluation of the Latest Multi-Source Weighted-Ensemble Precipitation (MSWEP) over the Highlands of Indo-Pak (1981-2009)
* Long-Term Spatiotemporal Characteristics and Impact Factors of Land Surface Temperature of Inhabited Islands with Different Urbanization Levels
* Long-term Video Frame Interpolation via Feature Propagation
* Long-term Visual Map Sparsification with Heterogeneous GNN
* LongReMix: Robust learning with high confidence samples in a noisy label environment
* Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling
* Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Discriminator
* Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
* Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image
* Lost in Compression: the Impact of Lossy Image Compression on Variable Size Object Detection within Infrared Imagery
* Low Memory Footprint Quantized Neural Network for Depth Completion of Very Sparse Time-of-Flight Depth Maps, A
* Low-Altitude Remote Sensing Inspection Method on Rural Living Environments Based on a Modified YOLOv5s-ViT, A
* Low-cost & Realtime Motion Capture System, A
* Low-rank 2D local discriminant graph embedding for robust image feature extraction
* Low-Resource Adaptation for Personalized Co-Speech Gesture Generation
* LSVC: A Learning-based Stereo Video Compression Framework
* LTP: Lane-based Trajectory Prediction for Autonomous Driving
* Lw-Count: An Effective Lightweight Encoding-Decoding Crowd Counting Network
* Lymph Node Metastasis Prediction From Whole Slide Images With Transformer-Guided Multiinstance Learning and Knowledge Transfer
* M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation
* M2I: From Factored Marginal Trajectory Prediction to Interactive Prediction
* M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers
* M3T: three-dimensional Medical image classifier using Multi-plane and Multi-slice Transformer
* M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining
* MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
* MAEANet: Multiscale Attention and Edge-Aware Siamese Network for Building Change Detection in High-Resolution Remote Sensing Images
* Maintaining Reasoning Consistency in Compositional Visual Question Answering
* Majority Can Help the Minority: Context-rich Minority Oversampling for Long-tailed Classification, The
* Make It Move: Controllable Image-to-Video Generation with Text Descriptions
* Manifold Learning Benefits GANs
* MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
* ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation
* Many-to-many Splatting for Efficient Video Frame Interpolation
* MAPLE-Edge: A Runtime Latency Predictor for Edge Devices
* MAPLE: Microprocessor A Priori for Latency Estimation
* Mapping African wetlands for 2020 using multiple spectral, geo-ecological features and Google Earth Engine
* Margin embedding net for robust margin collaborative representation-based classification
* Marginal Contrastive Correspondence for Guided Image Generation
* Marginalizing Sample Consensus
* Mask Transfiner for High-Quality Instance Segmentation
* Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction
* Masked Autoencoders Are Scalable Vision Learners
* Masked face recognition: Human versus machine
* Masked Feature Prediction for Self-Supervised Visual Pre-Training
* Masked-attention Mask Transformer for Universal Image Segmentation
* MaskGIT: Masked Generative Image Transformer
* Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network
* MAT: Mask-Aware Transformer for Large Hole Image Inpainting
* Matching Feature Sets for Few-Shot Image Classification
* Material Swapping for 3D Scenes using a Learnt Material Similarity Measure
* MatteFormer: Transformer-Based Image Matting via Prior-Tokens
* MAXIM: Multi-Axis MLP for Image Processing
* Maximum Consensus by Weighted Influences of Monotone Boolean Functions
* Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation
* MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis
* Measurement-Based Wideband Space-Time Channel Models for 77GHz Automotive Radar in Underground Parking Lots
* Measuring Compositional Consistency for Video Question Answering
* Medial Spectral Coordinates for 3D Shape Analysis
* Medusa: Universal Feature Learning via Attentional Multitasking
* medXGAN: Visual Explanations for Medical Classifiers through a Generative Latent Space
* Mega-NeRF: Scalable Construction of Large-Scale NeRFs for Virtual Fly- Throughs
* Memory-augmented Deep Conditional Unfolding Network for Pansharpening
* Memory-Augmented Non-Local Attention for Video Super-Resolution
* Memory-Based Ant Colony System Approach for Multi-Source Data Associated Dynamic Electric Vehicle Dispatch Optimization
* MeMOT: Multi-Object Tracking with Memory
* MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
* MERLOT RESERVE: Neural Script Knowledge through Vision and Language and Sound
* Merry Go Round: Rotate a Frame and Fool a DNN
* Meta Agent Teaming Active Learning for Pose Estimation
* Meta Balanced Network for Fair Face Recognition
* Meta Convolutional Neural Networks for Single Domain Generalization
* Meta Distribution Alignment for Generalizable Person Re-Identification
* Meta-attention for ViT-backed Continual Learning
* Meta-Generating Deep Attentive Metric for Few-Shot Classification
* Meta-Wrapper: Differentiable Wrapping Operator for User Interest Selection in CTR Prediction
* MetaFormer is Actually What You Need for Vision
* MetaFSCIL: A Meta-Learning Approach for Few-Shot Class Incremental Learning
* MetaPose: Fast 3D Pose from Multiple Views without 3D Supervision
* Method of Potentially Promising Network for Crack Detection With Enhanced Convolution and Dynamic Feature Fusion, A
* Methodology for Lidar Monitoring of Biomass Burning Smoke in Connection with the Land Cover
* Methodology for National Scale Coastal Landcover Mapping in New Zealand, A
* MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation
* Microscopic Model of Vehicle CO2 Emissions Based on Deep Learning: A Spatiotemporal Analysis of Taxicabs in Wuhan, China, A
* MIL-Derived Transformer for Weakly Supervised Point Cloud Segmentation, An
* Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning
* Mining Data Impressions From Deep Models as Substitute for the Unavailable Training Data
* Mining Multi-View Information: A Strong Self-Supervised Framework for Depth-based 3D Hand Pose and Mesh Estimation
* MiniViT: Compressing Vision Transformers with Weight Multiplexing
* MinNet: Minutia Patch Embedding Network for Automated Latent Fingerprint Recognition
* Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields
* MISF:Multi-level Interactive Siamese Filtering for High-Fidelity Image Inpainting
* Mitigating Paucity of Data in Sinusoid Characterization Using Generative Synthetic Noise
* Mix and Localize: Localizing Sound Sources in Mixtures
* MixAugment & Mixup: Augmentation Methods for Facial Expression Recognition
* Mixed Differential Privacy in Computer Vision
* Mixed Feature Prediction on Boundary Learning for Point Cloud Semantic Segmentation
* Mixed Methods Approach for Fuel Characterisation in Gorse (Ulex europaeus L.) Scrub from High-Density UAV Laser Scanning Point Clouds and Semantic Segmentation of UAV Imagery, A
* Mixed-attention-based regional soft partition network for vehicle reidentification
* MixFormer: End-to-End Tracking With Iterative Mixed Attention
* MixFormer: Mixing Features across Windows and Dimensions
* MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video
* MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
* MLSLT: Towards Multilingual Sign Language Translation
* MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation
* MNSRNet: Multimodal Transformer Network for 3D Surface Super-Resolution
* Mobile-Former: Bridging MobileNet and Transformer
* MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image
* Modality-Agnostic Learning for Radar-Lidar Fusion in Vehicle Detection
* Model Level Ensemble for Facial Action Unit Recognition at the 3rd ABAW Challenge
* Modeling 3D Layout For Group Re-Identification
* Modeling Driver Responses to Automation Failures With Active Inference
* Modeling Image Composition for Complex Scene Generation
* Modeling Indirect Illumination for Inverse Rendering
* Modeling Missing Annotations for Incremental Learning in Object Detection
* Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
* Modeling Small-Granularity Expressway Traffic Volumes With Quantum Walks
* Modeling sRGB Camera Noise with Normalizing Flows
* Modelling and performance analysis of Balise under dynamic energy harvesting in high-speed railway
* Modelling multiple quantiles together with the mean based on SA-ConvLSTM for taxi pick-up prediction
* Modified Multi-Direction Iterative Algorithm for Separable Nonlinear Models With Missing Data
* Modular Action Concept Grounding in Semantic Video Prediction
* Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings, A
* Modulated Contrast for Versatile Image Synthesis
* Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters
* MogFace: Towards a Deeper Appreciation on Face Detection
* Momentum Contrastive Pruning
* Monitoring Damage Caused by Pantana phyllostachysae Chao to Moso Bamboo Forests Using Sentinel-1 and Sentinel-2 Images
* Monitoring Mesoscale to Submesoscale Processes in Large Lakes with Sentinel-1 SAR Imagery: The Case of Lake Geneva
* Monitoring Non-Linear Ground Motion above Underground Gas Storage Using GNSS and PSInSAR Based on Sentinel-1 Data
* Monitoring of Hydrological Resources in Surface Water Change by Satellite Altimetry
* Monitoring of Wheat Height Based on Multi-GNSS Reflected Signals
* MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer
* MonoGround: Detecting Monocular 3D Objects from the Ground
* MonoJSG: Joint Semantic and Geometric Cost Volume for Monocular 3D Object Detection
* MonoScene: Monocular 3D Semantic Scene Completion
* MonoTrack: Shuttle trajectory reconstruction from monocular badminton video
* MOOD 2020: A Public Benchmark for Out-of-Distribution Detection and Localization on Medical Images
* More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
* MORPH-DSLAM: Model Order Reduction for Physics-Based Deformable SLAM
* Motion Aware Double Attention Network for Dynamic Scene Deblurring
* Motion detection in moving camera videos using background modeling and FlowNet
* Motion Planning for Connected Automated Vehicles at Occluded Intersections With Infrastructure Sensors
* Motion-Adjustable Neural Implicit Video Representation
* Motion-aware Contrastive Video Representation Learning via Foreground-background Merging
* Motion-from-Blur: 3D Shape and Motion Estimation of Motion-blurred Objects in Videos
* Motion-modulated Temporal Fragment Alignment Network For Few-Shot Action Recognition
* MotionAug: Augmentation with Physical Correction for Human Motion Prediction
* Motron: Multimodal Probabilistic Human Motion Forecasting
* Moving Objects Tracking Based on Geometric Model-Free Approach With Particle Filter Using Automotive LiDAR
* Moving Window Regression: A Novel Approach to Ordinal Regression
* MPAF: Model Poisoning Attacks to Federated Learning based on Fake Clients
* MPC: Multi-view Probabilistic Clustering
* MPViT: Multi-Path Vision Transformer for Dense Prediction
* Mr.BiQ: Post-Training Non-Uniform Quantization based on Minimizing the Reconstruction Error
* MS-Pansharpening Algorithm Based on Dual Constraint Guided Filtering
* MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
* MS2DG-Net: Progressive Correspondence Learning via Multiple Sparse Semantics Dynamic Graph
* MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning
* MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens
* MsKAT: Multi-Scale Knowledge-Aware Transformer for Vehicle Re-Identification
* MSPR-Net: A Multi-Scale Features Based Point Cloud Registration Network
* MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction
* MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
* MSTRIQ: No Reference Image Quality Assessment Based on Swin Transformer with Multi-Stage Fusion
* MuIT: An End-to-End Multitask Learning Transformer
* MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
* Multi stain graph fusion for multimodal integration in pathology
* Multi-attribute balanced sampling for disentangled GAN controls
* Multi-Bracket High Dynamic Range Imaging with Event Cameras
* Multi-Camera Multi-Vehicle Tracking with Domain Generalization and Contextual Constraints
* Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous Vehicles
* Multi-Camera Trajectory Forecasting With Trajectory Tensors
* Multi-Camera Vehicle Tracking Based on Occlusion-aware and Inter-vehicle Information
* Multi-Camera Vehicle Tracking System for AI City Challenge 2022
* multi-channel geometric algebra residual network for traffic data prediction, A
* Multi-Class Cell Detection Using Modified Self-Attention
* Multi-class Token Transformer for Weakly Supervised Semantic Segmentation
* Multi-Dimensional Vision Transformer Compression via Dependency Guided Gaussian Process Search
* Multi-Dimensional, Nuanced and Subjective - Measuring the Perception of Facial Expressions
* Multi-encoder Network for Parameter Reduction of a Kernel-based Interpolation Architecture
* Multi-Frame Self-Supervised Depth with Transformers
* Multi-grained Spatio-Temporal Features Perceived Network for Event-based Lip-Reading
* Multi-Granularity Alignment Domain Adaptation for Object Detection
* Multi-granularity Retrieval System for Natural Language-based Vehicle Retrieval, A
* Multi-Graph Convolutional-Recurrent Neural Network (MGC-RNN) for Short-Term Forecasting of Transit Passenger Flow
* Multi-Head Distillation for Continual Unsupervised Domain Adaptation in Semantic Segmentation
* Multi-instance Point Cloud Registration by Efficient Correspondence Clustering
* Multi-label Classification with Partial Annotations using Class-aware Selective Loss
* Multi-label Iterated Learning for Image Classification with Label Ambiguity
* Multi-Layer Modeling of Dense Vegetation from Aerial LiDAR Scans
* Multi-level Domain Adaptation for Lane Detection
* Multi-level Feature Learning for Contrastive Multi-view Clustering
* Multi-Level Representation Learning with Semantic Alignment for Referring Video Object Segmentation
* Multi-marginal Contrastive Learning for Multilabel Subcellular Protein Localization
* Multi-modal 3D Human Pose Estimation with 2D Weak Supervision in Autonomous Driving
* Multi-modal Aerial View Object Classification Challenge Results: PBVS 2022
* Multi-modal Alignment using Representation Codebook
* Multi-Modal Dynamic Graph Transformer for Visual Grounding
* Multi-modal Extreme Classification
* Multi-Modal Feature Fusion Network with Adaptive Center Point Detector for Building Instance Extraction
* Multi-Modal Fusion Network Guided by Feature Co-Occurrence for Urban Region Function Recognition, A
* multi-modal universe of fast-fashion: the Visuelle 2.0 benchmark, The
* Multi-Object Tracking Meets Moving UAV
* Multi-Objective Diverse Human Motion Prediction with Knowledge Distillation
* Multi-Person Extreme Motion Prediction
* Multi-Robot Active Mapping via Neural Bipartite Graph Matching
* Multi-Scale Analysis for Coherent Change Detection: A Method for Extracting Typical Changed Area
* Multi-scale attention guided network for end-to-end face alignment and recognition
* Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
* Multi-Scale Memory-Based Video Deblurring
* Multi-scale multi-hierarchy attention convolutional neural network for fetal brain extraction
* Multi-Sensor Remote Sensing of Intertidal Flat Habitats for Migratory Shorebird Conservation
* Multi-Source Time Series Remote Sensing Feature Selection and Urban Forest Extraction Based on Improved Artificial Bee Colony
* Multi-Source Uncertainty Mining for Deep Unsupervised Saliency Detection
* Multi-Stage Optimisation Approach to Design Relocation Strategies in One-Way Car-Sharing Systems With Stackable Cars, A
* Multi-task Learning for Human Affect Prediction with Auditory-Visual Synchronized Representation
* Multi-Task Learning for Video Surveillance with Limited Data
* Multi-Temporal Network for Improving Semantic Segmentation of Large-Scale Landsat Imagery, A
* Multi-Trip Multi-Trailer Drop-and-Pull Container Drayage Problem
* Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis
* Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry
* Multi-View Mesh Reconstruction with Neural Deferred Shading
* Multi-View Transformer for 3D Visual Grounding
* Multibranch Unsupervised Domain Adaptation Network for Cross Multidomain Orchard Area Segmentation
* Multidimensional Belief Quantification for Label-Efficient Meta-Learning
* Multidimensional Prototype Refactor Enhanced Network for Few-Shot Action Recognition
* Multidirectional Shift Rasterization (MDSR) Algorithm for Effective Identification of Ground in Dense Point Clouds
* Multidomain Suppression of Ambient Light in Visible Light Communication Transceivers
* Multiframe Joint Enhancement for Early Interlaced Videos
* Multimodal Colored Point Cloud to Image Alignment
* Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal Classification
* Multimodal Material Segmentation
* Multimodal Shape Completion via Implicit Maximum Likelihood Estimation
* Multimodal Token Fusion for Vision Transformers
* Multimodal Transformer for Nursing Activity Recognition
* Multiobjective Platooning of Connected and Automated Vehicles Using Distributed Economic Model Predictive Control
* Multiple Characteristics of Precipitation Inferred from Wind Profiler Radar Doppler Spectra
* Multiple Degradation and Reconstruction Network for Single Image Denoising via Knowledge Distillation
* Multiple Object Detection and Tracking in the Thermal Spectrum
* Multiplicative Long Short-Term Memory with Improved Mayfly Optimization for LULC Classification
* Multiregional Coverage Path Planning for Multiple Energy Constrained UAVs
* Multiscale and Multitask Deep Learning Framework for Automatic Building Extraction, A
* Multiscale Normalization Attention Network for Water Body Extraction from Remote Sensing Imagery
* Multiscale Superpixel Guided Discriminative Forest for Hyperspectral Anomaly Detection
* Multispectral interaction convolutional neural network for pedestrian detection
* Multistage Curvature-Guided Network for Progressive Single Image Reflection Removal
* Multitaper-Mel Spectrograms for Keyword Spotting
* Multitask Hypergraph Convolutional Networks: A Heterogeneous Traffic Prediction Framework
* Multitype Highway Mobility Analytics for Efficient Learning Model Design: A Case of Station Traffic Prediction
* Multiview Depth-based Motion Capture Benchmark Dataset for Human Motion Denoising and Enhancement Research, A
* Multiview Transformers for Video Recognition
* MUM: Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object Detection
* MUSE-VAE: Multi-Scale VAE for Environment-Aware Long Term Trajectory Prediction
* MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries
* Mutual Information-driven Pan-sharpening
* Mutual Quantization for Cross-Modal Search with Noisy Labels
* MV-TAL: Mulit-view Temporal Action Localization in Naturalistic Driving
* MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
* MVS2D: Efficient Multiview Stereo via Attention-Driven 2D Convolutions
* NAFSSR: Stereo Image Super-Resolution Using NAFNet
* NAN: Noise-Aware NeRFs for Burst-Denoising
* Natural Language-Based Vehicle Retrieval with Explicit Cross-Modal Representation Learning
* Negative-Aware Attention Framework for Image-Text Matching
* NeRF in the Dark: High Dynamic Range View Synthesis from Noisy Raw Images
* NeRF-Editing: Geometry Editing of Neural Radiance Fields
* Nerfels: Renderable Neural Codes for Improved Camera Pose Estimation
* NeRFReN: Neural Radiance Fields with Reflections
* NeRFusion: Fusing Radiance Fields for Large-Scale Scene Reconstruction
* Nested Collaborative Learning for Long-Tailed Visual Recognition
* Nested Hyperbolic Spaces for Dimensionality Reduction and Hyperbolic NN Design
* Network Amplification with Efficient MACs Allocation
* Network Rebalance and Operational Efficiency of Sharing Transportation System: Multi-Objective Optimization and Model Predictive Control Approaches
* Neural 3D Scene Reconstruction with the Manhattan-world Assumption
* Neural 3D Video Synthesis from Multi-view Video
* Neural Architecture Search with Representation Mutual Information
* Neural Collaborative Graph Machines for Table Structure Recognition
* Neural Compression-Based Feature Learning for Video Restoration
* Neural Convolutional Surfaces
* Neural Data-Dependent Transform for Learned Image Compression
* Neural Emotion Director: Speech-preserving semantic control of facial expressions in in-the-wild videos
* Neural Face Identification in a 2D Wireframe Projection of a Manifold Object
* Neural Face Video Compression using Multiple Views
* Neural Fields as Learnable Kernels for 3D Reconstruction
* Neural Global Shutter: Learn to Restore Video from a Rolling Shutter Camera with Global Reset Feature
* Neural graph embeddings as explicit low-rank matrix factorization for link prediction
* Neural Head Avatars from Monocular RGB Videos
* Neural Image Recolorization for Creative Domains
* Neural Inertial Localization
* Neural Mean Discrepancy for Efficient Out-of-Distribution Detection
* Neural Mesh Simplification
* Neural MoCon: Neural Motion Control for Physically Plausible Human Motion Capture
* Neural Network-based In-Loop Filter for CLIC 2022
* Neural Point Light Fields
* Neural Points: Point Cloud Representation with Neural Fields for Arbitrary Upsampling
* Neural Prior for Trajectory Estimation
* Neural Rays for Occlusion-aware Image-based Rendering
* Neural Recognition of Dashed Curves with Gestalt Law of Continuity
* Neural Reflectance for Shape Recovery with Shadow Handling
* Neural RGB-D Surface Reconstruction
* Neural Shape Mating: Self-Supervised Object Assembly with Adversarial Shape Priors
* Neural Template: Topology-aware Reconstruction and Disentangled Generation of 3D Meshes
* Neural Texture Extraction and Distribution for Controllable Person Image Synthesis
* Neural Volumetric Object Selection
* Neural Window Fully-connected CRFs for Monocular Depth Estimation
* Neural-network Enhanced Video Coding Framework beyond VVC, A
* NeuralAnnot: Neural Annotator for 3D Human Mesh Training Sets
* NeuralHDHair: Automatic High-fidelity Hair Modeling from a Single Image Using Implicit Neural Representations
* NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions
* Neurally-Guided Shape Parser: Grammar-based Labeling of 3D Shape Regions with Approximate Inference, The
* NeurMiPs: Neural Mixture of Planar Experts for View Synthesis
* new algorithm for support vector regression with automatic selection of hyperparameters, A
* New Approach for Nitrogen Status Monitoring in Potato Plants by Combining RGB Images and SPAD Measurements, A
* New Data Processing System for Generating Sea Ice Surface Roughness Products from the Multi-Angle Imaging SpectroRadiometer (MISR) Imagery, A
* New Dataset and Transformer for Stereoscopic Video Super-Resolution, A
* New Era for Geo-Parsing to Obtain Actual Locations: A Novel Toponym Correction Method Based on Remote Sensing Images
* New Modeling Approach for Predicting Vehicle-Based Safety Threats, A
* New Non-central Model for Fisheye Calibration, A
* New Optimal Subset Selection Method of Partial Ambiguity Resolution for Precise Point Positioning, A
* New Ship Detection Algorithm in Optical Remote Sensing Images Based on Improved R3Det, A
* New Strategy for Forest Height Estimation Using Airborne X-Band PolInSAR Data, A
* New Unsupervised Deep Learning Algorithm for Fine-Grained Detection of Driver Distraction, A
* New VVC Chroma Prediction Modes Based on Coloring with Inter-Channel Correlation
* NFormer: Robust Person Re-identification with Neighbor Transformer
* NICE-SLAM: Neural Implicit Scalable Encoding for SLAM
* NICGSlowDown: Evaluating the Efficiency Robustness of Neural Image Caption Generation Models
* NightLab: A Dual-level Architecture with Hardness Detection for Segmentation at Night
* Nighttime Image Dehazing Based on Variational Decomposition Model
* NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning
* NL-FFC: Non-Local Fast Fourier Convolution for Image Super Resolution
* NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks
* No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces
* No-Reference Image Quality Assessment by Hallucinating Pristine Features
* no-reference perceptual image quality assessment database for learned image codecs, A
* No-Reference Point Cloud Quality Assessment via Domain Adaptation
* NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External Knowledge
* Node Representation Learning in Graph via Node-to-Neighbourhood Mutual Information Maximization
* Node-aligned Graph Convolutional Network for Whole-slide Image Representation and Classification
* NODEO: A Neural Ordinary Differential Equation Based Optimization Framework for Deformable Image Registration
* Noise Distribution Adaptive Self-Supervised Image Denoising using Tweedie Distribution and Score Matching
* Noise Is Also Useful: Negative Correlation-Steered Latent Contrastive Learning
* Noise Parameter Estimation Two-Stage Network for Single Infrared Dim Small Target Image Destriping
* Noise-robust oversampling for imbalanced data classification
* Noise2NoiseFlow: Realistic Camera Noise Modeling without Clean Images
* Noisy Boundaries: Lemon or Lemonade for Semi-supervised Instance Segmentation?
* NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition
* Non-Contact In-Plane Movement Estimation of Floating Covers Using Finite Element Formulation on Field-Scale DEM
* Non-generative Generalized Zero-shot Learning via Task-correlated Disentanglement and Controllable Samples Synthesis
* Non-instinct detection of cellphone usage from lane-keeping performance based on eXtreme gradient boosting and optimal sliding windows
* Non-isotropy Regularization for Proxy-based Deep Metric Learning
* Non-Iterative Recovery from Nonlinear Observations using Generative Models
* Non-linear Motion Estimation for Video Frame Interpolation using Space-time Convolutions
* Non-parametric Depth Distribution Modelling based Depth Inference for Multi-view Stereo
* Non-Probability Sampling Network for Stochastic Human Trajectory Prediction
* Non-Sinusoidal micro-Doppler Estimation Based on Dual-Branch Network
* Nonlocal feature learning based on a variational graph auto-encoder network for small area change detection using SAR imagery
* Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation
* Nonuniformly Dehaze Network for Visible Remote Sensing Images
* Norm Must Go On: Dynamic Unsupervised Domain Adaptation by Normalization, The
* Not All Labels Are Equal: Rationalizing The Labeling Costs for Training Object Detection
* Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds
* Not All Relations are Equal: Mining Informative Labels for Scene Graph Generation
* Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
* Not Just Selection, but Exploration: Online Class-Incremental Continual Learning via Dual View Consistency
* Notice of Retraction: E2V-SDE: From Asynchronous Events to Fast and Continuous Video Reconstruction via Neural Stochastic Differential Equations
* Novel Algorithm Modelling for UWB Localization Accuracy in Remote Sensing, A
* Novel Class Discovery in Semantic Segmentation
* Novel Direct Trajectory Planning Approach Based on Generative Adversarial Networks and Rapidly-Exploring Random Tree, A
* Novel Freeze-Thaw State Detection Algorithm Based on L-Band Passive Microwave Remote Sensing, A
* novel hierarchical light field coding scheme based on hybrid stacked multiplicative layers and Fourier disparity layers for glasses-free 3D displays, A
* Novel Hybrid Attention-Driven Multistream Hierarchical Graph Embedding Network for Remote Sensing Object Detection, A
* Novel Lidar Signal-Denoising Algorithm Based on Sparrow Search Algorithm for Optimal Variational Modal Decomposition, A
* Novel Machine Learning-Based Scheme for Spectrum Sharing in Virtualized 5G Networks, A
* Novel Neuron-like Procedure of Weak Signal Detection against the Non-Stationary Noise Background with Application to Underwater Sound
* Novel Occlusion-Aware Vote Cost for Light Field Depth Estimation, A
* Novel Robust Inertial and Ultra-Short Baseline Integrated Navigation Strategy Under the Influence of Motion Effect, A
* Novel Slow-Growing Gross Error Detection Method for GNSS/Accelerometer Integrated Deformation Monitoring Based on State Domain Consistency Theory, A
* NPBG++: Accelerating Neural Point-Based Graphics
* NTIRE 2022 Burst Super-Resolution Challenge
* NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results
* NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results
* NTIRE 2022 Challenge on Learning the Super-Resolution Space
* NTIRE 2022 Challenge on Night Photography Rendering
* NTIRE 2022 Challenge on Perceptual Image Quality Assessment
* NTIRE 2022 Challenge on Stereo Image Super-Resolution: Methods and Results
* NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results
* NTIRE 2022 Image Inpainting Challenge: Report
* NTIRE 2022 Spectral Demosaicing Challenge and Data Set
* NTIRE 2022 Spectral Recovery Challenge and Data Set
* OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction
* Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges
* Object Localization under Single Coarse Point Supervision
* Object Point Set Inductive Tracker for Multi-Object Tracking and Segmentation, An
* Object Prior Embedded Network for Query-Agnostic Image Retrieval
* Object-aware Video-language Pre-training for Retrieval
* Object-Oriented Canopy Gap Extraction from UAV Images Based on Edge Enhancement
* Object-Region Video Transformers
* Object-Relation Reasoning Graph for Action Recognition
* ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer
* ObjectFormer for Image Manipulation Detection and Localization
* Oblique Projection-Based Beamforming Method for Coherent Signals Receiving, An
* Observation Density Based Method for Independent Baseline Searching in GNSS Network Solution, An
* Observer-Based Double Closed-Loop Control for Mixed Vehicle Groups: A Macro and Micro Perspective
* OccAM's Laser: Occlusion-based Attribution Maps for 3D Object Detectors on LiDAR Data
* Occluded Human Mesh Recovery
* Occlusion and Deformation Handling Visual Tracking for UAV via Attention-Based Mask Generative Network
* Occlusion-Aware Cost Constructor for Light Field Depth Estimation
* Occlusion-robust Face Alignment using A Viewpoint-invariant Hierarchical Network Architecture
* OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction
* OCSampler: Compressing Videos to One Clip with Single-step Sampling
* Oil Spill Detection by CP SAR Based on the Power Entropy Decomposition
* OMG: Observe Multiple Granularities for Natural Language-Based Vehicle Retrieval
* Omni-DETR: Omni-Supervised Object Detection with Transformers
* omni-scale global-local aware network for shadow extraction in remote sensing imagery, An
* OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion
* Omnivore: A Single Model for Many Visual Modalities
* On Adversarial Robustness of Trajectory Prediction for Autonomous Vehicles
* On Aliased Resizing and Surprising Subtleties in GAN Evaluation
* On Generalizing Beyond Domains in Cross-Domain Continual Learning
* On Guiding Visual Attention with Language Specification
* On Improving Cross-dataset Generalization of Deepfake Detectors
* On Learning Contrastive Representations for Learning with Noisy Labels
* On Reliable Awareness System for Autonomous River Vessels
* On the Choice of Data for Efficient Training and Validation of End-to-End Driving Models
* On the Correlation Among Edge, Pose and Parsing
* On the Effect of Atmospheric Turbulence in the Feature Space of Deep Face Recognition
* On the Exploitation of Deepfake Model Recognition
* On the Impacts of Historical and Future Climate Changes to the Sustainability of the Main Sardinian Forests
* On the Importance of Asymmetry for Siamese Representation Learning
* On the Instability of Relative Pose Estimation and RANSAC's Role
* On the Integration of Self-Attention and Convolution
* On the Potential of Flaming Hotspot Detection at Night via Multiband Visible/Near-Infrared Imaging
* On the Problem of the Sea Ice Detection by Orbital Microwave Doppler Radar at the Nadir Sounding
* On the Road to Online Adaptation for Semantic Image Segmentation
* On the Sensitivity of a Ground-Based Tropospheric Lidar to Aitken Mode Particles in the Upper Troposphere
* On the Treatment of Optimization Problems With L1 Penalty Terms via Multiobjective Continuation
* On the Use of Sentinel-2 NDVI Time Series and Google Earth Engine to Detect Land-Use/Land-Cover Changes in Fire-Affected Areas
* On-Orbit Radiometric Calibration of Hyperspectral Sensors on Board Micro-Nano Satellite Constellation Based on RadCalNet Data
* On-Sensor Binarized Fully Convolutional Neural Network for Localisation and Coarse Segmentation
* ONCE-3DLanes: Building Monocular 3D Lane Detection
* Once-for-All Budgeted Pruning Framework for ConvNets Considering Input Resolution, An
* One Loss for Quantization: Deep Hashing with Discrete Wasserstein Distributional Matching
* One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones
* One-bit Active Query with Contrastive Pairs
* One-Stage Object Referring with Gaze Estimation
* OneFlow: One-Class Flow for Anomaly Detection Based on a Minimal Volume Region
* OnePose: One-Shot Object Pose Estimation without CAD Models
* Online and offline streaming feature selection methods with bat algorithm for redundancy analysis
* Online change-point detection with kernels
* Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries
* Online Convolutional Reparameterization
* Online Learning of Reusable Abstract Models for Object Goal Navigation
* Online Meta Adaptation for Variable-Rate Learned Image Compression
* Online Unsupervised Domain Adaptation for Person Re-identification
* OoD-Bench: Quantifying and Understanding Two Dimensions of Out-of-Distribution Generalization
* OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis
* Open Challenges in Deep Stereo: the Booster Dataset
* Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources
* Open-Set Domain Adaptation Under Few Source-Domain Labeled Samples
* Open-Set Text Recognition via Character-Context Decoupling
* Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling
* Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation
* Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity
* Opening up Open World Tracking
* OpenSentinelMap: A Large-Scale Land Use Dataset using OpenStreetMap and Sentinel-2 Imagery
* OpenTAL: Towards Open Set Temporal Action Localization
* Optical Flow Estimation for Spiking Camera
* Optimal Correction Cost for Object Detection Evaluation
* Optimal Dynamic Supply of Parking Permits Under Uncertainties: A Stochastic Control Approach
* Optimal LED Spectral Multiplexing for NIR2RGB Translation
* Optimal Parameter Inflation to Enhance the Availability of Single-Frequency GBAS for Intelligent Air Transportation
* Optimal positioning of terrestrial LiDAR scanner stations in complex 3D environments with a multiobjective optimization method based on GPU simulations
* Optimal selection of wavelet transform parameters for spatio-temporal analysis based on non-stationary NDVI MODIS time series in Mediterranean region
* Optimising rPPG Signal Extraction by Exploiting Facial Surface Orientation
* Optimization of Remote Sensing Image Segmentation by a Customized Parallel Sine Cosine Algorithm Based on the Taguchi Method
* Optimized Dual Fire Attention Network and Medium-Scale Fire Classification Benchmark
* Optimized Software Tools to Generate Large Spatio-Temporal Data Using the Datacubes Concept: Application to Crop Classification in Cap Bon, Tunisia
* Optimizing Elimination Templates by Greedy Parameter Search
* Optimizing Management Practices to Reduce Sediment Connectivity between Forest Roads and Streams in a Mountainous Watershed
* Optimizing Nitrogen Management with Deep Reinforcement Learning and Crop Simulations
* Optimizing the Land Use and Land Cover Pattern to Increase Its Contribution to Carbon Neutrality
* Optimizing Video Prediction via Video Frame Interpolation
* Oriented RepPoints for Aerial Object Detection
* Oriented Ship Detection Based on Intersecting Circle and Deformable RoI in Remote Sensing Images
* Origin-Destination Demands-Based Multipath-Band Approach to Time-Varying Arterial Coordination, An
* OrphicX: A Causality-Inspired Latent Variable Model for Interpreting Graph Neural Networks
* OSKDet: Orientation-sensitive Keypoint Localization for Rotated Object Detection
* OSOP: A Multi-Stage One Shot Object Pose Estimation Framework
* OSSGAN: Open-Set Semi-Supervised Image Generation
* OSSO: Obtaining Skeletal Shape from Outside
* Out-Of-Distribution Detection In Unsupervised Continual Learning
* Out-of-distribution Generalization with Causal Invariant Transformations
* OutfitGAN: Learning Compatible Items for Generative Fashion Outfits
* OutfitTransformer: Outfit Representations for Fashion Recommendation
* Outlier removal and feature point pairs optimization for piecewise linear transformation in the co-registration of very high-resolution optical remote sensing imagery
* OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation
* Overarching Sustainable Energy Management of PV Integrated EV Parking Lots in Reconfigurable Microgrids Using Generative Adversarial Networks
* Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation
* Overview of Ecosystem Changes in Tibetan and Other Alpine Regions from Earth Observation, An
* OW-DETR: Open-world Detection Transformer
* OW-TAL: Learning Unknown Human Activities for Open-World Temporal Action Localization
* P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior
* P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision
* PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition
* PaintInStyle: One-Shot Discovery of Interpretable Directions by Painting
* PAND: Precise Action Recognition on Naturalistic Driving
* Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
* Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers
* Panoptic, Instance and Semantic Relations: A Relational Context Encoder to Enhance Panoptic Segmentation
* Panoptic-PHNet: Towards Real-Time and High-Precision LiDAR Panoptic Segmentation via Clustering Pseudo Heatmap
* PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation
* Parallel Generative Adversarial Network for Third-person to First-person Image Generation
* Parallel Simulation of Crowd Multi-Cell Occupancy and Velocity Variety
* Parameter-free Online Test-time Adaptation
* Parametric Scattering Networks
* Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention
* Part-based Pseudo Label Refinement for Unsupervised Person Re-identification
* PartGlot: Learning Shape Part Segmentation from Language Reference Games
* Partial Class Activation Attention for Semantic Segmentation
* partial cooperative control vehicle-to-vehicle trajectory planning algorithm with potential field constraints of arc-shaped road's boundary and vehicles' relative position, A
* Partially Does It: Towards Scene-Level FG-SBIR with Partial Input
* Particulate Matter Concentrations over South Korea: Impact of Meteorology and Other Pollutants
* Pass Receiver Prediction in Soccer using Video and Players' Trajectories
* Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
* PAT: Pseudo-Adversarial Training For Detecting Adversarial Videos
* Patch Slimming for Efficient Vision Transformers
* Patch-Based Uncalibrated Photometric Stereo Under Natural Illumination
* Patch-level Representation Learning for Self-supervised Vision Transformers
* Patch-wise Contrastive Style Learning for Instagram Filter Removal
* PatchFormer: An Efficient Point Transformer with Patch Attention
* PatchNet: A Simple Face Anti-Spoofing Framework via Fine-Grained Patch Recognition
* Path Planning of Spacecraft Cluster Orbit Reconstruction Based on ALPIO
* Patterns, Dynamics, and Drivers of Soil Available Nitrogen and Phosphorus in Alpine Grasslands across the QingZang Plateau
* PCA-Based Knowledge Distillation Towards Lightweight and Content-Style Balanced Photorealistic Style Transfer Models
* PCCN-RE: Point cloud colourisation network based on relevance embedding
* PCL: Proxy-based Contrastive Learning for Domain Generalization
* PEA: Improving the Performance of ReLU Networks for Free by Using Progressive Ensemble Activations
* Pedestrian next to the Lamppost Adaptive Object Graphs for Better Instantaneous Mapping, The
* Per-Clip Video Object Segmentation
* Perception Prioritized Training of Diffusion Models
* Perception-Based Pseudo-Motion Response for 360-Degree Video Streaming
* Perceptual in-Loop Filter for Image and Video Compression
* PerDet: Machine-Learning-Based UAV GPS Spoofing Detection Using Perception Data
* Performance Analysis of Separating Function Estimation Test for Impropriety of Complex Signals
* Performance Evaluation and Noise Mitigation of the FY-3E Microwave Humidity Sounder
* Performance Evaluation of Multi-Epoch Double-Differenced Pseudorange Observation Method Using GNSS Ground Stations
* Performance Prediction for Semantic Segmentation by a Self-Supervised Image Reconstruction Decoder
* Performance-Aware Mutual Knowledge Distillation for Improving Neural Architecture Search
* Perfusion assessment via local remote photoplethysmography (rPPG)
* Permafrost Early Deformation Signals before the Norilsk Oil Tank Collapse in Russia
* Persistent-Transient Duality in Human Behavior Modeling
* Person Re-identification Method Based on Color Attack and Joint Defence
* Personalized Image Aesthetics Assessment with Rich Attributes
* PersonGONE: Image Inpainting for Automated Checkout Solution
* Perturbed and Strict Mean Teachers for Semi-supervised Semantic Segmentation
* Phase Unwrapping using a Joint CNN and SQD-LSTM Network
* Phenological Changes and Driving Forces of Lake Ice in Central Asia from 2002 to 2020
* phi-SfT: Shape-from-Template with a Physics-Based Deformation Model
* PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation with Photometrically Challenging Objects
* PhoneDepth: A Dataset for Monocular Depth Estimation on Mobile Devices
* Photometric Visual Gyroscope for Full-View Spherical Camera
* Photorealistic Monocular 3D Reconstruction of Humans Wearing Clothing
* PhotoScene: Photorealistic Material and Lighting Transfer for Indoor Scenes
* PhyIR: Physics-based Inverse Rendering for Panoramic Indoor Images
* PhysFormer: Facial Video-based Physiological Measurement with Temporal Difference Transformer
* Physical Inertial Poser (PIP): Physics-aware Real-time Human Motion Tracking from Sparse Inertial Sensors
* Physical Simulation Layer for Accurate 3D Modeling
* Physically Disentangled Intra- and Inter-domain Adaptation for Varicolored Haze Removal
* Physically-guided Disentangled Implicit Rendering for 3D Face Modeling
* Physics Based Image Deshadowing Using Local Linear Model
* Physics-Based Noise Modeling for Extreme Low-Light Photography
* PIE-Net: Photometric Invariant Edge Guided Network for Intrinsic Image Decomposition
* PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework
* Pin the Memory: Learning to Generalize Semantic Segmentation
* PINA: Learning a Personalized Implicit Neural Avatar from a Single RGB-D Video Sequence
* Pix2NeRF: Unsupervised Conditional pi-GAN for Single Image to Neural Radiance Fields Translation
* Pixel screening based intermediate correction for blind deblurring
* Pixel-Level Segmentation-Synthesis Framework for Dynamic Texture Video Compression, A
* Pixel-wise supervision for presentation attack detection on identity document cards
* PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures
* PLAD: Learning to Infer Shape Programs with Pseudo-Labels and Approximate Distributions
* PlanarRecon: Realtime 3D Plane Detection and Reconstruction from Posed Monocular Videos
* PlaneMVS: 3D Plane Reconstruction from Multi-View Stereo
* Playable Environments: Video Manipulation in Space and Time
* Plenoxels: Radiance Fields without Neural Networks
* Plithogenic multi-criteria decision making approach on airspace planning scheme evaluation based on ATC-flight real-time simulation
* Plug-and-Play Regularization Using Linear Solvers
* PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction
* PO-ELIC: Perception-Oriented Efficient Learned Image Coding
* POCO: Point Convolution for Surface Reconstruction
* PODD: A Dual-Task Detection for Greenhouse Extraction Based on Deep Learning
* Point Cloud Color Constancy
* Point cloud completion by dynamic transformer with adaptive neighbourhood feature fusion
* Point Cloud Pre-training with Natural 3D Structures
* Point Density-Aware Voxels for LiDAR 3D Object Detection
* Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
* Point-Level Region Contrast for Object Detection Pre-Training
* Point-NeRF: Point-based Neural Radiance Fields
* Point-to-Surface Upscaling Algorithms for Snow Depth Ground Observations
* Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation
* Point2Cyl: Reverse Engineering 3D Objects from Point Clouds to Extrusion Cylinders
* Point2Roof: End-to-end 3D building roof modeling from airborne LiDAR point clouds
* Point2Seq: Detecting 3D Objects as Sequences
* PointCaps: Raw point cloud processing using capsule networks with Euclidean distance routing
* PointCLIP: Point Cloud Understanding by CLIP
* Pointly-Supervised Instance Segmentation
* PointMotionNet: Point-Wise Motion Learning for Large-Scale LiDAR Point Clouds Sequences
* PointOT: Interpretable Geometry-Inspired Point Cloud Generative Model via Optimal Transport
* Poisons that are learned faster are more effective
* PokeBNN: A Binary Pursuit of Lightweight Accuracy
* Polarity Sampling: Quality and Diversity Control of Pre-Trained Generative Networks via Singular Values
* Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps
* PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite Images
* PONI: Potential Functions for ObjectGoal Navigation with Interaction-free Learning
* Pooling Revisited: Your Receptive Field is Suboptimal
* Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian
* Portrait Eyeglasses and Shadow Removal by Leveraging 3D Synthetic Data
* Pose Estimation for Two-View Panoramas based on Keypoint Matching: A Comparative Study and Critical Analysis
* Pose Tutor: An Explainable System for Pose Correction in the Wild
* Pose-based Contrastive Learning for Domain Agnostic Activity Representations
* PoseKernelLifter: Metric Lifting of 3D Human Pose using Sound
* PoseTrack21: A Dataset for Person Search, Multi-Object Tracking and Multi-Person Pose Tracking
* PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision
* Positronium Lifetime Image Reconstruction for TOF PET
* Post-Flood Analysis for Damage and Restoration Assessment Using Drone Imagery
* Posture Calibration Based Cross-View and Hard-Sensitive Metric Learning for UAV-Based Vehicle Re-Identification
* Power Pylon Reconstruction from Airborne LiDAR Data Based on Component Segmentation and Model Matching
* PPDL: Predicate Probability Distribution based Loss for Unbiased Scene Graph Generation
* PPW Curves: a C2 Interpolating Spline with Hyperbolic Blending of Rational Bézier Curves
* Practical Evaluation of Adversarial Robustness via Adaptive Auto Attack
* Practical Learned Lossless JPEG Recompression with Multi-Level Cross-Channel Entropy Model in the DCT Domain
* Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation
* Pre-Predictive Congestion-Based Data Allocation for Sixth Generation Cooperative Intelligent Transportation Systems
* Precise Positioning Method of Moving Laser Point Cloud in Shield Tunnel Based on Bolt Hole Extraction
* Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
* Predicting and Mapping Potential Fire Severity for Risk Analysis at Regional Level Using Google Earth Engine
* Predicting Mind-Wandering with Facial Videos in Online Lectures
* Predicting Risky Driving in a Connected Vehicle Environment
* Prediction of Radar Echo Space-Time Sequence Based on Improving TrajGRU Deep-Learning Model
* Prediction of Sea Surface Temperature by Combining Interdimensional and Self-Attention with Neural Networks
* Presentation and Short Discussion of rVAD-fast, a Fast Voice Activity Detector, A
* Preserving Location-Privacy in Vehicular Networks via Reinforcement Learning
* Pretrain, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction
* Primitive3D: 3D Object Dataset Synthesis from Randomly Assembled Primitives
* Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy, The
* Prior Knowledge-Aware Fusion Network for Prediction of Macrovascular Invasion in Hepatocellular Carcinoma
* Privacy Leakage of Adversarial Training Models in Federated Learning Systems
* Privacy Preserving Partial Localization
* Privacy-friendly Synthetic Data for the Development of Face Morphing Attack Detectors
* Privacy-Preserving Collaborative Estimation for Networked Vehicles With Application to Collaborative Road Profile Estimation
* Privacy-preserving Online AutoML for Domain-Specific Face Detection
* Proactive Image Manipulation Detection
* Probabilistic Compositional Embeddings for Multimodal Image Retrieval
* Probabilistic Graphical Model Based on Neural-symbolic Reasoning for Visual Relationship Detection, A
* Probabilistic Normal Epipolar Constraint for Frame-To-Frame Rotation Optimization under Uncertain Feature Positions, The
* Probabilistic Representations for Video Contrastive Learning
* Probabilistic Risk Metric for Highway Driving Leveraging Multi-Modal Trajectory Predictions
* Probabilistic Tracking of Annual Cropland Changes over Large, Complex Agricultural Landscapes Using Google Earth Engine
* Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences
* Probing Representation Forgetting in Supervised and Unsupervised Continual Learning
* Programmatic Concept Learning for Human Motion Description and Synthesis
* Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
* Progressive End-to-End Object Detection in Crowded Scenes
* Progressive Minimal Path Method with Embedded CNN
* Progressive multi-scale fusion network for RGB-D salient object detection
* Progressive Tandem Learning for Pattern Recognition With Deep Spiking Neural Networks
* Progressive Training of A Two-Stage Framework for Video Restoration
* Progressively Generating Better Initial Guesses Towards Next Stages for High-Quality Human Motion Prediction
* Projective Manifold Gradient Layer for Deep Rotation Regression
* Prompt Distribution Learning
* Prompt-RSVQA: Prompting visual context to a language model for Remote Sensing Visual Question Answering
* Propagation Regularizer for Semi-supervised Learning with Extremely Scarce Labeled Samples
* Proper Reuse of Image Classification Features Improves Object Detection
* Proposal-based Paradigm for Self-supervised Sound Source Localization in Videos, A
* Proposal-free Lidar Panoptic Segmentation with Pillar-level Affinity
* ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues
* Protecting Celebrities from DeepFake with Identity Consistency Transformer
* Protecting Facial Privacy: Generating Adversarial Identity Masks via Style-robust Makeup Transfer
* Proto2Proto: Can you recognize the car, the way I do?
* Pruning rPPG Networks: Toward Small Dense Network with Limited Number of Training Samples
* Pseudo-label Generation and Various Data Augmentation for Semi-Supervised Hyperspectral Object Detection
* Pseudo-label Generation for Agricultural Robotics Applications
* Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
* Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
* PseudoProp: Robust Pseudo-Label Generation for Semi-Supervised Object Detection in Autonomous Driving Systems
* PSGAN++: Robust Detail-Preserving Makeup Transfer and Removal
* PSMNet: Position-aware Stereo Merging Network for Room Layout Estimation
* PSNet: Fast Data Structuring for Hierarchical Deep Learning on Point Cloud
* PSTR: End-to-End One-Step Person Search With Transformers
* PTNet3D: A 3D High-Resolution Longitudinal Infant Brain MRI Synthesizer Based on Transformers
* PTR-CNN for in-loop filtering in video coding
* PTTR: Relational 3D Point Cloud Object Tracking with Transformer
* PubTables-1M: Towards comprehensive table extraction from unstructured documents
* PUMP: Pyramidal and Uniqueness Matching Priors for Unsupervised Learning of Local Descriptors
* Pushing the Envelope of Gradient Boosting Forests via Globally-Optimized Oblique Trees
* Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference
* Pushing the Performance Limit of Scene Text Recognizer without Human Annotation
* Putting People in their Place: Monocular Regression of 3D People in Depth
* PVNAS: 3D Neural Architecture Search With Point-Voxel Convolution
* PyMiceTracking: An Open-Source Toolbox For Real-Time Behavioral Neuroscience Experiments
* Pyramid Adversarial Training Improves ViT Performance
* Pyramid Architecture for Multi-Scale Processing in Point Cloud Segmentation
* Pyramid Geometric Consistency Learning For Semantic Segmentation
* Pyramid Grafting Network for One-Stage High Resolution Saliency Detection
* Pyramidal Attention for Saliency Detection
* PyTorch-OOD: A Library for Out-of-Distribution Detection based on PyTorch
* QLP: Deep Q-Learning for Pruning Deep Neural Networks
* QPNet: Lane-changing trajectory planning combining quadratic programming and neural network under the convex optimization framework
* QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation
* Quantify the Road Link Performance and Capacity Using Deep Learning Models
* Quantifying Plant Species alpha-Diversity Using Normalized Difference Vegetation Index and Climate Data in Alpine Grasslands
* Quantifying Societal Bias Amplification in Image Captioning
* Quantifying the Influence of Climate Change and Anthropogenic Activities on the Net Primary Productivity of China's Grasslands
* Quantitative Impact of the Arable Land Protection Policy on the Landscape of Farmland Abandonment in Guangdong Province, The
* Quantization-aware Deep Optics for Diffractive Snapshot Hyperspectral Imaging
* Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free
* Query and Attention Augmentation for Knowledge-Based Explainable Reasoning
* Query efficient black-box adversarial attack on deep neural networks
* Query-guided networks for few-shot fine-grained classification and person search
* QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection
* R(Det)2: Randomized Decision Routing for Object Detection
* Radar and Communication Spectral Coexistence on Moving Platform with Interference Suppression
* Radar Detection Method of Plasma-Sheath-Covered Target Based on the Improved Keystone Algorithm, A
* Radar-Based Human Activity Recognition Under the Limited Measurement Data Support Using Domain Translation
* RADU: Ray-Aligned Depth Update Convolutions for ToF Data Denoising
* RAGO: Recurrent Graph Optimizer For Multiple Rotation Averaging
* Railway Lidar Point Cloud Reconstruction Based on Target Detection and Trajectory Filtering, A
* Raindrop Size Distribution Prediction by an Improved Long Short-Term Memory Network
* Raising context awareness in motion forecasting
* RAMA: A Rapid Multicut Algorithm on GPU
* Random Forest Model for Drought: Monitoring and Validation for Grassland Drought Based on Multi-Source Remote Sensing Data, A
* Rank in Style: A Ranking-based Approach to Find Interpretable Directions
* Ranking Distance Calibration for Cross-Domain Few-Shot Learning
* Ranking-Based Siamese Visual Tracking
* Raw High-Definition Radar for Multi-Task Learning
* Ray Priors through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation
* Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization
* RayMVSNet: Learning Ray-based 1D Implicit Fields for Accurate Multi-View Stereo
* RBFNN-Based Adaptive Event-Triggered Control for Heterogeneous Vehicle Platoon Consensus
* RBGNet: Ray-based Grouping for 3D Object Detection
* RCL: Recurrent Continuous Localization for Temporal Action Detection
* RCP: Recurrent Closest Point for Point Cloud
* RDONet: Rate-Distortion Optimized Learned Image Compression with Variable Depth
* Re-Balancing Strategy for Class-Imbalanced Classification Based on Instance Difficulty, A
* Reading During Fully Automated Driving: A Study of the Effect of Peripheral Visual and Haptic Information on Situation Awareness and Mental Workload
* Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation
* Real-Time Detection of Winter Jujubes Based on Improved YOLOX-Nano Network
* Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators
* Real-time Hyperspectral Imaging in Hardware via Trained Metasurface Encoders
* Real-time identification of eye fixations and saccades using radial basis function networks and Markov chains
* Real-time Object Detection for Streaming Perception
* Real-Time Self-Supervised Monocular Depth Estimation Without GPU
* Real-Time Vehicle Sound Detection System Based on Depthwise Separable Convolution Neural Network and Spectrogram Augmentation
* Real-Time, Accurate, and Consistent Video Semantic Segmentation via Unsupervised Adaptation and Cross-Unit Deployment on Mobile Device
* Reasoning with Multi-Structure Commonsense Knowledge in Visual Dialog
* Rebalancing and Charging Scheduling With Price Incentives for Car Sharing Systems
* Recall@k Surrogate Loss with Large Batches and Similarity Mixup
* RecDis-SNN: Rectifying Membrane Potential Distribution for Directly Training Spiking Neural Networks
* Recent Seasonal Spatiotemporal Variations in Alpine Glacier Surface Elevation in the Pamir
* Recognition of Freely Selected Keypoints on Human Limbs
* Recognition of Sago Palm Trees Based on Transfer Learning
* Reconfigurable Convolution-in-Pixel CMOS Image Sensor Architecture, A
* Reconstruct from Top View: A 3D Lane Detection Approach based on Geometry Structure Prior
* Reconstructing Surfaces for Sparse Point Clouds with On-Surface Priors
* Reconstruction of Monthly Surface Nutrient Concentrations in the Yellow and Bohai Seas from 2003-2019 Using Machine Learning
* Reconstruction of Rainfall Field Using Earth-Space Links Network: A Compressed Sensing Approach
* Recurrent Dynamic Embedding for Video Object Segmentation
* Recurrent Glimpse-based Decoder for Detection with Transformer
* Recurrent Models for Lane Change Prediction and Situation Assessment
* Recurrent Multi-Frame Deraining: Combining Physics Guidance and Adversarial Learning
* Recurrent Variational Network: A Deep Learning Inverse Problem Solver applied to the task of Accelerated MRI Reconstruction
* Recurring the Transformer for Video Action Recognition
* Reduce Information Loss in Transformers for Pluralistic Image Inpainting
* Ref-NeRF: Structured View-Dependent Appearance for Neural Radiance Fields
* Reference-Based Video Super-Resolution Using Multi-Camera Video Triplets
* Reflash Dropout in Image Super-Resolution
* Reflection and Rotation Symmetry Detection via Equivariant Learning
* Regarding the quality of disparity estimation from distorted light fields
* Region-Aware Face Swapping
* Region-Based Deep Learning Approach to Automated Retail Checkout, A
* Region-based two-stage MRI bone tissue segmentation of the knee joint
* Regional Monitoring of Fall Armyworm (FAW) Using Early Warning Systems
* Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation
* RegionCLIP: Region-based Language-Image Pretraining
* Registering Explicit to Implicit: Towards High-Fidelity Garment Mesh Reconstruction from Single Images
* RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs
* Regression or Classification? Reflection on BP prediction from PPG data using Deep Neural Networks in the scope of practical applications
* REGTR: End-to-end Point Cloud Correspondences with Transformers
* Reinforced Structured State-Evolution for Vision-Language Navigation
* Relationship between Topological Structure and Ecosystem Services of Forest Grass Ecospatial Network in China
* Relative Pose from a Calibrated and an Uncalibrated Smartphone Image
* Reliability of Forensic Body-Shape Identification, The
* Relieving Long-tailed Instance Segmentation via Pairwise Class Balance
* RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition
* Remember Intentions: Retrospective-Memory-based Trajectory Prediction
* Remote Estimation of Continuous Blood Pressure by a Convolutional Neural Network Trained on Spatial Patterns of Facial Pulse Waves
* Remote Heart Rate Estimation by Signal Quality Attention Network
* Remote Pulse Estimation in the Presence of Face Masks
* Remote Sensing Analysis of Geologic Hazards
* Remote Sensing Image Scene Classification via Self-Supervised Learning and Knowledge Distillation
* Remote Sensing Image-Change Detection with Pre-Generation of Depthwise-Separable Change-Salient Maps
* Remote Sensing of Forest Burnt Area, Burn Severity, and Post-Fire Recovery: A Review
* Remote Sensing on Alfalfa as an Approach to Optimize Production Outcomes: A Review of Evidence and Directions for Future Assessments
* Remote Sensing Scene Graph and Knowledge Graph Matching with Parallel Walking Algorithm
* Rendering Nighttime Image Via Cascaded Color and Brightness Compensation
* RenderSR: A Lightweight Super-Resolution Model for Mobile Gaming Upscaling
* RendNet: Unified 2D/3D Recognizer with Latent Space Rendering
* RePaint: Inpainting using Denoising Diffusion Probabilistic Models
* Replacing Labeled Real-image Datasets with Auto-generated Contours
* RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality
* RepNet: Efficient On-Device Learning via Feature Reprogramming
* Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting
* Representation Compensation Networks for Continual Semantic Segmentation
* Representing 3D Shapes with Probabilistic Directed Distance Fields
* Research on Co-Channel Interference Cancellation for Underwater Acoustic MIMO Communications
* Research on Cross-Regional Debris Flow Susceptibility Mapping Based on Transfer Learning, A
* Research on the Measurement Accuracy of Shipborne Rayleigh Scattering Lidar
* Residual Feature Pyramid Network for Enhancement of Vascular Patterns
* Residual Local Feature Network for Efficient Super-Resolution
* ResNeSt: Split-Attention Networks
* ResSFL: A Resistance Transfer Framework for Defending Model Inversion Attack in Split Federated Learning
* RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs
* RestoreX-AI: A Contrastive Approach towards Guiding Image Restoration via Explainable AI Systems
* Restormer: Efficient Transformer for High-Resolution Image Restoration
* ReSTR: Convolution-free Referring Image Segmentation Using Transformers
* ResViT: Residual Vision Transformers for Multimodal Medical Image Synthesis
* Rethinking Adversarial Examples in Wargames
* Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning
* Rethinking Bayesian Deep Learning Methods for Semi-Supervised Volumetric Medical Image Segmentation
* Rethinking Controllable Variational Autoencoders
* Rethinking Deep Face Restoration
* Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation
* Rethinking Efficient Lane Detection via Curve Modeling
* Rethinking Illumination for Person Re-Identification: A Unified View
* Rethinking Image Cropping: Exploring Diverse Compositions from Global Views
* Rethinking Minimal Sufficient Representation in Contrastive Learning
* Rethinking Reconstruction Autoencoder-Based Out-of-Distribution Detection
* Rethinking referring relationships from a perspective of mask-level relational reasoning
* Rethinking Semantic Segmentation: A Prototype View
* Rethinking Spatial Invariance of Convolutional Networks for Object Counting
* Rethinking Supervised Depth Estimation for 360° Panoramic Imagery
* Rethinking the Augmentation Module in Contrastive Learning: Learning Hierarchical Augmentation Invariance with Expanded Views
* Rethinking Visual Geo-localization for Large-Scale Applications
* Retinal image enhancement with artifact reduction and structure retention
* RETRACTION: Transformer-induced graph reasoning for multimodal semantic segmentation in remote sensing
* Retrieval Augmented Classification for Long-Tail Visual Recognition
* Retrieval of carbon content and biomass from hyperspectral imagery over cultivated areas
* Retrieval of Chlorophyll-a Concentrations Using Sentinel-2 MSI Imagery in Lake Chagan Based on Assessments with Machine Learning Models
* Retrieval of Stratospheric Ozone Profiles from Limb Scattering Measurements of the Backward Limb Spectrometer on Chinese Space Laboratory Tiangong-2: Preliminary Results
* Retrieval-based Spatially Adaptive Normalization for Semantic Image Synthesis
* Retrieving Water Quality Parameters from Noisy-Label Data Based on Instance Selection
* Reusing the Task-specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation
* Revealing Occlusions with 4D Neural Fields
* Reversible Vision Transformers
* Review of Automatic Processing of Topography and Surface Feature Identification LiDAR Data Using Machine Learning Techniques
* Review of Marine Gravity Field Recovery from Satellite Altimetry, A
* Review of Ship Collision Avoidance Guidance Algorithms Using Remote Sensing and Game Control
* Review of Spectral Indices for Mangrove Remote Sensing, A
* Revisiting AP Loss for Dense Object Detection: Adaptive Ranking Pair Selection
* Revisiting Document Image Dewarping by Grid Regularization
* Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective
* Revisiting Learnable Affines for Batch Norm in Few-Shot Transfer Learning
* Revisiting Modality-Specific Feature Compensation for Visible-Infrared Person Re-Identification
* Revisiting Near/Remote Sensing with Geospatial Attention
* Revisiting Random Channel Pruning for Neural Network Compression
* Revisiting Skeleton-based Action Recognition
* Revisiting Temporal Alignment for Video Restoration
* Revisiting the Receptive Field of Conv-GRU in DROID-SLAM
* Revisiting the Transferability of Supervised Pretraining: An MLP Perspective
* Revisiting Vicinal Risk Minimization for Partially Supervised Multi-Label Classification Under Data Scarcity
* Revisiting Weakly Supervised Pre-Training of Visual Perception Models
* REX: Reasoning-aware and Grounded Explanation
* RFID Technology Study for Traffic Signage Inventory Management Application
* RFNet: Unsupervised Network for Mutually Reinforcing Multi-modal Image Registration and Fusion
* RGB-Depth Fusion GAN for Indoor Depth Completion
* RGB-ICP Method to Calculate Ground Three-Dimensional Deformation Based on Point Cloud from Airborne LiDAR
* RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation
* Rice Yield Prediction and Model Interpretation Based on Satellite and Climatic Indicators Using a Transformer Method
* RIDDLE: Lidar Data Compression with Range Image Deep Delta Encoding
* RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior
* RigNeRF: Fully Controllable Neural 3D Portraits
* RIM-Net: Recursive Implicit Fields for Unsupervised Learning of Hierarchical Shape Structures
* Ring and Radius Sampling Based Phasor Field Diffraction Algorithm for Non-Line-of-Sight Reconstruction
* RIO: Rotation-equivariance supervised learning of robust inertial odometry
* Risk Assessment Methodologies for Autonomous Driving: A Survey
* Risk-Averse Equilibria for Vehicle Navigation in Stochastic Congestion Games
* RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes
* RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization
* Road Network Extraction from SAR Images with the Support of Angular Texture Signature and POIs
* Road-Curvature-Range-Dependent Path Following Controller Design for Autonomous Ground Vehicles Subject to Stochastic Delays
* Road-Model-Based Road Boundary Extraction for High Definition Map via LIDAR
* RoadSaW: A Large-Scale Dataset for Camera-Based Road Surface and Wetness Estimation
* Roadside Decision-Making Methodology Based on Deep Reinforcement Learning to Simultaneously Improve the Safety and Efficiency of Merging Zone, A
* Robust Analog Beamforming for Periodic Broadcast V2V Communication
* Robust and Accurate 3D Self-Portraits in Seconds
* Robust and Accurate Superquadric Recovery: a Probabilistic Approach
* robust and efficient method for skeleton-based human action recognition and its application for cross-dataset evaluation, A
* Robust Combination of Distributed Gradients Under Adversarial Perturbations
* Robust Contrastive Learning against Noisy Views
* Robust Cross-Modal Representation Learning with Progressive Self-Distillation
* Robust Cuboid Modeling from Noisy and Incomplete 3D Point Clouds Using Gaussian Mixture Model
* Robust Egocentric Photo-realistic Facial Expression Transfer for Virtual Reality
* Robust Equivariant Imaging: A fully unsupervised framework for learning to image from noisy and partial measurements
* Robust Federated Learning with Noisy and Heterogeneous Clients
* Robust fine-tuning of zero-shot models
* Robust Image Forgery Detection over Online Social Network Shared Images
* Robust Invertible Image Steganography
* Robust linear unmixing with enhanced constraint of classification for hyperspectral remote sensing imagery
* robust non-blind deblurring method using deep denoiser prior, A
* Robust Non-Fragile Fault Tolerant Control for Ensuring the Safety of the Intended Functionality of Cooperative Adaptive Cruise Control
* Robust Optimization as Data Augmentation for Large-scale Graphs
* Robust outlier detection by de-biasing VAE likelihoods
* Robust Physical-World Attacks on Face Recognition
* Robust Region Feature Synthesizer for Zero-Shot Object Detection
* Robust Star Identification Algorithm Based on a Masked Distance Map, A
* Robust Structured Declarative Classifiers for 3D Point Clouds: Defending Adversarial Attacks with Implicit Gradients
* Robust Table Detection and Structure Recognition from Heterogeneous Document Images
* Robust Traffic Speed Inference With Ensemble Learning
* Robust Traffic-Aware City-Scale Multi-Camera Vehicle Tracking Of Vehicles, A
* Robustness and Adaptation to Hidden Factors of Variation
* Robustness of Deep Learning-Based Specific Emitter Identification under Adversarial Attacks
* ROCA: Robust CAD Model Retrieval and Alignment from a Single Image
* RODD: A Self-Supervised Approach for Robust Out-of-Distribution Detection
* Role of Shape for Domain Generalization on Sparsely-Textured Images, The
* Rope3D: The Roadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task
* Rotation invariant Gabor convolutional neural network for image classification
* Rotationally Equivariant 3D Object Detection
* RSCFed: Random Sampling Consensus Federated Semi-supervised Learning
* RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks
* RSTT: Real-time Spatial Temporal Transformer for Space-Time Video Super-Resolution
* RTrPPG: An Ultra Light 3DCNN for Real-Time Remote Photoplethysmography
* RU-Net: Regularized Unrolling Network for Scene Graph Generation
* RV-GAN: Recurrent GAN for Unconditional Video Generation
* S2F2: Single-Stage Flow Forecasting for Future Multiple Trajectories Prediction
* SaberNet: Self-attention based effective relation network for few-shot learning
* Safe Self-Refinement for Transformer-based Domain Adaptation
* Safe-Student for Safe Deep Semi-Supervised Learning with Unseen-Class Unlabeled Data
* Safety Assured Online Guidance With Airborne Separation for Urban Air Mobility Operations in Uncertain Environments
* Salient-to-Broad Transition for Video Person Re-identification
* Salt Stockpile Inventory Management Using LiDAR Volumetric Measurements
* Salvage of Supervision in Weakly Supervised Object Detection
* Sam's Net: A Self-Augmented Multistage Deep-Learning Network for End-to-End Reconstruction of Limited Angle CT
* SAM: Self-Supervised Learning of Pixel-Wise Anatomical Embeddings in Radiological Images
* Sample Selection Approach with Number of False Predictions for Learning with Noisy Labels
* Sampling and Re-Weighting: Towards Diverse Frame Aware Unsupervised Video Person Re-Identification
* sampling-based approach for efficient clustering in large datasets, A
* SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation
* SaR: Self-adaptive Refinement on Pseudo Labels for Multiclass-Imbalanced Semi-supervised Learning
* SASIC: Stereo Image Compression with Latent Shifts and Stereo Attention
* Sat-NeRF: Learning Multi-View Satellite Photogrammetry With Transient Objects and Shadow Modeling Using RPC Cameras
* Satellite Soil Moisture Data Reconstruction in the Temporal and Spatial Domains: Latent Error Assessments and Performances for Tracing Rainstorms and Droughts
* SC2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration
* Scalable Combinatorial Solver for Elastic Geometrically Consistent 3D Shape Matching, A
* Scalable Intra Coding Optimization for Video Coding
* Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels
* Scale robust point matching-Net: End-to-end scale point matching using Lie group
* Scale-Consistent Fusion: From Heterogeneous Local Sampling to Global Immersive Rendering
* Scale-Equivalent Distillation for Semi-Supervised Object Detection
* ScaleNet: A Shallow Architecture for Scale Estimation
* Scaling Up Vision-Language Pretraining for Image Captioning
* Scaling Up Your Kernels to 31X31: Revisiting Large Kernel Design in CNNs
* Scaling Vision Transformers
* Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning
* Scanline Homographies for Rolling-Shutter Plane Absolute Pose
* ScanpathNet: A Recurrent Mixture Density Network for Scanpath Prediction
* ScanQA: 3D Question Answering for Spatial Scene Understanding
* Scenario Parameter Generation Method and Scenario Representativeness Metric for Scenario-Based Assessment of Automated Vehicles
* Scenario Understanding and Motion Prediction for Autonomous Vehicles: Review and Comparison
* Scene Consistency Representation Learning for Video Scene Segmentation
* Scene Graph Expansion for Semantics-Guided Image Outpainting
* Scene Representation in Bird's-Eye View from Surrounding Cameras with Transformers
* Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations
* SceneSqueezer: Learning to Compress Scene for Camera Relocalization
* SCENIC: A JAX Library for Computer Vision Research and Beyond
* ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning
* Screen Content Video Quality Assessment Model Using Hybrid Spatiotemporal Features
* Scribble-Supervised LiDAR Semantic Segmentation
* ScribbleNet: Efficient interactive annotation of urban city scenes for semantic segmentation
* SCS-Co: Self-Consistent Style Contrastive Learning for Image Harmonization
* SCVRL: Shuffled Contrastive Video Representation Learning
* Sea Situational Awareness (SeaSAw) Dataset
* Searching for Efficient Neural Architectures for On-Device ML on Edge TPUs
* Searching for Energy-Efficient Hybrid Adder-Convolution Neural Networks
* Searching the Deployable Convolution Neural Networks for GPUs
* Seasonal and Diurnal Variation Characteristics of Soil Moisture at Different Depths from Observational Sites over the Tibetan Plateau, The
* Seasonal and Microphysical Characteristics of Fog at a Northern Airport in Alberta, Canada
* Secure and Intelligent Framework for Vehicle Health Monitoring Exploiting Big-Data Analytics, A
* Secure and Lightweight Drones-Access Protocol for Smart City Surveillance, A
* Secure Transmission in Cellular V2X Communications Using Deep Q-Learning
* SEEG: Semantic Energized Co-speech Gesture Generation
* Seek-and-Hide: Adversarial Steganography via Deep Reinforcement Learning
* SeeTheSeams: Localized Detection of Seam Carving based Image Forgery in Satellite Imagery
* SeeThroughNet: Resurrection of Auxiliary Loss by Preserving Class Probability Information
* Segment and Complete: Defending Object Detectors against Adversarial Patch Attacks with Robust Patch Detection
* Segment, Magnify and Reiterate: Detecting Camouflaged Objects the Hard Way
* Segment-Fusion: Hierarchical Context Fusion for Robust 3D Semantic Segmentation
* Segmenting across places: The need for fair transfer learning with satellite imagery
* Segmenting Objects From Relational Visual Data
* Selection of Lunar South Pole Landing Site Based on Constructing and Analyzing Fuzzy Cognitive Maps
* Selection of the Speed Command Distance for Improved Performance of a Rule-Based VSL and Lane Change Control
* Selective-Supervised Contrastive Learning with Noisy Labels
* Self Supervised Scanpath Prediction Framework for Painting Images
* Self-Attention with Convolution and Deconvolution for Efficient Eye Gaze Estimation from a Full Face Image
* Self-augmented Unpaired Image Dehazing via Density and Depth Decomposition
* Self-Calibrated Efficient Transformer for Lightweight Super-Resolution
* Self-Distillation from the Last Mini-Batch for Consistency Regularization
* Self-regularized prototypical network for few-shot semantic segmentation
* Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation
* Self-Supervised Bulk Motion Artifact Removal in Optical Coherence Tomography Angiography
* Self-supervised Correlation Mining Network for Person Image Generation
* Self-supervised Deep Image Restoration via Adaptive Stochastic Gradient Langevin Dynamics
* Self-Supervised Dense Consistency Regularization for Image-to-Image Translation
* Self-Supervised Descriptor for Image Copy Detection, A
* Self-Supervised Equivariant Learning for Oriented Keypoint Detection
* Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation with Reliable Voted Pseudo Labels
* Self-Supervised Image Representation Learning with Geometric Set Consistency
* Self-supervised Image-specific Prototype Exploration for Weakly Supervised Semantic Segmentation
* Self-Supervised Keypoint Discovery in Behavioral Videos
* Self-supervised Learning for Sonar Image Classification
* Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
* Self-Supervised Learning of Object Parts for Semantic Segmentation
* Self-Supervised Learning of Pose-Informed Latents
* Self-Supervised Learning to Guide Scientifically Relevant Categorization of Martian Terrain Images
* Self-Supervised Material and Texture Representation Learning for Remote Sensing Tasks
* Self-Supervised Models are Continual Learners
* Self-supervised Neural Articulated Shape and Appearance Models
* Self-supervised object detection from audio-visual correspondence
* Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis
* Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection
* Self-supervised Spatial Reasoning on Multi-View Line Drawings
* Self-Supervised Super-Resolution for Multi-Exposure Push-Frame Satellites
* Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut
* Self-Supervised Variable Rate Image Compression using Visual Attention
* Self-Supervised Video Representation Learning Using Improved Instance-Wise Contrastive Learning and Deep Clustering
* Self-supervised Video Representation Learning with Cascade Positive Retrieval
* Self-supervised Video Transformer
* Self-supervised Vision Transformers for Land-cover Segmentation and Classification
* Self-Supervised Voxel-Level Representation Rediscovers Subcellular Structures in Volume Electron Microscopy
* Self-supervision and meta-learning for one-shot unsupervised cross-domain detection
* Self-supervision versus synthetic datasets: Which is the lesser evil in the context of video denoising?
* Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning
* Self-Taught Metric Learning without Labels
* Self-Training Strategy Based on Finite Element Method for Adaptive Bioluminescence Tomography Reconstruction
* Self-Updatable Database System Based on Human Motion Assessment Framework
* SelfD: Self-Learning Large-Scale Driving Policies From the Web
* SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video
* SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation
* Semantic Cameras for 360-Degree Environment Perception in Automated Urban Driving
* Semantic Contrastive Embedding for Generalized Zero-Shot Learning
* Semantic guided knowledge graph for large-scale zero-shot learning
* Semantic Guided Long Range Stereo Depth Estimation for Safer Autonomous Vehicle Applications
* Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning
* Semantic Segmentation by Early Region Proxy
* Semantic Segmentation for Thermal Images: A Comparative Survey
* Semantic Segmentation Guided Coarse-to-Fine Detection of Individual Trees from MLS Point Clouds Based on Treetop Points Extraction and Radius Expansion
* Semantic-aligned Fusion Transformer for One-shot Object Detection
* Semantic-Aware Auto-Encoders for Self-supervised Representation Learning
* Semantic-Aware Domain Generalized Segmentation
* Semantic-Shape Adaptive Feature Modulation for Semantic Image Synthesis
* Semantically Grounded Visual Embeddings for Zero-Shot Learning
* Semantics-and-Primitives-Guided Indoor 3D Reconstruction from Point Clouds
* SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing
* Semi-Supervised Few-Shot Learning from A Dependency-Discriminant Perspective
* Semi-Supervised Few-shot Learning via Multi-Factor Clustering
* Semi-Supervised Hyperspectral Object Detection Challenge Results: PBVS 2022
* Semi-Supervised Learning of Semantic Correspondence with Pseudo-Labels
* Semi-Supervised Object Detection via Multi-instance Alignment with Global Class Prototypes
* Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels
* Semi-supervised Semantic Segmentation with Error Localization Network
* Semi-Supervised Training to Improve Player and Ball Detection in Soccer
* Semi-supervised Video Paragraph Grounding with Contrastive Encoder
* Semi-Supervised Video Semantic Segmentation with Inter-Frame Feature Reconstruction
* Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer
* Semi-Weakly-Supervised Learning of Complex Actions from Instructional Task Videos
* Semiconductor Defect Detection by Hybrid Classical-Quantum Deep Learning
* Sensitivity Analysis of 1,3-Butadiene Monitoring Based on Space-Based Detection in the Infrared Band
* Sensor Fusion for Aircraft Detection at Airport Ramps Using Conditional Random Fields
* Sequential Pattern Mining Based Approach to Adaptively Detect Anomalous Paths in Floating Vehicle Trajectories, A
* Sequential Voting with Relational Box Fields for Active Object Detection
* SERNet: Squeeze and Excitation Residual Network for Semantic Segmentation of High-Resolution Remote Sensing Images
* Set-Supervised Action Learning in Procedural Task Videos via Pairwise Order Consistency
* SG-SRNs: Superpixel-Guided Scene Representation Networks
* SGTR: End-to-end Scene Graph Generation with Transformer
* Shadow Compensation from UAV Images Based on Texture-Preserving Local Color Transfer
* Shadow detection via multi-scale feature fusion and unsupervised domain adaptation
* Shadows can be Dangerous: Stealthy and Effective Physical-world Adversarial Attack by Natural Phenomenon
* Shape Enhanced Keypoints Learning with Geometric Prior for 6D Object Pose Tracking
* Shape from Polarization for Complex Scenes in the Wild
* Shape from Thermal Radiation: Passive Ranging Using Multi-spectral LWIR Measurements
* Shape-invariant 3D Adversarial Point Clouds
* ShapeFormer: Transformer-based Shape Completion via Sparse Representation
* Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search
* SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation
* Shift Pooling PSPNet: Rethinking PSPNet for Building Extraction in Remote Sensing Images from Entire Local Feature Pooling
* SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation
* Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding
* Shortwave Infrared Multi-Angle Polarization Imager (MAPI) Onboard Fengyun-3 Precipitation Satellite for Enhanced Cloud Characterization
* Should I take a walk? Estimating Energy Expenditure from Video Data
* Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
* Show, Deconfound and Tell: Image Captioning with Causal Inference
* Shunted Self-Attention via Multi-Scale Token Aggregation
* Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning
* Siamese Temporal Convolutional Networks for Driver Identification Using Driver Steering Behavior Analysis
* Side-Channel Security Analysis of Connected Vehicle Communications Using Hidden Markov Models
* SIGMA: Semantic-complete Graph Matching for Domain Adaptive Object Detection
* Sign Language Video Retrieval with Free-Form Textual Queries
* Signature Detection, Restoration, and Verification: A Novel Chinese Document Signature Forgery Detection Benchmark
* Signing at Scale: Learning to Co-Articulate Signs for Large-Scale Photo-Realistic Sign Language Production
* Sim VQA: Exploring Simulated Environments for Visual Question Answering
* SIM-MFR: Spatial interactions mechanisms based multi-feature representation for background modeling
* SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization
* SIMBAR: Single Image-Based Scene Relighting For Effective Data Augmentation For Automated Driving Vision Tasks
* SimMatch: Semi-supervised Learning with Similarity Matching
* SimMIM: a Simple Framework for Masked Image Modeling
* Simple and Efficient Architectures for Semantic Segmentation
* Simple Band Ratio Library (BRL) Algorithm for Retrieval of Hourly Aerosol Optical Depth Using FY-4A AGRI Geostationary Satellite Data, A
* Simple but Effective: CLIP Embeddings for Embodied AI
* Simple Data Mixing Prior for Improving Self-Supervised Learning, A
* Simple Episodic Linear Probe Improves Visual Recognition in the Wild, A
* Simple Multi-dataset Detection
* Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation, A
* Simple Procedure to Preprocess and Ingest Level-2 Ocean Color Data into Google Earth Engine, A
* Simple Spectral Failure Mode for Graph Convolutional Networks, A
* Simple Visual-Textual Baseline for Pedestrian Attribute Recognition, A
* SimT: Handling Open-set Noise for Domain Adaptive Semantic Segmentation
* Simulated Adversarial Testing of Face Recognition Models
* Simulated Quantization, Real Power Savings
* Simulation Calculation of Element Number Density in the Earth's Atmosphere Based on X-ray Occultation Sounding
* Simulation Performance and Case Study of Extreme Events in Northwest China Using the BCC-CSM2 Model
* SimVP: Simpler yet Better Video Prediction
* Single image reflection removal through multi-scale gradient refinement
* Single image super-resolution based on directional variance attention network
* Single image super-resolution based on progressive fusion of orientation-aware features
* Single-Domain Generalized Object Detection in Urban Scene via Cyclic-Disentangled Self-Distillation
* Single-Photon Structured Light
* Single-Shot End-to-end Road Graph Extraction
* Single-Stage 3D Geometry-Preserving Depth Estimation Model Training on Dataset Mixtures with Uncalibrated Stereo Data
* Single-Stage is Enough: Multi-Person Absolute 3D Pose Estimation
* SIOD: Single Instance Annotated Per Category Per Image for Object Detection
* SISL:Self-Supervised Image Signature Learning for Splicing Detection & Localization
* Sketch3T: Test-Time Training for Zero-Shot SBIR
* SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches
* Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval
* Skills to Drive: Successor Features for Autonomous Highway Pilot
* SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters
* SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos
* Slimmable Domain Adaptation
* Slimmable Video Codec
* Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation
* Smartadapt: Multi-branch Object Detection Framework for Videos on Mobiles
* SmartPortraits: Depth Powered Handheld Smartphone Dataset of Human Portraits for State Estimation, Reconstruction and Synthesis
* SMDAF: A novel keypoint based method for copy-move forgery detection
* SMM-Conv: Scalar Matrix Multiplication with Zero Packing for Accelerated Convolution
* Smooth Maximum Unit: Smooth Activation Function for Deep Networks using Smoothing Maximum Technique
* Smooth-Swap: A Simple Enhancement for Face-Swapping with Smoothness
* SMPL-A: Modeling Person-Specific Deformable Anatomy
* Snowvision: Segmenting, Identifying, and Discovering Stamped Curve Patterns from Fragments of Pottery
* SNR-Aware Low-light Image Enhancement
* SNUG: Self-Supervised Neural Dynamic Garments
* SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos
* SoccerTrack: A Dataset and Tracking Algorithm for Soccer with Fish-eye and Drone Videos
* Soft-ranked Index Fusion Framework with Saliency Weighting for Image Quality Assessment, A
* SoftCollage: A Differentiable Probabilistic Tree Generator for Image Collage
* SoftGroup for 3D Instance Segmentation on Point Clouds
* SOLO: A Simple Framework for Instance Segmentation
* SOMSI: Spherical Novel View Synthesis with Soft Occlusion Multi-Sphere Images
* Sound and Visual Representation Learning with Multiple Pretraining Tasks
* Sound-Guided Semantic Image Manipulation
* Source Data-Absent Unsupervised Domain Adaptation Through Hypothesis Transfer and Labeling Transfer
* Source-Free Domain Adaptation via Distribution Estimation
* Source-Free Object Detection by Learning to Overlook Domain Style
* Source-Free Open Compound Domain Adaptation in Semantic Segmentation
* Source-Independent Waveform Inversion Method for Ground Penetrating Radar Based on Envelope Objective Function
* SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Color Editing
* SpaceNet 8: The Detection of Flooded Roads and Buildings
* Spacing Loss for Discovering Novel Categories
* SPAct: Self-supervised Privacy Preservation for Action Recognition
* SPAMs: Structured Implicit Parametric Models
* Sparse and Complete Latent Organization for Geospatial Semantic Segmentation
* Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion
* Sparse Instance Activation for Real-Time Instance Segmentation
* Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning
* Sparse Non-local CRF
* Sparse Object-level Supervision for Instance Segmentation with Pixel Embeddings
* Sparse point-voxel aggregation network for efficient point cloud semantic segmentation
* Sparse to Dense Dynamic 3D Facial Expression Generation
* Sparsity-Based Two-Dimensional DOA Estimation for Co-Prime Planar Array via Enhanced Matrix Completion
* SPAS: Smart Pothole-Avoidance Strategy for Autonomous Vehicles
* Spatial Commonsense Graph for Object Localisation in Partial Scenes
* Spatial Patterns of Errors in GPM IMERG Summer Precipitation Estimates and Their Connections to Geographical Features in Complex Topographical Area
* Spatial Relationship and Evolution of World Cultural Heritage Sites and Neighbouring Towns, The
* Spatial Representativeness of Eddy Covariance Measurements in a Coniferous Plantation Mixed with Cropland in Southeastern China
* Spatial Sampling and Grouping Information Entropy Strategy Based on Kernel Fuzzy C-Means Clustering Method for Hyperspectral Band Selection
* Spatial-frequency HEVC multiple description video coding with adaptive perceptual redundancy allocation
* Spatial-Temporal Based Multihead Self-Attention for Remote Sensing Image Change Detection
* Spatial-Temporal Parallel Transformer for Arm-Hand Dynamic Estimation
* Spatial-Temporal Space Hand-in-Hand: Spatial-Temporal Video Super-Resolution via Cycle-Projected Mutual Learning
* Spatially gap free analysis of aerosol type grids in China: First retrieval via satellite remote sensing and big data analytics
* Spatially-Adaptive Multilayer Selection for GAN Inversion and Editing
* Spatio-temporal convolutional emotional attention network for spotting macro- and micro-expression intervals in long video sequences
* Spatio-Temporal Dynamics and Driving Forces of Multi-Scale CO2 Emissions by Integrating DMSP-OLS and NPP-VIIRS Data: A Case Study in Beijing-Tianjin-Hebei, China
* Spatio-Temporal Feature Encoding for Traffic Accident Detection in VANET Environment
* Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction
* Spatio-temporal Relation Modeling for Few-shot Action Recognition
* Spatiotemporal Prediction of Monthly Sea Subsurface Temperature Fields Using a 3D U-Net-Based Model
* Spatiotemporal Variation in Vegetation Growth Status and Its Response to Climate in the Three-River Headwaters Region, China
* Spectral Unsupervised Domain Adaptation for Visual Recognition
* Speech Driven Tongue Animation
* Speed up Object Detection on Gigapixel-level Images with Patch Arrangement
* SPHARM-Net: Spherical Harmonics-Based Convolution for Cortical Parcellation
* SphereSR: 360° Image Super-Resolution with Arbitrary Projection via Continuous Spherical Image Representation
* SphericGAN: Semi-Supervised Hyper-Spherical Generative Adversarial Networks for Fine-Grained Image Synthesis
* SpiderNet: Hybrid Differentiable-Evolutionary Architecture Search via Train-Free Metrics
* Spiking Transformers for Event-based Single Object Tracking
* SPIN: Simplifying Polar Invariance for Neural networks Application to vision-based irradiance forecasting
* Splicing ViT Features for Semantic Appearance Transfer
* Split Hierarchical Variational Compression
* SplitNets: Designing Neural Architectures for Efficient Distributed Computing on Head-Mounted Systems
* Sports Field Registration via Keypoints-aware Label Condition
* SqueezeNeRF: Further factorized FastNeRF for memory-efficient inference
* SS3D: Sparsely-Supervised 3D Object Detection from Point Cloud
* SSR-GNNs: Stroke-based Sketch Representation with Graph Neural Networks
* ST++: Make Self-trainingWork Better for Semi-supervised Semantic Segmentation
* ST-InNet: Deep Spatio-Temporal Inception Networks for Traffic Flow Prediction in Smart Cities
* ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation
* Stability-driven Contact Reconstruction From Monocular Color Images
* Stable Lightweight and Adaptive Feature Enhanced Convolution Neural Network for Efficient Railway Transit Object Detection, A
* Stable Long-Term Recurrent Video Super-Resolution
* Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation
* Stand Structural Characteristics Derived from Combined TLS and Landsat Data Support Predictions of Mushroom Yields in Mediterranean Forest
* Stand-Alone Inter-Frame Attention in Video Models
* Stargazer: A Transformer-based Driver Action Detection System for Intelligent Transportation
* State Estimation and Motion Prediction of Vehicles and Vulnerable Road Users for Cooperative Autonomous Driving: A Survey
* Statistical Analysis of the Spatiotemporal Distribution of Lower Atmospheric Ducts over the Seas Adjacent to China, Based on the ECMWF Reanalysis Dataset
* Statistical Characteristics of Warm Season Raindrop Size Distribution in the Beibu Gulf, South China
* STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes
* Stepwise Domain Adaptation (SDA) for Object Detection in Autonomous Vehicles Using an Adaptive CenterNet
* Stereo Depth from Events Cameras: Concentrate and Focus on the Future
* Stereo Magnification with Multi-Layer Images
* Stereoscopic Universal Perturbations across Different Architectures and Datasets
* STGM: Vehicle Trajectory Prediction Based on Generative Model for Spatial-Temporal Features
* Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration, A
* Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models
* Stochastic Optimal Sizing of Plug-in Electric Vehicle Parking Lots in Reconfigurable Power Distribution Systems
* Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion
* Stochastic Variance Reduced Ensemble Adversarial Attack for Boosting the Adversarial Transferability
* Strain Detection based on Breath and Motion Features Obtained by a Force Sensor for Smart Toilet Systems
* Stratified Transformer for 3D Point Cloud Segmentation
* Strengthening the Transferability of Adversarial Examples Using Advanced Looking Ahead and Self-CutMix
* String Stable and Collision-Safe Model Predictive Platoon Control
* STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction
* Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation
* Structure Tensor-Based Infrared Small Target Detection Method for a Double Linear Array Detector
* Structure-Aware Flow Generation for Human Body Reshaping
* Structure-Aware Motion Transfer with Deformable Anchor Model
* Structure-Preserving Image Super-Resolution
* Structured Cooperative Reinforcement Learning With Time-Varying Composite Action Space
* Structured Dictionary Perspective on Implicit Neural Representations, A
* Structured Local Radiance Fields for Human Avatar Modeling
* Structured Sparse R-CNN for Direct Scene Graph Generation
* Study of Subjective and Objective Quality Assessment of Night-Time Videos
* study on the distribution of social biases in self-supervised learning visual models, A
* Study on the local and global performance of homogeneous platoon with a bidirectional ring interconnection
* Style Neophile: Constantly Seeking Novel Styles for Domain Generalization
* Style Transformer for Image Inversion and Editing
* Style-aware Discriminator for Controllable Image Translation, A
* Style-Based Global Appearance Flow for Virtual Try-On
* Style-ERD: Responsive and Coherent Online Motion Style Transfer
* Style-Structure Disentangled Features and Normalizing Flows for Diverse Icon Colorization
* Styleformer: Transformer based Generative Adversarial Networks with Style Vector
* StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
* StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions
* StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation
* StyleSwin: Transformer-Based GAN for High-Resolution Image Generation
* StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
* StylizedNeRF: Consistent 3D Scene Stylization as Stylized NeRF via 2D-3D Mutual Learning
* StyTr2: Image Style Transfer with Transformers
* Sub-word Level Lip Reading With Visual Attention
* Subspace Adversarial Training
* SubTTD: DOA Estimation via Sub-Nyquist Tensor Train Decomposition
* Sugarcane Biomass Prediction with Multi-Mode Remote Sensing Data Using Deep Archetypal Analysis and Integrated Learning
* Super-Fibonacci Spirals: Fast, Low-Discrepancy Sampling of SO(3)
* Super-Resolution based Video Coding Scheme
* Surface Reconstruction from Point Clouds by Learning Predictive Context Priors
* Surface Representation for Point Clouds
* Surface Subsidence Monitoring Induced by Underground Coal Mining by Combining DInSAR and UAV Photogrammetry
* Surface-Aligned Neural Radiance Fields for Controllable 3D Human Synthesis
* SurfEmb: Dense and Continuous Correspondence Distributions for Object Pose Estimation with Learnt Surface Embeddings
* Surpassing the Human Accuracy: Detecting Gallbladder Cancer from USG Images with Curriculum Learning
* SurRF: Unsupervised Multi-View Stereopsis by Learning Surface Radiance Field
* Survey and Evaluation of Neural 3D Shape Classification Approaches
* Survey of GNSS Spoofing and Anti-Spoofing Technology, A
* survey on bias in visual datasets, A
* survey on methods, datasets and implementations for scene text spotting, A
* SVIP: Sequence VerIfication for Procedures in Videos
* SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
* SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization
* Swin Transformer V2: Scaling Up Capacity and Resolution
* SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning
* SwiniPASSR: Swin Transformer based Parallax Attention Network for Stereo Image Super-Resolution
* SwinIQA: Learned Swin Distance for Compressed Image Quality Assessment
* SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition
* Sylph: A Hypernetwork Framework for Incremental Few-shot Object Detection
* SymDNN: Simple & Effective Adversarial Robustness for Embedded Systems
* Symmetric Network with Spatial Relationship Modeling for Natural Language-based Vehicle Retrieval
* Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation
* Symmetry-aware Neural Architecture for Embodied Visual Exploration
* Syntax-Aware Network for Handwritten Mathematical Expression Recognition
* Synthetic Aperture Imaging with Events and Frames
* Synthetic Generation of Face Videos with Plethysmograph Physiology
* Systemic distortion analysis with deep distortion directed image quality assessment models
* T-BFA: Targeted Bit-Flip Adversarial Weight Attack
* TableFormer: Table Structure Understanding with Transformers
* Talking Face Generation with Multilingual TTS
* TARDet: Two-stage Anchor-free Rotating Object Detector in Aerial Images
* Target Detection Method of UAV Aerial Imagery Based on Improved YOLOv5
* Target Oriented Perceptual Adversarial Fusion Network for Underwater Image Enhancement
* Target-aware and spatial-spectral discriminant feature joint correlation filters for hyperspectral video object tracking
* Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection
* Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection
* Targeted Supervised Contrastive Learning for Long-Tailed Recognition
* Task Adaptive Parameter Sharing for Multi-Task Learning
* Task Decoupled Framework for Reference-based Super-Resolution
* Task Discrepancy Maximization for Fine-grained Few-Shot Classification
* Task-Adaptive Negative Envision for Few-Shot Open-Set Recognition
* Task-specific Inconsistency Alignment for Domain Adaptive Object Detection
* Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data
* TC-Net: Detecting Noisy Labels Via Transform Consistency
* TCTrack: Temporal Contexts for Aerial Tracking
* TDT: Teaching Detectors to Track without Fully Annotated Videos
* TeachAugment: Data Augmentation Optimization Using Teacher Knowledge
* Technical Challenges for Multi-Temporal and Multi-Sensor Image Processing Surveyed by UAV for Mapping and Monitoring in Precision Agriculture
* TelecomNet: Tag-Based Weakly-Supervised Modally Cooperative Hashing Network for Image Retrieval
* Temperature/Emissivity Separation of Typical Grassland of Northwestern China Based on Hyper-CAM and Its Potential for Grassland Drought Monitoring
* Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions
* Temporal Alignment Networks for Long-term Video
* Temporal Analysis of Ground Movement at a Metal Mine in China
* Temporal and Spatial Changes and GLOF Susceptibility Assessment of Glacial Lakes in Nepal from 2000 to 2020
* Temporal Complementarity-Guided Reinforcement Learning for Image-to-Video Person Re-Identification
* Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations
* Temporal Driver Action Localization using Action Classification Methods
* Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation
* Temporal sparse adversarial attack on sequence-based gait recognition
* Temporal Weighting Appearance-Aligned Network for Nighttime Video Retrieval
* Temporally Efficient Vision Transformer for Video Instance Segmentation
* TemporalUV: Capturing Loose Clothing with Temporally Coherent UV Coordinates
* Tencent-MVSE: A Large-Scale Benchmark Dataset for Multi-Modal Video Similarity Evaluation
* Test of Determining Geopotential Difference between Two Sites at Wuhan Based on Optical Clocks' Frequency Comparisons
* Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution, A
* Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding
* Text Spotting Transformers
* Text to Image Generation with Semantic-Spatial Aware GAN
* Text-to-Image Synthesis based on Object-Guided Joint-Decoding Transformer
* Text2Mesh: Text-Driven Neural Stylization for Meshes
* Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
* Texture-based Error Analysis for Image Super-Resolution
* Thermal Image Super-Resolution Challenge Results - PBVS 2022
* Thin-Plate Spline Motion Model for Image Animation
* Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
* Think Twice Before Detecting GAN-generated Fake Images from their Spectral Domain Imprints
* Thinking Inside Uncertainty: Interest Moment Perception for Diverse Temporal Grounding
* Three Rivers Source Region Alpine Grassland Ecosystem Was a Weak Carbon Sink Based on BEPS Model Analysis, The
* Three Stream Graph Attention Network using Dynamic Patch Selection for the classification of micro-expressions
* Three-Dimensional Deep-Tissue Functional and Molecular Imaging by Integrated Photoacoustic, Ultrasound, and Angiographic Tomography (PAUSAT)
* Threshold Matters in WSSS: Manipulating the Activation for the Robust and Accurate Segmentation Model Against Thresholds
* Throughput Maximization for RIS-UAV Relaying Communications
* TikTok for good: Creating a diverse emotion expression database
* Time Lens++: Event-based Frame Interpolation with Parametric Nonlinear Flow and Multi-scale Fusion
* Time Series Analysis-Based Long-Term Onboard Radiometric Calibration Coefficient Correction and Validation for the HY-1C Satellite Calibration Spectrometer
* Time-Continuous Audiovisual Fusion with Recurrence vs Attention for In-The-Wild Affect Recognition
* Time-Multiplexed Coded Aperture and Coded Focal Stack: Comparative Study on Snapshot Compressive Light Field Imaging
* Time3D: End-to-End Joint Monocular 3D Object Detection and Tracking for Autonomous Driving
* Timely and Low-Cost Remote Sensing Practices for the Assessment of Landslide Activity in the Service of Hazard Management
* TimeReplayer: Unlocking the Potential of Event Cameras for Video Interpolation
* TinyOps: ImageNet Scale Deep Learning on Microcontrollers
* TMVNet: Using Transformers for Multi-view Voxel-based 3D Reconstruction
* TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization adjoint with Moving Speed
* tooth surface design method combining semantic guidance, confidence, and structural coherence, A
* TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
* Topographic Correction of the SELENE MI Images with the LOLA DEM around Shackleton Crater
* Topologically-Aware Deformation Fields for Single-View 3D Reconstruction
* Topology and Language of Relationships in the Visual Genome Dataset, The
* Topology Preserving Local Road Network Estimation from Single Onboard Camera Image
* Topology-Preserving Shape Reconstruction and Registration via Neural Diffeomorphic Flow
* TorMentor: Deterministic dynamic-path, data augmentations with fractals
* Total Variation Optimization Layers for Computer Vision
* Toward Fast, Flexible, and Robust Low-Light Image Enhancement
* Toward Practical Monocular Indoor Depth Estimation
* Toward Real-World Super-Resolution via Adaptive Downsampling Models
* Towards a Deeper Understanding of Skeleton-based Gait Recognition
* Towards a Unified Quadrature Framework for Large-Scale Kernel Machines
* Towards Accurate Facial Landmark Detection via Cascaded Transformers
* Towards An End-to-End Framework for Flow-Guided Video Inpainting
* Towards assessing agricultural land suitability with causal machine learning
* Towards Better Plasticity-Stability Trade-off in Incremental Learning: A Simple Linear Connector
* Towards Better Understanding Attribution Methods
* Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence
* Towards Comprehensive Testing on the Robustness of Cooperative Multi-agent Reinforcement Learning
* Towards Data-Free Model Stealing in a Hard Label Setting
* Towards Detailed Characteristic-Preserving Virtual Try-On
* Towards Discovering the Effectiveness of Moderately Confident Samples for Semi-Supervised Learning
* Towards Discriminative Representation: Multi-view Trajectory Contrastive Learning for Online Multi-object Tracking
* Towards Diverse and Natural Scene-aware 3D Human Motion Synthesis
* Towards Driving-Oriented Metric for Lane Detection Models
* Towards Efficient and Scalable Sharpness-Aware Minimization
* Towards Efficient Data Free Blackbox Adversarial Attack
* Towards efficient feature sharing in MIMO architectures
* Towards End-to-End Unified Scene Text Detection and Layout Analysis
* Towards Exemplar-Free Continual Learning in Vision Transformers: an Account of Attention, Functional and Weight Regularization
* Towards Explaining Image-Based Distribution Shifts
* Towards Fewer Annotations: Active Learning via Region Impurity and Prediction Uncertainty for Domain Adaptive Semantic Segmentation
* Towards General Purpose Vision Systems: An End-to-End Task-Agnostic Vision-Language Architecture
* Towards Implicit Text-Guided 3D Shape Generation
* Towards Knowledge-Aware Video Captioning via Transitive Visual Relationship Detection
* Towards Language-Free Training for Text-to-Image Generation
* Towards Layer-wise Image Vectorization
* Towards Low-Cost and Efficient Malaria Detection
* Towards Multi-domain Single Image Dehazing via Test-time Training
* Towards Multimodal Depth Estimation from Light Fields
* Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation
* Towards Open-Set Object Detection and Discovery
* Towards Practical Certifiable Patch Defense with Vision Transformer
* Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks
* Towards Principled Disentanglement for Domain Generalization
* Towards prior gap and representation gap for long-tailed recognition
* Towards Real-Time Monocular Depth Estimation for Robotics: A Survey
* Towards real-world navigation with deep differentiable planners
* Towards Real-world Shadow Removal with a Shadow Simulation Method and a Two-stage Framework
* Towards Robust Adaptive Object Detection under Noisy Annotations
* Towards Robust and Adaptive Motion Forecasting: A Causal Representation Perspective
* Towards Robust and Reproducible Active Learning using Neural Networks
* Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive Benchmark Analysis and Beyond
* Towards Robust Semantic Segmentation of Accident Scenes via Multi-Source Mixed Sampling and Meta-Learning
* Towards Robust Vision Transformer
* Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin
* Towards spatially continuous mapping of soil organic carbon in croplands using multitemporal Sentinel-2 remote sensing
* Towards Total Recall in Industrial Anomaly Detection
* Towards Understanding Adversarial Robustness of Optical Flow Networks
* Towards understanding the character of quality sampling in deep learning face recognition
* Towards Unsupervised Domain Generalization
* Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
* Tracked-Vehicle Retrieval by Natural Language Descriptions With Domain Adaptive Knowledge
* TrackFormer: Multi-Object Tracking with Transformers
* Tracking Dependent Extended Targets Using Multi-Output Spatiotemporal Gaussian Processes
* Tracking People by Predicting 3D Appearance, Location and Pose
* Traffic Control in a Mixed Autonomy Scenario at Urban Intersections: An Optimal Control Approach
* Traffic Data Recovery From Corrupted and Incomplete Observations via Spatial-Temporal TRPCA
* Traffic Signal Self-Organizing Control With Road Capacity Constraints
* Traffic-GGNN: Predicting Traffic Flow via Attentional Spatial-Temporal Gated Graph Neural Networks
* Traffic-Informed Multi-Camera Sensing (TIMS) System Based on Vehicle Re-Identification
* Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
* Training High-Performance Low-Latency Spiking Neural Networks by Differentiation on Spike Representation
* Training Object Detectors from Scratch: An Empirical Study in the Era of Vision Transformer
* Training Quantised Neural Networks with STE Variants: The Additive Noise Annealing Algorithm
* Training-free Transformer Architecture Search
* Trajectory Jerking Suppression for Mixed Traffic Flow at a Signalized Intersection: A Trajectory Prediction Based Deep Reinforcement Learning Method
* Trajectory Optimization for High-Speed Trains via a Mixed Integer Linear Programming Approach
* Trajectory Optimization for Physics-Based Reconstruction of 3d Human Pose from Monocular Video
* Trajectory Planning for an Autonomous Vehicle in Spatially Constrained Environments
* Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance
* TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing
* Transfer Learning from Synthetic In-vitro Soybean Pods Dataset for In-situ Segmentation of On-branch Soybean Pods
* Transferability analysis of adversarial attacks on gender classification to face recognition: Fixed and variable attack perturbation
* Transferability Estimation using Bhattacharyya Class Separability
* Transferability Metrics for Selecting Source Model Ensembles
* Transferring Unconditional to Conditional GANs with Hyper-Modulation
* Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge Visual Question Answering
* Transformaly: Two (Feature Spaces) Are Better Than One
* TransforMatcher: Match-to-Match Attention for Semantic Correspondence
* Transformed domain convolutional neural network for Alzheimer's disease diagnosis using structural MRI
* Transformer Based Line Segment Classifier with Image Context for Real-Time Vanishing Point Detection in Manhattan World
* Transformer Decoders with Multi-Modal Regularization for Cross-Modal Food Retrieval
* Transformer for Single Image Super-Resolution
* Transformer Tracking with Cyclic Shifting Window Attention
* Transformer-based Multimodal Information Fusion for Facial Expression Analysis
* Transformer-empowered Multi-scale Contextual Matching and Aggregation for Multi-contrast MRI Super-resolution
* Transforming Model Prediction for Tracking
* Transforming Temporal Embeddings to Keypoint Heatmaps for Detection of Tiny Vehicles in Wide Area Motion Imagery (WAMI) Sequences
* TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers
* TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization
* Transmission Control Over Satellite Network for Marine Environmental Monitoring System
* TransMix: Attend to Mix for Vision Transformers
* TransMVSNet: Global Context-Aware Multi-View Stereo Network with Transformers
* TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting
* TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition
* TransVPR: Transformer-Based Place Recognition with Multi-Level Attention Aggregation
* TransWeather: Transformer-based Restoration of Images Degraded by Adverse Weather Conditions
* Tree Detection and Species Classification in a Mixed Species Forest Using Unoccupied Aircraft System (UAS) RGB and Multispectral Imagery
* Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation
* TripletTrack: 3D Object Tracking using Triplet Embeddings and LSTM
* Tropospheric Second-Order Horizontal Gradient Modeling for GNSS PPP
* True Black-Box Explanation in Facial Analysis
* Trust Your IMU: Consequences of Ignoring the IMU Drift
* Trustworthy Long-Tailed Classification
* TubeDETR: Spatio-Temporal Video Grounding with Transformers
* TubeFormer-DeepLab: Video Mask Transformer
* TubeR: Tubelet Transformer for Video Action Detection
* Tunnel Facility Based Vehicle Localization in Highway Tunnel Using 3D LIDAR
* TV regularisation sparse light field reconstruction model based on guided-filtering, A
* TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing
* TWIST: Two-Way Inter-label Self-Training for Semi-supervised 3D Instance Segmentation
* Two Coupled Rejection Metrics Can Tell Adversarial Examples Apart
* Two Dimensions of Worst-case Training and Their Integrated Effect for Out-of-domain Generalization, The
* Two-stage partial image-text clustering (TPIT-C)
* Two-Stage Shake-Shake Network for Long-Tailed Recognition of SAR Aerial View Objects, A
* Two-Stage Supervised Discrete Hashing for Cross-Modal Retrieval
* Two-Step Machine Learning Approach for Crop Disease Detection Using GAN and UAV Technology, A
* Two-Step Model for Predicting Travel Demand in Expanding Subways, A
* UAV Remote Sensing Prediction Method of Winter Wheat Yield Based on the Fused Features of Crop and Soil
* UAVformer: A Composite Transformer Network for Urban Scene Segmentation of UAV Images
* UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection
* UBoCo: Unsupervised Boundary Contrastive Learning for Generic Event Boundary Detection
* UCC: Uncertainty guided Cross-head Cotraining for Semi-Supervised Semantic Segmentation
* UDA-COPE: Unsupervised Domain Adaptation for Category-level Object Pose Estimation
* Uformer: A General U-Shaped Transformer for Image Restoration
* UIGR: Unified Interactive Garment Retrieval
* UKPGAN: A General Self-Supervised Keypoint Detector
* UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection
* Unbiased Area Estimation Using Copernicus High Resolution Layers and Reference Data
* Unbiased Subclass Regularization for Semi-Supervised Semantic Segmentation
* Unbiased Teacher v2: Semi-supervised Object Detection for Anchor-free and Anchor-based Detectors
* Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose Estimation
* Uncertainty-Aware and Multigranularity Consistent Constrained Model for Semi-Supervised Hashing
* Uncertainty-Aware Deep Multi-View Photometric Stereo
* Uncertainty-Guided Probabilistic Transformer for Complex Action Recognition
* Underground Cavity Detection through Group Dispersion of a GPR Signal
* Understanding 3D Object Articulation in Internet Videos
* Understanding and Increasing Efficiency of Frank-Wolfe Adversarial Training
* Understanding Intensity-Duration-Frequency (IDF) Curves Using IMERG Sub-Hourly Precipitation against Dense Gauge Networks
* Understanding the Role of Weather Data for Earth Surface Forecasting using a ConvLSTM-based Model
* Understanding Uncertainty Maps in Vision with Statistical Testing
* Underwater Image Enhancement With Lightweight Cascaded Network
* Underwater Light Field Retention: Neural Rendering for Underwater Imaging
* Undoing the Damage of Label Shift for Cross-domain Semantic Segmentation
* Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks
* Uni6D: A Unified CNN Framework without Projection Breakdown for 6D Pose Estimation
* UNICON: Combating Label Noise Through Uniform Selection and Contrastive Learning
* UniCoRN: A Unified Conditional Image Repainting Network
* Unified Contrastive Learning in Image-Text-Label Space
* Unified Framework for Implicit Sinkhorn Differentiation, A
* Unified Model for Line Projections in Catadioptric Cameras with Rotationally Symmetric Mirrors, A
* unified model for the sparse optimal scoring problem, A
* Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression
* Unified Query-based Paradigm for Point Cloud Understanding, A
* Unified Transformer Tracker for Object Tracking
* Uniform Priors for Data-Efficient Learning
* Uniform Subdivision of Omnidirectional Camera Space for Efficient Spherical Stereo Matching
* Unifying Deep ConvNet and Semantic Edge Features for Loop Closure Detection
* Unifying Motion Deblurring and Frame Interpolation with Events
* Unifying Panoptic Segmentation for Autonomous Driving
* Unimodal-Concentrated Loss: Fully Adaptive Label Distribution Learning for Ordinal Regression
* UNIST: Unpaired Neural Implicit Shape Translation Network
* Universal Framework of Spatiotemporal Bias Block for Long-Term Traffic Forecasting, A
* Universal Landslide Detection Method in Optical Remote Sensing Images Based on Improved YOLOX, A
* Universal Photometric Stereo Network using Global Lighting Contexts
* Universal Real-Time Adaptive Signal Compression for High-Frame-Rate Optoacoustic Tomography
* UniVIP: A Unified Framework for Self-Supervised Visual Pre-training
* Unknown-Aware Object Detection: Learning What You Don't Know from Videos in the Wild
* Unleashing Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification
* Unmanned Ground Vehicle Platooning Under Cyber Attacks: A Human-Robot Interaction Framework
* Unmanned-Aerial-Vehicle Routing Problem With Mobile Charging Stations for Assisting Search and Rescue Missions in Postdisaster Scenarios
* Unpaired Cartoon Image Synthesis via Gated Cycle Mapping
* Unpaired Deep Image Deraining Using Dual Contrastive Learning
* Unpaired Face Restoration via Learnable Cross-Quality Shift
* Unpaired Faces to Cartoons: Improving XGAN
* Unpaired Real-World Super-Resolution with Pseudo Controllable Restoration
* Unreasonable Effectiveness of CLIP Features for Image Captioning: An Experimental Analysis, The
* Unrolled Network for Light Field Display
* Unscented Kalman Filter With General Complex-Valued Signals
* Unseen Classes at a Later Time? No Problem
* Unstructured Object Matching using Co-Salient Region Segmentation
* Unsupervised Action Segmentation by Joint Representation Learning and Online Clustering
* Unsupervised Anomaly Detection from Time-of-Flight Depth Images
* Unsupervised Change Detection Based on Image Reconstruction Loss
* Unsupervised Continual Learning for Gradually Varying Domains
* Unsupervised Decomposition and Correction Network for Low-Light Image Enhancement
* Unsupervised Deraining: Where Contrastive Learning Meets Self-similarity
* Unsupervised domain adaptation and super resolution on drone images for autonomous dry herbage biomass estimation
* Unsupervised Domain Adaptation for Cardiac Segmentation: Towards Structure Mutual Information Maximization
* Unsupervised Domain Adaptation for Nighttime Aerial Tracking
* Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
* Unsupervised Domain Generalization by Learning a Bridge Across Domains
* unsupervised fusion network for boosting denoising performance, An
* Unsupervised Hierarchical Semantic Segmentation with Multiview Cosegmentation and Clustering Transformers
* Unsupervised Homography Estimation with Coplanarity-Aware GAN
* Unsupervised Image-to-Image Translation with Generative Prior
* Unsupervised Learning Approach for Road Anomaly Segmentation Using RGB-D Sensor for Advanced Driver Assistance System, An
* Unsupervised Learning of Accurate Siamese Tracking
* Unsupervised Learning of Debiased Representations with Pseudo-Attributes
* Unsupervised Learning of Depth Estimation and Camera Pose With Multi-Scale GANs
* Unsupervised Multi-View Gaze Representation Learning
* Unsupervised Pre-training for Temporal Action Localization Tasks
* Unsupervised Representation Learning for Binary Networks by Joint Classifier Learning
* Unsupervised Salient Object Detection with Spectral Cluster Voting
* Unsupervised Sparse, Nonnegative, Low Rank Dictionary Learning for Detection of Driver Cell Phone Usage
* Unsupervised Vision-and-Language Pretraining via Retrieval-based Multi-Granular Alignment
* Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships
* Unsupervised Visual Representation Learning by Online Constrained K-Means
* UnweaveNet: Unweaving Activity Stories
* Update Compression for Deep Neural Networks on the Edge
* Updating Inventory, Deformation, and Development Characteristics of Landslides in Hunza Valley, NW Karakoram, Pakistan by SBAS-InSAR
* Upright-Net: Learning Upright Orientation for 3D Point Cloud
* Urban Building Classification (UBC) - A Dataset for Individual Building Detection and Classification from Satellite Imagery
* Urban Flood Risk Assessment in Zhengzhou, China, Based on a D-Number-Improved Analytic Hierarchy Process and a Self-Organizing Map Algorithm
* Urban Radiance Fields
* URetinex-Net: Retinex-based Deep Unfolding Network for Low-light Image Enhancement
* Use All The Labels: A Hierarchical Multi-Label Contrastive Learning Framework
* Use of Remote Sensing Techniques to Estimate Plant Diversity within Ecological Networks: A Worked Example
* User-Guided Variable Rate Learned Image Compression
* Using 3D Topological Connectivity for Ghost Particle Reduction in Flow Reconstruction
* Using a Moving Antenna to Improve GNSS/INS Integration Performance Under Low-Dynamic Scenarios
* Using GEOBIA and Vegetation Indices to Assess Small Urban Green Areas in Two Climatic Regions
* Using Pure Pollen Species When Training a CNN to Segment Pollen Mixtures
* Using UAV to Identify the Optimal Vegetation Index for Yield Prediction of Oil Seed Rape (Brassica napus L.) at the Flowering Stage
* Using Vision Transformers for Spatial-Context-Aware Rain and Road Surface Condition Detection on Freeways
* UTC: A Unified Transformer with Inter-Task Contrastive Learning for Visual Dialog
* V-Doc: Visual questions answers with Documents
* V2C: Visual Voice Cloning
* Valence and Arousal Estimation based on Multimodal Temporal-Aware Features for Videos in the Wild
* VALHALLA: Visual multimodal-conditioned generation CVPR22
* Valid Inequality and Variable Fixation for Unrestricted Block Relocation Problems
* Variability of Chl a Concentration of Priority Marine Regions of the Northwest of Mexico
* Variable Few Shot Class Incremental and Open World Learning
* Variable Speed Limit Control Based on Variable Cell Transmission Model in the Connecting Traffic Environment, A
* Variational Autoencoders for Generating Hyperspectral Imaging Honey Adulteration Data
* variational Bayesian method for similarity learning in non-rigid image registration, A
* vCLIMB: A Novel Video Class Incremental Learning Benchmark
* Vector Quantized Diffusion Model for Text-to-Image Synthesis
* Vegetation Growth Status and Topographic Effects in Frozen Soil Regions on the Qinghai-Tibet Plateau
* Vehicle and Pedestrian Detection Algorithm Based on Lightweight YOLOv3-Promote and Semi-Precision Acceleration
* Vehicle trajectory prediction works, but not everywhere
* Vehicle Trajectory Reconstruction at Signalized Intersections Under Connected and Automated Vehicle Environment
* Vehicle-Consensus Information Exchange Scheme for Traffic Management in Vehicular Ad-Hoc Networks, A
* Vehicle-Road Environment Perception Under Low-Visibility Condition Based on Polarization Features via Deep Learning
* Versatile Multi-Modal Pre-Training for Human-Centric Perception
* Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation, A
* VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution
* VG-VAE: A Venatus Geometry Point-Cloud Variational Auto-Encoder
* VGSE: Visually-Grounded Semantic Embeddings for Zero-Shot Learning
* Vicinal Counting Networks
* Video Captioning Using Global-Local Representation
* Video Demoiréing with Relation-Based Temporal Consistency
* Video Frame Interpolation Transformer
* Video Frame Interpolation with Transformer
* Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
* Video Saliency Forecasting Transformer
* Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training
* Video Summarization Overview
* Video Swin Transformer
* Video-based Frame-level Facial Analysis of Affective Behavior on Mobile Devices using EfficientNets
* Video-based multimodal spontaneous emotion recognition using facial expressions and physiological signals
* Video-Text Representation Learning via Differentiable Weak Temporal Alignment
* VideoDG: Generalizing Temporal Relations in Videos to Novel Domains
* VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
* ViM: Out-Of-Distribution with Virtual-Logit Matching
* Virtual Correspondence: Humans as a Cue for Extreme-View Geometry
* Virtual Elastic Objects
* Virtual Laser Scanning Approach to Assessing Impact of Geometric Inaccuracy on 3D Plant Traits
* VIsCUIT: Visual Auditor for Bias in CNN Image Classifier
* Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline
* Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space
* Vision Transformer with Deformable Attention
* Vision-Enhanced and Consensus-Aware Transformer for Image Captioning
* Vision-Language Pre-Training for Boosting Scene Text Detectors
* Vision-Language Pre-Training with Triple Contrastive Learning
* VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance Segmentation
* VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention
* ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
* VISTA: Vision Transformer enhanced by U-Net and Image Colorfulness Frame Filtration for Automatic Retail Checkout
* Visual Abductive Reasoning
* Visual Acoustic Matching
* Visual Domain Bridge: A source-free domain adaptation for cross-domain few-shot learning
* Visual Goal-Directed Meta-Imitation Learning
* Visual Vibration Tomography: Estimating Interior Material Properties from Monocular Video
* VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
* VisualHow: Multimodal Problem Solving
* ViTOL: Vision Transformer for Weakly Supervised Object Localization
* VL-ADAPTER: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
* VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers
* VMD-WT-Based Method for Extracting On-the-Fly GNSS Tide Level and Its Realization
* Volumetric Bundle Adjustment for Online Photorealistic Scene Capture
* Volumetric Fetal Flow Imaging With Magnetic Resonance Imaging
* Vox2Cortex: Fast Explicit Reconstruction of Cortical Surfaces from 3D MRI Scans with Geometric Deep Neural Networks
* Voxel Field Fusion for 3D Object Detection
* Voxel Graph CNN for Object Classification with Event Cameras, A
* Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds
* VP-CAST: Velocity and Position-Based Broadcast Suppression for VANETs
* VRDFormer: End-to-End Video Visual Relation Detection with Transformers
* VSOIQE: A Novel Viewport-Based Stitched 360° Omnidirectional Image Quality Evaluator
* WALT: Watch And Learn 2D amodal representation from Time-lapse imagery
* Wanderings of Odysseus in 3D Scenes, The
* WarpingGAN: Warping Multiple Uniform Priors for Adversarial 3D Point Cloud Generation
* Watch and Act: Dual Interacting Agents for Automatic Generation of Possession Statistics in Soccer
* Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of Articulated Objects
* Water Quality and Water Hyacinth Monitoring with the Sentinel-2A/B Satellites in Lake Tana (Ethiopia)
* Water Quality Retrieval from ZY1-02D Hyperspectral Imagery in Urban Water Bodies and Comparison with Sentinel-2
* Water Surface Mapping from Sentinel-1 Imagery Based on Attention-UNet3+: A Case Study of Poyang Lake Region
* Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation
* Wavelet-based multi-level generative adversarial networks for face aging
* Weakly But Deeply Supervised Occlusion-Reasoned Parametric Road Layouts
* Weakly Paired Associative Learning for Sound and Image Representations via Bimodal Associative Memory
* Weakly Supervised High-Fidelity Clothing Model Generation
* Weakly Supervised Object Localization as Domain Adaption
* Weakly Supervised Rotation-Invariant Aerial Object Detection Network
* Weakly Supervised Segmentation on Outdoor 4D point clouds with Temporal Matching and Spatial Graph Propagation
* Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast
* Weakly Supervised Semantic Segmentation using Out-of-Distribution Data
* Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation
* Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning
* Weakly-Supervised Action Detection Guided by Audio Narration
* Weakly-supervised Action Transition Learning for Stochastic Human Motion Prediction
* Weakly-Supervised Generation and Grounding of Visual Descriptions with Conditional Generative Models
* Weakly-supervised Metric Learning with Cross-Module Communications for the Classification of Anterior Chamber Angle Images
* Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos
* Weakly-Supervised Salient Object Detection on Light Fields
* Wearable ImageNet: Synthesizing Tileable Textures via Dataset Distillation
* WebQA: Multihop and Multimodal QA
* Weight-Dependent Gates for Network Pruning
* What do navigation agents learn about their environment?
* What Makes Transfer Learning Work for Medical Images: Feature Reuse & Other Factors
* What Matters For Meta-Learning Vision Regression Tasks?
* What Should Be Equivariant In Self-Supervised Learning
* What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions
* What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
* What's in your hands? 3D Reconstruction of Generic Objects in Hands
* When Does Contrastive Visual Representation Learning Work?
* When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search
* When to Prune? A Policy towards Early Structural Pruning
* Where did I leave my keys?: Episodic-Memory-Based Question Answering on Egocentric Videos
* Which images to label for few-shot medical landmark detection?
* Which Model to Transfer? Finding the Needle in the Growing Haystack
* Whose Hands are These? Hand Detection and Hand-Body Association in the Wild
* Whose Track Is It Anyway? Improving Robustness to Tracking Errors with Affinity-based Trajectory Prediction
* Why Discard if You can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis
* Why Object Detectors Fail: Investigating the Influence of the Dataset
* Why They Escape: Mining Prioritized Fuzzy Decision Rule in Crowd Evacuation
* Widar3.0: Zero-Effort Cross-Domain Gesture Recognition With Wi-Fi
* WildNet: Learning Domain Generalized Semantic Segmentation from the Wild
* Wind Field Retrieval with Rain Correction from Dual-Polarized Sentinel-1 SAR Imagery Collected during Tropical Cyclones
* Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
* Winter Wheat Lodging Area Extraction Using Deep Learning with GaoFen-2 Satellite Imagery
* WITS: Weakly-supervised individual tooth segmentation model trained on box-level labels
* Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross- Modal Denoising Networks
* X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval
* X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
* XGBoost-Based Lane Change Prediction on Time Series Data Using Feature Engineering for Autopilot Vehicles, A
* XMP-Font: Self-Supervised Cross-Modality Pre-training for Few-Shot Font Generation
* XYDeblur: Divide and Conquer for Single Image Deblurring
* XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
* YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss
* YouMVOS: An Actor-centric Multi-shot Video Object Segmentation Dataset
* Z-Domain Entropy Adaptable Flex for Semi-supervised Action Recognition in the Dark
* ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation
* Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation
* Zero-Query Transfer Attacks on Context-Aware Object Detectors
* Zero-shot Learning Using Multimodal Descriptions
* Zero-Shot Text-Guided Object Generation with Dream Fields
* ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
* ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes
* Zoom In and Out: A Mixed-scale Triplet Network for Camouflaged Object Detection
* Zoom-to-Inpaint: Image Inpainting with High-Frequency Details
* ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds
3779 for 2210

Last update:17-Dec-25 16:26:55
Use price@usc.edu for comments.