Update Dates 2203

2203 * 3D Building Reconstruction from Monocular Remote Sensing Images
* 3D Human Pose Estimation with Spatial and Temporal Transformers
* 3D Human Texture Estimation from a Single Image with Transformers
* 3D Local Convolutional Neural Networks for Gait Recognition
* 3D pose estimation and future motion prediction from 2D images
* 3D Pyramid Pooling Network for Abdominal MRI Series Classification
* 3D Shape Generation and Completion through Point-Voxel Diffusion
* 3D-CSTM: A 3D continuous spatio-temporal mapping method
* 3D-FRONT: 3D Furnished Rooms with layOuts and semaNTics
* 3DeepCT: Learning Volumetric Scattering Tomography of Clouds
* 3DIAS: 3D Shape Reconstruction with Implicit Algebraic Surfaces
* 3DStyleNet: Creating 3D Shapes with Geometric and Texture Style Variations
* 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
* 4D Cloud Scattering Tomography
* 4D-Net for Learned Multi-Modal Alignment
* 4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface
* A-Muze-Net: Music Generation by Composing the Harmony Based on the Generated Melody
* A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation
* AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network
* ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning
* Accelerating Atmospheric Turbulence Simulation via Learned Phase-to-Space Transform
* accumulation cost of relaxed fixed time accumulation mode, The
* Accurate 3D Reconstruction of Dynamic Objects by Spatial-Temporal Multiplexing and Motion-Induced Error Elimination
* Accurate Matching of Invariant Features Derived from Irregular Curves
* ACDC: The Adverse Conditions Dataset with Correspondences for Semantic Driving Scene Understanding
* ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot
* Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
* Achieving Real-Time Path Planning in Unknown Environments Through Deep Neural Networks
* Act the Part: Learning Interaction Strategies for Articulated Object Part Discovery
* Action Transformer: A self-attention model for short-time pose-based human action recognition
* Action-Conditioned 3D Human Motion Synthesis with Transformer VAE
* Active Domain Adaptation via Clustering Uncertainty-weighted Embeddings
* Active Gradual Domain Adaptation: Dataset and Approach
* Active Learning for Deep Object Detection via Probabilistic Modeling
* Active Learning for Lane Detection: A Knowledge Distillation Approach
* Active Universal Domain Adaptation
* AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis
* AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer
* AdaConfigure: Reinforcement Learning-Based Adaptive Configuration for Video Analytics Services
* AdaFit: Rethinking Learning-based Normal Estimation on Point Clouds
* AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
* AdaPool: A Diurnal-Adaptive Fleet Management Framework Using Model-Free Deep Reinforcement Learning and Change Point Detection
* Adaptive Adversarial Network for Source-free Domain Adaptation
* Adaptive Binarization for Vehicle State Images Based on Contrast Preserving Decolorization and Major Cluster Estimation
* Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection
* Adaptive confidence thresholding for monocular depth estimation
* Adaptive Contourlet Fusion Clustering for SAR Image Change Detection
* Adaptive Convolutions with Per-pixel Dynamic Filter Atom
* Adaptive Curriculum Learning
* Adaptive Event-Triggered Platoon Control Under Unreliable Communication Links
* Adaptive Feature Fusion and Spatio-Temporal Background Modeling in KDE Framework for Object Detection and Shadow Removal
* Adaptive Feature Weights Based Double-Layer Multi-Objective Method for SAR Image Segmentation
* Adaptive Focus for Efficient Video Recognition
* Adaptive Gabor convolutional networks
* Adaptive Graph Convolution for Point Cloud Analysis
* Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference
* Adaptive Label Noise Cleaning with Meta-Supervision for Deep Face Recognition
* Adaptive region-aware feature enhancement for object detection
* Adaptive Speech Intelligibility Enhancement for Far-and-Near-end Noise Environments Based on Self-attention StarGAN
* Adaptive Square-Root Unscented Kalman Filter Phase Unwrapping with Modified Phase Gradient Estimation
* Adaptive Surface Normal Constraint for Depth Estimation
* Adaptive Surface Reconstruction with Multiscale Convolutional Kernels
* Adaptive Unfolding Total Variation Network for Low-Light Image Enhancement
* AdaSGN: Adapting Joint Number and Model Size for Efficient Skeleton-Based Action Recognition
* AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach
* Adjusting solar-induced fluorescence to nadir-viewing provides a better proxy for GPP
* Admix: Enhancing the Transferability of Adversarial Attacks
* ADNet: Leveraging Error-Bias Towards Normal Direction in Face Alignment
* ADR-MVSNet: A cascade network for 3D point cloud reconstruction with pixel occlusion
* Advances in human action, activity and gesture recognition
* Advances in Lightning Monitoring and Location Technology Research in China
* AdvDrop: Adversarial Attack to DNNs by Dropping Information
* Adversarial Attack on Deep Cross-Modal Hamming Retrieval
* Adversarial Attacks are Reversible with Natural Supervision
* Adversarial Attacks on Deepfake Detectors: A Practical Analysis
* Adversarial Attacks On Multi-Agent Communication
* Adversarial Example Detection Using Latent Neighborhood Graph
* Adversarial Graph Convolutional Network for Cross-Modal Retrieval
* Adversarial Joint-Learning Recurrent Neural Network for Incomplete Time Series Classification
* Adversarial learning and decomposition-based domain generalization for face anti-spoofing
* Adversarial Reinforcement Learning With Object-Scene Relational Graph for Video Captioning
* Adversarial Robustness for Unsupervised Domain Adaptation
* Adversarial Unsupervised Domain Adaptation with Conditional and Label Shift: Infer, Align and Iterate
* Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models
* AdvRush: Searching for Adversarially Robust Neural Architectures
* AESOP: Abstract Encoding of Stories, Objects, and Pictures
* AFE-RCNN: Adaptive Feature Enhancement RCNN for 3D Object Detection
* Affect Estimation in 3D Space Using Multi-Task Active Learning for Regression
* Affect in Multimedia: Benchmarking Violent Scenes Detection
* Affective Audio Annotation of Public Speeches with Convolutional Clustering Neural Network
* Affective Impression: Sentiment-Awareness POI Suggestion via Embedding in Heterogeneous LBSNs
* Affinity Propagation Based on Structural Similarity Index and Local Outlier Factor for Hyperspectral Image Clustering
* Age of Information Aware UAV Deployment for Intelligent Transportation Systems
* AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting
* Aggregation with Feature Detection
* AGKD-BML: Defense Against Adversarial Attack by Attention Guided Knowledge Distillation and Bi-directional Metric Learning
* Aha! Adaptive History-driven Attack for Decision-based Black-box Models
* AI Choreographer: Music Conditioned 3D Dance Generation with AIST++
* AI for the Media Industry: Application Potential and Automation Levels
* AINet: Association Implantation for Superpixel Segmentation
* Airbert: In-Domain Pretraining for Vision-and-Language Navigation
* Airborne HySpex Hyperspectral Versus Multitemporal Sentinel-2 Images for Mountain Plant Communities Mapping
* Airborne imaging spectroscopy for assessing land-use effect on soil quality in drylands
* Airborne Validation of ICESat-2 ATLAS Data over Crevassed Surfaces and Other Complex Glacial Environments: Results from Experiments of Laser Altimeter and Kinematic GPS Data Collection from a Helicopter over a Surging Arctic Glacier (Negribreen, Svalbard)
* ALADIN: All Layer Adaptive Instance Normalization for Fine-grained Style Similarity
* Alien Pulse Rejection in Concurrent Firing LIDAR
* Aligning Latent and Image Spaces to Connect the Unconnectable
* Aligning Subtitles in Sign Language Videos
* ALiSa: Acrostic Linguistic Steganography Based on BERT and Gibbs Sampling
* ALL Snow Removed: Single Image Desnowing Algorithm Using Hierarchical Dual-tree Complex Wavelet Representation and Contradict Channel Loss
* All-in-One: Emotion, Sentiment and Intensity Prediction Using a Multi-Task Ensemble Framework
* Always Be Dreaming: A New Approach for Data-Free Class-Incremental Learning
* Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain
* Analysis of Ecological Blockage Pattern in Beijing Important Ecological Function Area, China
* analysis of heuristic metrics for classifier ensemble pruning based on ordered aggregation, An
* Analysis of MP4 Videos in 5G Using SDN
* Analysis of the Periodic Component of Vertical Land Motion in the Po Delta (Northern Italy) by GNSS and Hydrological Data
* Angular-spatial analysis of factors affecting the performance of light field reconstruction
* Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies
* Animation Transformer: Visual Correspondence via Segment Matching, The
* Annealing Genetic GAN for Imbalanced Web Data Learning
* Anonymizing Egocentric Videos
* anti-phishing model based on similarity measurement, An
* Anticipative Video Transformer
* Appearance-Based Loop Closure Detection via Locality-Driven Accurate Motion Field Learning
* Application of TLS Method in Digitization of Bridge Infrastructures: A Path to BrIM Development
* Applying Machine Learning and Time-Series Analysis on Sentinel-1A SAR/InSAR for Characterizing Arctic Tundra Hydro-Ecological Conditions
* approach to boundary detection for 3D point clouds based on DBSCAN clustering, An
* ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators
* Arbitrary Style Transfer with Adaptive Channel Network
* ARCH++: Animation-Ready Clothed Human Reconstruction Revisited
* Architecture Disentanglement for Deep Neural Networks
* Are we Missing Confidence in Pseudo-LiDAR Methods for Monocular 3D Object Detection?
* Artificial Fingerprinting for Generative Models: Rooting Deepfake Attribution in Training Data
* AS-Net: Class-Aware Assistance and Suppression Network for Few-Shot Learning
* ASCNet: Self-supervised Video Representation Learning with Appearance-Speed Consistency
* Ask amp;Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query
* ASMR: Learning Attribute-Based Person Search with Adaptive Semantic Margin Regularizer
* Assessing the Ability to Quantify Bathymetric Change over Time Using Solely Satellite-Based Measurements
* Assessing the Accuracy and Potential for Improvement of the National Land Cover Database's Tree Canopy Cover Dataset in Urban Areas of the Conterminous United States
* Assessing the Environmental Suitability for Transhumance in Support of Conflict Prevention in the Sahel
* Assessing the Performance of Irrigation Systems in Large Scale Urban Parks: Application to the Case of Valdebebas, Madrid (Spain)
* Assessing Variations in Water Use Efficiency and Linkages with Land-Use Changes Using Three Different Data Sources: A Case Study of the Yellow River, China
* Assessment of Contemporary Antarctic GIA Models Using High-Precision GPS Time Series
* Assignment-Space-based Multi-Object Tracking and Segmentation
* ASTER and GF-5 Satellite Data for Mapping Hydrothermal Alteration Minerals in the Longtoushan Pb-Zn Deposit, SW China
* Asymmetric Bilateral Motion Estimation for Video Frame Interpolation
* Asymmetric Loss For Multi-Label Classification
* Asynchronous Kalman Filter for Hybrid Event Cameras, An
* Att2ResNet: A deep attention-based approach for melanoma skin cancer classification
* Attack as the Best Defense: Nullifying Image-to-image Translation GANs via Limit-aware Adversarial Attack
* Attack-Guided Perceptual Data Generation for Real-world Re-Identification
* Attention is not Enough: Mitigating the Distribution Discrepancy in Asynchronous Multimodal Sequence Fusion
* Attention-based Multi-Reference Learning for Image Super-Resolution
* Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition
* Attentive and Contrastive Learning for Joint Depth and Motion Field Estimation
* Attentive occlusion-adaptive deep network for facial landmark detection
* Attributions of Evapotranspiration and Gross Primary Production Changes in Semi-Arid Region: A Case Study in the Water Source Area of the Xiong'an New Area in North China
* Audio-Visual Floorplan Reconstruction
* Audio2Gestures: Generating Diverse Gestures from Speech Audio with Conditional Variational Autoencoders
* Augmented Lagrangian Adversarial Attacks
* Augmenting Depth Estimation with Geospatial Context
* Authenticated Key Agreement Scheme With User Anonymity and Untraceability for 5G-Enabled Softwarized Industrial Cyber-Physical Systems
* Auto Graph Encoder-Decoder for Neural Network Pruning
* Auto uning of price prediction models for high-frequency trading via reinforcement learning
* Auto-FSL: Searching the Attribute Consistent Network for Few-Shot Learning
* Auto-Parsing Network for Image Captioning and Visual Question Answering
* AutoFormer: Searching Transformers for Visual Recognition
* Automated assessment for Alzheimer's disease diagnosis from MRI images: Meta-heuristic assisted deep learning model
* Automated Extraction of Ground Fissures Due to Coal Mining Subsidence Based on UAV Photogrammetry
* Automated search space and search strategy selection for AutoML
* Automatic Generation of Urban Road 3D Models for Pedestrian Studies from LiDAR Data
* Automatic Procedure for Forest Fire Fuel Mapping Using Hyperspectral (PRISMA) Imagery: A Semi-Supervised Classification Approach, An
* Automatic Recognition Methods Supporting Pain Assessment: A Survey
* AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection
* AutoSpace: Neural Architecture Search with Less Human Interference
* Auxiliary Tasks and Exploration Enable ObjectGoal Navigation
* AVSeeker: An Active Video Retrieval Engine at VBS2022
* BabelCalib: A Universal Approach to Calibrating Central Cameras
* Backdoor Attack against 3D Point Cloud Classifiers, A
* BADet: Boundary-Aware 3D Object Detection from Point Clouds
* Baking Neural Radiance Fields for Real-Time View Synthesis
* BAM: Block attention mechanism for OCT image classification
* Banana Fusarium Wilt Disease Detection by Supervised and Unsupervised Methods from UAV-Based Multispectral Imagery
* BAPA-Net: Boundary Adaptation and Prototype Alignment for Cross-domain Semantic Segmentation
* BARF: Bundle-Adjusting Neural Radiance Fields
* Batch Normalization Increases Adversarial Vulnerability and Decreases Adversarial Transferability: A Non-Robust Feature Perspective
* Bayesian Deep Basis Fitting for Depth Completion with Uncertainty
* Bayesian Route Choice Inference to Address Missed Bluetooth Detections
* Bayesian Triplet Loss: Uncertainty Quantification in Image Retrieval
* Benchmark Platform for Ultra-Fine-Grained Visual Categorization Beyond Human Performance
* Benchmarking Ultra-High-Definition Image Super-resolution
* Benefit of Distraction: Denoising Camera-Based Physiological Measurements using Inverse Attention, The
* Better Aggregation in Test-Time Augmentation
* BEV-Net: Assessing Social Distancing Compliance by Joint People Localization and Geometric Reasoning
* Beyond Mutual Information: Generative Adversarial Network for Domain Adaptation Using Information Bottleneck Constraint
* Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering
* Beyond Road Extraction: A Dataset for Map Update using Aerial Images
* Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations
* Bi-attention Modal Separation Network for Multimodal Video Fusion
* Bias Loss for Mobile Neural Networks
* Bias-Eliminated Semantic Refinement for Any-Shot Learning
* BiaSwap: Removing Dataset Bias with Bias-Tailored Swapping Augmentation
* bibliometric analysis of off-line handwritten document analysis literature (1990-2020), A
* Bifold and Semantic Reasoning for Pedestrian Behavior Prediction
* Big Self-Supervised Models Advance Medical Image Classification
* BiMaL: Bijective Maximum Likelihood Approach to Domain Adaptation in Semantic Scene Segmentation
* Binocular Feature Fusion and Spatial Attention Mechanism Based Gaze Tracking
* Binocular Mutual Learning for Improving Few-shot Classification
* BioFors: A Large Biomedical Image Forensics Dataset
* Biogeochemical Model Optimization by Using Satellite-Derived Phytoplankton Functional Type Data and BGC-Argo Observations in the Northern South China Sea
* Bit-Mixer: Mixed-precision networks with runtime bit-width selection
* Black-box Detection of Backdoor Attacks with Limited Information and Data
* blind contour-aware quality model for sonar images, A
* Blind Image Deblurring via Superpixel Segmentation Prior
* Blind Remote Sensing Image Deblurring Using Local Binary Pattern Prior
* BlockCopy: High-Resolution Video Processing with Block-Sparse Feature Propagation and Online Policies
* BlockPlanner: City Block Generation with Vectorized Graph Representation
* BN-NAS: Neural Architecture Search with Batch Normalization
* Body-Face Joint Detection via Embedding and Head Hook
* Boom With a View: The Satellite-Imaging Industry is Exploding. Here's how to take Advantage of it, A
* Boosting Monocular Depth Estimation with Lightweight 3D Point Fusion
* Boosting the Generalization Capability in Cross-Domain Few-shot Learning via Noise-enhanced Supervised Autoencoder
* Boosting Weakly Supervised Object Detection via Learning Bounding Box Adjusters
* Bootstrap Your Own Correspondences
* BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search
* Boundary-sensitive Pre-training for Temporal Localization in Videos
* Bounds of Improvements Toward Real-Time Forecast of Multi-Scenario Train Delays, The
* Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds
* Brain tumor segmentation based on the dual-path network of multi-modal MRI images
* Breast Thermography as an Adjunct Tool to Monitor the Chemotherapy Response in a Triple Negative BIRADS V Cancer Patient: A Case Study
* Bridging the Gap between Label- and Reference-based Synthesis in Multi-attribute Image-to-Image Translation
* Bridging Unsupervised and Supervised Depth from Focus via All-in-Focus Supervision
* Bringing Events into Video Deblurring with Non-consecutively Blurry Frames
* Broad Study on the Transferability of Visual Representations with Contrastive Learning, A
* Broaden Your Views for Self-Supervised Video Learning
* Building-GAN: Graph-Conditioned Architectural Volumetric Design Generation
* BuildingNet: Learning to Label 3D Buildings
* BV-Person: A Large-scale Dataset for Bird-view Person Re-identification
* C2N: Practical Generative Noise Modeling for Real-World Denoising
* C3-SemiSeg: Contrastive Semi-supervised Segmentation via Cross-set Learning and Dynamic Class-balancing
* CAG-QIL: Context-Aware Actionness Grouping via Q Imitation Learning for Online Temporal Action Localization
* Calculation of the Rub'al Khali Sand Dune Volume for Estimating Potential Sand Sources
* Calibrated Adversarial Refinement for Stochastic Semantic Segmentation
* Calibrated and Partially Calibrated Semi-Generalized Homographies
* Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
* Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-Learning
* Can laboratory parameters be an alternative to CT and RT-PCR in the diagnosis of COVID-19? A machine learning approach
* Can Scale-Consistent Monocular Depth Be Learned in a Self-Supervised Scale-Invariant Manner?
* Can Shape Structure Features Improve Model Robustness under Diverse Adversarial Settings?
* Can Visually Impaired Use Gestures to Interact With Computers? A Cognitive Load Perspective
* CANet: A Context-Aware Network for Shadow Removal
* CANet: Co-attention network for RGB-D semantic segmentation
* CanvasVAE: Learning to Generate Vector Graphic Documents
* Capsule-Encoder-Decoder: A Method for Generalizable Building Extraction from Remote Sensing Images
* CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds
* Cascade Image Matting with Deformable Graph Refinement
* cascaded nested network for 3T brain MR image segmentation guided by 7T labeling, A
* CaT: Weakly Supervised Object Detection with Category Transfer
* Category-Sensitive Incremental Learning for Image-Based 3D Shape Reconstruction
* Causal Attention for Unbiased Visual Recognition
* CCT-Net: Category-Invariant Cross-Domain Transfer for Medical Single-to-Multiple Disease Diagnosis
* CDC: Color-Based Diffusion Model with Caption Embedding in VBS 2022
* CDeRSNet: Towards High Performance Object Detection in Vietnamese Document Images
* CDNet: Centripetal Direction Network for Nuclear Instance Segmentation
* CDS: Cross-Domain Self-supervised Pre-training
* Centennial Total Solar Irradiance Variation
* Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation, The
* CERN: Compact facial expression recognition net
* Challenges with Regard to Unmanned Aerial Systems (UASs) Measurement of River Surface Velocity Using Doppler Radar
* Change Analysis on the Spatio-Temporal Patterns of Main Crop Planting in the Middle Yangtze Plain
* Change is Everywhere: Single-Temporal Supervised Object Change Detection in Remote Sensing Imagery
* Changes in the Eruptive Style of Stromboli Volcano before the 2019 Paroxysmal Phase Discovered through SOM Clustering of Seismo-Acoustic Features Compared with Camera Images and GBInSAR Data
* Channel Augmented Joint Learning for Visible-Infrared Recognition
* Channel splitting attention network for low-light image enhancement
* Channel-wise Knowledge Distillation for Dense Prediction*
* Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition
* Characteristics and Evaluation of Future Droughts across China through the CMIP6 Multi-Model Ensemble, The
* Characteristics of Regions with High-Density Initiation of Flashes in Mesoscale Convective Systems
* Characterizing Garden Greenspace in a Medieval European City: Added Values of Spatial Resolution and Multi-Temporal Stereo Imagery
* Characterizing ordinal network of time series based on complexity-entropy curve
* Cherry-Picking Gradients: Learning Low-Rank Embeddings of Visual Data via Differentiable Cross-Approximation
* Circle Representation for Medical Object Detection
* Class Semantics-based Attention for Action Detection
* Class-Incremental Learning for Action Recognition in Videos
* Classification of Video Game Player Experience Using Consumer-Grade Electroencephalography
* Classroom Attention Estimation Method Based on Mining Facial Landmarks of Students
* CLEAR: Clean-up Sample-Targeted Backdoor in Neural Networks
* Click to Move: Controlling Video Generation with Sparse Motion
* CLN: Cross-Domain Learning Network for 2D Image-Based 3D Shape Retrieval
* Closer Look at Rotation-invariant Deep Point Cloud Analysis, A
* Clothing Status Awareness for Long-Term Person Re-Identification
* Cloud detection with boundary nets
* Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks
* Cluster-Promoting Quantization with Bit-Drop for Minimizing Network Quantization Loss
* Clustering by Maximizing Mutual Information Across Views
* Clustering with multi-layered perceptron
* Clutter Edges Detection Algorithms for Structured Clutter Covariance Matrices
* CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification
* CMC2R: Cross-modal collaborative contextual representation for RGBT tracking
* CMSNet: Deep Color and Monochrome Stereo
* CNN-RNN and Data Augmentation Using Deep Convolutional Generative Adversarial Network for Environmental Sound Classification
* CO Fluxes in Western Europe during 2017-2020 Winter Seasons Inverted by WRF-Chem/Data Assimilation Research Testbed with MOPITT Observations
* Co-LDL: A Co-Training-Based Label Distribution Learning Method for Tackling Label Noise
* Co-saliency-regularized correlation filter for object tracking
* Co-Scale Conv-Attentional Image Transformers
* Co2L: Contrastive Continual Learning
* Coarsely-labeled Data for Better Few-shot Transfer
* Coastal Bathymetry Estimation from Sentinel-2 Satellite Imagery: Comparing Deep Learning and Physics-Based Approaches
* CodeNeRF: Disentangled Neural Radiance Fields for Object Categories
* CODEs: Chamfer Out-of-Distribution Examples against Overconfidence Issue
* Cognitive Workload Assessment of Prosthetic Devices: A Review of Literature and Meta-Analysis
* Collaborative and Adversarial Learning of Focused and Dispersive Representations for Semi-supervised Polyp Segmentation
* Collaborative boundary-aware context encoding networks for error map prediction
* Collaborative Learning with Disentangled Features for Zero-shot Domain Adaptation
* Collaborative Optimization and Aggregation for Decentralized Domain Generalization and Adaptation
* Collaborative Unsupervised Visual Representation Learning from Decentralized Data
* Collaging Class-specific GANs for Semantic Image Synthesis
* Color the Word: Leveraging Web Images for Machine Translation of Untranslatable Words
* CoMatch: Semi-supervised Learning with Contrastive Graph Regularization
* Combined Convolutional Neural Network for Urban Land-Use Classification with GIS Data, A
* Combining embedding-based and symbol-based methods for entity alignment
* Combining Knowledge and Multi-modal Fusion for Meme Classification
* Combining Object-Based Machine Learning with Long-Term Time-Series Analysis for Informal Settlement Identification
* Combining Spectral and Textural Information from UAV RGB Images for Leaf Area Index Monitoring in Kiwifruit Orchard
* COMISR: Compression-Informed Video Super-Resolution
* Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction
* Comparative Evaluation for Tracking the Capability of Solar Cell Malfunction Caused by Soil Debris between UAV Video versus Photo-Mosaic
* Comparative Study of the Landfall Precipitation by Tropical Cyclones ARB 01 (2002) and Luban (2018) near the Arabian Peninsula, A
* Comparing Sentinel-2 and WorldView-3 Imagery for Coastal Bottom Habitat Mapping in Atlantic Canada
* Comparison of Aerial and Ground 3D Point Clouds for Canopy Size Assessment in Precision Viticulture
* Comparison of the Performances of Unmanned-Aerial-Vehicle (UAV) and Terrestrial Laser Scanning for Forest Plot Canopy Cover Estimation in Pinus massoniana Forests, A
* Complementary Fusion Strategy for RGB-D Face Recognition, A
* Complementary Patch for Weakly Supervised Semantic Segmentation
* Composable Augmentation Encoding for Video Representation Learning
* Comprehensive and Context-Sensitive Neonatal Pain Assessment Using Computer Vision, A
* Compressed dual-channel neural network with application to image-based smoke detection
* Compressing Visual-linguistic Model via Knowledge Distillation
* Compressive Sensing-Based Image Encryption and Authentication in Edge-Clouds
* Computer Vision Approach for Estimating Lifting Load Contributors to Injury Risk, A
* Concept Generalization in Visual Representation Learning
* Condensing a Sequence to One Informative Frame for Video Recognition
* Conditional Context-Aware Feature Alignment for Domain Adaptive Detection Transformer
* Conditional DETR for Fast Training Convergence
* Conditional Diffusion for Interactive Segmentation
* Conditional Variational Capsule Network for Open Set Recognition
* CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional Convolution
* Confidence Calibration for Domain Generalization under Covariate Shift
* Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo, A
* Conformer: Local Features Coupling Global Representations for Visual Recognition
* Consensus-Based Optimization for 3D Human Pose Estimation in Camera Coordinates
* Conservative Finite Element Modeling of EEG and MEG on Unstructured Grids
* Consistency-Aware Graph Network for Human Interaction Understanding
* Consistency-Sensitivity Guided Ensemble Black-Box Adversarial Attacks in Low-Dimensional Spaces
* Construction Progress and Aviation Flight Test of BDSBAS
* Contact-Aware Retargeting of Skinned Motion
* Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation
* Context Reasoning Attention Network for Image Super-Resolution
* Context-aware Scene Graph Generation with Seq2Seq Transformers
* Context-Aware Taxi Dispatching at City-Scale Using Deep Reinforcement Learning
* Context-Sensitive Temporal Feature Learning for Gait Recognition
* Contextually Plausible and Diverse 3D Human Motion Prediction
* Continual Learning for Image-Based Camera Localization
* Continual Learning on Noisy Data Streams via Self-Purified Replay
* Continual Neural Mapping: Learning An Implicit Scene Representation from Sequential Observations
* Continual Prototype Evolution: Learning Online from Non-Stationary Data Streams
* Continuous Copy-Paste for One-stage Multi-object Tracking and Segmentation
* Continuous space ant colony algorithm for automatic selection of orthophoto mosaic seamline network
* Continuous, High-Resolution Mapping of Coastal Seafloor Sediment Distribution
* Contour-enhanced attention CNN for CT-based COVID-19 segmentation
* Contrast and Classify: Training Robust VQA Models
* Contrast and Order Representations for Video Self-Supervised Learning
* contrast enhancement framework under uncontrolled environments based on just noticeable difference, A
* Contrasting Contrastive Self-Supervised Representation Learning Pipelines
* Contrastive Adaptation Network for Single- and Multi-Source Domain Adaptation
* Contrastive Attention Maps for Self-supervised Co-localization
* Contrastive attention network with dense field estimation for face completion
* Contrastive Coding for Active Learning under Class Distribution Mismatch
* Contrastive Learning for Label Efficient Semantic Segmentation
* Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency
* Contrastive Multimodal Fusion with TupleInfoNCE
* Contribution of Climate Change and Grazing on Carbon Dynamics in Central Asian Pasturelands
* Convex Optimization Approach For NLOS Error Mitigation in TOA-Based Localization, A
* Convolution by Multiplication: Accelerated Two- Stream Fourier Domain Convolutional Neural Network for Facial Expression Recognition
* Convolutional analysis operator learning for multifocus image fusion
* COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation
* Cooperative Perception for 3D Object Detection in Driving Scenarios Using Infrastructure Sensors
* Coordinated Path-Following Control of Fixed-Wing Unmanned Aerial Vehicles
* Correlating Extremes in Wind Divergence with Extremes in Rain over the Tropical Atlantic
* Correlation-Guided Ensemble Clustering for Hyperspectral Band Selection
* Cortical Surface Shape Analysis Based on Alexandrov Polyhedra
* Cost-Effective Method for Reconstructing City-Building 3D Models from Sparse Lidar Point Clouds, A
* COTR: Correspondence Transformer for Matching Across Images
* Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification
* Covariance Attention for Semantic Segmentation
* Covert Wireless Communication With Noise Uncertainty in Space-Air-Ground Integrated Vehicular Networks
* COVID-MTL: Multitask learning with Shift3D and random-weighted loss for COVID-19 diagnosis and severity assessment
* COVID-opt-aiNet: A clinical decision support system for COVID-19 detection
* CPC-GSCT: Visual quality assessment for coloured point cloud based on geometric segmentation and colour transformation
* cPCA++: An efficient method for contrastive feature learning
* CPF: Learning a Contact Potential Field to Model the Hand-Object Interaction
* CPFN: Cascaded Primitive Fitting Networks for High-Resolution Point Clouds
* CR-Fill: Generative Image Inpainting with Auxiliary Contextual Reconstruction
* CR-GAN: Automatic craniofacial reconstruction for personal identification
* CrackFormer: Transformer Network for Fine-Grained Crack Detection
* Crop Detection Using Time Series of Sentinel-2 and Sentinel-1 and Existing Land Parcel Information Systems
* Cross-Camera Convolutional Color Constancy
* Cross-category Video Highlight Detection via Set-based Learning
* Cross-Descriptor Visual Localization and Mapping
* Cross-Domain Person Re-Identification Using Heterogeneous Convolutional Network
* Cross-Encoder for Unsupervised Gaze Representation Learning
* Cross-Modal Dynamic Networks for Video Moment Retrieval With Text Query
* Cross-Modality Compensation Convolutional Neural Networks for RGB-D Action Recognition
* Cross-Modality Person Re-Identification via Modality Confusion and Center Aggregation
* Cross-Patch Graph Convolutional Network for Image Denoising
* Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation
* CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations
* CrossDet: Crossline Representation for Object Detection
* CrossNorm and SelfNorm for Generalization under Distribution Shifts
* Crossover Learning for Fast Online Video Instance Segmentation
* CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
* Crowd Counting With Partial Annotations in an Image
* CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization
* CryoDRGN2: Ab initio neural reconstruction of 3D protein structures from real cryo-EM images
* CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing
* CTRL-C: Camera calibration TRansformer with Line-Classification
* Curious Representation Learning for Embodied Intelligence
* Current Crustal Vertical Deformation Features of the Sichuan-Yunnan Region Constrained by Fusing the Leveling Data with the GNSS Data, The
* Curvature Generation in Curved Spaces for Few-Shot Learning
* CvT: Introducing Convolutions to Vision Transformers
* D-LSTM: Short-Term Road Traffic Speed Prediction Model Based on GPS Positioning Data
* D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations
* DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis
* DAHP: Deep Attention-Guided Hashing With Pairwise Labels
* DAM: Discrepancy Alignment Metric for Face Recognition
* Dance with Self-Attention: A New Look of Conditional Random Fields on Anomaly Detection in Videos
* Dark Flash Normal Camera, A
* Data Fusion in Earth Observation and the Role of Citizen as a Sensor: A Scoping Review of Applications, Methods and Future Trends
* Data Processing of Gravity Base Network in Plateau Area: The Case of Qinghai Province, China
* Data-aware relation learning-based graph convolution neural network for facial action unit recognition
* Data-Driven Fault Diagnosis for Traction Systems in High-Speed Trains: A Survey, Challenges, and Perspectives
* Data-Driven Modeling for Transferable Sea State Estimation Between Marine Systems
* Data-free Universal Adversarial Perturbation and Black-box Attack
* DataCAP: A Satellite Datacube and Crowdsourced Street-Level Images for the Monitoring of the Common Agricultural Policy
* DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network
* DCT-SNN: Using DCT to Distribute Spatial Information over Time for Low-Latency Spiking Neural Networks
* DDoS Mitigation Based on Space-Time Flow Regularities in IoV: A Feature Adaption Reinforcement Learning Approach
* DE-GAN: Domain Embedded GAN for High Quality Face Image Inpainting
* De-rendering Stylized Texts
* DECA: Deep viewpoint-Equivariant human pose estimation using Capsule Autoencoders
* DecentLaM: Decentralized Momentum SGD for Large-batch Deep Training
* Decentralized Ride-Sharing and Vehicle-Pooling Based on Fair Cost-Sharing Mechanisms
* Deep 3D Mask Volume for View Synthesis of Dynamic Scenes
* Deep Blind Video Super-resolution
* Deep CNN, Body Pose, and Body-Object Interaction Features for Drivers' Activity Monitoring
* Deep Co-Image-Label Hashing for Multi-Label Image Retrieval
* Deep co-supervision and attention fusion strategy for automatic COVID-19 lung infection segmentation on CT images
* Deep Co-Training with Task Decomposition for Semi-Supervised Domain Adaptation
* Deep Coarse-to-Fine Dense Light Field Reconstruction With Flexible Sampling and Geometry-Aware Fusion
* Deep Collaborative Multi-Task Network: A Human Decision Process Inspired Model for Hierarchical Image Classification
* Deep Correlated Joint Network for 2-D Image-Based 3-D Model Retrieval
* Deep Demosaicing for Polarimetric Filter Array Cameras
* Deep Edge-Aware Interactive Colorization against Color-Bleeding Effects
* Deep Halftoning with Reversible Binary Pattern
* Deep Hough Voting for Robust Global Registration
* Deep Hybrid Self-Prior for Full 3D Mesh Generation
* Deep Illumination-Aware Dehazing With Low-Light and Detail Enhancement
* Deep Implicit Surface Point Prediction Networks
* Deep Interactive Image Matting With Feature Propagation
* Deep Interpretable Classification and Weakly-Supervised Segmentation of Histology Images via Max-Min Uncertainty
* Deep Learning Based Just Noticeable Difference and Perceptual Quality Prediction Models for Compressed Video
* Deep Learning in Neuroimaging: Promises and challenges
* Deep Matching Prior: Test-Time Optimization for Dense Correspondence
* Deep Metric Learning for Open World Semantic Segmentation
* Deep open-set recognition for silicon wafer production monitoring
* Deep Permutation Equivariant Structure from Motion
* Deep Ranking Exemplar-Based Dynamic Scene Deblurring
* Deep reinforcement learning with credit assignment for combinatorial optimization
* Deep Reinforcement Learning-Based Resource Management Game in Vehicular Edge Computing, A
* Deep Reinforcement Learning-Based Traffic Light Scheduling Framework for SDN-Enabled Smart Transportation System
* Deep Relational Metric Learning
* Deep Reparametrization of Multi-Frame Super-Resolution and Denoising
* Deep Structured Instance Graph for Distilling Object Detectors
* Deep survival analysis with longitudinal X-rays for COVID-19
* Deep Symmetric Network for Underexposed Image Enhancement with Recurrent Attentional Learning
* Deep Transport Network for Unsupervised Video Object Segmentation
* Deep Unrolled Recovery in Sparse Biological Imaging: Achieving fast, accurate results
* Deep Virtual Markers for Articulated 3D Shapes
* Deep-Learning-Based Multispectral Image Reconstruction from Single Natural Color RGB Image: Enhancing UAV-Based Phenotyping
* DeepCAD: A Deep Generative Network for Computer-Aided Design Models
* DeepGaze IIE: Calibrated prediction in and out-of-domain for state-of-the-art saliency modeling
* DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras
* DeepNC: Deep Generative Network Completion
* DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization
* DeepPRO: Deep Partial Point Cloud Registration of Objects
* DeePSD: Automatic Deep Skinning And Pose Space Deformation For 3D Garment Animation
* Defending against Universal Adversarial Patches by Clipping Feature Norms
* Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image
* deformable CNN-based triplet model for fine-grained sketch-based image retrieval, A
* Deformations monitoring in complicated-surface areas by adaptive distributed Scatterer InSAR combined with land cover: Taking the Jiaju landslide in Danba, China as an example
* DeFRCN: Decoupled Faster R-CNN for Few-Shot Object Detection
* Delineation of Geomorphological Woodland Key Habitats Using Airborne Laser Scanning
* Dense Deep Unfolding Network with 3D-CNN Prior for Snapshot Compressive Imaging
* Dense Interaction Learning for Video-based Person Re-identification
* Densely Guided Knowledge Distillation using Multiple Teacher Assistants
* Densely Semantic Enhancement for Domain Adaptive Region-Free Detectors
* DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension
* DenseTNT: End-to-end Trajectory Prediction from Dense Goal Sets
* DepecheMood++: A Bilingual Emotion Lexicon Built Through Simple Yet Powerful Techniques
* Deployment Optimization for Shared e-Mobility Systems With Multi-Agent Deep Neural Search
* Depth Selection for Deep ReLU Nets in Feature Extraction and Generalization
* DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation
* DepthTrack: Unveiling the Power of RGBD Tracking
* Depthwise-Separable Residual Capsule for Robust Keyword Spotting
* Describing and Localizing Multiple Changes with Transformers
* Designing a Practical Degradation Model for Deep Blind Image Super-Resolution
* Destination Prediction Based on Virtual POI Docks in Dockless Bike-Sharing System
* Detail Me More: Improving GAN's photo-realism of complex scenes
* Detailed Mapping of Lava and Ash Deposits at Indonesian Volcanoes by Means of VHR PlanetScope Change Detection
* DetCo: Unsupervised Contrastive Learning for Object Detection
* Detecting Compressed Deepfake Videos in Social Networks Using Frame-Temporality Two-Stream Convolutional Network
* Detecting Human-Object Relationships in Videos
* Detecting Invisible People
* Detecting owner-member relationship in fisheye camera system with graph convolution network
* Detecting Persuasive Atypicality by Modeling Contextual Compatibility
* Detection and Continual Learning of Novel Face Presentation Attacks
* Detection and diagnosis of COVID-19 infection in lungs images using deep learning techniques
* Detection and rectification of arbitrary shaped scene texts by using text keypoints and links
* Detection Method for Collapsed Buildings Combining Post-Earthquake High-Resolution Optical and Synthetic Aperture Radar Images, A
* Detector-Free Weakly Supervised Grounding by Separation
* Deterioration Mapping of RC Bridge Elements Based on Automated Analysis of GPR Images
* Developing a generic framework for anomaly detection
* Development and Cross-Cultural Evaluation of a Scoring Algorithm for the Biometric Attachment Test: Overcoming the Challenges of Multimodal Fusion with Small Data
* Development of a Landscape-Based Multi-Metric Index to Assess Wetland Health of the Poyang Lake
* Development of a Multi-Index Method Based on Landsat Reflectance Data to Map Open Water in a Complex Environment
* Development of a Phenology-Based Method for Identifying Sugarcane Plantation Areas in China Using High-Resolution Satellite Datasets
* Development of Semantic Maps of Vegetation Cover from UAV Images to Support Planning and Management in Fine-Grained Fire-Prone Landscapes
* Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection, The
* Diagonal Attention and Style-based GAN for Content-Style Disentanglement in Image Generation and Translation
* DiagViB-6: A Diagnostic Benchmark Suite for Vision Models in the Presence of Shortcut and Generalization Opportunities
* Differentiable Convolution Search for Point Cloud Processing
* Differentiable Dynamic Wirings for Neural Networks
* Differentiable Surface Rendering via Non-Differentiable Sampling
* Differential Transform for Video-Based Plenoptic Point Cloud Coding
* DIG: A Data-Driven Impact-Based Grouping Method for Video Rebuffering Optimization
* Digging into Uncertainty in Self-supervised Multi-view Stereo
* Digital Video Manipulation Detection Technique Based on Compression Algorithms
* Direct Differentiable Augmentation Search
* Direct photogrammetry with multispectral imagery for UAV-based snow depth estimation
* Direct Reconstruction of Linear Parametric Images From Dynamic PET Using Nonlocal Deep Image Prior
* Disaster Prediction Knowledge Graph Based on Multi-Source Spatio-Temporal Information
* DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision
* Discover the Unknown Biased Attribute of an Image Classifier
* Discovering 3D Parts from Image Collections
* Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection
* Discriminative deep attributes for generalized zero-shot learning
* Discriminative Part CNN for Pedestrian Detection
* Discriminative Region-based Multi-Label Zero-Shot Learning
* Discriminative Transfer Learning for Driving Pattern Recognition in Unlabeled Scenes
* Disentangled High Quality Salient Object Detection
* Disentangled Lifespan Face Synthesis
* Disentangled Representation for Age-Invariant Face Recognition: A Mutual Information Minimization Perspective
* Dissecting Image Crops
* Distance-aware Quantization
* Distillation-guided Image Inpainting
* Distilling Global and Local Logits with Densely Connected Relations
* Distilling Holistic Knowledge with Graph Neural Networks
* Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces
* Distilling Virtual Examples for Long-tailed Recognition
* Distinctiveness oriented Positional Equilibrium for Point Cloud Registration
* Distributed MPC for Large Freeway Networks Using Alternating Optimization
* Distribution Cognisant Loss for Cross-Database Facial Age Estimation With Sensitivity Analysis
* Distributional Robustness Loss for Long-tail Learning
* DisUnknown: Distilling Unknown Factors for Disentanglement Learning
* DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization
* Diverse Image Style Transfer via Invertible Cross-Space Mapping
* diveXplore 6.0: ITEC's Interactive Video Exploration System at VBS 2022
* Divide and Conquer for Single-frame Temporal Action Localization
* Divide and Contrast: Self-supervised Learning from Uncurated Data
* Divide-and-Assemble: Learning Block-wise Memory for Unsupervised Anomaly Detection
* DKDFN: Domain Knowledge-Guided deep collaborative fusion network for multimodal unitemporal remote sensing land cover classification
* DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection
* DnD: Dense Depth Estimation in Crowded Dynamic Indoor Scenes
* DNN-Based Channel Model for Network Planning in Train Control Systems, A
* Do Different Deep Metric Learning Losses Lead to Similar Learned Features?
* Do Image Classifiers Generalize Across Time?
* DocFormer: End-to-End Transformer for Document Understanding
* DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features
* Domain Adaptive Semantic Segmentation with Self-Supervised Depth Estimation
* Domain Adaptive Video Segmentation via Temporal Consistency Regularization
* Domain Generalization via Gradient Surgery
* Domain-Aware Universal Style Transfer
* Domain-Invariant Disentangled Network for Generalizable Object Detection
* Dominant Factors and Spatial Heterogeneity of Land Surface Temperatures in Urban Areas: A Case Study in Fuzhou, China
* Double Compression Detection in HEVC-Coded Video with the Same Coding Parameters Using Picture Partitioning Information
* Double Granularity Relation Network with Self-criticism for Occluded Person Re-identification
* DRB-GAN: A Dynamic ResBlock Generative Adversarial Network for Artistic Style Transfer
* Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing
* DRINet: A Dual-Representation Iterative Learning Network for Point Cloud Segmentation
* DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation
* DRĈM: A discriminatively trained reconstruction embedding for surface anomaly detection
* DTMNet: A Discrete Tchebichef Moments-based Deep Neural Network for Multi-focus Image Fusion
* Dual Bipartite Graph Learning: A General Approach for Domain Adaptive Object Detection
* Dual Contrastive Loss and Attention for GANs
* Dual Path Learning for Domain Adaptation of Semantic Segmentation
* Dual Projection Generative Adversarial Networks for Conditional Image Generation
* Dual Transfer Learning for Event-based End-task Prediction via Pluggable Event to Image Translation
* Dual-Camera Super-Resolution with Aligned Attention Modules
* Dual-Domain Generative Adversarial Network for Digital Image Operation Anti-Forensics
* Dual-Master/Single-Slave Haptic Teleoperation System for Semiautonomous Bilateral Control of Hexapod Robot Subject to Deformable Rough Terrain
* DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency
* Dust Radiative Effect Characteristics during a Typical Springtime Dust Storm with Persistent Floating Dust in the Tarim Basin, Northwest China
* DWKS: A Local Descriptor of Deformations Between Meshes and Point Clouds
* Dynamic and Scalable User-Centric Route Planning Algorithm Based on Polychromatic Sets Theory, A
* Dynamic Attentive Graph Learning for Image Restoration
* Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection
* Dynamic Cross Feature Fusion for Remote Sensing Pansharpening
* Dynamic CT Reconstruction from Limited Views with Implicit Neural Representations and Parametric Motion Fields
* Dynamic DETR: End-to-End Object Detection with Dynamic Attention
* Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation
* Dynamic Dual Gating Neural Networks
* Dynamic Frame Selection Framework for Fast Video Recognition, A
* Dynamic High-Pass Filtering and Multi-Spectral Attention for Image Super-Resolution
* Dynamic manifold Boltzmann optimization based on self-supervised learning for human motion estimation
* Dynamic Network Quantization for Efficient Video Inference
* Dynamic Orthogonal Projection Constrained Discriminative Tracking
* Dynamic Perception Framework for Fine-Grained Recognition
* Dynamic Surface Function Networks for Clothed Human Bodies
* Dynamic Training Data Dropout for Robust Deep Face Recognition
* Dynamic View Synthesis from Dynamic Monocular Video
* Dynamical Pose Estimation
* e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks
* Early Melanoma Diagnosis With Sequential Dermoscopic Images
* Eastern Arctic Sea Ice Sensing: First Results from the RADARSAT Constellation Mission Data
* EC-DARTS: Inducing Equalized and Consistent Optimization into DARTS
* ECACL: A Holistic Framework for Semi-Supervised Domain Adaptation
* ECAS-ML: Edge Computing Assisted Adaptation Scheme with Machine Learning for HTTP Adaptive Streaming
* ECFFNet: Effective and Consistent Feature Fusion Network for RGB-T Salient Object Detection
* ECS-Net: Improving Weakly Supervised Semantic Segmentation by Using Connections Between Class Activation Maps
* Edge and anomaly detection of brain magnetic resonance images in a distributed environment
* Editing Conditional Radiance Fields
* Editorial for the Special Issue: Ground Deformation Patterns Detection by InSAR and GNSS Techniques
* Editorial Special Issue on AI Innovations in Intelligent Transportation Systems
* EEG Correlates of Driving Performance
* EEG Emotion Recognition Based on Dynamically Organized Graph Neural Network
* EEG-Based Auditory Attention Detection via Frequency and Channel Neural Attention
* Effect of Aerosol Vertical Distribution on the Modeling of Solar Radiation
* Effective extraction of ventricles and myocardium objects from cardiac magnetic resonance images with a multi-task learning U-Net
* Effectively Leveraging Attributes for Visual Similarity
* Effects and Combination of Tailored Browser-Based and Mobile Cognitive Software Training
* Effects of Land Use/Cover on Regional Habitat Quality under Different Geomorphic Types Based on InVEST Model
* Efficient Action Recognition via Dynamic Knowledge Propagation
* Efficient and Anonymous Authentication With Succinct Multi-Subscription Credential in SAGVN
* Efficient and Differentiable Shadow Computation for Inverse Problems
* Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling
* Efficient COLREG-Compliant Collision Avoidance in Multi-Ship Encounter Situations
* Efficient Depth Fusion Transformer for Aerial Image Semantic Segmentation
* Efficient Global MOT Under Minimum-Cost Circulation Framework
* Efficient Large Scale Inlier Voting for Geometric Vision Problems
* Efficient Lightweight Surface Reconstruction Method from Rock-Mass Point Clouds
* efficient multiclass classifier for classification of Alzheimer's disease/mild cognitive impairment/Normal subjects, An
* Efficient Resource Allocation for Multi-Beam Satellite-Terrestrial Vehicular Networks: A Multi-Agent Actor-Critic Method With Attention Mechanism
* Efficient Search and Browsing of Large-Scale Video Collections with Vibro
* Efficient Solution to Non-Minimal Case Essential Matrix Estimation, An
* Efficient Tensor Robust PCA Under Hybrid Model of Tucker and Tensor Train
* Efficient Unsupervised Dimension Reduction for Streaming Multiview Data
* Efficient Video Compression via Content-Adaptive Super-Resolution
* Efficient Visual Pretraining with Contrastive Detection
* Egocentric Pose Estimation from Human Vision Span
* EgoRenderer: Rendering Human Avatars from Egocentric Camera Images
* EigenGAN: Layer-Wise Eigen-Learning for GANs
* EILPR: Toward End-to-End Irregular License Plate Recognition Based on Automatic Perspective Alignment
* Elaborative Rehearsal for Zero-shot Action Recognition
* Elastica Geodesic Approach with Convexity Shape Prior, An
* Electromagnetic Signal Classification Based on Class Exemplar Selection and Multi-Objective Linear Programming
* ELF-VC: Efficient Learned Flexible-Rate Video Coding
* ELLIPSDF: Joint Object Pose and Shape Optimization with a Bi-level Ellipsoid and Signed Distance Function Description
* ELSD: Efficient Line Segment Detector and Descriptor
* Else-Net: Elastic Semantic Network for Continual Action Recognition from Skeleton Data
* Embed Me If You Can: A Geometric Perceptron
* Embedding Novel Views in a Single JPEG Image
* Emerging Properties in Self-Supervised Vision Transformers
* Emotion Recognition and EEG Analysis Using ADMM-Based Sparse Group Lasso
* Emotional Conversation Generation Orientated Syntactically Constrained Bidirectional-Asynchronous Framework
* Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation, An
* Empirical Study of Training Self-Supervised Vision Transformers, An
* Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation
* End-to-End Dense Video Captioning with Parallel Decoding
* End-to-End Detection and Pose Estimation of Two Interacting Hands
* End-to-end Piece-wise Unwarping of Document Images
* End-to-end robust joint unsupervised image alignment and clustering
* End-to-End Semi-Supervised Object Detection with Soft Teacher
* End-to-End Trainable Trident Person Search Network Using Adaptive Gradient Propagation
* End-to-End Transformer Model for 3D Object Detection, An
* End-to-End Unsupervised Document Image Blind Denoising
* End-to-End Urban Driving by Imitating a Reinforcement Learning Coach
* End-to-End Video Instance Segmentation via Spatial-Temporal Graph Neural Networks
* Energy-Based Open-World Uncertainty Modeling for Confidence Calibration
* Enhanced Boundary Learning for Glass-like Object Segmentation
* Enhanced Chlorophyll-a in the Coastal Waters near the Eastern Guangdong during the Downwelling Favorable Wind Period
* Enhanced Feature Alignment for Unsupervised Domain Adaptation of Semantic Segmentation
* enhanced relation-aware global-local attention network for escaping human detection in indoor smoke scenarios, An
* Enhanced Surveillance Video Compression With Dual Reference Frames Generation
* Enhancement algorithm for high visibility of underwater images
* Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization
* Enriching Local and Global Contexts for Temporal Action Localization
* Ensemble Attention Distillation for Privacy-Preserving Federated Learning
* Ensemble Learning With Manifold-Based Data Splitting for Noisy Label Correction
* Ensemble Stego Selection for Enhancing Image Steganography
* Entropy Maximization and Meta Classification for Out-of-Distribution Detection in Semantic Segmentation
* Entropy regularization for unsupervised clustering with adaptive neighbors
* Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments
* Episodic Transformer for Vision-and-Language Navigation
* EPP-MVSNet: Epipolar-assembling based Depth Prediction for Multi-view Stereo
* Equivariant Imaging: Learning Beyond the Range Space
* Error Overbounding Method Based on a Gaussian Mixture Model with Uncertainty Estimation for a Dual-Frequency Ground-Based Augmentation System, An
* Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation
* Estimating Egocentric 3D Human Pose in Global Space
* Estimating Long-Term Average Carbon Emissions from Fires in Non-Forest Ecosystems in the Temperate Belt
* Estimating Next Day's Forest Fire Risk via a Complete Machine Learning Methodology
* Estimating Wildlife Density as a Function of Environmental Heterogeneity Using Unmarked Data
* Estimation of Above-Ground Biomass of Winter Wheat Based on Consumer-Grade Multi-Spectral UAV
* Estimation of Aerosol Extinction Coefficient Using Camera Images and Application in Mass Extinction Efficiency Retrieval
* Estimation of Aerosol Optical Depth at 30 m Resolution Using Landsat Imagery and Machine Learning
* Evaluation of Forward Models for GNSS Radio Occultation Data Processing and Assimilation
* Evaluation of GOCI Remote Sensing Reflectance Spectral Quality Based on a Quality Assurance Score System in the Bohai Sea
* Evaluation of GPM IMERG Performance Using Gauge Data over Indonesian Maritime Continent at Different Time Scales
* Evaluation of Sentinel-2/MSI Atmospheric Correction Algorithms over Two Contrasted French Coastal Waters
* Evaluation of the Performance of Multi-Source Satellite Products in Simulating Observed Precipitation over the Tensift Basin in Morocco
* Event Stream Super-Resolution via Spatiotemporal Constraint Learning
* Event-based Video Reconstruction Using Transformer
* Event-Intensity Stereo: Estimating Depth by the Best of Both Worlds
* Event-Triggered Predictive Control for Automatic Train Regulation and Passenger Flow in Metro Rail Systems
* EventHands: Real-Time Neural 3D Hand Pose Estimation from an Event Stream
* EventHPE: Event-based 3D Human Pose and Shape Estimation
* Evidential Deep Learning for Open Set Action Recognition
* EvIntSR-Net: Event Guided Multiple Latent Frames Reconstruction and Super-resolution
* Evolving Search Space for Neural Architecture Search
* Excavating the Potential Capacity of Self-Supervised Monocular Depth Estimation
* Experimental HBIM Processing: Innovative Tool for 3D Model Reconstruction of Morpho-Typological Phases for the Cultural Heritage, An
* Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation
* Explainable Artificial Intelligence for Magnetic Resonance Imaging Aging Brainprints: Grounds and challenges
* Explainable Person Re-Identification with Attribute-guided Metric Distillation
* Explainable Video Entailment with Grounded Visual Evidence
* Explaining in Style: Training a GAN to explain a classifier in StyleSpace
* Explaining Local, Global, And Higher-Order Interactions In Deep Learning
* Explanations for Occluded Images
* Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation
* Exploiting appearance transfer and multi-scale context for efficient person image generation
* Exploiting Evolutionary Algorithms to Model Nonverbal Reactions to Conversational Interruptions in User-Agent Interactions
* Exploiting Explanations for Model Inversion Attacks
* Exploiting Multi-Object Relationships for Detecting Adversarial Attacks in Complex Scenes
* Exploiting Raw Images for Real-Scene Super-Resolution
* Exploiting sample correlation for crowd counting with multi-expert network
* Exploiting Scene Graphs for Human-Object Interaction Detection
* Exploiting Web Images for Fine-Grained Visual Recognition via Dynamic Loss Correction and Global Sample Selection
* Exploration and Estimation for Model Compression
* Exploring Classification Equilibrium in Long-Tailed Object Detection
* Exploring Cross-Image Pixel Contrast for Semantic Segmentation
* Exploring Dense Context for Salient Object Detection
* Exploring Event-Driven Dynamic Context for Accident Scene Segmentation
* Exploring Geometry-aware Contrast and Clustering Harmonization for Self-supervised 3D Object Detection
* Exploring Implicit and Explicit Relations with the Dual Relation-Aware Network for Image Captioning
* Exploring Inter-Channel Correlation for Diversity-preserved Knowledge Distillation
* Exploring Long Tail Visual Relationship Recognition with Large Vocabulary
* Exploring Relational Context for Multi-Task Dense Prediction
* Exploring Robustness of Unsupervised Domain Adaptation in Semantic Segmentation
* Exploring semantic segmentation of related subclasses from a superset of classes
* Exploring Simple 3D Multi-Object Tracking for Autonomous Driving
* Exploring Structural Sparsity in CNN via Selective Penalty
* Exploring Temporal Coherence for More General Video Face Forgery Detection
* Exploring Visual Engagement Signals for Representation Learning
* Exquisitor at the Video Browser Showdown 2022
* Extending Neural P-frame Codecs for B-frame Coding
* Extensions of Karger's Algorithm: Why They Fail in Theory and How They Are Useful in Practice
* Extensive Parameters as a Tool to Monitoring the Volcanic Activity: The Case Study of Vulcano Island (Italy), The
* Extracting Disaster-Related Location Information through Social Media to Assist Remote Sensing for Disaster Analysis: The Case of the Flood Disaster in the Yangtze River Basin in China in 2020
* Extreme Structure from Motion for Indoor Panoramas without Visual Overlaps
* Extreme-Quality Computational Imaging via Degradation Framework
* F-Drop &Match: GANs with a Dead Zone in the High-Frequency Domain
* Face Image Retrieval with Attribute Manipulation
* Face photo-sketch synthesis via full-scale identity supervision
* Face recognition with Raspberry Pi using deep neural networks
* Facial Expression Recognition Using a Temporal Ensemble of Multi-Level Convolutional Neural Networks
* Facial Expressions of Comprehension (FEC)
* FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning
* Factorizing Perception and Policy for Interactive Instruction Following
* FactorNet: Holistic Actor, Object, and Scene Factorization for Action Recognition in Videos
* FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search
* Fake it till you make it: face analysis in the wild using synthetic data alone
* Fall Detection Using Multimodal Data
* FaPN: Feature-aligned Pyramid Network for Dense Image Prediction
* FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation
* FashionMirror: Co-attention Feature-remapping Virtual Try-on with Sequential Template Poses
* Fast and Automated FMT/XCT Reconstruction Strategy Based on Standardized Imaging Space, A
* Fast and Efficient DNN Deployment via Deep Gaussian Transfer Learning
* Fast and Scalable Polyatomic Frank-Wolfe Algorithm for the LASSO, A
* Fast Convergence of DETR with Spatially Modulated Co-Attention
* Fast CU Depth Decision Algorithm for AVS3
* Fast Expansion-Bins-Determination for Multiple Histograms Modification Based Reversible Data Hiding
* Fast Light-field Disparity Estimation with Multi-disparity-scale Cost Aggregation
* Fast Single Image Dehazing Using Morphological Reconstruction and Saturation Compensation
* Fast Universal Low Rank Representation
* Fast Video Moment Retrieval
* Faster Multi-Object Segmentation using Parallel Quadratic Pseudo-Boolean Optimization
* FastNeRF: High-Fidelity Neural Rendering at 200FPS
* FATNN: Fast and Accurate Ternary Neural Networks*
* FcaNet: Frequency Channel Attention Networks
* FCOS: A Simple and Strong Anchor-Free Object Detector
* Feature back-projection guided residual refinement for real-time stereo matching network
* Feature Importance-aware Transferable Adversarial Attacks
* Feature Interactive Representation for Point Cloud Registration
* Feature-Induced Label Distribution for Learning with Noisy Labels
* Feature-versus deep learning-based approaches for the automated detection of brain tumor with magnetic resonance images: A comparative study
* Federated Intrusion Detection in Blockchain-Based Smart Transportation Systems
* Federated Learning for Non-IID Data via Unified Feature Learning and Optimization Objective Alignment
* Femto-Satellite Localization Method Based on TDOA and AOA Using Two CubeSats, A
* Few-Shot and Continual Learning with Attentive Independent Mechanisms
* Few-shot Image Classification: Just Use a Library of Pre-trained Feature Extractors and a Simple Classifier
* Few-Shot Semantic Segmentation with Cyclic Memory Network
* Few-Shot Visual Relationship Co-Localization
* FFT-OT: A Fast Algorithm for Optimal Transportation
* Field Convolutions for Surface CNNs
* Field of Junctions: Extracting Boundary Structure at Low SNR
* Field-Guide-Inspired Zero-Shot Learning
* FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras
* Finding Representative Interpretations on Convolutional Neural Networks
* Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation
* First Form, Then Function: 3D Reconstruction of Cucumber Plants (Cucumis sativus L.) Allows Early Detection of Stress Effects through Leaf Dimensions
* First Impressions: A Survey on Vision-Based Apparent Personality Trait Analysis
* Fixing Defect of Photometric Loss for Self-Supervised Monocular Depth Estimation
* FLAR: A Unified Prototype Framework for Few-sample Lifelong Active Recognition
* FlatNet: Towards Photorealistic Scene Reconstruction From Lensless Measurements
* Flexible Multi-Temporal and Multi-Modal Framework for Sentinel-1 and Sentinel-2 Analysis Ready Data, A
* Flood Monitoring Using Enhanced Resolution Passive Microwave Data: A Test Case over Bangladesh
* FloorPlanCAD: A Large-Scale CAD Drawing Dataset for Panoptic Symbol Spotting
* Flow-Guided Video Inpainting with Scene Templates
* FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters
* FMODetect: Robust Detection of Fast Moving Objects
* Focal Frequency Loss for Image Reconstruction and Synthesis
* Focal learning on stranger for imbalanced image segmentation
* Focus on the Positives: Self-Supervised Learning for Biodiversity Monitoring
* Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather
* Fooling LiDAR Perception via Adversarial Trajectory Perturbation
* Foreground Activation Maps for Weakly Supervised Object Localization
* Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization
* Forensic Analysis of JPEG-Domain Enhanced Images via Coefficient Likelihood Modeling
* Fourier Space Losses for Efficient Perceptual Image Super-Resolution
* FOVEA: Foveated Image Magnification for Autonomous Navigation
* Free-form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud
* FREE: Feature Refinement for Generalized Zero-Shot Learning
* Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving
* Frequency Selection for Platoon Communications in Secondary Spectrum Using Radio Environment Maps
* Frequency-Aware Spatiotemporal Transformers for Video Inpainting Detection
* Frequency-domain blind quality assessment of blurred and blocking-artefact images using Gaussian Process Regression model
* FRIDA: Generative feature replay for incremental domain adaptation
* From Contexts to Locality: Ultra-high Resolution Image Segmentation via Locality-aware Contextual Correlation
* From Continuity to Editability: Inverting GANs with Consecutive Images
* From Culture to Clothing: Discovering the World Events Behind A Century of Fashion Images
* From General to Specific: Informative Scene Graph Generation via Balance Adjustment
* From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting
* From Individual to Whole: Reducing Intra-class Variance by Feature Aggregation
* From Regression Based on Dynamic Filter Network to Pansharpening by Pixel-Dependent Spatial-Detail Injection
* From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network
* Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
* Full-Body Motion from a Single Head-Mounted Device: Generating SMPL Poses from Partial Observations
* Full-Duplex Strategy for Video Object Segmentation
* Full-Velocity Radar Returns by Radar-Camera Fusion
* Functional Analysis for Habitat Mapping in a Special Area of Conservation Using Sentinel-2 Time-Series Data
* Functional Correspondence Problem, The
* FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
* Fusion Moves for Graph Matching
* Fusion of convolutional neural networks based on Dempster-Shafer theory for automatic pneumonia detection from chest X-ray images
* Fusion of Drone-Based RGB and Multi-Spectral Imagery for Shallow Water Bathymetry Inversion
* fusion representation for face learning by low-rank constrain and high-frequency texture components, A
* Fuzzy Repair of Urban Building Facade Point Cloud Based on Distribution Regularity, The
* FW-SMOTE: A feature-weighted oversampling approach for imbalanced classification
* G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-Guided Feature Imitation
* Gait Recognition in the Wild: A Benchmark
* Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation
* GaitSlice: A gait recognition model based on spatio-temporal slice features
* Game-Theoretic Modeling of Traffic in Unsignalized Intersection Network for Autonomous Vehicle Control Verification and Validation
* GAN Inversion for Out-of-Range Images with Geometric Transformations
* GAN-Control: Explicitly Controllable GANs
* GAN-MVAE: A discriminative latent feature generation framework for generalized zero-shot learning
* GANcraft: Unsupervised 3D Neural Rendering of Minecraft Worlds
* Gapless-REMA100: A gapless 100-m reference elevation model of Antarctica with voids filled by multi-source DEMs
* GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion
* Gated3D: Monocular 3D Object Detection From Temporal Illumination Cues
* Gaussian Fusion: Accurate 3D Reconstruction via Geometry-Guided Displacement Interpolation
* Gaze Estimation via the Joint Modeling of Multiple Cues
* GCB-Net: Graph Convolutional Broad Network and Its Application in Emotion Recognition
* GDP: Stabilized Neural Network Pruning via Gates with Differentiable Polarization
* General Recurrent Tracking Framework without Real Data, A
* Generalizable Mixed-Precision Quantization via Attribution Rank Preservation
* Generalizable No-Reference Image Quality Assessment via Deep Meta-Learning
* Generalize then Adapt: Source-Free Domain Adaptive Semantic Segmentation
* Generalized and Incremental Few-Shot Learning by Explicit Learning and Calibration without Forgetting
* Generalized Large Margin kNN for Partial Label Learning
* Generalized Shuffled Linear Regression
* Generalized Source-free Domain Adaptation
* Generalizing Gaze Estimation with Outlier-guided Collaborative Adaptation
* Generating Attribution Maps with Disentangled Masked Backpropagation
* Generating Masks from Boxes by Mining Spatio-Temporal Consistencies in Videos
* Generating Smooth Pose Sequences for Diverse Human Motion Prediction
* Generating Terrain Data for Geomorphological Analysis by Integrating Topographical Features and Conditional Generative Adversarial Networks
* Generative Adversarial Imitation Learning Approach for Realistic Aircraft Taxi-Speed Modeling, A
* Generative Adversarial Registration for Improved Conditional Deformable Templates
* Generative Compositional Augmentations for Scene Graph Prediction
* Generative Landmarks Guided Eyeglasses Removal 3D Face Reconstruction
* Generative Layout Modeling using Constraint Graphs
* Generative Model for Generic Light Field Reconstruction, A
* Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers
* Generic Event Boundary Detection: A Benchmark for Event Segmentation
* Genetic-based adaptive momentum estimation for predicting mortality risk factors for COVID-19 patients using deep learning
* Geodesic Translation Model for Spherical Video Compression, A
* Geographical Detection of Urban Thermal Environment Based on the Local Climate Zones: A Case Study in Wuhan, China
* Geography-Aware Self-Supervised Learning
* Geometric Deep Neural Network using Rigid and Non-Rigid Transformations for Human Action Recognition
* Geometric Granularity Aware Pixel-to-Mesh
* Geometric Unsupervised Domain Adaptation for Semantic Segmentation
* Geometry Uncertainty Projection Network for Monocular 3D Object Detection
* Geometry-Aware Self-Training for Unsupervised Domain Adaptation on Object Point Clouds
* Geometry-based Distance Decomposition for Monocular 3D Object Detection
* Geometry-Free View Synthesis: Transformers and no 3D Priors
* GeomNet: A Neural Network Based on Riemannian Geometries of SPD Matrix Space and Cholesky Space for 3D Skeleton-Based Interaction Recognition
* GeoRec: Geometry-enhanced semantic 3D reconstruction of RGB-D indoor scenes
* Geostatistical Resampling of LiDAR-Derived DEM in Wide Resolution Range for Modelling in SWAT: A Case Study of Zglowiaczka River (Poland)
* GEVE: A generative adversarial network for extremely dark image/video enhancement
* GistNet: a Geometric Structure Transfer Network for Long-Tailed Recognition
* Glimpse-Attend-and-Explore: Self-Attention for Active Visual Exploration
* GLiT: Neural Architecture Search for Global and Local Image Transformer
* Global Assessment of Night Lights as an Indicator for Shipping Activity in Anchorage Areas, A
* Global Conversion Factor Model for Mapping Zenith Total Delay onto Precipitable Water, A
* Global models for time series forecasting: A Simulation study
* Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs
* Globally Optimal and Efficient Manhattan Frame Estimation by Delimiting Rotation Search Space
* Globally Optimal Vertical Direction Estimation in Atlanta World
* GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-efficient Medical Image Recognition
* GMLight: Lighting Estimation via Geometric Distribution Approximation
* GNAS-U2Net: A New Optic Cup and Optic Disc Segmentation Architecture With Genetic Neural Architecture Search
* GNeRF: GAN-based Neural Radiance Field without Posed Camera
* Goal oriented image quality assessment
* Going deeper with Image Transformers
* GP-S3Net: Graph-based Panoptic Sparse Semantic Segmentation Network
* GPR1200: A Benchmark for General-Purpose Content-Based Image Retrieval
* Gradient Distribution Alignment Certificates Better Adversarial Domain Adaptation
* Gradient Normalization for Generative Adversarial Networks
* Grafit: Learning fine-grained image representations with coarse labels
* Granularity of Digital Elevation Model and Optimal Level of Detail in Small-Scale Cartographic Relief Presentation
* Graph Constrained Data Representation Learning for Human Motion Segmentation
* Graph Contrastive Clustering
* graph convolutional neural network model with Fisher vector encoding and channel-wise spatial-temporal aggregation for skeleton-based action recognition, A
* Graph label prediction based on local structure characteristics representation
* Graph Neural Networks Based Multi-granularity Feature Representation Learning for Fine-Grained Visual Categorization
* Graph Signal Processing Approach to QSAR/QSPR Model Learning of Compounds
* Graph-BAS3Net: Boundary-Aware Semi-Supervised Segmentation Network with Bilateral Graph Convolution
* Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images
* Graph-based Asynchronous Event Processing for Rapid Object Recognition
* Graph-Based Embedding Smoothing Network for Few-Shot Scene Classification of Remote Sensing Images
* Graph-Based Intrusion Detection System for Controller Area Networks
* Graph-Based Region and Boundary Aggregation for Biomedical Image Segmentation
* Graph-Based Surgical Instrument Adaptive Segmentation via Domain-Common Knowledge
* Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs
* GraphFPN: Graph Feature Pyramid Network for Object Detection
* GraphSAGE-Based Traffic Speed Forecasting for Segment Network With Sparse Data
* Graspness Discovery in Clutters for Fast and Accurate Grasp Detection
* Gravity-Aware Monocular 3D Human-Object Reconstruction
* Grayscale Enhancement Colorization Network for Visible-Infrared Person Re-Identification
* Greedy Gradient Ensemble for Robust Visual Question Answering
* GRF: Learning a General Radiance Field for 3D Representation and Rendering
* GridToPix: Training Embodied Agents with Minimal Supervision
* Ground-Penetrating Radar and Photogrammetric Investigation on Prehistoric Tumuli at Parabita (Lecce, Italy) Performed with an Unconventional Use of the Position Markers
* Ground-Truth or DAER: Selective Re-Query of Secondary Information
* Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection
* Group-aware Contrastive Regression for Action Quality Assessment
* Group-Free 3D Object Detection via Transformers
* Group-wise Inhibition based Feature Regularization for Robust Classification
* GroupFormer: Group Activity Recognition with Clustered Spatial-Temporal Transformer
* GTT-Net: Learned Generalized Trajectory Triangulation
* Guest Editorial Special Issue on Space-Air-Ground Integrated Networks for Intelligent Transportation Systems
* Guided neighborhood affine subspace embedding for feature matching
* Guided Point Contrastive Learning for Semi-supervised Point Cloud Semantic Segmentation
* GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning
* H2O: A Benchmark for Visual Human-human Object Handover Analysis
* H2O: Two Hands Manipulating Objects for First Person Interaction Recognition
* H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction
* Ha Long: Cam Pha Cities Evolution Analysis Utilizing Remote Sensing Data
* HAA500: Human-Centric Atomic Action Dataset with Curated Videos
* HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering
* Hamiltonian Monte Carlo Method for Probabilistic Adversarial Attack and Learning, A
* Hand Image Understanding via Deep Multi-Task Learning
* Hand-Object Contact Consistency Reasoning for Human Grasps Generation
* HandFoldingNet: A 3D Hand Pose Estimation Network Using Multiscale-Feature Guided Folding of a 2D Hand Skeleton
* Handwriting Transformers
* Harmonization of Multi-Mission High-Resolution Time Series: Application to BELAIR
* Harnessing the Conditioning Sensorium for Improved Image Translation
* HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark Dataset
* Head Pose Estimation Based on Multivariate Label Distribution
* HeadGAN: One-shot Neural Head Synthesis and Editing
* Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep Reinforcement Learning
* Heterogeneous Graph Attention Network for Unsupervised Multiple-Target Domain Adaptation
* Heterogeneous Relational Complement for Vehicle Re-identification
* HF-TPE: High-Fidelity Thumbnail- Preserving Encryption
* Hierarchical Aggregation for 3D Instance Segmentation
* Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling
* Hierarchical Disentangled Representation Learning for Outdoor Illumination Estimation and Editing
* Hierarchical domain adaptation with local feature patterns
* Hierarchical Gaussian Markov Random Field for Image Denoising
* Hierarchical Graph Attention Network for Few-shot Visual-Semantic Learning
* Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild
* Hierarchical Memory Matching Network for Video Object Segmentation
* Hierarchical Object-to-Zone Graph for Object Navigation
* Hierarchical Transformation-Discriminating Generative Model for Few Shot Anomaly Detection, A
* Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction, A
* HiFT: Hierarchical Feature Transformer for Aerial Tracking
* High Quality Disparity Remapping with Two-Stage Warping
* High-Fidelity Pluralistic Image Completion with Transformers
* High-Order Correlation Preserved Incomplete Multi-View Subspace Clustering
* High-Performance Discriminative Tracking with Transformers
* high-performance insulators location scheme based on YOLOv4 deep learning network with GDIoU loss function, A
* High-quality reversible data hiding scheme using sorting and enhanced pairwise PEE
* High-Rate One-Hourly Updated Ultra-Rapid Multi-GNSS Satellite Clock Offsets Estimation and Its Application in Real-Time Precise Point Positioning
* High-Resolution Optical Flow from 1D Attention and Correlation
* High-Speed Magnetic Surveying for Unexploded Ordnance Using UAV Systems
* HighlightMe: Detecting Highlights from Human-Centric Videos
* HiNet: Deep Image Hiding by Invertible Network
* HIRE-SNN: Harnessing the Inherent Robustness of Energy-Efficient Deep Spiking Neural Networks by Training with Crafted Input Noise
* Histogram-Based Intrusion Detection and Filtering Framework for Secure and Safe In-Vehicle Networks
* HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval
* Holistic Pose Graph: Modeling Geometric Structure among Objects in a Scene using Graph Inference for 3D Object Prediction
* Homogeneous Architecture Augmentation for Neural Predictor
* How Can We Understand the Past from Now On? Three-Dimensional Modelling and Landscape Reconstruction of the Shuanghuaishu Site in the Central Plains of China
* How Fast You Will Drive? Predicting Speed of Customized Paths By Deep Neural Network
* How Shift Equivariance Impacts Metric Learning for Instance Segmentation
* How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild
* How to Train Neural Networks for Flare Removal
* HPNet: Deep Primitive Segmentation Using Hybrid Representations
* HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration
* Human Activity Recognition with IMU and Vital Signs Feature Fusion
* Human Detection and Segmentation via Multi-view Consensus
* Human object interaction detection using two-direction spatial enhancement and exclusive object prior
* Human Pose Regression with Residual Log-likelihood Estimation
* Human Trajectory Prediction via Counterfactual Analysis
* Human-Inspired Haptic-Enabled Learning From Prehensile Move Demonstrations
* HuMoR: 3D Human Motion Model for Robust Pose Estimation
* Hybrid Compression Framework for Color Attributes of Static 3D Point Clouds, A
* Hybrid Frequency-Spatial Domain Model for Sparse Image Reconstruction in Scanning Transmission Electron Microscopy, A
* Hybrid N-Inception-LSTM-Based Aircraft Coordinate Prediction Method for Secure Air Traffic
* Hybrid Neural Fusion for Full-frame Video Stabilization
* Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction, A
* Hypercorrelation Squeeze for Few-Shot Segmenation
* Hypergraph matching via game-theoretic hypergraph clustering
* Hypergraph Neural Networks for Hypergraph Matching
* Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
* Hyperspectral Image Denoising with Realistic Data
* Hyperspherical class prototypes for adversarial robustness
* Hysteretic mapping and corridor semantic modeling using mobile LiDAR systems
* HyText: A Scene-Text Extraction Method for Video Retrieval
* I2UV-HandNet: Image-to-UV Prediction Network for Accurate and High-fidelity 3D Hand Mesh Modeling
* IBC Reference Block Enhancement Model Based on GAN for Screen Content Video Coding, An
* ICE: Inter-Instance Contrastive Encoding for Unsupervised Person Re-identification
* ICON: Learning Regular Maps Through Inverse Consistency
* ID-Reveal: Identity-aware DeepFake Video Detection
* IDARTS: Interactive Differentiable Architecture Search
* Identification of Low Impedance Points Along Railway Tracks From a Railroad Inspection Vehicle
* Identification of the Potential Critical Slip Surface for Fractured Rock Slope Using the Floyd Algorithm
* Identifying players in broadcast videos using graph convolutional network
* Identifying Reservoirs and Estimating Evaporation Losses in a Large Arid Inland Basin in Northwestern China
* Identity-Quantity Harmonic Multi-Object Tracking
* IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID
* IICNet: A Generic Framework for Reversible Image Conversion
* IID-Net: Image Inpainting Detection Network via Neural Architecture Search and Attention
* ILMICA - Interactive Learning Model of Image Collage Assessment: A Transfer Learning Approach for Aesthetic Principles
* ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models
* Image Correction and In Situ Spectral Calibration for Low-Cost, Smartphone Hyperspectral Imaging
* Image Harmonization with Transformer
* Image Inpainting via Conditional Texture and Structure Dual Generation
* Image Inpainting With Local and Global Refinement
* Image Manipulation Detection by Multi-View Multi-Scale Supervision
* Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
* Image Shape Manipulation from a Single Augmented Training Sample
* Image Synthesis from Layout with Locality-Aware Mask Adaption
* Image Synthesis via Semantic Composition
* Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis
* iMAP: Implicit Mapping and Positioning in Real-Time
* imGHUM: Implicit Generative Models of 3D Human Shape and Articulated Pose
* Impact of Aliasing on Generalization in Deep Convolutional Networks
* Impact of Vertical Profiles of Aerosols on the Photolysis Rates in the Lower Troposphere from the Synergy of Photometer and Ceilometer Measurements in Raciborz, Poland, for the Period 2015-2020
* Improve Deep Unsupervised Hashing via Structural and Intrinsic Similarity Learning
* Improve Unsupervised Pretraining for Few-label Transfer
* Improved Algorithm for the Retrieval of the Antarctic Sea Ice Freeboard and Thickness from ICESat-2 Altimeter Data, An
* Improved Empirical Mode Decomposition of Electroencephalogram Signals for Depression Detection, An
* Improved iteratively reweighted least squares algorithms for sparse recovery problem
* Improved k-NN Mapping of Forest Attributes in Northern Canada Using Spaceborne L-Band SAR, Multispectral and LiDAR Data
* Improved time series clustering based on new geometric frameworks
* improved tongue image segmentation algorithm based on Deeplabv3+ framework, An
* Improved Traffic Flow Efficiency During Yellow Interval at Signalized Intersections Using a Smart Countdown System
* Improved U-Net Remote Sensing Classification Algorithm Based on Multi-Feature Fusion Perception
* Improvement in design and training of feature pyramid network for contour refinement
* Improving 3D Object Detection with Channel-wise Transformer
* Improving Contrastive Learning by Visualizing Feature Transformation
* Improving De-raining Generalization via Neural Reorganization
* Improving Generalization of Batch Whitening by Convolutional Unit Optimization
* Improving Low-Precision Network Quantization via Bin Regularization
* Improving Neural Network Efficiency via Post-training Quantization with Adaptive Floating-Point
* Improving robustness against common corruptions with frequency biased models
* Improving Robustness of Facial Landmark Detection by Defending against Adversarial Attacks
* Improving Synchronization in High-Speed Railway and Air Intermodality: Integrated Train Timetable Rescheduling and Passenger Flow Forecasting
* Improving the performance of automotive vision-based applications under rainy conditions
* Improving visual multi-object tracking algorithm via integrating GM-PHD and correlation filter
* In Defense of Scene Graphs for Image Captioning
* In-Flight Relative Radiometric Calibration of a Wide Field of View Directional Polarimetric Camera Based on the Rayleigh Scattering over Ocean
* In-Place Scene Labelling and Understanding with Implicit Scene Representation
* In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces
* iNAS: Integral NAS for Device-Aware Salient Object Detection
* Inconsistency-Aware Uncertainty Estimation for Semi-Supervised Medical Image Segmentation
* Incorporating Convolution Designs into Visual Transformers
* Incorporating Dynamicity of Transportation Network With Multi-Weight Traffic Graph Convolutional Network for Traffic Forecasting
* Incorporating Learnable Membrane Time Constant to Enhance Learning of Spiking Neural Networks
* Incorporation of Net Radiation Model Considering Complex Terrain in Evapotranspiration Determination with Sentinel-2 Data
* Indie Games Popularity Prediction by Considering Multimodal Features
* Individual Tree Crown Delineation Method Based on Multi-Criteria Graph Using Geometric and Spectral Information: Application to Several Temperate Forest Sites
* Individualized real-time prediction of working memory performance by classifying electroencephalography signals
* Indoor Scene Generation from a Collection of Semantic-Segmented Depth Images
* Inference of Black Hole Fluid-Dynamics from Sparse Interferometric Measurements
* Inferring high-resolution traffic accident risk maps based on satellite imagery and GPS trajectories
* Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image
* Infinite-dimensional feature aggregation via a factorized bilinear model
* Influence Selection for Active Learning
* Influence-Balanced Loss for Imbalanced Visual Classification
* Influences of 1DVAR Background Covariances and Observation Operators on Retrieving Tropical Cyclone Thermal Structures
* Information-theoretic regularization for Multi-source Domain Adaptation
* InfraGAN: A GAN architecture to transfer visible images to infrared domain
* Injury prediction algorithm for rear-seat occupants in advanced automatic crash notification systems
* InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images
* Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks
* Instance Similarity Learning for Unsupervised Feature Representation
* Instance-Aware Scene Layout Forecasting
* Instance-level Image Retrieval using Reranking Transformers
* Instance-wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation
* InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
* Instances as Queries
* Integer-arithmetic-only Certified Robustness for Quantized Neural Networks
* Integrated Schedule and Trajectory Optimization for Connected Automated Vehicles in a Conflict Zone
* Integrating Domain Knowledge Into Deep Networks for Lung Ultrasound With Applications to COVID-19
* Integrating Remote Sensing and Meteorological Data to Predict Wheat Stripe Rust
* Integration of GNSS Precise Point Positioning and Reduced Inertial Sensor System for Lane-Level Car Navigation
* Integration of Sentinel-3 and MODIS Vegetation Indices with ERA-5 Agro-Meteorological Indicators for Operational Crop Yield Forecasting
* Intelligent Traffic Accident Prediction Model for Internet of Vehicles With Deep Learning Approach
* Inter-Domain Adaptation Label for Data Augmentation in Vehicle Re-Identification
* Interacting Two-Hand 3D Pose and Shape Reconstruction from Single Color Image
* Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations
* Interaction via Bi-directional Graph of Semantic Region Affinity for Scene Parsing
* Interactive image segmentation based on the appearance model and orientation energy
* Interactive Model Predictive Control for Robot Navigation in Dense Crowds
* Interactive Prototype Learning for Egocentric Action Recognition
* Interannual Transfer Learning Approach for Crop Classification in the Hetao Irrigation District, China, An
* Intercomparison of Satellite Derived Arctic Sea Ice Motion Products, An
* InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs
* Internal Video Inpainting by Implicit Long-range Propagation
* Interpolation-Aware Padding for 3D Sparse Convolutional Neural Networks
* Interpretability of deep neural networks used for the diagnosis of Alzheimer's disease
* Interpretable Image Recognition by Constructing Transparent Embedding Space
* Interpretable Visual Reasoning via Induced Symbolic Space
* Interpretation Approach of Ascending-Descending SAR Data for Landslide Identification, An
* Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents
* Interpreting Attributions and Interactions of Adversarial Attacks
* Interpreting Image Classifiers by Generating Discrete Masks
* Interseismic Fault Coupling and Slip Rate Deficit on the Central and Southern Segments of the Tanlu Fault Zone Based on Anhui CORS Measurements
* IntraTomo: Self-supervised Learning-based Tomography via Sinogram Synthesis and Prediction
* Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer
* Introduction to conformal predictors
* Inverse Method for Drop Size Distribution Retrieval from Polarimetric Radar at Attenuating Frequency, An
* Inverting a Rolling Shutter Camera: Bring Rolling Shutter Images to High Framerate Global Shutter Video
* Investigation into Keystroke Dynamics and Heart Rate Variability as Indicators of Stress, An
* Invisible Backdoor Attack with Sample-Specific Triggers
* Ionospheric Nighttime Enhancements at Low Latitudes Challenge Performance of the Global Ionospheric Maps
* iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis
* Iris R-CNN: Accurate iris segmentation and localization in non-cooperative environment with visible illumination
* Is Pseudo-Lidar needed for Monocular 3D Object detection?
* ISAR Resolution Enhancement Method Exploiting Generative Adversarial Network
* ISD: Self-Supervised Learning by Iterative Similarity Distillation
* ISNet: Integrate Image-Level and Semantic-Level Context for Semantic Segmentation
* Isolated spoken word recognition using packed-MFCC on padded-voice signal for unscripted languages
* Iterative Correction Phase of Light Field for Novel View Reconstruction, An
* Iterative label cleaning for transductive and semi-supervised few-shot learning
* IVIST: Interactive Video Search Tool in VBS 2022
* JEM++: Improved Techniques for Training JEM
* Joint Adaptive Dual Graph and Feature Selection for Domain Adaptation
* Joint Audio-Visual Deepfake Detection
* Joint Estimation of Azimuth and Distance for Far-Field Multi Targets Based on Graph Signal Processing
* Joint Expression Synthesis and Representation Learning for Facial Expression Recognition
* Joint Face Image Restoration and Frontalization for Recognition
* Joint image denoising with gradient direction and edge-preserving regularization
* Joint Inductive and Transductive Learning for Video Object Segmentation
* Joint Multi-Dimensional Model for Global and Time-Series Annotations
* Joint Re-Detection and Re-Identification for Multi-Object Tracking
* Joint Representation Learning and Novel Category Discovery on Single- and Multi-Modal Data
* Joint Topology-preserving and Feature-refinement Network for Curvilinear Structure Segmentation
* Joint Transmit and Reflective Beamformer Design for Secure Estimation in IRS-Aided WSNs
* Joint Visual and Audio Learning for Video Highlight Detection
* Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition
* Just a Few Points are All You Need for Multi-view Stereo: A Novel Semi-supervised Learning Method for Multi-view Stereo
* Just Ask: Learning to Answer Questions from Millions of Narrated Videos
* Just One Moment: Structural Vulnerability of Deep Action Recognition against One Frame Attack
* Just) A Spoonful of Refinements Helps the Registration Error Go Down
* JVCSR: Video Compressive Sensing Reconstruction with Joint In-Loop Reference Enhancement and Out-Loop Super-Resolution
* Keep CALM and Improve Visual Feature Attribution
* Kernel correlation filter tracking strategy based on adaptive fusion response map
* Kernel Methods in Hyperbolic Spaces
* Kernel Ridge Regression Hybrid Method for Wheat Yield Prediction with Satellite-Derived Predictors
* Keypoint Communities
* KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs
* Knowledge Mining and Transferring for Domain Adaptive Object Detection
* Knowledge-Enriched Distributional Model Inversion Attacks
* KoDF: A Large-scale Korean DeepFake Detection Dataset
* L-Sign: Large-Vocabulary Sign Gestures Recognition System
* Labels4Free: Unsupervised Segmentation using StyleGAN
* LabOR: Labeling Only if Required for Domain Adaptive Semantic Segmentation
* LaLaLoc: Latent Layout Localisation in Dynamic, Unvisited Environments
* Land Surface Phenology Retrieval through Spectral and Angular Harmonization of Landsat-8, Sentinel-2 and Gaofen-1 Data
* Landscape-Based Habitat Suitability Model (LHS Model) for Oriental Migratory Locust Area Extraction at Large Scales: A Case Study along the Middle and Lower Reaches of the Yellow River, A
* Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism
* LapsCore: Language-guided Person Search via Color Reasoning
* Large Scale Interactive Motion Forecasting for Autonomous Driving: The Waymo Open Motion Dataset
* Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination
* Large-Scale Frontal Vehicle Image Dataset for Fine-Grained Vehicle Categorization, A
* Large-scale Robust Deep AUC Maximization: A New Surrogate Loss and Empirical Studies on Medical Image Classification
* Latent Transformations via NeuralODEs for GAN-based Image Editing
* Latent Transformer for Disentangled Face Editing in Images and Videos, A
* LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions
* LayoutTransformer: Layout Generation and Completion with Self-attention
* Lazily Aggregated Quantized Gradient Innovation for Communication-Efficient Federated Learning
* Lazy Approach to Long-Horizon Gradient-Based Meta-Learning, A
* LD-GAN: Learning perturbations for adversarial defense based on GAN structure
* Learn to Cluster Faces via Pairwise Classification
* Learn to Match: Automatic Matching Network Design for Visual Tracking
* Learn-to-Race: A Multimodal Control Environment for Autonomous Racing
* Learnable Boundary Guided Adversarial Training
* Learned Spatial Representations for Few-shot Talking-Head Synthesis
* Learning 3D Semantic Scene Graphs with Instance Embeddings
* Learning A Single Network for Scale-Arbitrary Super-Resolution
* Learning a Sketch Tensor Space for Image Inpainting of Man-made Scenes
* Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization
* Learning an Augmented RGB Representation with Cross-Modal Knowledge Distillation for Action Detection
* Learning Anchored Unsigned Distance Functions with Gradient Direction Alignment for Single-view Garment Reconstruction
* Learning Attribute-driven Disentangled Representations for Interactive Fashion Retrieval
* Learning Better Visual Data Similarities via New Grouplet Non-Euclidean Embedding
* Learning Bias-Invariant Representation by Cross-Sample Mutual Information Minimization
* Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences
* Learning Canonical 3D Object Representation for Fine-Grained Recognition
* Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views
* Learning Causal Representation for Training Cross-Domain Pose Estimator via Generative Interventions
* Learning Clustering for Motion Segmentation
* Learning Compatible Embeddings
* Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
* Learning Cross-Modal Contrastive Features for Video Domain Adaptation
* Learning Deep Local Features with Multiple Dynamic Attentions for Large-Scale Image Retrieval
* Learning Dual Priors for JPEG Compression Artifacts Removal
* Learning Dynamic Interpolation for Extremely Sparse Light Fields with Wide Baselines
* Learning Efficient Photometric Feature Transform for Multi-view Stereo
* Learning Facial Representations from the Cycle-consistency of Face
* Learning Fast Sample Re-weighting Without Reward Data
* Learning Feature Channel Weighting for Real-Time Visual Tracking
* Learning Frequency-aware Dynamic Network for Efficient Super-Resolution
* Learning from Noisy Data with Robust Representation Learning
* Learning From Web Recipe-Image Pairs for Food Recognition: Problem, Baselines and Performance
* Learning Generalized Transformation Equivariant Representations Via AutoEncoding Transformations
* Learning Generative Models of Textured 3D Meshes from Real-World Images
* Learning Hierarchical Graph Neural Networks for Image Clustering
* Learning High-Fidelity Face Texture Completion without Complete Face Texture
* Learning Icosahedral Spherical Probability Map Based on Bingham Mixture Model for Vanishing Point Estimation
* Learning image aesthetic subjectivity from attribute-aware relational reasoning network
* Learning Image Representation via Attribute-Aware Attention Networks for Fashion Classification
* Learning Image-Adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-Time
* Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting
* Learning Inner-Group Relations on Point Clouds
* Learning Instance-level Spatial-Temporal Patterns for Person Re-identification
* Learning interlaced sparse Sinkhorn matching network for video super-resolution
* Learning Latent Architectural Distribution in Differentiable Neural Architecture Search via Variational Information Maximization
* Learning Meta-class Memory for Few-Shot Semantic Segmentation
* Learning Motion Priors for 4D Human Body Capture in 3D Scenes
* Learning Motion-Appearance Co-Attention for Zero-Shot Video Object Segmentation
* Learning Multi-Scene Absolute Pose Regression with Transformers
* Learning Multiple Pixelwise Tasks Based on Loss Scale Balancing
* Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering
* Learning of Visual Relations: The Devil is in the Tails
* Learning Pain from Action Unit Combinations: A Weakly Supervised Approach via Multiple Instance Learning
* Learning Privacy-preserving Optics for Human Pose Estimation
* Learning Rare Category Classifiers on a Tight Labeling Budget
* Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision
* Learning Realistic Human Reposing using Cyclic Self-Supervision with 3D Shape, Pose, and Appearance Consistency
* Learning Residual Color for Novel View Synthesis
* Learning Scene Dynamics from Point Cloud Sequences
* Learning Self-Consistency for Deepfake Detection
* Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition
* Learning Signed Distance Field for Multi-view Surface Reconstruction
* Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation
* Learning Spatio-Temporal Transformer for Visual Tracking
* Learning specialized activation functions with the Piecewise Linear Unit
* Learning Target Candidate Association to Keep Track of What Not to Track
* Learning Temporal Dynamics from Cycles in Narrated Video
* Learning to Adversarially Blur Visual Object Tracking
* Learning to Better Segment Objects from Unseen Classes with Unlabeled Videos
* Learning to Bundle-adjust: A Graph Network Approach to Faster Optimization of Bundle Adjustment for Vehicular SLAM
* Learning to Classify Weather Conditions from Single Images Without Labels
* Learning to combine the modalities of language and video for temporal moment localization
* Learning to compress videos without computing motion
* Learning to Cut by Watching Movies
* Learning to Detect Instance-Level Salient Objects Using Complementary Image Labels
* Learning to Discover Reflection Symmetry via Polar Matching Convolution
* Learning to Diversify for Single Domain Generalization
* Learning to drive from a world on rails
* Learning to Estimate Hidden Motions with Global Motion Aggregation
* Learning to Generate Scene Graph from Natural Language Supervision
* Learning to Hallucinate Examples from Extrinsic and Intrinsic Supervision
* Learning to Know Where to See: A Visibility-Aware Approach for Occluded Person Re-identification
* Learning to Match Features with Seeded Graph Matching Network
* Learning to Recognize Human Actions From Noisy Skeleton Data Via Noise Adaptation
* Learning to rectify for robust learning with noisy labels
* Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data
* Learning to Regress Bodies from Images using Differentiable Semantic Rendering
* Learning to Remove Refractive Distortions from Underwater Images
* Learning to Resize Images for Computer Vision Tasks
* Learning to Stylize Novel Views
* Learning to Track Objects from Unlabeled Videos
* Learning to Track with Object Permanence
* Learning Unsupervised Metaformer for Anomaly Detection
* Learning Video Moment Retrieval Without a Single Annotated Video
* Learning with Memory-based Virtual Classes for Deep Metric Learning
* Learning with Noisy Labels for Robust Point Cloud Segmentation
* Learning with Noisy Labels via Sparse Regularization
* Learning With Privileged Multimodal Knowledge for Unimodal Segmentation
* Learning with Privileged Tasks
* Learning-Based Rate Control for Video-Based Point Cloud Compression
* Let's See Clearly: Contaminant Artifact Removal for Moving Cameras
* Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised Semantic Segmentation
* Leveraging Selective Prediction for Reliable Image Geolocation
* Leveraging the Deep Learning Paradigm for Continuous Affect Estimation from Facial Expressions
* LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
* LFI-CAM: Learning Feature Importance for Better Visual Explanation
* LiDAR Voxel-Size Optimization for Canopy Gap Estimation
* Lifelong Infinite Mixture Model Based on Knowledge-Driven Dirichlet Process
* LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector
* Light Field Saliency Detection with Dual Local Graph Learning and Reciprocative Guidance
* Light Source Guided Single-Image Flare Removal from Unpaired Data
* Light Stage on Every Desk, A
* Light-Field Microscopy for the Optical Imaging of Neuronal Activity: When model-based methods meet data-driven approaches
* lightweight capsule network architecture for detection of COVID-19 from lung CT scans, A
* Lightweight Image Super-Resolution With Expectation-Maximization Attention Mechanism
* Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras
* Lightweight Tensor Deep Computation Model With Its Application in Intelligent Transportation Systems
* Lightweight Wavelet-Based Network for JPEG Artifacts Removal
* Likelihood-Based Diverse Sampling for Trajectory Forecasting
* Linguistically Routing Capsule Network for Out-of-distribution Visual Question Answering
* Lipschitz Continuity Guided Knowledge Distillation
* LIRA: Learnable, Imperceptible and Robust Backdoor Attacks
* Liveness-Enforcing Supervisor Tolerant to Sensor-Reading Modification Attacks, A
* Liver tumor segmentation from computed tomography images using multiscale residual dilated encoder-decoder network
* LLQA: Lifelog Question Answering Dataset
* Local and Global Perception Generative Adversarial Network for Facial Expression Synthesis
* Local Temperature Scaling for Probability Calibration
* Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization
* Localized multiple kernel learning using graph modularity
* Localized Simple Multiple Kernel K-means
* LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution Homography Estimation
* Location-aware Single Image Reflection Removal
* LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision
* Loess Landslide Detection Using Object Detection Algorithms in Northwest China
* LoFGAN: Fusing Local Representations for Few-shot Image Generation
* LOKI: Long Term and Key Intentions for Trajectory Prediction
* Long Short View Feature Decomposition via Contrastive Video Representation Learning
* Long-Periodic Analysis of Boresight Misalignment of Ziyuan3-01 Three-Line Camera
* Long-Range Feature Dependencies Capturing for Low-Resolution Image Classification
* Long-Range Multi-Object Tracking at Traffic Intersections on Low-Power Devices
* Long-Term Temporally Consistent Unpaired Video Translation from Simulated Surgical 3D Data
* Long-Term Visual Localization Revisited
* Looking here or there? Gaze Following in 360-Degree Images
* LookOut: Diverse Multi-Future Prediction and Planning for Self-Driving
* LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning
* Loss function search for person re-identification
* Low Curvature Activations Reduce Overfitting in Adversarial Training
* Low-Rank High-Order Tensor Completion With Applications in Visual Data
* Low-Rank Tensor Completion by Approximating the Tensor Average Rank
* Low-Rank Tucker-2 Model for Multi-Subject fMRI Data Decomposition With Spatial Sparsity Constraint
* Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories
* LR-SVM+: Learning Using Privileged Information with Noisy Labels
* LSD-StructureNet: Modeling Levels of Structural Detail in 3D Part Hierarchies
* LSG-CPD: Coherent Point Drift with Local Surface Geometry for Point Cloud Registration
* Lucas-Kanade Reloaded: End-to-End Super-Resolution from Raw Image Bursts
* Lunar Mare Fecunditatis: A Science-Rich Region and a Concept Mission for Long-Distance Exploration
* M3D-VTON: A Monocular-to-3D Virtual Try-On Network
* MAAS: Multi-modal Assignation for Active Speaker Detection
* machine learning approach for identifying and delineating agricultural fields and their multi-temporal dynamics using three decades of Landsat data, A
* Machine Teaching Framework for Scalable Recognition, A
* Making Few-Shot Object Detection Simpler and Less Frustrating
* Making Higher Order MOT Scalable: An Efficient Approximate Solver for Lifted Disjoint Paths
* Malignancy detection on mammograms by integrating modified convolutional neural network classifier and texture features
* MAMask: Multi-feature aggregation instance segmentation with pyramid attention mechanism
* Manifold Alignment for Semantically Aligned Style Transfer
* Manifold Matching via Deep Metric Learning for Generative Modeling
* Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization, The
* Map Matching and Lane Detection Based on Markovian Behavior, GIS, and IMU Data
* Mapping corn and soybean phenometrics at field scales over the United States Corn Belt by fusing time series of Landsat 8 and Sentinel-2 data with VIIRS data
* Mapping Forest Aboveground Biomass Using Multisource Remotely Sensed Data
* Mask encoding: A general instance mask representation for object segmentation
* Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes
* Maximizing the latency fairness in UAV-assisted MEC system
* MBA-VO: Motion Blur Aware Visual Odometry
* MC-Calib: A generic and robust calibration toolbox for multi-camera systems
* mDALU: Multi-Source Domain Adaptation and Label Unification with Partial Datasets
* MDETR: Modulated Detection for End-to-End Multi-Modal Understanding
* Me-Momentum: Extracting Hard Confident Examples from Noisily Labeled Data
* ME-PCN: Point Completion Conditioned on Mask Emptiness
* Mean Shift for Self-Supervised Learning
* Medical image segmentation using deep learning: A survey
* MEDIRL: Predicting the Visual Attention of Drivers via Maximum Entropy Deep Inverse Reinforcement Learning
* Melody Generation from Lyrics Using Three Branch Conditional LSTM-GAN
* Membership Inference Attacks are Easier on Difficult Problems
* Memory-Augmented Dynamic Neural Relational Inference
* MES-P: An Emotional Tonal Speech Dataset in Mandarin with Distal and Proximal Labels
* Mesh Graphormer
* MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
* Meta captioning: A meta learning based remote sensing image captioning framework
* Meta Gradient Adversarial Attack
* Meta Learning on a Sequence of Imbalanced Domains with Difficulty Awareness
* Meta Navigator: Search for a Good Adaptation Policy for Few-shot Learning
* Meta Pairwise Relationship Distillation for Unsupervised Person Re-identification
* Meta PID Attention Network for Flexible and Efficient Real-World Noisy Image Denoising
* Meta-Aggregator: Learning to Aggregate for 1-bit Graph Neural Networks
* Meta-Attack: Class-agnostic and Model-agnostic Physical Adversarial Attack
* Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning
* Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning
* method for detecting and classifying the tumor regions in brain MRI images using vector index filtering and ANFIS classification process, A
* Methods of Sandy Land Detection in a Sparse-Vegetation Scene Based on the Fusion of HJ-2A Hyperspectral and GF-3 SAR Data
* MEViT: Motion Enhanced Video Transformer for Video Classification
* MF-GAN: Multi-conditional Fusion Generative Adversarial Network for Text-to-Image Synthesis
* MFAUNet: Multiscale feature attentive U-Net for cardiac MRI structural segmentation
* MFFNet: Single facial depth map refinement using multi-level feature fusion
* MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object Detection
* MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction
* MGMP: Multimodal Graph Message Propagation Network for Event Detection
* MGNet: Monocular Geometric Scene Understanding for Autonomous Driving
* MGSampler: An Explainable Sampling Strategy for Video Action Recognition
* Micro and Macro Facial Expression Recognition Using Advanced Local Motion Patterns
* Micro-Motion Classification of Flying Bird and Rotor Drones via Data Augmentation and Modified Multi-Scale CNN
* MicroNet: Improving Image Recognition with Extremely Low FLOPs
* MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis
* Minimal Adversarial Examples for Deep Learning on 3D Point Clouds
* Minimal Cases for Computing the Generalized Relative Pose using Affine Correspondences
* Minimal Solutions for Panoramic Stitching Given Gravity Prior
* Mining Contextual Information Beyond Image for Semantic Segmentation
* Mining Cross-Domain Structure Affinity for Refined Building Segmentation in Weakly Supervised Constraints
* Mining Latent Classes for Few-shot Segmentation
* Mining Minority-Class Examples with Uncertainty Estimates
* Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields
* Mitigating Intensity Bias in Shadow Detection via Feature Decomposition and Reweighting
* Mixed SIGNals: Sign Language Production via a Mixture of Motion Primitives
* Mixed Structure with 3D Multi-Shortcut-Link Networks for Hyperspectral Image Classification
* Mixing up contrastive learning: Self-supervised representation learning for time series
* MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing
* MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks
* Mixture-based Feature Space Learning for Few-shot Image Classification
* MLVSNet: Multi-level Voting Siamese Network for 3D Visual Tracking
* Modeling and Control Using Stochastic Distribution Control Theory for Intersection Traffic Flow
* Modeling Drivers' Strategy When Overtaking Cyclists in the Presence of Oncoming Traffic
* Modelling and Assessment of Single-Frequency PPP Time Transfer with BDS-3 B1I and B1C Observations
* Modelling Neighbor Relation in Joint Space-Time Graph for Video Correspondence Learning
* Modelling people's perceived scene complexity of real-world environments using street-view panoramas and open geodata
* Modelling the road network capacity considering residual queues and connected automated vehicles
* Modern Dryland Source-to-Sink System Segments and Coupling Relationships from Digital Elevation Model Analysis: A Case Study from the Mongolian Altai
* Modulated Graph Convolutional Network for 3D Human Pose Estimation
* Modulated Periodic Activations for Generalizable Local Functional Representations
* Monitoring Irrigation Events and Crop Dynamics Using Sentinel-1 and Sentinel-2 Time Series
* Monitoring of Radial Deformations of a Gravity Dam Using Sentinel-1 Persistent Scatterer Interferometry
* Monitoring the Spatio-Temporal Dynamics of Shale Oil/Gas Development with Landsat Time Series: Case Studies in the USA
* Monocular Depth Perception on Microcontrollers for Edge Applications
* Monocular, One-stage, Regression of Multiple 3D People
* MonoIndoor: Towards Good Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments
* MonteFloor: Extending MCTS for Reconstructing Accurate Large-Scale Floor Plans
* Morphable Detector for Object Detection on Demand
* MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection
* Motion Adaptive Pose Estimation from Compressed Videos
* Motion Basis Learning for Unsupervised Deep Homography Estimation with Subspace Projection
* Motion Compensation Method for Shipborne HFSWR by Using Dual Reference RF Signals Generated Onshore, A
* Motion Deblurring with Real Events
* Motion Guided Attention Fusion to Recognize Interactions from Videos
* Motion Guided Region Message Passing for Video Captioning
* Motion Prediction using Trajectory Cues
* Motion-Augmented Self-Training for Video Recognition at Smaller Scale
* Motion-Aware Dynamic Architecture for Efficient Frame Interpolation
* Motion-Focused Contrastive Learning of Video Representations*
* MOTSynth: How Can Synthetic Data Help Pedestrian Detection and Tracking?
* Move2Hear: Active Audio-Visual Source Separation
* MoViDNN: A Mobile Platform for Evaluating Video Quality Enhancement with Deep Neural Networks
* MP2020: Visual quality assessment database for macro photography images
* mRMR-based hybrid convolutional neural network model for classification of Alzheimer's disease on brain magnetic resonance images
* MSR-GCN: Multi-Scale Residual Graph Convolution Networks for Human Motion Prediction
* MT-ORL: Multi-Task Occlusion Relationship Learning
* MTCNet: Multi-task collaboration network for rotation-invariance face detection
* Multi- and dual-tuned microstripline-based transmit/receive switch for 7-Tesla magnetic resonance imaging
* Multi-Anchor Active Domain Adaptation for Semantic Segmentation
* Multi-Class Cell Detection Using Spatial Context Representation
* Multi-Class Multi-Instance Count Conditioned Adversarial Image Generation
* Multi-complementary and unlabeled learning for arbitrary losses and models
* Multi-Controller Deployment in SDN-Enabled 6G Space-Air-Ground Integrated Network
* Multi-Echo LiDAR for 3D Object Detection
* Multi-Expert Adversarial Attack Detection in Person Re-identification Using Context Inconsistency
* Multi-frame Motion Segmentation by Combining Two-Frame Results
* Multi-Fusion Residual Memory Network for Multimodal Human Sentiment Comprehension
* Multi-Graph Fusion and Learning for RGBT Image Saliency Detection
* Multi-Instance Pose Networks: Rethinking Top-Down Pose Estimation
* Multi-Label Multi-Task Deep Learning for Behavioral Coding
* Multi-Lane Detection and Tracking Using Temporal-Spatial Model and Particle Filtering
* Multi-Level Curriculum for Training A Distortion-Aware Barrel Distortion Rectification Model
* Multi-Modal Fusion Network for Rumor Detection with Texts and Images
* Multi-modal Interactive Video Retrieval with Temporal Queries
* Multi-Modal Multi-Action Video Recognition
* Multi-modal Semantic Inconsistency Detection in Social Media News Posts
* Multi-Modal Variational Graph Auto-Encoder for Recommendation Systems
* Multi-Modal Video Retrieval in Virtual Reality with VITRIVR-VR
* Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video
* Multi-Mode Modulator for Multi-Domain Few-Shot Classification, A
* Multi-object Tracking with a Hierarchical Single-Branch Network
* Multi-scale Cross-Modal Transformer Network for RGB-D Object Detection
* multi-scale learning method with dilated convolutional network for concrete surface cracks detection, A
* Multi-scale Matching Networks for Semantic Correspondence
* Multi-Scale Separable Network for Ultra-High-Definition Video Deblurring
* Multi-Scale Ship Detection Algorithm Based on a Lightweight Neural Network for Spaceborne SAR Images
* Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
* Multi-scale visual attention for attribute disambiguation in zero-shot learning
* Multi-Sensor Fusion Self-Supervised Deep Odometry and Depth Estimation
* Multi-Source Domain Adaptation for Object Detection
* Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain
* Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation
* Multi-task Facial Activity Patterns Learning for micro-expression recognition using Joint Temporal Local Cube Binary Pattern
* Multi-Task Self-Training for Learning General Representations
* Multi-Temporal Analysis of Changes of the Southern Part of the Baltic Sea Coast Using Aerial Remote Sensing Data
* Multi-VAE: Learning Disentangled View-common and View-peculiar Visual Representations for Multi-view Clustering
* Multi-view 3D Reconstruction with Transformers
* Multi-View Radar Semantic Segmentation
* Multicomponent Temporal Coherence Model for 3-D Phase Unwrapping in Time-Series InSAR of Seasonal Deformation Areas, A
* Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
* Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images
* Multimodal Deception Detection Using Real-Life Trial Data
* Multimodal Embedding for Lifelog Retrieval
* Multimodal Knowledge Expansion
* Multimodal Self-Assessed Personality Estimation During Crowded Mingle Scenarios Using Wearables Devices and Cameras
* Multimodal Unsupervised Image-to-Image Translation Without Independent Style Encoder
* Multiplayer VR Live Concert With Information Exchange Through Feedback Modulated by EEG Signals, A
* Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts
* Multiple Instance Learning for Emotion Recognition Using Physiological Signals
* Multiple Pairwise Ranking Networks for Personalized Video Summarization
* Multiple Positives Enhanced NCE Loss for Image-Text Retrieval, A
* Multiple UAV Flights across the Growing Season Can Characterize Fine Scale Phenological Heterogeneity within and among Vegetation Functional Groups
* Multiresolution Deep Implicit Functions for 3D Shape Representation
* Multiscale Vision Transformers
* MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving
* Multisource Vegetation Inventory (MVI): A Satellite-Based Forest Inventory for the Northwest Territories Taiga Plains, The
* Multispectral illumination estimation using deep unrolling network
* MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions
* Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection
* Multitask Deep Learning Reconstruction and Localization of Lesions in Limited Angle Diffuse Optical Tomography
* Multiview Pseudo-Labeling for Semi-supervised Learning from Video
* Multiview Video-Based 3-D Pose Estimation of Patients in Computer-Assisted Rehabilitation Environment (CAREN)
* MUSIQ: Multi-Scale Image Quality Transformer
* Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution
* Mutual Supervision for Dense Object Detection
* Mutual-Complementing Framework for Nuclei Detection and Segmentation in Pathology Image
* MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo
* MVTN: Multi-View Transformation Network for 3D Shape Recognition
* N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras
* NAS-OoD: Neural Architecture Search for Out-of-Distribution Generalization
* NASOA: Towards Faster Task-Oriented Online Fine-Tuning with a Zoo of Models
* Naturalistic Physical Adversarial Patch for Object Detectors
* NEAT: Neural Attention Fields for End-to-End Autonomous Driving
* NeRD: Neural Reflectance Decomposition from Image Collections
* Nerfies: Deformable Neural Radiance Fields
* NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo
* Network Adjustment: Channel and Block Search Guided by Resource Utilization Ratio
* Neural Architecture Search for Joint Human Parsing and Pose Estimation
* Neural Articulated Radiance Field
* Neural Image Compression via Attentional Multi-scale Back Projection and Frequency Decomposition
* Neural Photofit: Gaze-based Mental Image Reconstruction
* Neural Radiance Flow for 4D View Synthesis and Video Processing
* Neural Strokes: Stylized Line Drawing of 3D Shapes
* Neural TMDlayer: Modeling Instantaneous flow of features via SDE Generators
* Neural Video Portrait Relighting in Real-time via Consistency Modeling
* Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing
* NeuSpike-Net: High Speed Video Reconstruction via Bio-inspired Neuromorphic Cameras
* new framework of designing iterative techniques for image deblurring, A
* New Journey from SDRTV to HDRTV, A
* New Methodology for Bridge Inspections in Linear Infrastructures from Optical Images and HD Videos Obtained by UAV, A
* New Spatial Filtering Algorithm for Noisy and Missing GNSS Position Time Series Using Weighted Expectation Maximization Principal Component Analysis: A Case Study for Regional GNSS Network in Xinjiang Province, A
* NGC: A Unified Framework for Learning with Open-World Noisy Data
* Nightlights and Subnational Economic Activity: Estimating Departmental GDP in Paraguay
* No-Reference Quality Assessment of Pan-Sharpening Images with Multi-Level Deep Image Representations
* Non-Local Meets Global: An Iterative Paradigm for Hyperspectral Image Restoration
* Non-Rigid Neural Radiance Fields: Reconstruction and Novel View Synthesis of a Dynamic Scene From Monocular Video
* Non-Uniform Attention Network for Multi-modal Sentiment Analysis
* Nonlocal convolutional block attention module VNet for gliomas automatic segmentation
* Nonlocal Feature Selection Encoder-Decoder Network for Accurate InSAR Phase Filtering
* Normalization Matters in Weakly Supervised Object Localization
* Normalized Human Pose Features for Human Action Video Alignment
* Not All Operations Contribute Equally: Hierarchical Operation-adaptive Predictor for Neural Architecture Search
* Novel Chinese Sarcasm Detection Model Based on Retrospective Reader, A
* Novel Coding Architecture for Multi-Line LiDAR Point Clouds Based on Clustering and Convolutional LSTM Network, A
* Novel Gate Resource Allocation Method Using Improved PSO-Based QEA, A
* novel image quality assessment method and coefficient of quality for digital solutions of colour blindness, A
* Novel Moving Coprime Array Configurations for Real-Valued Sources
* Novel Sentiment Polarity Detection Framework for Chinese, A
* Novel Spectral Index for Automatic Canola Mapping by Using Sentinel-2 Imagery, A
* Novel Time-Domain Frequency Diverse Array HRWS Imaging Scheme for Spotlight SAR, A
* NPMs: Neural Parametric Models for 3D Deformable Shapes
* Numerical Prediction of Duality Principle with Bloch-Floquet Periodic Boundary Condition in Fully Anisotropic FDTD
* OadTR: Online Action Detection with Transformers
* OAST: Obstacle Avoidance System for Teleoperation of UAVs
* Object Tracking by Jointly Exploiting Frame and Event Domain
* Object-Based Approach to Map Young Forest and Shrubland Vegetation Based on Multi-Source Remote Sensing Data, An
* Object-Based Genetic Programming Approach for Cropland Field Extraction, An
* Objects as Cameras: Estimating High-Frequency Illumination from Shadows
* OC4-SO: A New Chlorophyll-a Algorithm for the Western Antarctic Peninsula Using Multi-Sensor Satellite Data
* Occlude Them All: Occlusion-Aware Attention Network for Occluded Person Re-ID
* Occluded Person Re-Identification with Single-scale Global Representations
* Occlusion-Aware Unsupervised Learning of Depth From 4-D Light Fields
* Occlusion-Aware Video Object Inpainting
* OCmst: One-class novelty detection using convolutional neural network and minimum spanning trees
* ODAM: Object Detection, Association, and Mapping using Posed RGB Video
* OMNet: Learning Overlapping Mask for Partial-to-Partial Point Cloud Registration
* Omni-GAN: On the Secrets of cGANs and Beyond
* Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans
* Omniscient Video Super-Resolution
* On Assisting Diagnoses of Pareidolia by Emulating Patient Behavior
* On Compositions of Transformations in Contrastive Self-Supervised Learning
* On Emotions as Features for Speech Overlaps Classification
* On Equivariant and Invariant Learning of Object Landmark Representations
* On Exposing the Challenging Long Tail in Future Prediction of Traffic Actors
* On Feature Decorrelation in Self-Supervised Learning
* On Generating Transferable Targeted Perturbations
* On Infinite Past Predictability of Cyclostationary Signals
* On the hidden treasure of dialog in video question answering
* On the Importance of Distractors for Few-Shot Classification
* On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation
* On the Prediction Policy for Timely Status Updates in Space-Air-Ground Integrated Transportation Systems
* On the Robustness of Vision Transformers to Adversarial Examples
* On The Security of Block Permutation and Co-XOR in Reversible Data Hiding
* On-the-Fly Facial Expression Prediction Using LSTM Encoded Appearance-Suppressed Dynamics
* Once Quantization-Aware Training: High Performance Extremely Low-bit Architecture Search
* One-pass Multi-view Clustering for Large-scale Data
* One-Stage Image Inpainting with Hybrid Attention
* Online Continual Learning with Natural Distribution Shifts: An Empirical Study with Visual Data
* Online Knowledge Distillation for Efficient Pose Estimation
* Online Multi-Granularity Distillation for GAN Compression
* Online Pseudo Label Generation by Hierarchical Cluster Dynamics for Adaptive Person Re-identification
* Online real-time pedestrian tracking from medium altitude aerial footage with camera motion cancellation
* Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization
* Online-trained Upsampler for Deep Low Complexity Video Compression
* OpenForensics: Large-Scale Challenging Dataset For Multi-Face Forgery Detection And Segmentation In-The-Wild
* OpenGAN: Open-Set Recognition via Open Data Generation
* Optimal Assignments in Mobility-on-Demand Systems Using Event-Driven Receding Horizon Control
* ORBIT: A Real-World Few-Shot Dataset for Teachable Object Recognition
* Oriented Object Detection in Remote Sensing Images with Anchor-Free Oriented Region Proposal Network
* Oriented R-CNN for Object Detection
* Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation
* Orthogonal Projection Loss
* Orthographic-Perspective Epipolar Geometry
* OSCAR-Net: Object-centric Scene Graph Attention for Image Attribution
* Out-of-boundary View Synthesis Towards Full-Frame Video Stabilization
* Out-of-Core Surface Reconstruction via Global TGV Minimization
* OVANet: One-vs-All Network for Universal Domain Adaptation
* Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation
* Overview of the Special Issue on Applications of Remote Sensing Imagery for Urban Areas
* P2-Net: Joint Description and Detection of Local Features for Pixel and Point Matching
* Paint Transformer: Feed Forward Neural Painting with Stroke Prediction
* Painting from Part
* Pano-AVQA: Grounded Audio-Visual Question Answering on 360° Videos
* Panoptic Narrative Grounding
* Panoptic Segmentation of Satellite Image Time Series with Convolutional Temporal Attention Networks
* Parallax Attention for Unsupervised Stereo Correspondence Learning
* Parallel DBSCAN-Martingale Estimation of the Number of Concepts for Automatic Satellite Image Clustering
* Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance Segmentation
* Parallel Multi-Resolution Fusion Network for Image Inpainting
* Parallel Rectangle Flip Attack: A Query-based Black-box Attack against Object Detection
* Parameter Adaptation and Situation Awareness of LTE-R Handover for High-Speed Railway Communication
* Parameter selection of Touzi decomposition and a distribution improved autoencoder for PolSAR image classification
* Parametric Contrastive Learning
* PARE: Part Attention Regressor for 3D Human Body Estimation
* Parsimonious Gap-Filling Models for Sub-Daily Actual Evapotranspiration Observations from Eddy-Covariance Systems
* Parsing Table Structures in the Wild
* Partial Off-policy Learning: Balance Accuracy and Diversity for Human-Oriented Image Captioning
* Partial Video Domain Adaptation with Partial Adversarial Temporal Attentive Network
* Participatory Design of Affective Technology: Interfacing Biomusic and Autism
* Partner-Assisted Learning for Few-Shot Image Classification
* PARTS: Unsupervised segmentation with slots, attention and independence maximization
* PASS: Protected Attribute Suppression System for Mitigating Bias in Face Recognition
* Patch Craft: Video Denoising by Deep Modeling and Patch Matching
* Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image
* Patching Your Clothes: Semantic-Aware Learning for Cloth-Changed Person Re-Identification
* PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility
* Path tracking control for autonomous vehicles with saturated input: A fuzzy fixed-time learning control approach
* Pathdreamer: A World Model for Indoor Navigation
* PCAM: Product of Cross-Attention Matrices for Rigid Registration of Point Clouds
* Pedestrian Crossing Intention Prediction at Red-Light Using Pose Estimation
* Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation
* Perceptual Variousness Motion Deblurring with Light Global Context Refinement
* Perceptually Unimportant Information Reduction and Cosine Similarity-Based Quality Assessment of 3D-Synthesized Images
* Performance of a Link in a Field of Vehicular Interferers With Hardcore Headway Distance
* Performance of BDS B1 Frequency Standard Point Positioning during the Main Phase of Different Classified Geomagnetic Storms in China and the Surrounding Area
* PersEmoN: A Deep Network for Joint Analysis of Apparent Personality, Emotion and Their Relationship
* Persistent Homology based Graph Convolution Network for Fine-grained 3D Shape Segmentation
* Personality Assessment Based on Multimodal Attention Network Learning With Category-Based Mean Square Error
* Personalized and Invertible Face De-identification by Disentangled Identity Information Manipulation
* Personalized Fashion Recommendation Using Pairwise Attention
* Personalized Image Aesthetics Assessment via Meta-Learning With Bilevel Gradient Optimization
* Personalized Image Semantic Segmentation
* Personalized Trajectory Prediction via Distribution Discrimination
* Perturbed Self-Distillation: Weakly Supervised Large-Scale Point Cloud Semantic Segmentation
* PF-VTON: Toward High-Quality Parser-Free Virtual Try-On Network
* Phase retrieval from incomplete data via weighted nuclear norm minimization
* photo-based quality assessment model for the estimation of PM2.5 concentrations, A
* Photon-Starved Scene Inference using Single Photon Cameras
* Physics-based Differentiable Depth Sensor Simulation
* Physics-based Human Motion Estimation and Synthesis from Videos
* Physics-Enhanced Machine Learning for Virtual Fluorescence Microscopy
* Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift
* PIAP-DF: Pixel-Interested and Anti Person-Specific Facial Action Unit Detection Net with Discrete Feedback Learning
* PicArrange: Visually Sort, Search, and Explore Private Images on a Mac Computer
* PICCOLO: Point Cloud-Centric Omnidirectional Localization
* PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering
* PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation
* Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation
* Pixel Difference Networks for Efficient Edge Detection
* Pixel-Perfect Structure-from-Motion with Featuremetric Refinement
* PixelPyramids: Exact Inference Models from Lossless Image Pyramids
* PixelSynth: Generating a 3D-Consistent Experience from a Single Image
* Planar Surface Reconstruction from Sparse Views
* PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
* Plant leaf disease detection using deep learning on mobile devices
* Platoon Trajectories Generation: A Unidirectional Interconnected LSTM-Based Car-Following Model
* PlenOctrees for Real-time Rendering of Neural Radiance Fields
* PnP-DETR: Towards Efficient Visual Analysis with Transformers
* PoGO-Net: Pose Graph Optimization with Graph Neural Networks
* Point Cloud Augmentation with Weighted Local Transformations
* Point Cloud Upsampling via a Coarse-to-Fine Network
* Point Transformer
* Point-Based Modeling of Human Clothing
* Point-set Distances for Learning Representations of 3D Point Clouds
* PointBA: Towards Backdoor Attacks in 3D Point Cloud
* PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers
* Poisson kernel: Avoiding self-smoothing in graph convolutional networks
* Polarimetric Helmholtz Stereopsis
* Polarization Estimation with a Single Vector Sensor for Radar Detection
* PolGAN: A deep-learning-based unsupervised forest height estimation based on the synergy of PolInSAR and LiDAR data
* Poly-NL: Linear Complexity Non-local Layers With 3rd Order Polynomials
* Pose Correction for Highly Accurate Visual Localization in Large-scale Indoor Spaces
* Pose Invariant Topological Memory for Visual Navigation
* Pose-Enhanced Relation Feature for Action Recognition in Still Images
* Power of Points for Modeling Humans in Clothing, The
* PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose Estimation
* PR-Net: Preference Reasoning for Personalized Video Highlight Detection
* PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-rigid Structure-from-Motion
* Practical Guide to Supervised Deep Learning for Bioimage Analysis: Challenges and good practices, A
* Practical Relative Order Attack in Deep Ranking
* PreDet: Large-scale weakly supervised pre-training for detection
* Predicting Canopy Chlorophyll Content in Sugarcane Crops Using Machine Learning Algorithms and Spectral Vegetation Indices Derived from UAV Multispectral Imagery
* Predicting Individual Tree Diameter of Larch (Larix olgensis) from UAV-LiDAR Data Using Six Different Algorithms
* Predicting with Confidence on Unseen Distributions
* Prediction by Anticipation: An Action-Conditional Prediction Method based on Interaction Learning
* Prediction of Blood Glucose Using Contextual LifeLog Data
* Prediction of Landslide Displacement Based on the Combined VMD-Stacked LSTM-TAR Model
* Predictive Feature Learning for Future Segmentation Prediction
* Preoperative planning for jugular-foramen tumors: Preparation of a three-dimensional surgical drawing
* Preservational Learning Improves Self-supervised Medical Image Models by Reconstructing Diverse Contexts
* Pri3D: Can 3D Priors Help 2D Representation Learning?
* PrimitiveNet: Primitive Instance Segmentation with Local Primitive Embedding under Adversarial Metric
* Prior to Segment: Foreground Cues for Weakly Annotated Classes in Partially Supervised Instance Segmentation
* Privacy-Preserving Deep Action Recognition: An Adversarial Learning Framework and A New Dataset
* PRNU registration under scale and rotation transform based on convolutional neural networks
* Proactive Eavesdropping With Jamming Power Allocation in Training-Based Suspicious Communications
* Probabilistic Modeling for Human Mesh Recovery
* Probabilistic Monocular 3D Human Pose Estimation with Normalizing Flows
* Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network
* Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning
* Processing of VENmuS Images of High Mountains: A Case Study for Cryospheric and Hydro-Climatic Applications in the Everest Region (Nepal)
* Procrustean Training for Imbalanced Deep Learning
* Product Quantizer Aware Inverted Index for Scalable Nearest Neighbor Search
* Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-Modal Pretraining
* ProFlip: Targeted Trojan Attack with Progressive Bit Flips
* Progressive Correspondence Pruning by Consensus Learning
* Progressive editing with stacked Generative Adversarial Network for multiple facial attribute editing
* Progressive Feature Learning for Facade Parsing With Occlusions
* Progressive GAN-Based Transfer Network for Low-Light Image Enhancement
* Progressive Joint Low-Light Enhancement and Noise Removal for Raw Images
* Progressive polarization based reflection removal via realistic training data generation
* Progressive Seed Generation Auto-Encoder for Unsupervised Point Cloud Learning
* Prostate Segmentation of Ultrasound Images Based on Interpretable-Guided Mathematical Model
* Prototypical Matching and Open Set Rejection for Zero-Shot Semantic Segmentation
* Provably Approximated Point Cloud Registration
* Proxy-Bridged Image Reconstruction Network for Anomaly Detection in Medical Images
* Pseudo-loss Confidence Metric for Semi-supervised Few-shot Learning
* Pseudo-mask Matters in Weakly-supervised Semantic Segmentation
* Psychophysiological Reactions to Persuasive Messages Deploying Persuasion Principles
* PT-CapsNet: A Novel Prediction-Tuning Capsule Network Suitable for Deeper Architectures
* PU-EVA: An Edge-Vector based Approximation Solution for Flexible-scale Point Cloud Upsampling
* PUNet: Novel and efficient deep neural network architecture for handwritten documents word spotting
* Purely Attention Based Local Feature Integration for Video Classification
* Pursuit of Knowledge: Discovering and Localizing Novel Categories using Dual Memory, The
* Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis
* PX-NET: Simple and Efficient Pixel-Wise Training of Photometric Stereo Networks
* PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop
* Pyramid Architecture Search for Real-Time Image Deblurring
* Pyramid Point Cloud Transformer for Large-Scale Place Recognition
* Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection
* Pyramid Spatial-Temporal Aggregation for Video-based Person Re-Identification
* Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
* Q-Match: Iterative Shape Matching via Quantum Annealing
* QoS-Guaranteed Adaptive Modulation and Coding for Wireless Scalable Video Multicast
* Quality-Oriented Task Allocation and Scheduling in Transcoding Servers With Heterogeneous Processors
* Quantitative Physical Ergonomics Assessment of Teleoperation Interfaces
* Quantitative Sound Speed Imaging of Cortical Bone and Soft Tissue: Results From Observational Data Sets
* Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
* R-MSFM: Recurrent Multi-Scale Feature Modulation for Monocular Depth Estimating
* R-SLAM: Optimizing Eye Tracking from Rolling Shutter Video of the Retina
* Radar Signal Intrapulse Modulation Recognition Based on a Denoising-Guided Disentangled Network
* Radial Distortion Invariant Factorization for Structure from Motion
* Radio Map Assisted Path Planning for UAV Anti-Jamming Communications
* RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting
* Random Forest Classification of Land Use, Land-Use Change and Forestry (LULUCF) Using Sentinel-2 Data: A Case Study of Czechia
* RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection
* RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection
* Rank &Sort Loss for Object Detection and Instance Segmentation
* RANK-NOSH: Efficient Predictor-Based Architecture Search via Non-Uniform Successive Halving
* Ranking Models in Unlabeled New Environments
* Rapid construction of 4D high-quality microstructural image for cement hydration using partial information registration
* RAPT360: Reinforcement Learning-Based Rate Adaptation for 360-Degree Video Streaming With Adaptive Prediction and Tiling
* Rating Vs. Paired Comparison for the Judgment of Dominance on First Impressions
* Rating-Aware Self-Organizing Maps
* Ratio of the Land Consumption Rate to the Population Growth Rate: A Framework for the Achievement of the Spatiotemporal Pattern in Poland and Lithuania, The
* Rational Polynomial Camera Model Warping for Deep Learning Based Satellite Multi-View Stereo Matching
* RDA: Robust Domain Adaptation via Fourier Adversarial Attacking
* RDI-Net: Relational Dynamic Inference Networks
* Re-Aging GAN: Toward Personalized Face Age Transformation
* Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation
* Re-energizing Domain Discriminator with Sample Relabeling for Adversarial Domain Adaptation
* Real-Time 3-D Semantic Scene Parsing With LiDAR Sensors
* Real-Time Decision Making and Path Planning for Robotic Autonomous Luggage Trolley Collection at Airports
* Real-time Detection of Tiny Objects Based on a Weighted Bi-directional FPN
* Real-Time Fault Diagnosis of Pulse Rectifier in Traction System Based on Structural Model
* Real-Time FPGA Design for OMP Targeting 8K Image Reconstruction
* Real-time Image Enhancer via Learnable Spatial-aware 3D Lookup Tables
* Real-time Instance Segmentation with Discriminative Orientation Maps
* Real-Time Stability Performance Monitoring and Evaluation of Maglev Trains' Levitation System: A Data-Driven Approach
* Real-Time Tracking Algorithm for Aerial Vehicles Using Improved Convolutional Neural Network and Transfer Learning
* Real-time Vanishing Point Detector Integrating Under-parameterized RANSAC and Hough Transform
* Real-Time Video Deraining via Global Motion Compensation and Hybrid Multi-Scale Temporal Correlations
* Real-Time Video Emotion Recognition Based on Reinforcement Learning and Domain Knowledge
* Real-Time Video Inference on Edge Devices via Adaptive Model Streaming
* Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme
* Reality Transform Adversarial Generators for Image Splicing Forgery Detection and Localization
* RECALL: Replay-based Continual Learning in Semantic Segmentation
* Reciprocal Twin Networks for Pedestrian Motion Learning and Future Path Prediction
* Recognition of Multiple Anxiety Levels Based on Electroencephalograph, The
* Recognizing Very Small Face Images Using Convolution Neural Networks
* Reconcile Prediction Consistency for Balanced Object Detection
* ReconfigISP: Reconfigurable Camera Image Processing Pipeline
* Reconstructing 3D Contour Models of General Scenes from RGB-D Sequences
* Reconstructing Hand-Object Interactions in the Wild
* Recovering the Parameters of an LDPC Code From Noisy Intercepted Sequences
* RectiNet-v2: A stacked network architecture for document image dewarping
* ReCU: Reviving the Dead Weights in Binary Neural Networks
* Recurrent Mask Refinement for Few-Shot Medical Image Segmentation
* Recurrent Neural Network Based Collaborative Filtering for QoS Prediction in IoV
* Recursive Multi-Scale Channel-Spatial Attention for Fine-Grained Image Classification
* Recursively Conditional Gaussian for Ordinal Unsupervised Domain Adaptation
* ReDAL: Region-based and Diversity-aware Active Learning for Point Cloud Semantic Segmentation
* Reduced Biquaternion Convolutional Neural Network for Color Image Processing
* Reducing CACC Platoon Disturbances Caused by State Jitters by Combining Two Stages Driving State Recognition With Multiple Platoons' Strategies and Risk Prediction
* Reducing the Residual Topography Phase for the Robust Landscape Deformation Monitoring of Architectural Heritage Sites in Mountain Areas: The Pseudo-Combination SBAS Method
* Reference-Free DIBR-Synthesized Video Quality Metric in Spatial and Temporal Domains
* Refining Action Segmentation with Hierarchical Video Representations
* Refining activation downsampling with SoftPool
* Refraction and coordinate correction with the JONSWAP model for ICESat-2 bathymetry
* Region Similarity Representation Learning
* Region-aware Contrastive Learning for Semantic Segmentation
* Regression Guided by Relative Ranking Using Convolutional Neural Network (R^3CNN) for Facial Beauty Prediction
* Regularized Two Granularity Loss Function for Weakly Supervised Video Moment Retrieval
* Regularizing Nighttime Weirdness: Efficient Self-supervised Monocular Depth Estimation in the Dark
* Rehearsal revealed: The limits and merits of revisiting samples in continual learning
* Reinforcement learning cropping method based on comprehensive feature and aesthetics assessment
* Reinforcement Learning-Based Interactive Video Search
* Relating Adversarially Robust Generalization to Flat Minima
* Relational Embedding for Few-Shot Classification
* Relaxed Transformer Decoders for Direct Action Proposal Generation
* Relevance attack on detectors
* Reliable Vision-Based Grasping Target Recognition for Upper Limb Prostheses
* Reliably fast adversarial training via latent adversarial perturbation
* Remote Sensing Image Denoising Based on Deep and Shallow Feature Fusion and Attention Mechanism
* Remote Sensing Products Validated by Flux Tower Data in Amazon Rain Forest
* Remote Sensing to Characterize River Floodplain Structure and Function
* Removing Adversarial Noise in Class Activation Feature Space
* Removing the Bias of Integral Pose Regression
* RePCD-Net: Feature-Aware Recurrent Point Cloud Denoising Network
* RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering
* Representative Color Transform for Image Enhancement
* Research on fatigue detection based on visual features
* Research on Shore-Based River Flow Velocity Inversion Model Using GNSS-R Raw Data
* Residual Attention: A Simple but Effective Method for Multi-Label Recognition
* Response of Vegetation Phenology to the Interaction of Temperature and Precipitation Changes in Qilian Mountains
* ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting
* ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement
* Rethinking 360° Image Visual Attention Modelling with Unsupervised Learning
* Rethinking and Improving Relative Position Encoding for Vision Transformer
* Rethinking Coarse-to-Fine Approach in Single Image Deblurring
* Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework
* Rethinking Deep Image Prior for Denoising
* Rethinking Noise Synthesis and Modeling in Raw Denoising
* Rethinking preventing class-collapsing in metric learning with margin-based losses
* Rethinking Self-supervised Correspondence Learning: A Video Frame-level Similarity Perspective
* Rethinking Shared Features and Re-ranking for Cross-Modality Person Re-identification
* Rethinking Spatial Dimensions of Vision Transformers
* Rethinking the Backdoor Attacks' Triggers: A Frequency Perspective
* Rethinking the Truly Unsupervised Image-to-Image Translation
* Rethinking Transformer-based Set Prediction for Object Detection
* Retinal vessel segmentation using a strip wise classification approach with grid search-based parameter selection
* RetinexDIP: A Unified Deep Framework for Low-Light Image Enhancement
* Retrieval of Crop Variables from Proximal Multispectral UAV Image Data Using PROSAIL in Maize Canopy
* RetrievalFuse: Neural 3D Scene Reconstruction with a Database
* Retrieve in Style: Unsupervised Facial Feature Transfer and Retrieval
* Retrospective Predictions of Rice and Other Crop Production in Madagascar Using Soil Moisture and an NDVI-Based Calendar from 2010-2017
* Revealing the Reciprocal Relations between Self-Supervised Stereo and Monocular Depth Estimation
* Reversible data hiding for JPEG images with a cascaded structure
* Review of Synthetic-Aperture Radar Image Formation Algorithms and Implementations: A Computational Perspective, A
* Review on Psychological Stress Detection Using Biosignals
* ReViewNet: A Fast and Resource Optimized Network for Enabling Safe Autonomous Driving in Hazy Weather Conditions
* Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better
* Revisiting Image-Language Networks for Open-Ended Phrase Detection
* Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers
* Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation
* RFNet: Recurrent Forward Network for Dense Point Cloud Completion
* RFNet: Region-aware Fusion Network for Incomplete Multi-modal Brain Tumor Segmentation
* RGB-D Saliency Detection via Cascaded Mutual Information Minimization
* Rheology of the Northern Tibetan Plateau Lithosphere Inferred from the Post-Seismic Deformation Resulting from the 2001 Mw 7.8 Kokoxili Earthquake
* Right to Talk: An Audio-Visual Transformer Approach, The
* RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth
* RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise Mixed Schemes and Multiple Precisions
* Road Anomaly Detection by Partial Image Reconstruction with Segmentation Coupling
* Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation, The
* Robust 2D/3D Vehicle Parsing in Arbitrary Camera Views for CVIS
* Robust Automatic Monocular Vehicle Speed Estimation for Traffic Surveillance
* Robust detection of dehazed images via dual-stream CNNs with adaptive feature fusion
* Robust experience replay sampling for multi-agent reinforcement learning
* Robust face recognition for occluded real-world images using constrained probabilistic sparse network
* Robust Gaussian process regression with a bias model
* Robust Learning From Noisy Web Images Via Data Purification for Fine-Grained Recognition
* Robust Loss for Point Cloud Registration, A
* Robust object detection under harsh autonomous-driving environments
* Robust Object Detection via Instance-Level Temporal Cycle Confusion
* robust photo-based PM2.5 monitoring method by combining linear and non-linear learning, A
* Robust power line extraction from aerial image using object-based Gaussian-Markov random field with gravity property parameters
* Robust Small Object Detection on the Water Surface through Fusion of Camera and Millimeter Wave Radar
* Robust Small-scale Pedestrian Detection with Cued Recall via Memory Learning
* Robust Trust Region for Weakly Supervised Segmentation
* Robust Watermarking for Deep Neural Networks via Bi-level Optimization
* RobustNav: Towards Benchmarking Robustness in Embodied Navigation
* Robustness and Generalization via Generative Adversarial Training
* Robustness Certification for Point Cloud Models
* Robustness via Cross-Domain Ensembles
* Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs
* RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation
* RVFace: Reliable Vector Guided Softmax Loss for Face Recognition
* S3VAADA: Submodular Subset Selection for Virtual Adversarial Active Domain Adaptation
* SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks
* SaccadeCam: Adaptive Visual Attention for Monocular Depth Sensing
* SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-powered Intelligent PhlatCam
* SAF-Net: A spatio-temporal deep learning method for typhoon intensity prediction
* Safe and Efficient Lane Change Maneuver for Obstacle Avoidance Inspired From Human Driving Pattern
* Safe incomplete label distribution learning
* SafeDrive: A New Model for Driving Risk Analysis Based on Crash Avoidance
* Safety-aware Motion Prediction with Unseen Vehicles for Autonomous Driving
* Saliency-Associated Object Tracking
* Salient Object Ranking with Position-Preserved Attention
* SAM: Self Attention Mechanism for Scene Text Recognition Based on Swin Transformer
* Sample Efficient Detection and Classification of Adversarial Attacks via Self-Supervised Embeddings
* Sample-Centric Feature Generation for Semi-Supervised Few-Shot Learning
* Sampling Network Guided Cross-Entropy Method for Unsupervised Point Cloud Registration
* Sat2Vid: Street-view Panoramic Video Synthesis from a Single Satellite Image
* SAT: 2D Semantics Assisted Training for 3D Visual Grounding
* Satellite Multi-Sensor Data Fusion for Soil Clay Mapping Based on the Spectral Index and Spectral Bands Approaches
* Satellite-Based Diagnosis and Numerical Verification of Ozone Formation Regimes over Nine Megacities in East Asia
* Scalable Vision Transformers with Hierarchical Pooling
* scale-sensitive heatmap representation for multi-person pose estimation, A
* Scaling Semantic Segmentation Beyond 1K Classes on a Single GPU
* Scaling up instance annotation via label propagation
* Scaling-up Disentanglement for Image Translation
* Scene Context-Aware Salient Object Detection
* Scene Independency Matters: An Empirical Study of Scene Dependent and Scene Independent Evaluation for CNN-Based Change Detection
* Scene Synthesis via Uncertainty-Driven Attribute Synchronization
* Scene-specific crowd counting using synthetic training images
* Score-Based Point Cloud Denoising
* SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition
* Scribble-Supervised Semantic Segmentation by Uncertainty Reduction on Neural Representation and Self-Supervision on Neural Eigenspace
* Scribble-Supervised Semantic Segmentation Inference
* Sea Ice Concentration Estimation Methodology Utilizing ICESat-2 Photon-Counting Laser Altimeter in the Arctic, A
* Searching for Controllable Image Restoration Networks
* Searching for Robustness: Loss Learning for Noisy Classification Tasks
* Searching for Two-Stream Models in Multivariate Space for Video Recognition
* Seasonal Contrast and Interactive Effects of Potential Drivers on Land Surface Temperature in the Sichuan Basin, China
* Seasonal Contrast: Unsupervised Pre-Training from Uncurated Remote Sensing Data
* Security-Aware Information Dissemination With Fine-Grained Access Control in Cooperative Multi-RSU of VANETs
* Seeing Dynamic Scene in the Dark: A High-Quality Video Dataset with Mechatronic Alignment
* Seek Common Ground While Reserving Differences: A Model-Agnostic Module for Noisy Domain Adaptation
* Seeking Similarities over Differences: Similarity-based Domain Alignment for Adaptive Object Detection
* Segmentation information with attention integration for classification of breast tumor in ultrasound image
* Segmentation of lung airways based on deep learning methods
* Segmentation-grounded Scene Graph Generation
* Segmenter: Transformer for Semantic Segmentation
* Selective Feature Compression for Efficient Activity Recognition Inference
* Selective Nearest Neighbors Clustering
* Selective part-based correlation filter tracking algorithm with reinforcement learning
* Self Supervision to Distillation for Long-Tailed Visual Recognition
* Self-born Wiring for Neural Trees
* Self-Calibrating Neural Radiance Fields
* Self-Conditioned Probabilistic Learning of Video Rescaling
* Self-Knowledge Distillation with Progressive Refinement of Targets
* Self-Motivated Communication Agent for Real-World Vision-Dialog Navigation
* Self-Mutating Network for Domain Adaptive Segmentation of Aerial Images
* Self-Mutual Distillation Learning for Continuous Sign Language Recognition
* Self-Regulation for Semantic Segmentation
* Self-restrained triplet loss for accurate masked face recognition
* Self-Supervised 3D Face Reconstruction via Conditional Estimation
* Self-Supervised 3D Hand Pose Estimation from monocular RGB via Contrastive Learning
* Self-supervised 3D Skeleton Action Representation Learning with Motion Consistency and Continuity
* Self-Supervised Cryo-Electron Tomography Volumetric Image Restoration from Single Noisy Volume with Sparsity Constraint
* Self-supervised Domain Adaptation for Forgery Localization of JPEG Compressed Images
* Self-supervised Geometric Features Discovery via Interpretable Attention for Vehicle Re-Identification and Beyond
* Self-Supervised Image Prior Learning with GMM from a Single Noisy Image
* Self-supervised Monocular Depth Estimation for All Day Images using Domain Separation
* Self-supervised Neural Networks for Spectral Snapshot Compressive Imaging
* Self-Supervised Object Detection via Generative Image Synthesis
* Self-Supervised Pretraining of 3D Features on any Point-Cloud
* Self-supervised Product Quantization for Deep Unsupervised Image Retrieval
* Self-Supervised Real-to-Sim Scene Generation
* Self-Supervised Representation Learning from Flow Equivariance
* Self-supervised Transfer Learning for Hand Mesh Recovery from Binocular Images
* Self-Supervised Vessel Segmentation via Adversarial Learning
* Self-supervised Video Object Segmentation by Motion Grouping
* Self-Supervised Video Representation Learning with Meta-Contrastive Network
* Self-Supervised Visual Representations Learning by Contrastive Mask Prediction
* Self-weighting multi-view spectral clustering based on nuclear norm
* SelfReg: Self-supervised Contrastive Regularization for Domain Generalization
* SeLFVi: Self-supervised Light-Field Video Reconstruction from Stereo Video
* Semantic and context features integration for robust object tracking
* Semantic Aware Data Augmentation for Cell Nuclei Microscopical Images with Artificial Neural Networks
* Semantic clustering based deduction learning for image recognition and classification
* Semantic Concentration for Domain Adaptation
* Semantic Diversity Learning for Zero-Shot Multi-label Classification
* Semantic Perturbations with Normalizing Flows for Improved Generalization
* Semantic-embedded Unsupervised Spectral Reconstruction from Single RGB Images in the Wild
* Semantically Coherent Out-of-Distribution Detection
* Semantically guided self-supervised monocular depth estimation
* Semantically Robust Unpaired Image Translation for Data with Unmatched Semantics Statistics
* Semantics Disentangling for Generalized Zero-Shot Learning
* Semi-supervised Active Learning for Semi-supervised Models: Exploit Adversarial Examples with Graph-based Virtual Labels
* Semi-Supervised Active Learning with Temporal Output Discrepancy
* Semi-Supervised Federated Learning for Travel Mode Identification From GPS Trajectories
* Semi-supervised image super-resolution with attention CycleGAN
* Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples
* Semi-supervised node classification via adaptive graph smoothing networks
* Semi-supervised robust training with generalized perturbed neighborhood
* Semi-Supervised Segmentation of Radiation-Induced Pulmonary Fibrosis From Lung CT Scans With Multi-Scale Guided Dense Attention
* Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive Learning from a Class-wise Memory Bank
* Semi-Supervised Single-Stage Controllable GANs for Conditional Fine-Grained Image Generation
* Semidefinite Relaxation for Source Localization by TOA in Unsynchronized Networks
* SemIE: Semantically-aware Image Extrapolation
* SemiHand: Semi-supervised Hand Pose Estimation with Consistency
* Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation
* Sensitive loss: Improving accuracy and fairness of face representations with discrimination-aware deep learning
* Sensor-Guided Optical Flow
* Sentinel-1 Satellite Radar Images: A New Source of Information for Study of River Channel Dynamics on the Lower Vistula River, Poland
* SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised Domain Adaptation
* Separable Flow: Learning Motion Cost Volumes for Optical Flow Estimation
* Sequence-Level Reference Frames in Video Coding
* SGMNet: Learning Rotation-Invariant Point Cloud Representations via Sorted Gram Matrix
* SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation
* Shallow Bayesian Meta Learning for Real-World Few-Shot Recognition
* Shape Self-Correction for Unsupervised Point Cloud Understanding
* Shape-aware Multi-Person Pose Estimation from Multi-View Images
* Shape-Biased Domain Generalization via Shock Graph Embeddings
* Shape-Constrained Method of Remote Sensing Monitoring of Marine Raft Aquaculture Areas on Multitemporal Synthetic Sentinel-1 Imagery
* ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation
* Shared Latent Space of Font Shapes and Their Noisy Impressions
* Ship Detection Method via Redesigned FCOS in Large-Scale SAR Images, A
* Short-Term Travel Speed Prediction for Urban Expressways: Hybrid Convolutional Neural Network Models
* SiamCDA: Complementarity- and Distractor-Aware RGB-T Tracking Based on Siamese Network
* Siamese Network Ensembles for Hyperspectral Target Detection with Pseudo Data Generation
* SibNet: Food instance counting and segmentation
* SIGN: Spatial-information Incorporated Generative Network for Generalized Zero-shot Semantic Segmentation
* Signature barcodes for online verification
* SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition
* SIGNET: Efficient Neural Representation for Light Fields
* Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation*, A
* Simple Baseline for Weakly-Supervised Scene Graph Generation, A
* Simple Feature Augmentation for Domain Generalization, A
* Simple Framework for 3D Lensless Imaging with Programmable Masks, A
* Simple Statistical Intra-Seasonal Prediction Model for Sea Surface Variables Utilizing Satellite Remote Sensing, A
* Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer
* SimROD: A Simple Adaptation Method for Robust Object Detection
* SIMstack: A Generative Shape and Instance Model for Unordered Object Stacks
* Single Image 3D Shape Retrieval via Cross-Modal Instance and Category Contrastive Learning
* Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions
* Single Image Super-Resolution Quality Assessment: A Real-World Dataset, Subjective Studies, and an Objective Metric
* Single View Physical Distance Estimation using Human Pose
* Single-shot Hyperspectral-Depth Imaging with Learned Diffractive Optics
* Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning
* Skeleton2Mesh: Kinematics Prior Injected Unsupervised Human Mesh Recovery
* Skeletonization Based on K-Nearest-Neighbors on Binary Image
* Sketch Your Own GAN
* Sketch2Mesh: Reconstructing and Editing 3D Shapes from Sketches
* SketchAA: Abstract Representation for Abstract Sketches
* Sketches by MoSSaRT: Representative selection from manifolds with gross sparse corruptions
* SketchLattice: Latticed Representation for Sketch Manipulation
* SLAMP: Stochastic Latent Appearance and Motion Prediction
* SLIDE: Single Image 3D Photography with Soft Layering and Depth-aware Inpainting
* SLIM: Self-Supervised LiDAR Scene Flow and Motion Segmentation
* Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods
* Smart Autodriver Algorithm for Real-Time Autonomous Vehicle Trajectory Control
* SmartShadow: Artistic Shadow Drawing Tool for Line Drawings
* Smoothing Linear Multi-Target Tracking Using Integrated Track Splitting Filter
* SNARF: Differentiable Forward Skinning for Animating Non-Rigid Neural Implicit Shapes
* SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer
* SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation
* Social Fabric: Tubelet Compositions for Video Relation Detection
* Social NCE: Contrastive Learning of Socially-aware Motion Representations
* SoDar: Multitarget Gesture Recognition Based on SIMO Doppler Radar
* Soft Exemplar Highlighting for Cross-View Image-Based Geo-Localization
* Solving Inefficiency of Self-supervised Representation Learning
* SOMA: Solving Optical Marker-Based MoCap Automatically
* Sonar image quality evaluation using deep neural network
* SOTR: Segmenting Objects with Transformers
* Source data-free domain adaptation for a faster R-CNN
* Space-Air-Ground Integrated Multi-Domain Network Resource Orchestration Based on Virtual Network Architecture: A DRL Method
* Space-Air-Sea-Ground Integrated Monitoring Network-Based Maritime Transportation Emergency Forecasting
* Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
* Space-Time-Separable Graph Convolutional Network for Pose Forecasting
* Sparse attention block: Aggregating contextual information for object detection
* Sparse CapsNet with explicit regularizer
* Sparse Needlets for Lighting Estimation with Spherical Transport Loss
* Sparse-shot Learning with Exclusive Cross-Entropy for Extremely Many Localisations
* Sparse-to-dense Feature Matching: Intra and Inter domain Cross-modal Learning in Domain Adaptation for 3D Semantic Segmentation
* SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation
* Spatial and Semantic Consistency Regularizations for Pedestrian Attribute Recognition
* Spatial Difference between Temperature and Snowfall Driven Spring Phenology of Alpine Grassland Land Surface Based on Process-Based Modeling on the Qinghai-Tibet Plateau
* Spatial Potential Energy Weighted Maximum Simplex Algorithm for Hyperspectral Endmember Extraction
* Spatial Uncertainty-Aware Semi-Supervised Crowd Counting
* Spatial-Driven Features Based on Image Dependencies for Person Re-Identification
* Spatial-Temporal Asynchronous Normalization for Unsupervised 3D Action Representation Learning
* Spatial-Temporal Consistency Network for Low-Latency Trajectory Forecasting
* Spatial-Temporal Deep Intention Destination Networks for Online Travel Planning
* Spatial-Temporal Transformer for Dynamic Scene Graph Generation
* Spatial-Temporal Variation in Paddy Evapotranspiration in Subtropical Climate Regions Based on the SEBAL Model: A Case Study of the Ganfu Plain Irrigation System, Southern China
* Spatially Conditioned Graphs for Detecting Human-Object Interactions
* Spatially-Adaptive Image Restoration using Distortion-Guided Networks
* Spatio-Temporal Dynamic Inference Network for Group Activity Recognition
* Spatio-Temporal Poisson Point Process: A Simple Model for the Alignment of Event Camera Data, The
* Spatio-Temporal Representation Factorization for Video-based Person Re-Identification
* Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds
* Spatiotemporal Multimodal Learning With 3D CNNs for Video Action Recognition
* Spatiotemporal Patterns and Driving Force of Urbanization and Its Impact on Urban Ecology
* Spatiotemporal Patterns of Cultivated Land Quality Integrated with Multi-Source Remote Sensing: A Case Study of Guangzhou, China
* Spatiotemporal Perturbation Based Dynamic Consistency for Semi-Supervised Temporal Action Detection
* SPEC: Seeing People in the Wild with an Estimated Camera
* Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation
* Specificity-preserving RGB-D Saliency Detection
* Spectral Leakage and Rethinking the Kernel Size in CNNs
* Spectral-Based Classification of Plant Species Groups and Functional Plant Parts in Managed Permanent Grassland
* Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates
* Speech Emotion Recognition Enhanced Traffic Efficiency Solution for Autonomous Vehicles in a 5G-Enabled Space-Air-Ground Integrated Intelligent Transportation System
* Speech Intelligibility Enhancement By Non-Parallel Speech Style Conversion Using CWT and iMetricGAN Based CycleGAN
* SPEye: A Calibration-Free Gaze-Driven Text Entry Technique Based on Smooth Pursuit
* SPG-VTON: Semantic Prediction Guidance for Multi-Pose Virtual Try-on
* SPG: Unsupervised Domain Adaptation for 3D Object Detection via Semantic Point Generation
* Split-Delivery Capacitated Arc-Routing Problem With Time Windows
* Square Root Marginalization for Sliding-Window Bundle Adjustment
* SS-IL: Separated Softmax for Incremental Learning
* SSH: A Self-Supervised Framework for Image Harmonization
* SST-Wind Causal Relationship during the Development of the IOD in Observations and Model Simulations, The
* Stacked Homography Transformations for Multi-View Pedestrian Detection
* Stacking-based ensemble learning method for the recognition of the preceding vehicle lane-changing manoeuvre: A naturalistic driving study on the highway
* Standardized Max Logits: A Simple yet Effective Approach for Identifying Unexpected Road Obstacles in Urban-Scene Segmentation
* STAP: A Spatio-Temporal Correlative Estimating Model for Improving Quality of Traffic Data
* STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement
* StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement
* Statistically Consistent Saliency Estimation
* STEM: An approach to Multi-source Domain Adaptation with Guarantees
* StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose Estimation
* Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-grained Recognition
* Stochastic Scene-Aware Motion Prediction
* Stochastic Transformer Networks with Linear Competing Units: Application to end-to-end SL Translation
* STR-GQN: Scene Representation and Rendering for Unknown Cameras Based on Spatial Transformation Routing
* Striking a Balance between Stability and Plasticity for Class-Incremental Learning
* STRIVE: Scene Text Replacement In Videos
* StructDepth: Leveraging the structural regularities for self-supervised indoor depth estimation
* Structure-aware multiple salient region detection and localization for autonomous robotic manipulation
* Structure-Aware Positional Transformer for Visible-Infrared Person Re-Identification
* Structure-from-Sherds: Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections
* Structure-Preserving Deraining with Residue Channel Prior Guidance
* Structure-transformed Texture-enhanced Network for Person Image Synthesis
* Structured Bird's-Eye-View Traffic Scene Understanding from Onboard Images
* Structured Outdoor Architecture Reconstruction by Exploration and Classification
* Student Customized Knowledge Distillation: Bridging the Gap Between Student and Teacher
* Study of Atmospheric Carbon Dioxide Retrieval Method Based on Normalized Sensitivity
* Study of improved signal-based merge strategy in work zone areas based on Cellular Automata simulation
* Study of Possible Correlations between Seismo-Ionospheric Anomalies of GNSS Total Electron Content and Earthquake Energy, A
* Study of the Effect of Vegetation on Reducing Atmospheric Pollution Particles
* Study on Horse-Rider Interaction Based on Body Sensor Network in Competitive Equitation
* Study on the Classification and Change Detection Methods of Drylands in Arid and Semi-Arid Regions
* STVGBert: A Visual-linguistic Transformer based Framework for Spatio-temporal Video Grounding
* Style and Semantic Memory Mechanism for Domain Generalization*, A
* StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
* StyleFormer: Real-time Arbitrary Style Transfer via Parametric Style Composition
* Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks
* Subcycle Waveform Modeling of Traffic Intersections Using Recurrent Attention Networks
* Suitability of PlanetScope Imagery for Mapping Rubber Plantations, The
* Summarize and Search: Learning Consensus-aware Dynamic Convolution for Co-Saliency Detection
* Summer Nighttime Anomalies of Ionospheric Electron Content at Midlatitudes: Comparing Years of Low and High Solar Activities Using Observations and Tidal/Planetary Wave Features
* SUnet++: Joint Demosaicing and Denoising of Extreme Low-Light Raw Image
* SUNet: Symmetric Undistortion Network for Rolling Shutter Correction
* Super Resolve Dynamic Scene from Continuous Spike Streams
* Super-resolution guided knowledge distillation for low-resolution image classification
* Super-Resolution Semantic Segmentation with Relation Calibrating Network
* Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar
* Super-Resolving Ocean Dynamics from Space with Computer Vision Algorithms
* Superpoint Network for Point Cloud Oversegmentation
* Supervised dimensionality reduction technology of generalized discriminant component analysis and its kernelization forms
* Support-Set Based Cross-Supervision for Video Grounding
* SurfaceNet: Adversarial SVBRDF Estimation from a Single Image
* SurfGen: Adversarial 3D Shape Synthesis with Explicit Surface Discriminators
* Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation, The
* surprising impact of mask-head architecture on novel class segmentation, The
* Survey on Deep Learning Techniques for Stereo-Based Depth Estimation, A
* Survey on Intrinsic Images: Delving Deep into Lambert and Beyond, A
* Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
* Switchable K-class Hyperplanes for Noise-Robust Representation Learning
* Symmetry-Driven hyper feature GCN for skeleton-based gait recognition
* Synchronization of Group-labelled Multi-graphs
* Syncretic Modality Collaborative Learning for Visible Infrared Person Re-Identification
* SynFace: Face Recognition with Synthetic Data
* Synthesis of Compositional Animations from Textual Descriptions
* Synthesized Feature based Few-Shot Class-Incremental Learning on a Mixture of Subspaces
* SynthMorph: Learning Contrast-Invariant Registration Without Acquired Images
* Systems and Methods for Prediction of Sellability of Fashion Products
* T-AutoML: Automated Machine Learning for Lesion Segmentation using Transformers in 3D Medical Imaging
* T-Net: Effective Permutation-Equivariant Network for Two-View Correspondence Learning
* T-SVDNet: Exploring High-Order Prototypical Correlations for Multi-Source Domain Adaptation
* TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
* TAGNet: Triplet-Attention Graph Networks for Hashtag Recommendation
* Talk-to-Edit: Fine-Grained Facial Editing via Dialog
* TAM: Temporal Adaptive Module for Video Recognition
* Target Adaptive Context Aggregation for Video Scene Graph Generation
* Task Category Space for User-Centric Comparative Multimedia Search Evaluations, A
* Task Switching Network for Multi-task Learning
* Task-aware Part Mining Network for Few-Shot Learning
* TDPN: Texture and Detail-Preserving Network for Single Image Super-Resolution
* Teacher-Student Adversarial Depth Hallucination to Improve Face Recognition
* TeachText: CrossModal Generalized Distillation for Text-Video Retrieval
* Telling the What while Pointing to the Where: Multimodal Queries for Image Retrieval
* TempNet: Online Semantic Segmentation on Large-scale Point Cloud Series
* Temporal Action Detection with Multi-level Supervision
* Temporal Cue Guided Video Highlight Detection with Low-Rank Audio-Visual Fusion
* Temporal Knowledge Consistency for Unsupervised Visual Representation Learning
* Temporal-wise Attention Spiking Neural Networks for Event Streams Classification
* Temporally-Coherent Surface Reconstruction via Metric-Consistent Atlases
* Tensor Laplacian Regularized Low-Rank Representation for Non-Uniformly Distributed Data Subspace Clustering
* Tensor-Based Truthful Incentive Mechanism for Blockchain-Enabled Space-Air-Ground Integrated Vehicular Crowdsensing, A
* Testing using Privileged Information by Adapting Features with Statistical Dependence
* Text is Text, No Matter What: Unifying Text Recognition using Knowledge Distillation
* Text-instance graph: Exploring the relational semantics for text-based visual question answering
* Text-to-Traffic Generative Adversarial Network for Traffic Situation Generation
* TF-Blender: Temporal Feature Blender for Video Object Detection
* TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition
* THDA: Treasure Hunt Data Augmentation for Semantic Navigation
* Three Stages Detail Injection Network for Remote Sensing Images Pansharpening, A
* Three Steps to Multimodal Trajectory Prediction: Modality Clustering, Classification and Synthesis
* Three-Dimensional Geometry Reconstruction Method for Slowly Rotating Space Targets Utilizing ISAR Image Sequence
* Throughput Optimization in Heterogeneous Swarms of Unmanned Aircraft Systems for Advanced Aerial Mobility
* THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers
* Time Series Surface Deformation of Changbaishan Volcano Based on Sentinel-1B SAR Data and Its Geological Significance
* Time-Equivariant Contrastive Video Representation Learning
* Time-Frequency Attention for Speech Emotion Recognition with Squeeze-and-Excitation Blocks
* Time-Multiplexed Coded Aperture Imaging: Learned Coded Aperture and Pixel Exposures for Compressive Imaging Systems
* TkML-AP: Adversarial Attacks to Top-k Multi-Label Learning
* TMCOSS: Thresholded Multi-Criteria Online Subset Selection for Data-Efficient Autonomous Driving
* TokenPose: Learning Keypoint Tokens for Human Pose Estimation
* Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
* TOOD: Task-aligned One-stage Object Detection
* Topic Scene Graph Generation by Attention Distillation from Caption
* Topologically Consistent Multi-View Face Inference Using Volumetric Sampling
* Toward a Visual Concept Vocabulary for GAN Latent Space
* Toward an Optimal Selection of Constraints for Terrestrial Reference Frame (TRF)
* Toward Automated Machine Learning-Based Hyperspectral Image Analysis in Crop Yield and Biomass Estimation
* Toward Detail-Oriented Image-Based Virtual Try-On with Arbitrary Poses
* Toward Human-Like Grasp: Dexterous Grasping via Semantic Representation of Object-Hand
* Toward Open-World Electroencephalogram Decoding Via Deep Learning: A comprehensive survey
* Toward Physical Layer Security and Efficiency for SAGIN: A WFRFT-Based Parallel Complex-Valued Spectrum Spreading Approach
* Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images
* Toward Spatially Unbiased Generative Models
* Towards A Universal Model for Cross-Dataset Crowd Counting
* Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction
* Towards Alleviating the Modeling Ambiguity of Unsupervised Monocular 3D Human Pose Estimation
* Towards an End-to-End Visual-to-Raw-Audio Generation With GAN
* Towards Better Explanations of Class Activation Mapping
* Towards Complete Scene and Regular Shape for Distortion Rectification by Curve-Aware Extrapolation
* Towards Consistent Soil Moisture Records from China's FengYun-3 Microwave Observations
* Towards Discovery and Attribution of Open-world GAN Generated Images
* Towards Discriminative Representation Learning for Unsupervised Person Re-identification
* Towards Early Detection of Tropospheric Aerosol Layers Using Monitoring with Ceilometer, Photometer, and Air Mass Trajectories
* Towards Efficient Graph Convolutional Networks for Point Cloud Handling
* Towards Face Encryption by Generating Adversarial Identity Masks
* Towards Flexible Blind JPEG Artifacts Removal
* Towards High Fidelity Monocular Face Reconstruction with Rich Reflectance using Self-supervised Learning and Ray Tracing
* Towards Interpretable Deep Metric Learning with Structural Matching
* Towards Learning Spatially Discriminative Feature Representations
* Towards Memory-Efficient Neural Networks via Multi-Level in situ Generation
* Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
* Towards Novel Target Discovery Through Open-Set Domain Adaptation
* Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark
* Towards Real-world X-ray Security Inspection: A High-Quality Benchmark And Lateral Inhibition Module For Prohibited Items Detection
* Towards Robustness of Deep Neural Networks via Regularization
* Towards Rotation Invariance in Object Detection
* Towards the Combination of C2RCC Processors for Improving Water Quality Retrieval in Inland and Coastal Areas
* Towards the Unseen: Iterative Text Recognition by Distilling from Errors
* Towards Understanding the Generative Capability of Adversarially Robust Classifiers
* Towards Vivid and Diverse Image Colorization with Generative Color Prior
* Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision
* Track without Appearance: Learn Box and Tracklet Embedding with Local and Global Motion Patterns for Vehicle Tracking
* TradeBot: Bandit learning for hyper-parameters optimization of high frequency trading strategy
* Train Time Delay Prediction for High-Speed Train Dispatching Based on Spatio-Temporal Graph Convolutional Network
* Training Multi-Object Detector by Estimating Bounding Box Distribution for Input Image
* Training Weakly Supervised Video Frame Interpolation with Events
* Transductive Few-Shot Classification on the Oblique Manifold
* TransFER: Learning Relation-aware Facial Expression Representations with Transformers
* TransferI2I: Transfer Learning for Image-to-Image Translation from Small Datasets
* TransForensics: Image Forgery Localization with Dense Self-Attention
* Transformer-based approach for joint handwriting and named entity recognition in historical document
* Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction
* Transformer-Based Coarse-to-Fine Wide-Swath SAR Image Registration Method under Weak Texture Conditions, A
* Transformer-based Dual Relation Graph for Multi-label Image Recognition
* Transformer-Based Language-Person Search With Multiple Region Slicing
* Transforms based Tensor Robust PCA: Corrupted Low-Rank Tensors Recovery via Convex Optimization
* Transfusion: A Novel SLAM Method Focused on Transparent Objects
* Transient Scattering Echo Simulation and ISAR Imaging for a Composite Target-Ocean Scene Based on the TDSBR Method
* Transparent Object Tracking Benchmark
* Transporting Causal Mechanisms for Unsupervised Domain Adaptation
* TransPose: Keypoint Localization via Transformer
* TransReID: Transformer-based Object Re-Identification
* TransVG: End-to-End Visual Grounding with Transformers
* TransView: Inside, Outside, and Across the Cropping View Boundaries
* TRAR: Routing the Attention Spans in Transformer for Visual Question Answering
* Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning
* TravelNet: Self-supervised Physically Plausible Hand Motion Learning from Monocular Color Images
* Tri-Attention fusion guided multi-modal segmentation network, A
* Triggering Failures: Out-Of-Distribution detection by learning from local adversarial attacks in Semantic Segmentation
* Tripartite Information Mining and Integration for Image Matting
* TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild
* TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation
* Tropical Cyclone Impact and Forest Resilience in the Southwestern Pacific
* Troubling Future for Facial Recognition Software, The
* TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization
* Tune it the Right Way: Unsupervised Validation of Domain Adaptation via Soft Neighborhood Density
* Two-dimensional DFT with sliding and hopping windows for edge map generation of road images
* two-scaled fully convolutional learning network for road detection, A
* Two-Source Normalized Soil Thermal Inertia Model for Estimating Field-Scale Soil Moisture from MODIS and ASTER Data, A
* Two-Stage Attentive Network for Single Image Super-Resolution, A
* Two-stage aware attentional Siamese network for visual tracking
* Two-Stage Pansharpening Method for the Fusion of Remote-Sensing Images, A
* Two-Way alignment approach for unsupervised multi-Source domain adaptation, A
* Twofold Convolutional Regression Tracking Network With Temporal and Spatial Mechanism, A
* UASNet: Uncertainty Adaptive Sampling Network for Deep Stereo Matching
* UAV Image Stitching Based on Optimal Seam and Half-Projective Warp
* UAV-assisted data dissemination based on network coding in vehicular networks
* UAV-Assisted Physical Layer Security in Multi-Beam Satellite-Enabled Vehicle Communications
* UIT at VBS 2022: An Unified and Interactive Video Retrieval System with Temporal Search
* Ultra-High-Definition Image HDR Reconstruction via Collaborative Bilateral Learning
* Ultra-Wideband Imaging via Frequency Diverse Array with Low Sampling Rate
* UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model
* Unaligned Image-to-Image Translation by Learning to Reweight
* Uncertainty Estimation for Stereo Matching Based on Evidential Deep Learning
* Uncertainty-Aware Human Mesh Recovery from Video by Learning Part-Based 3D Dynamics
* Uncertainty-aware Pseudo Label Refinery for Domain Adaptive Semantic Segmentation
* Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection
* Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection
* Unconditional Scene Graph Generation
* Unconstrained Scene Generation with Locally Conditioned Radiance Fields
* Understanding and Evaluating Racial Biases in Image Captioning
* Understanding and Mitigating Annotation Bias in Facial Expression Recognition
* Understanding and modeling finger vascular pattern imaging
* Understanding and Modeling Urban Mobility Dynamics via Disentangled Representation Learning
* Understanding Human Activities in Response to Typhoon Hato from Multi-Source Geospatial Big Data: A Case Study in Guangdong, China
* Understanding Robustness of Transformers for Image Classification
* Underwater Image Co-Enhancement With Correlation Feature Matching and Joint Learning
* Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
* Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder*, A
* Unified B-Spline Framework for Scale-Invariant Keypoint Detection, A
* Unified Graph Structured Models for Video Understanding
* Unified Objective for Novel Class Discovery, A
* unified perspective of classification-based loss and distance-based loss for cross-view gait recognition, A
* Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue
* Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting
* Unifying Nonlocal Blocks for Neural Networks
* UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction
* UniT: Multimodal Multitask Learning with a Unified Transformer
* Universal Adversarial Attack on Attention and the Resulting Dataset DAmageNet
* Universal and Flexible Optical Aberration Correction Using Deep-Prior Based Deconvolution
* Universal Cross-Domain Retrieval: Generalizing Across Classes and Domains
* Universal Representation Learning from Multiple Domains for Few-shot Classification
* Universal-Prototype Enhancing for Few-Shot Object Detection
* Unlimited Neighborhood Interaction for Heterogeneous Trajectory Prediction
* Unlocking the Potential of Ordinary Classifier: Class-specific Adversarial Erasing Framework for Weakly Supervised Semantic Segmentation
* Unmanned Aerial Vehicle (UAV) Remote Sensing in Grassland Ecosystem Monitoring: A Systematic Review
* Unmanned Aerial Vehicle (UAV)-Based Remote Sensing for Early-Stage Detection of Ganoderma
* Unmanned Aircraft System Airspace Structure and Safety Measures Based on Spatial Digital Twins
* Unpaired Learning for Deep Image Deraining with Rain Direction Regularizer
* Unpaired Learning for High Dynamic Range Image Tone Mapping
* Unraveling the Spatio-Temporal Relationship between Ecosystem Services and Socioeconomic Development in Dabie Mountain Area over the Last 10 years
* Unshuffling Data for Improved Generalization in Visual Question Answering
* Unsupervised 3D Pose Estimation for Hierarchical Dance Video Recognition *
* Unsupervised Cross-Domain Person Re-Identification by Instance and Distribution Alignment
* Unsupervised Curriculum Domain Adaptation for No-Reference Video Quality Assessment
* Unsupervised Deep Learning Methods for Biological Image Reconstruction and Enhancement: An overview from a signal processing perspective
* Unsupervised Deep Video Denoising
* Unsupervised Dense Deformation Embedding Network for Template-Free Shape Correspondence
* Unsupervised Depth Completion with Calibrated Backprojection Layers
* Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency
* Unsupervised Few-Shot Action Recognition via Action-Appearance Aligned Meta-Adaptation
* Unsupervised Generative Adversarial Network with Background Enhancement and Irredundant Pooling for Hyperspectral Anomaly Detection
* Unsupervised Image Generation with Infinite Generative Adversarial Networks
* Unsupervised Layered Image Decomposition into Object Prototypes
* Unsupervised Learning of Fine Structure Generation for 3D Point Clouds by 2D Projection Matching
* Unsupervised Multi-scale Generative Adversarial Network for Remote Sensing Image Pan-Sharpening, An
* Unsupervised Non-Rigid Image Distortion Removal via Grid Deformation
* Unsupervised person re-identification with multi-label learning guided self-paced clustering
* Unsupervised Point Cloud Object Co-segmentation by Co-contrastive Learning and Mutual Attention Sampling
* Unsupervised Point Cloud Pre-training via Occlusion Completion
* Unsupervised Real-World Super-Resolution: A Domain Adaptation Perspective
* Unsupervised Segmentation incorporating Shape Prior via Generative Adversarial Networks
* Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals
* Use Remote Sensing and Machine Learning to Study the Changes of Broad-Leaved Forest Biomass and Their Climate Driving Forces in Nature Reserves of Northern Subtropics
* User Generated HDR Gaming Video Streaming: Dataset, Codec Comparison, and Challenges
* User-based network embedding for opinion spammer detection
* User-Guided Deep Human Image Matting Using Arbitrary Trimaps
* Using Ensemble-Based Systems with Near-Infrared Hyperspectral Data to Estimate Seasonal Snowpack Density
* Using Explainable AI to Identify Differences Between Clinical and Experimental Pain Detection Models Based on Facial Expressions
* Using Eye-Tracking Data to Predict Situation Awareness in Real Time During Takeover Transitions in Conditionally Automated Driving
* Using Graph-Theoretic Machine Learning to Predict Human Driver Behavior
* Using Multi-Source Nighttime Lights Data to Proxy for County-Level Economic Activity in China from 2012 to 2019
* using of deep neural networks and natural mechanisms of acoustic wave propagation for extinguishing flames, The
* Utilizing Deep Learning Towards Multi-Modal Bio-Sensing and Vision-Based Affective Computing
* UVStyle-Net: Unsupervised Few-shot Learning of 3D Style Similarity Measure for B-Reps
* V-DESIRR: Very Fast Deep Embedded Single Image Reflection Removal
* V-FIRST: A Flexible Interactive Retrieval System for Video at VBS 2022
* V2V-Based Cooperative Control of Uncertain, Disturbed and Constrained Nonlinear CAVs Platoon
* VaPiD: A Rapid Vanishing Point Detector via Learned Optimizers
* Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform
* Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting
* Variational Feature Disentangling for Fine-Grained Few-Shot Classification
* VariTex: Variational Neural Face Textures
* Varying Amplitude Vibration Phase Suppression Algorithm in ISAL Imaging
* VD-LAB: A view-decoupled network with local-global aggregation bridge for airborne laser scanning point cloud classification
* VD-PCR: Improving visual dialog with pronoun coreference resolution
* Vector Neurons: A General Framework for SO(3)-Equivariant Networks
* Vector-Decomposed Disentanglement for Domain-Invariant Object Detection
* Vegetation Browning Trends in Spring and Autumn over Xinjiang, China, during the Warming Hiatus
* Vehicle localisation and deep model for automatic calibration of monocular camera in expressway scenes
* Vehicle Speed Measurement Using Stereo Camera Pair
* Velocity-Based Path Following Control for Autonomous Vehicles to Avoid Exceeding Road Friction Limits Using Sliding Mode Method
* Velocity-to-velocity human motion forecasting
* VENet: Voting Enhancement Network for 3D Object Detection
* VERGE in VBS 2022
* Verification and Validation of Hybridspectral Radiometry Obtained from an Unmanned Surface Vessel (USV) in the Open and Coastal Oceans
* Vi2CLR: Video and Image for Visual Contrastive Learning of Representation
* Video Annotation for Visual Tracking via Selection and Refinement
* Video anomaly detection using deep residual-spatiotemporal translation network
* Video Autoencoder: Self-Supervised Disentanglement of Static 3D Structure and Motion
* Video Geo-Localization Employing Geo-Temporal Feature Learning and GPS Trajectory Smoothing
* Video Instance Segmentation with a Propose-Reduce Paradigm
* Video Matting via Consistency-Regularized Graph Neural Networks
* Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment
* Video Pose Distillation for Few-Shot, Fine-Grained Sports Action Recognition
* Video Question Answering Using Language-Guided Deep Compressed-Domain Video Feature
* Video Reenactment as Inductive Bias for Content-Motion Disentanglement
* Video Search with Context-Aware Ranker and Relevance Feedback
* Video Self-Stitching Graph Network for Temporal Action Localization
* Video-based Person Re-identification with Spatial and Temporal Memory Networks
* Video-Rate Dual-Modal Wide-Beam Harmonic Ultrasound and Photoacoustic Computed Tomography
* Videofall: A Hierarchical Search Engine for VBS2022
* VideoLT: Large-scale Long-tailed Video Recognition
* VidSfM: Robust and Accurate Structure-From-Motion for Monocular Videos
* VidTr: Video Transformer Without Convolutions
* Viewing Graph Solvability via Cycle Consistency
* ViewNet: Unsupervised Viewpoint Estimation from Conditional Generation
* Viewpoint Invariant Dense Matching for Visual Geolocalization
* Viewpoint-Agnostic Change Captioning with Cycle Consistency
* Viewport-Based CNN: A Multi-Task Approach for Assessing 360° Video Quality
* VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection
* ViRMA: Virtual Reality Multimedia Analytics at Video Browser Showdown 2022
* Virtual light transport matrices for non-line-of-sight imaging
* Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction
* Virtual Reality Reminiscence Interface for Personal Lifelogs, A
* Virtual-Goal-Guided RRT for Visual Servoing of Mobile Robots With FOV Constraint
* Vis2Mesh: Efficient Mesh Reconstruction from Unstructured Point Clouds of Large Scenes with Learned Virtual View Visibility
* Visformer: The Vision-friendly Transformer
* Visio-Temporal Attention for Multi-Camera Multi-Target Association
* Vision Transformer with Progressive Sampling
* Vision Transformers for Dense Prediction
* Vision-Language Navigation with Random Environmental Mixup
* Vision-Language Transformer and Query Generation for Referring Segmentation
* VISIONE at Video Browser Showdown 2022
* Visual Alignment Constraint for Continuous Sign Language Recognition
* Visual Distant Supervision for Scene Graph Generation
* Visual Graph Memory with Unsupervised Representation for Visual Navigation
* Visual question answering with gated relation-aware auxiliary
* Visual Relationship Detection Using Part-and-Sum Transformers with Composite Queries
* Visual Saliency Transformer
* Visual Scene Graphs for Audio Source Separation
* Visual Transformers: Where Do Transformers Really Belong in Vision Models?
* Visual-Textual Attentive Semantic Consistency for Medical Report Generation
* ViViT: A Video Vision Transformer
* VLGrammar: Grounded Grammar Induction of Vision and Language
* VMNet: Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation
* Voids Filling of DEM with Multiattention Generative Adversarial Network Model
* VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction
* von Mises-Fisher Loss: An Exploration of Embedding Geometries for Supervised Learning
* Voxel Transformer for 3D Object Detection
* Voxel-based Network for Shape Completion by Leveraging Edge Generation
* VSAC: Efficient and Accurate Estimator for H and F
* Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis
* Wallpaper Texture Generation and Style Transfer Based on Multi-Label Semantics
* Wanderlust: Online Continual Object Detection in the Real World
* Warp and Learn: Novel Views Generation for Vehicles and Other Objects
* Warp Consistency for Unsupervised Learning of Dense Correspondences
* Warp-Refine Propagation: Semi-Supervised Auto-Labeling via Cycle-Consistency
* WarpedGANSpace: Finding non-linear RBF paths in GAN latent space
* Wasserstein Coupled Graph Learning for Cross-Modal Retrieval
* Watch Only Once: An End-to-End Video Action Detection Framework
* Water Quality Chl-a Inversion Based on Spatio-Temporal Fusion and Convolutional Neural Network
* WaveFill: A Wavelet-based Generation Network for Image Inpainting
* Way to my Heart is through Contrastive Learning: Remote Photoplethysmography from Unlabelled Video, The
* Waypoint Models for Instruction-guided Navigation in Continuous Environments
* WB-DETR: Transformer-Based Detector without Backbone
* WDCCNet: Weighted Double-Classifier Constraint Neural Network for Mammographic Image Classification
* Weak Adaptation Learning: Addressing Cross-domain Data Insufficiency with Weak Annotator
* Weakly Supervised 3D Semantic Segmentation Using Cross-Image Consensus and Inter-Voxel Affinity Relations
* Weakly Supervised Amodal Segmenter with Boundary Uncertainty Estimation, A
* Weakly Supervised Contrastive Learning
* Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
* Weakly Supervised Person Search with Region Siamese Networks
* Weakly Supervised Relative Spatial Reasoning for Visual Question Answering
* Weakly Supervised Representation Learning with Coarse Labels
* Weakly Supervised RGB-D Salient Object Detection With Prediction Consistency Training and Active Scribble Boosting
* Weakly Supervised Segmentation of Small Buildings with Point Labels
* Weakly Supervised Temporal Anomaly Segmentation with Dynamic Time Warping
* Weakly Supervised Text-based Person Re-Identification
* Weakly-Supervised Action Segmentation and Alignment via Transcript-Aware Union-of-Subspaces Learning
* Weakly-supervised semantic segmentation with superpixel guided local and global consistency
* Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning
* Wearable Photoplethysmography for Cardiovascular Monitoring
* Webly Supervised Fine-Grained Recognition: Benchmark Datasets and an Approach
* Weighted clustering ensemble: A review
* What an Ehm Leaks About You: Mapping Fillers into Personality Traits with Quantum Evolutionary Feature Selection Algorithms
* What Can We Learn from Nighttime Lights for Small Geographies? Measurement Errors and Heterogeneous Elasticities
* What You Can Learn by Staring at a Blank Wall
* When CNNs meet random RNNs: Towards multi-level analysis for RGB-D object and scene recognition
* When do GANs replicate? On the choice of dataset size
* When Pigs Fly: Contextual Reasoning in Synthetic and Natural Scenes
* Where Anthropogenic Activity Occurs, Anthropogenic Activity Dominates Vegetation Net Primary Productivity Change
* Where are you heading? Dynamic Trajectory Prediction with Expert Goal Examples
* Where2Act: From Pixels to Actions for Articulated 3D Objects
* Who's Waldo? Linking People Across Text and Images
* Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?
* Wide Weighted Attention Multi-Scale Network for Accurate MR Image Super-Resolution
* Wide-Angle Image Rectification: A Survey
* Wide-Area and Real-Time Object Search System of UAV
* Wide-Area Grid-Based Slant Ionospheric Delay Corrections for Precise Point Positioning
* Winter-Spring Phytoplankton Phenology Associated with the Kuroshio Extension Instability
* With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations
* Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image
* X-World: Accessibility, Vision, and Autonomy Meet
* XQM: Search-Oriented vs. Classifier-Oriented Relevance Feedback on Mobile Phones
* XVFI: eXtreme Video Frame Interpolation
* You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking
* Your Eye in the Sky: Satellite Reconnaissance Comes in from the Cold
* YouRefIt: Embodied Reference Understanding with Language and Gesture
* Z-Score Normalization, Hubness, and Few-Shot Learning
* Zen-NAS: A Zero-Shot NAS for High-Performance Image Recognition
* Zero-Shot Day-Night Domain Adaptation with a Physics Prior
* zero-shot learning framework via cluster-prototype matching, A
* Zero-shot Natural Language Video Localization
* Zero-Shot Video Object Segmentation With Co-Attention Siamese Networks
* ZFlow: Gated Appearance Flow-based Virtual Try-on with 3D Priors
2600 for 2203

Index for "2"


Last update:14-Aug-22 22:16:46
Use price@usc.edu for comments.