Keith Price Bibliography journal Details for mult

Journals starting with mult

MultiCamera11 * *Activity Monitoring by Multi-Camera Surveillance Systems
* Determining operational measures from multi-camera surveillance systems using soft biometrics
* game-theoretic design for collaborative tracking in a video camera network, A
* HSV and RGB color histograms comparing for objects tracking among non overlapping FOVs, using CBTF
* Improved person detection in industrial environments using multiple self-calibrated cameras
* Multi-camera detection association for 3D localisation
* Multiple views based human motion tracking in surveillance videos
* Real time complex event detection for resource-limited multimedia sensor networks
8 for MultiCamera11

MultiEmbodied25 * *Multi-Agent Embodied Intelligent Systems Meet Generative-AI Era: Opportunities, Challenges and Futures
* AI Hiring with LLMs: A Context-Aware and Explainable Multi-Agent Framework for Resume Screening
* Deciding the Path: Leveraging Multi-Agent Systems for Solving Complex Tasks
* Efficient Task-Specific Conditional Diffusion Policies: Shortcut Model Acceleration and SO(3) Optimization
* LangCoop: Collaborative Driving with Language
* LLM-Enabled Multi-Agent Autonomous Mechatronics Design Framework, An
* Multi-Agent Systems for Robotic Autonomy with LLMs
* SwarmDiff: Swarm Robotic Trajectory Planning in Cluttered Environments via Diffusion Transformer
8 for MultiEmbodied25

MultInfoRetr( Vol No. ) * *International Journal of Multimedia Information Retrieval

MultInfoRetr(1) * Acquisition of Multimedia Ontology: An Application in Preservation of Cultural Heritage
* Bridging the gap between expert and novice users for video search
* Cost-sensitive learning in social image tagging: Review, New Ideas and Evaluation
* Directional local extrema patterns: a new descriptor for content based image retrieval
* efficient framework for location-based scene matching in image databases, An
* Exploiting contextual information for image re-ranking and rank aggregation
* Fast shape retrieval using a graph theoretic approach
* heterogeneous feature selection with structural sparsity for multimedia annotation and hashing: a survey, The
* Interactive search in image retrieval: a survey
* Large-scale near-duplicate image retrieval by kernel density estimation
* Leveraging visual concepts and query performance prediction for semantic-theme-based video retrieval
* Multimedia semantics-aware query-adaptive hashing with bits reconfigurability
* Multimodal Image Retrieval
* New Grand Challenge for Multimedia Information Retrieval: Bridging the Utility Gap
* Optical music recognition: state-of-the-art and open issues
* Semantics-based selection of everyday concepts in visual lifelogging
* study on video data mining, A
* Video concept detection by audio-visual grouplets
* Á trous gradient structure descriptor for content based image retrieval
19 for MultInfoRetr(1)

MultInfoRetr(2) * 3D object retrieval using salient views
* Best papers in multimedia information retrieval
* Beyond audio and video retrieval: Topic-oriented multimedia summarization
* Bundle min-Hashing
* Combining usage and content in an online recommendation system for music in the Long Tail
* Content analysis meets viewers: linking concept detection with demographics on YouTube
* Event-related image retrieval: exploring geographical and temporal distribution of user tags
* Exploiting semantics on external resources to gather visual examples for video retrieval
* Genre-specific modeling of visual features for efficient content based video shot classification and retrieval
* geometrical distance measure for determining the similarity of musical harmony, A
* High-level event recognition in unconstrained videos
* Hybrid music information retrieval
* intelligent content-based image retrieval system for clinical decision support in brain tumor diagnosis, An
* Intrinsic spatial pyramid matching for deformable 3D shape retrieval
* Location-aware music recommendation
* Minimal test collections for low-cost evaluation of Audio Music Similarity and Retrieval systems
* Mobile video concept classification
* Multimodal biomedical image retrieval using hierarchical classification and modality fusion
* Searching for images by video
* Tonal representations for music retrieval: from version identification to query-by-humming
* Very large scale nearest neighbor search: Ideas, strategies and challenges
* When music makes a scene
22 for MultInfoRetr(2)

MultInfoRetr(3) * ACM ICMR 2014 best papers in image retrieval
* Adaptive diversification for tag-based social image retrieval
* Context-assisted face clustering framework with human-in-the-loop
* Editorial of the special issue on cross-media analysis
* Image re-ranking system based on closed frequent patterns
* Improving the quality of K-NN graphs through vector sparsification: application to image databases
* incremental evolutionary learning method for optimizing content-based image indexing algorithms, An
* Indexing heterogeneous features with superimages
* Information extraction from multimedia web documents: An open-source platform and testbed
* Interactive cross and multimodal biomedical image retrieval based on automatic region-of-interest (ROI) identification and classification
* MET: media-embedded target for connecting paper to digital media
* Multimedia information retrieval: best papers and expanding frontiers
* Multivariate time series modeling of geometric features of spatio-temporal volumes for content based video retrieval
* Optimization of information retrieval for cross media contents in a best practice network
* Parallel incremental power mean SVM for the classification of large-scale image datasets
* Person instance graphs for mono-, cross- and multi-modal person recognition in multimedia data: application to speaker identification in TV broadcast
* Self-similarity-based partial near-duplicate video retrieval and alignment
* sparse kernel relevance model for automatic image annotation, A
* Statistical framework for content-based medical image retrieval based on wavelet orthogonal polynomial model with multiresolution structure
* Topic detection in cross-media: A semi-supervised co-clustering approach
* Video Browser Showdown: a live evaluation of interactive video search tools, The
21 for MultInfoRetr(3)

MultInfoRetr(4) * Aligning plot synopses to videos for story-based retrieval
* aMM: Towards adaptive ranking of multi-modal documents
* Bregman pooling: feature-space local pooling for image classification
* Building effective SVM concept detectors from clickthrough data for large-scale image retrieval
* Detection of social events in streams of social multimedia
* Distributed cross-media multiple binary subspace learning
* Generic multivariate model for color texture classification in RGB color space
* ImageCLEF annotation with explicit context-aware kernel maps
* influence of image descriptors' dimensions' value cardinalities on large-scale similarity search, The
* Large image modality labeling initiative using semi-supervised and optimized clustering
* Learning to detect concepts with Approximate Laplacian Eigenmaps in large-scale and online settings
* Multi-Bin search: improved large-scale content-based image retrieval
* novel framework for CBCD using integrated color and acoustic features, A
* On-the-fly learning for visual search of large-scale image and video datasets
* Optimizing visual dictionaries for effective image retrieval
* Region-based Image Retrieval Using Shape-Adaptive DCT
* Region-based Image Retrieval Using Shape-Adaptive DCT
* Special issue on concept detection with big data
* Special issue on video retrieval
* Studying the impact of sequence clustering on near-duplicate video retrieval: an experimental comparison
* VIDCAR: an unsupervised CBVR framework for identifying similar videos with prominent object motion
* Video classification with Densely extracted HOG/HOF/MBH features: An evaluation of the accuracy/computational efficiency trade-off
* Weakly supervised detection of video events using hidden conditional random fields
23 for MultInfoRetr(4)

MultInfoRetr(5) * Automatic environmental sound concepts discovery for video retrieval
* Blind late fusion in multimedia event retrieval
* Boosting local texture descriptors with Log-Gabor filters response for improved image retrieval
* Bundling centre for landmark image discovery
* Classification of color texture images based on modified WLD
* Deep shape-aware descriptor for nonrigid 3D object retrieval
* efficient method for video shot boundary detection and keyframe extraction using SIFT-point distribution histogram, An
* Image recommendation based on keyword relevance using absorbing Markov chain and image features
* Improving content-based image retrieval with compact global and local multi-features
* IR_URFS_VF: image recommendation with user relevance feedback session and visual features in vertical image search
* Learning content-social influential features for influence analysis
* Learning initial feature weights for CBIR using query augmentation
* Major events in multimedia information retrieval
* MGraph: multimodal event summarization in social media using topic models and graph-based ranking
* novel approach for shape-based object recognition with curvelet transform, A
* On the coupled use of signal and semantic concepts to bridge the semantic and user intention gaps for visual content retrieval
* On the use of commonsense ontology for multimedia event recounting
* Open and free datasets for multimedia retrieval
* Robust facial expression recognition system based on hidden Markov models
* Special issue on visual information retrieval
* Text-to-video: a semantic search engine for internet videos
* User-adaptive image retrieval via fusing pointwise and pairwise labels
22 for MultInfoRetr(5)

MultInfoRetr(6) * ACSIR: ANOVA Cosine Similarity Image Recommendation in vertical search
* Computational framework for emotional VAD prediction using regularized Extreme Learning Machine
* DBAHCL: database for Arabic handwritten characters and ligatures
* Editorial for the ICMR 2016 special issue
* Fast discrete curvelet transform-based anisotropic feature extraction for biomedical image indexing and retrieval
* Instance search retrospective with focus on TRECVID
* Investigating country-specific music preferences and music recommendation algorithms with the LFM-1b dataset
* Learning hierarchical video representation for action recognition
* Multi-frame twin-channel descriptor for person re-identification in real-time surveillance videos
* Multicontext-adaptive indexing and search for large-scale video navigation
* Multilingual visual sentiment concept clustering and analysis
* overview of approaches for content-based medical image retrieval, An
* overview of traffic sign detection and classification methods, An
* OVIS: ontology video surveillance indexing and retrieval system
* Query-by-example music information retrieval by score-based genre prediction and similarity measure
* Script identification algorithms: a survey
* Shot boundary detection using perceptual and semantic information
* survey of tag-based information retrieval, A
* survey on camera-captured scene text detection and extraction: towards Gurmukhi script, A
* Survey on handwritten documents word spotting, A
* Toward semantic content-based image retrieval using Dempster-Shafer theory in multi-label classification framework
* Unsupervised group feature selection for media classification
22 for MultInfoRetr(6)

MultInfoRetr(8) * 3D local circular difference patterns for biomedical image retrieval
* automatic feature extraction and fusion model: Application to electromyogram (EMG) signal classification, An
* Automatic visual pattern mining from categorical image dataset
* Balancing search space partitions by sparse coding for distributed redundant media indexing and retrieval
* Brain disease diagnosis using local binary pattern and steerable pyramid
* Color-independent classification of animation video
* complete person re-identification model using Kernel-PCA-based Gabor-filtered hybrid descriptors, A
* Content-based medical image retrieval of CT images of liver lesions using manifold learning
* Cross-specificity: modelling data semantics for cross-modal matching and retrieval
* Current challenges and visions in music recommender systems research
* Detection and visualization of misleading content on Twitter
* DHFML: deep heterogeneous feature metric learning for matching photograph and cartoon pairs
* Digital watermarking for deep neural networks
* Editorial for the ICMR 2017 special issue
* Editorial for the ICMR 2018 special issue
* efficient content-based medical image indexing and retrieval using local texture feature descriptors, An
* End-to-end cross-modality retrieval with CCA projections and pairwise ranking loss
* Estimating the information gap between textual and visual representations
* faceted approach to reachability analysis of graph modelled collections, A
* Hybrid descriptors and Weighted PCA-EFMNet for Face Verification in the Wild
* Improvement of image description using bidirectional LSTM
* Joint embeddings with multimodal cues for video-text retrieval
* Mining exoticism from visual content with fusion-based deep neural networks
* MSRC: multimodal spatial regression with semantic context for phrase grounding
* Multi-dimensional multi-directional mask maximum edge pattern for bio-medical image retrieval
* Multi-view collective tensor decomposition for cross-modal hashing
* Multimodal analysis of user behavior and browsed content under different image search intents
* Order, context and popularity bias in next-song recommendations
* Pedestrian detection using first- and second-order aggregate channel features
* Probabilistic selection of frames for early action recognition in videos
* review of semantic segmentation using deep neural networks, A
* review on robust video copy detection, A
* Robustness of DR-LDP over PCANet for face analysis
* Semi-supervised domain adaptation for pedestrian detection in video surveillance based on maximum independence assumption
* Spatiotemporal wavelet correlogram for human action recognition
* Survey on brain tumor segmentation and feature extraction of MR images
* survey paper on secret image sharing schemes, A
* Three-dimensional spatio-temporal trajectory descriptor for human action recognition
* Transferred Semantic Scores for Scalable Retrieval of Histopathological Breast Cancer Images
* Using visual features based on MPEG-7 and deep learning for movie recommendation
* Video instance search via spatial fusion of visual words and object proposals
41 for MultInfoRetr(8)

MultInfoRetr(9) * Characterization and classification of semantic image-text relations
* ContextNet: representation and exploration for painting classification and retrieval in context
* Editorial for the ICMR 2019 special issue
* Effective video hyperlinking by means of enriched feature sets and monomodal query combinations
* Focus-Aspect-Value model for predicting subjective visual attributes, The
* Hierarchical attentive deep neural networks for semantic music annotation through multiple music representations
* Hypergraph learning with collaborative representation for image search reranking
* Image annotation: the effects of content, lexicon and annotation method
* Learning visual features for relational CBIR
* Multi-level context extraction and attention-based contextual inter-modal fusion for multimodal sentiment analysis and emotion classification
* retrieval-based approach for diverse and image-specific adversary selection, A
* Single-image crowd counting: a comparative survey on deep learning-based approaches
* Special issue on deep learning in image and video retrieval
* study on deep learning spatiotemporal models and feature extraction techniques for video understanding, A
* survey of traditional and deep learning-based feature descriptors for high dimensional data in computer vision, A
* survey on instance segmentation: state of the art, A
16 for MultInfoRetr(9)

MultiSP( Vol No. ) * *Multidimensional Systems and Signal Processing

MultiSP(6) * On the Smoothness Constraint in the Intensity-Based Estimation of the Parallax Field

MultiSP(8) * Adaptive Morphological Representation of Signals: Polynomial and Wavelet Methods
* Grobner Bases and Multidimensional FIR Multirate Systems
* Low Bit-Rate Design Considerations for Wavelet-Based Image-Coding
* Multidimensional Filter Banks and Wavelets: Research Developments and Applications - Preface
* Multiresolution Vector Quantization for Video Coding
* Multiscale, Statistical Anomaly Detection Analysis and Algorithms for Linearized Inverse Scattering Problems
* New Bit-Rate Control of MPEG with Predictive and Adaptive Perceptual Quantization, A
* On the Scalability of 2-D Discrete Wavelet Transform Algorithms
* On Translation Invariant Subspaces and Critically Sampled Wavelet Transforms
* Reconstruction and Decomposition Algorithms for Biorthogonal Multiwavelets
* Zero-Phase Filter Bank and Wavelet Code R-Matrices: Properties, Triangular Decompositions, and a Fast Algorithm
* Zero-Phase Filter Bank and Wavelet Code R-Matrices: Properties, Triangular Decompositions, and a Fast Algorithm
12 for MultiSP(8)

MultiSP(9) * Low-Bit-Rate VQ: A Projection Based Approach
* ROI Search Method for Still Images Based on Set Descriptions, An

MultiTemp11 * *International Workshop on the Analysis of Multi-temporal Remote Sensing Images
* Active-learning based cascade classification of multitemporal images for updating land-cover maps
* Analysis of earth observation time series to investigate the relation between rainfall, vegetation dynamic and streamflow in the Uele' basin (Central African Republic)
* Analysis of LULC changes and urban expansion of the resort city of Al Ain using remote sensing and GIS
* Analysis of NOAA/AVHRR multitemporal images, climate conditions and cultivated land of sugarcane fields applied to agricultural monitoring
* Analytical description of pseudo-invariant features (PIFs)
* Assessing the impact of the orbital drift of SPOT-VGT1 by comparing with SPOT-VGT2 data
* Automated backdating of transportation networks with Landsat imagery
* Automatic interpolation of phenological phases in Germany
* Bathymetry from fusion of multi-temporal Landsat and radar altimetery
* Braided river dynamics determined using satellite imagery: Upper Rakaia River, Canterbury, New Zealand
* Change detection in very high resolution imagery based on dynamic time warping: An implementation for Haiti earthquake damage assessment
* Classification of dynamic evolutions from satellitar image time series based on similarity measures
* Clustering analysis applied to NDVI/NOAA multitemporal images to improve the monitoring process of sugarcane crops
* Clustering of satellite image time series under Time Warping
* Coarse to fine patches-based multitemporal analysis of very high resolution satellite images
* Comparison of two remote sensing time series analysis methods for monitoring forest decline
* Deriving plant phenology from remote sensing
* Detection of small changes in airborne hyperspectral imagery: Experimental results over urban areas
* Does evapotranspiration influence the strength of the North American monsoon? Multitemporal satellite analysis of evapotranspiration and its effects
* Dynamic mapping of cropland areas in Sub-Saharan Africa using MODIS time series
* Effect of the learning algorithm on the accuracy of sub-pixel land use classifications with multilayer perceptrons
* Effects of multitemporal scene changes on pansharpening fusion
* Exploring the capacity to grasp multi-annual seasonal variability of winter wheat in Continental Climates with MODIS
* Feature extraction for NDVI AVHRR/NOAA time series classification
* Generation of 250m MODIS LAI time series by temporal regression
* Greenland inland ice melt-off: Analysis of global gravity data from the GRACE satellites
* hyperspectral reflectance data based model inversion methodology to detect reniform nematodes in cotton, A
* Identification of grazed and mown grasslands using a time series of high-spatial-resolution remote sensing images
* impact of inter-annual variability in remote sensing time series on modeling tree species distributions, The
* Investigation of evolutionary feature subset selection in multi-temporal datasets for harmful algal bloom detection
* Land cover change detection thresholds for Landsat data samples
* Land cover classification by using multi-temporal COSMO-SkyMed data
* Low and high spatial resolution time series fusion for improved land cover map production
* method for change detection with multi-temporal satellite images based on Principal Component Analysis, A
* Monitoring a fuzzy object: The case of Lake Naivasha
* Monitoring African surface water dynamic using medium resolution daily data allows anomalies detection in nearly real time
* Monitoring crop growth inter-annual variability from MODIS time series: Performance comparison between crop specific green area index and current global leaf area index products
* Monitoring environmental change in the Andes based on SPOT-VGT and NOAA-AVHRR time series analysis
* Monitoring global vegetation with the Yearly Land Cover Dynamics (YLCD) method
* Monitoring land cover changes in Hulun Buir by using object-oriented method
* Multi-temporal analysis of a mangrove ecosystem in Southeastern Brazil using object-based classification applied to IKONOS II data
* Multi-temporal damage assessment of linear infrastructural objects using Dynamic Bayesian Networks
* Multi-temporal SAR classification according to change detection operators
* multilevel approach to change detection for port surveillance with very high resolution SAR images, A
* Multitemporal classification of natural vegetation cover in Brazilian Cerrado
* Multitemporal data management and exploitation infrastructure
* Multitemporal fusion of Landsat and MERIS images
* NDVI time series and Markov chains to model the change of fuzzy vegetative drought classes
* Phenology of the natural vegetation: A land cover specific approach for a reference dataset in Central Africa
* PhenoSat: A tool for vegetation temporal analysis from satellite image data
* Producing global land cover maps consistent over time to respond the needs of the climate modelling community
* Quantification of LAI interannual anomalies by adjusting climatological patterns
* robust approach for phenological change detection within satellite image time series, A
* robust change detection feature for Cosmo-SkyMed detected SAR images, A
* SAR imagery change detection method for Land Border Monitoring
* Semi-automated generation of a multi-temporal forest depletion layer with the Landcover Change Mapper (LCM)
* Snow cover monitoring in alpine regions with COSMO-SkyMed images by using a multitemporal approach and depolarization ratio
* Spatial and temporal mapping of leaf area index in Alpine pastures and meadows with satellite MODIS imagery
* Spatiotemporal dimensionality and time-space characterization of vegetation phenology from multitemporal MODIS EVI
* Spatiotemporal mining of ENVISAT SAR interferogram time series over the Haiyuan fault in China
* Spectral-Temporal Analysis by Response Surface applied to detect deforestation in the Brazilian Amazon
* Time-series analysis of rainforest clearing in Sabah, Borneo using Landsat imagery
* Tools for multitemporal analysis and classification of multisource satellite imagery
* Unravelling long-term vegetation change patterns in a binational watershed using multitemporal land cover data and historical photography
* Urbanization analysis by mutual information based change detection between SPOT 5 panchromatic images
* Use of multi-annual MODIS Land Surface Temperature data for the characterization of the heat requirements for grapevine varieties
* Using NASA'S Long Term Data Record version 3 for the monitoring of land surface vegetation
* Utilization of spectral measurements and phenological observations to detect grassland-habitats with a RapidEye intra-annual time-series
* Year-to-year variability of NDVI in croplands and grasslands across a regional grasslands-forest ecotone in Central Alberta, Canada
70 for MultiTemp11

MultiTemp15 * *International Workshop on the Analysis of Multi-temporal Remote Sensing Images
* 13 Years of changes in the extent and physiognomy of mangroves after shrimp farming abandonment, Bali
* 3D displacement retrieval on glacial areas by airborne multi-view photogrammetry
* Agricultural monitoring with polarimetric SAR time series
* Alpine algorithms-time series of innovative remote sensing products for Alpine areas: Snow cover leaf area index and soil moisture
* alternative representation of coarse-resolution remote sensing images for time-series processing, An
* Building profile reconstruction using TerraSAR-X data time-series and tomographic techniques
* Change analysis of dual polarimetric Sentinel-1 SAR image time series using stationary wavelet transform and change detection matrix
* Change detection in bi-temporal data by canonical information analysis
* Change detection of coral reef habitats from multi-temporal and multi-source satellite imagery in Bunaken, Indonesia
* Change detection using multiscale segmentation and Kullback-Leibler divergence: Application on road damage extraction
* Characteristics of spatial-temporal sprawl in specific Chinese coastal cities from 1979 to 2013
* Cloud removal in image time series through unmixing
* CloudSim: A fair benchmark for comparison of methods for times series reconstruction from cloud and atmospheric contamination
* Comparison between spatial and temporal estimation of entropy on polarimetric SAR images
* Consistent forest change maps 198-2000 from the AVHRR time series: Case studies for South America and Indonesia
* Coupling of phenological information and synthetically generated time-series for crop types as indicator for vegetation coverage information
* Data assimilation in multiscale complex systems
* Data fusion approach for Urban area identification using multisensor information
* Data stream mining for multitemporal remote sensing data
* Dealing whith occultation when accounting for observation error correlation in a wavelet space
* Deformation estimation on low coherence areas by means of polarimetric differential SAR interferometry
* Determining the effects of ENSO phenomena on Andean areas by applying radiometric indices on long time series
* dynamical model to classify the content of multitemporal images employing distributed computing techniques, A
* Elevation changes and X-band ice and snow penetration inferred from TanDEM-X data of the Mont-Blanc area
* Evaluating the temporal stability of synthetically generated time-series for crop types in Central Germany
* Exploiting satelitte image time series for monitoring ecological quality parameters of french reservoirs
* Exploring the validity of the long term data record V4 database for land surface monitoring
* Extracting characteristics of satellite image time series with decision trees
* Fine co-registration of VHR images for multitemporal Urban area analysis
* Fluctuations of Caucasian glaciers in 20th century
* Global snow cover mapping using a multi-temporal multi-sensor approach
* Ground echoes filtering using the completed local binary pattern and the support vector machine
* Improved crop classification using multitemporal RapidEye data
* Inpainting restoration for inland waters Mexico ecosystems
* keypoint approach for change detection between SAR images based on graph theory, A
* Land cover change dynamics and multi-factor analysis in high mountains basins of Colombian Andes
* Landscape features that prevent or foster urban sprawl
* Mapping the snow line altitude for large glacier samples from multitemporal Landsat imagery
* Modeling high rainfall regions for flash flood nowcasting
* Monitoring forest recovery with change metrics derived from Landsat time series stacks
* Multitemporal classification without new labels: A solution with optimal transport
* Multitemporal data mining: From biomass monitoring to nuclear proliferation detection
* Multivariate statistical modeling for multi-temporal SAR change detection using wavelet transforms
* Normalized difference phytoplankton index (NDPI) and spatio-temporal cloud filtering for multitemporal cyanobacteria pollution analysis on Erie Lake in 2014
* Numerical models to forecast the sugarcane production in regional scale based on time series of NDVI/AVHRR images
* Prediction of NDVI for grassland habitats by fusing RapidEye and Landsat imagery
* Primal sketch of image series with edge preserving filtering application to change detection
* Processing polarimetric SAR time series over urban areas with binary partition trees
* rapid mapping approach to quantify damages caused by the 2003 bam earthquake using high resolution multitemporal optical images, A
* Recent elevation and velocity changes of Astrolabe Glacier, Terre Adelie, Antarctica
* Region-based change detection of PolSAR images using analytic information-theoretic divergence
* Regional glacier mapping from time-series of Landsat type data
* Retrieving daily evapotranspiration from the combination of geostationary and polar-orbit satellite data
* Robust glacier displacements using knowledge-based image matching
* Satellite image time series classification and analysis using an adapted graph labeling
* scalable spatiotemporal inference framework based on statistical shape analysis for natural ecosystem monitoring by remote sensing, A
* Sparse-smooth decomposition models for multi-temporal SAR images
* Spatio-temporal characterization in satellite image time series
* statistical approach for predicting grassland degradation in disturbance-driven landscapes, A
* Superpixel-based change detection in high resolution SAR images using region covariance features
* swap randomization approach for mining motion field time series over the Argentiere glacier, A
* Temporal stability of mangrove multispectral signatures at fine scales: Stability of mangrove multispectral signatures
* Testing satellite rainfall estimates for yield simulation of a rainfed cereal in West Africa
* Time series analysis of multi-frequency SAR backscatter and bistatic coherence in the context of flood mapping
* Towards the large-scale assessment of vegetation biomass production stability
* Tree species discrimination in temperate woodland using high spatial resolution Formosat-2 time series
* Trends in 15-year MODIS NDVI time series for Mexico
68 for MultiTemp15

MultiTemp17 * *International Workshop on the Analysis of Multi-temporal Remote Sensing Images
* Agricultural monitoring using clustering techniques on satellite image time series of low spatial resolution
* Analysis of multitemporal Sentinel-2 images in the framework of the ESA Scientific Exploitation of Operational Missions
* Analysis of Riparian forest buffers dynamics in Colombian basins by Landsat Time Series
* Angular normalisation of PROBA-V 300m NDVI
* ASAP - Anomaly hot Spots of Agricultural Production, a new early warning decision support system developed by the Joint Research Centre
* Assessing hypertemporal SENTINEL-1 COHERENCE maps for LAND COVER monitoring
* Assessment of ALOS PALSAR 25-m mosaic data for land cover mapping
* Assessment of AquaCrop for winter wheat using satellite derived fCover data
* Assessment of time series consistency of terrestrial Essential Climate Variables
* Automatic production of large-scale cloud-free orthomosaics from multitemporal satellite images
* Automatic smoothing of remote sensing data
* Built-up areas mapping at global scale based on adaptive parametric thresholding of Sentinel-1 intensity coherence time series
* Change detection in a series of Sentinel-1 SAR data
* Circular change detection in image time series inspired by two-dimensional phase unwrapping
* Classification of anthropogenic landscapes
* Combined use of SAR and optical time series data for near real-time forest disturbance mapping
* Detecting the spread of invasive species in central Chile with a Sentinel-2 time-series
* Estimate yield at parcel level from S2 time serie in sub-Saharan smallholder farming systems
* Estimating total aboveground, stem and branch biomass using multi-frequency SAR
* European Space agency (ESA) Landsat MSS/TM/ETM+/OLI archive: 42 years of our history
* Evaluating an energy balance setting and random forest-based downscaling for the estimation of daily ET at sub-kilometer spatial resolution
* Filtering mislabeled data for improving time series classification
* Glacier ice loss monitored through the Planet cubesat constellation
* Global climatic drivers of vegetation based on wavelet analysis
* Handling coherence measures of displacement field time series: Application to Greenland ice sheet glaciers
* Harbour pattern of life analysis with time series of medium resolution satellite images
* Humid tropical forest monitoring with multi-temporal L-, C- and X-band SAR data
* Identifying crops in smallholder farms using time series of WorldView-2 images
* Image representation alternatives for the analysis of satellite image time series
* Investigating the control of ocean-atmospheric oscillations over global terrestrial evaporation using a simple supervised learning method
* Joint retrieval of surface reflectance and aerosol properties from PROBA-V observations, part I: Algorithm performance evaluation
* Joint Surface Reflectance and AeRosol properties retrieval in the PV-LAC framework, part II: Validation
* Land cover change detection in Satellite Image Time Series using an active learning method
* Land surface phenology from Copernicus Global Land time series
* Land-cover evolution class analysis in Image Time Series of Landsat and Sentinel-2 based on Latent Dirichlet Allocation
* Lava emplacement mapping with SAR and optical satellite data
* Leveraging Sentinel-1 time-series data for mapping agricultural land cover and land use in the tropics
* Mapping of season length anomalies in Mexico
* Mapping small reservoirs in semi-arid regions using multitemporal SAR: Methods and applications
* Mapping tree species of forests in southwest France using Sentinel-2 image time series
* Monitoring pasture intesification in Brazilian Amazon biome with MODIS time series
* Mountain crop monitoring with multitemporal Sentinel-1 and Sentinel-2 imagery
* Multi temporal data visualization in EO mobile apps
* Multi-temporal and multi-source alpine glacier cover classification
* Multitemporal Sentinel-2 data: remarks and observations
* non-linear data-driven approach to reveal global vegetation sensitivity to climate, A
* novel method for unsupervised multiple Change Detection in hyperspectral images based on binary Spectral Change Vectors, A
* On the use of guided regularized random forests to identify crops in smallholder farm fields
* Optimizing SAR change detection based on log-ratio features
* Potato monitoring in Belgium with WatchITGrow
* Potential of Sentinel-2 and SPOT5 (Take5) time series for the estimation of grasslands biodiversity indices
* Preliminary exploration of introducing spatial correlation information into the probabilistic patch-based similarity measure
* Proba-V cloud detection Round Robin: Validation results and recommendations
* Remote sensing monitoring of land restoration interventions in semi-arid environments using a before-after control-impact statistical design
* Retrospective analysis of long-term landscape evolution based on archive satellite imagery and historical maps
* RGB SAR product exploiting multitemporal: General processing and applications
* Sea Surface Temperature changes analysis, an Essential Climate Variable for Ecosystem Services provisioning
* SITS for estimating sugarcane production
* Spatial relationships between natural resources and land use dynamics in the Amazonian agricultural frontier
* Spatio-temporal evolution of crop fields in Sentinel-2 Satellite Image Time Series
* Spatiotemporal variations of alpine climate, snow cover and phenology
* Support for Multi-temporal and Multi-mission data processing: The ESA Research and Service Support
* Survey of current hyperspectral Earth observation applications from space and synergies with Sentinel-2
* Temporal analysis of SAR imagery for permanent and evolving Earth land cover behavior assessment
* Temporal relationships between daily precipitation and NDVI time series in Mexico
* Unsupervised change detection of remote sensing images using superpixel segmentation and variational Gaussian mixture model
* Urban area change detection based on generalized likelihood ratio test
* use of Landsat time series for identification of forest degradation levels in the eastern Brazilian Amazon (Paragominas), The
* Using Landsat-8 and Sentinel-1 data for Above Ground Biomass assessment in the Tamar valley and Dartmoor
* Variations in mangrove regeneration rates under different management plans: An analysis of Landsat time-series in the Matang Mangrove Forest Reserve, Peninsular Malaysia
71 for MultiTemp17

MultiView07 * *Beyond Multiview Geometry: Robust Estimation and Organization of Shapes from Multiple Cues
* Active Visual Object Reconstruction using D-, E-, and T-Optimal Next Best Views
* Constrained Optimization for Retinal Curvature Estimation Using an Affine Camera
* Joint Priors for Variational Shape and Appearance Modeling
* MRF and Gaussian Curvature Based Shape Representation for Shape Matching, An
* Multiview normal field integration using level set methods
* Opti-Acoustic Stereo Imaging, System Calibration and 3-D Reconstruction
* Robust Click-Point Linking: Matching Visually Dissimilar Local Regions
* Scene-space Feature Detectors
9 for MultiView07

Multiview17 * *Multiview Relationships in 3D Data
* Accurate Depth Map Estimation from Small Motions
* Camera Pose Filtering with Local Regression Geodesics on the Riemannian Manifold of Dual Quaternions
* Combining Exemplar-Based Approach and learning-Based Approach for Light Field Super-Resolution Using a Hybrid Imaging System
* Computer Vision Meets Geometric Modeling: Multi-view Reconstruction of Surface Points and Normals Using Affine Correspondences
* Content-Aware Metric for Stitched Panoramic Image Quality Assessment, A
* Edge SLAM: Edge Points Based Monocular Visual SLAM
* KPPF: Keypoint-Based Point-Pair-Feature for Scalable Automatic Global Registration of Large RGB-D Scans
* Multiview Absolute Pose Using 3D-2D Perspective Line Correspondences and Vertical Direction
* On Tablet 3D Structured Light Reconstruction and Registration
* Probabilistic Surfel Fusion for Dense LiDAR Mapping
* Use-Case Study on Multi-view Hypothesis Fusion for 3D Object Classification, A
12 for Multiview17

MultLearnApp18 * *Multimodal Learning and Applications Workshop
* Boosting LiDAR-Based Semantic Labeling by Cross-modal Training Data Generation
* CentralNet: A Multilayer Approach for Multimodal Fusion
* Generalized Bayesian Canonical Correlation Analysis with Missing Modalities
* Learning from #Barcelona Instagram Data What Locals and Tourists Post About Its Neighbourhoods
* Learning to Learn from Web Data Through Deep Semantic Embeddings
* Structured Listwise Approach to Learning to Rank for Image Tagging, A
* ThermalGAN: Multimodal Color-to-Thermal Image Translation for Person Re-identification in Multispectral Dataset
* Unpaired Thermal to Visible Spectrum Transfer Using Adversarial Training
* Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach
* Visually Indicated Sound Generation by Perceptually Optimized Classification
* Where and What Am I Eating? Image-Based Food Menu Recognition
12 for MultLearnApp18

MultMed * *IEEE Transactions on Multimedia

MultMed(1) * Content-Based Video Indexing Retrieval
* Detection of Moving Cast Shadows for Object Segmentation

MultMed(10) * Admission Control Scheme Based on Online Measurement for VBR Video Streams Over Wireless Home Networks, An
* Association and Temporal Rule Mining for Post-Filtering of Semantic Concept Detection in Video
* Association Rule-Based Method to Support Medical Image Diagnosis With Efficiency, An
* Audio-Visual Affective Expression Recognition Through Multistream Fused HMM
* Batch Nearest Neighbor Search for Video Retrieval
* Boosting-Based Multimodal Speaker Detection for Distributed Meeting Videos
* Channel Aware Multiuser Scalable Video Streaming Over Lossy Under-Provisioned Channels: Modeling and Analysis
* Color-Based Image Salient Region Segmentation Using Novel Region Merging Strategy
* Comprehensive Survey on Three-Dimensional Mesh Watermarking, A
* Compression of 3-D Point Visual Data Using Vector Quantization and Rate-Distortion Optimization
* Constrained Probabilistic Petri Net Framework for Human Activity Detection in Video, A
* Content-Aware Playout and Packet Scheduling for Video Streaming Over Wireless Links
* Content-Aware Prediction Algorithm With Inter-View Mode Decision for Multiview Video Coding
* Content-Based Image Retrieval Using Multiresolution Color and Texture Features
* Cross-Dimensional Perceptual Quality Assessment for Low Bit-Rate Videos
* Delay-Constrained and R-D Optimized Transrating for High-Definition Video Streaming Over WLANs
* DISCOV: A Framework for Discovering Objects in Video
* Discriminant Graph Structures for Facial Expression Recognition
* Document Image Processing for Paper Side Communications
* Efficient Deblocking With Coefficient Regularization, Shape-Adaptive Filtering, and Quantization Constraint
* Efficient Watermarking Method Based on Significant Difference of Wavelet Coefficient Quantization, An
* Energy-Constrained Distortion Reduction Optimization for Wavelet-Based Coded Image Transmission in Wireless Sensor Networks
* Face Annotation Using Transductive Kernel Fisher Discriminant
* Fast Best-Match Shape Searching in Rotation-Invariant Metric Spaces
* Fast Inter Mode Decision Using Spatial Property of Motion Field
* Fragile Watermarking With Error-Free Restoration Capability
* Graph-Based Multiplayer Detection and Tracking in Broadcast Soccer Videos
* Highly Efficient VLSI Architecture for H.264/AVC CAVLC Decoder, A
* Human Age Estimation With Regression on Discriminative Aging Manifold
* Image Retrieval Over Networks: Active Learning Using Ant Algorithm
* Image Retrieval With Relevance Feedback Based on Graph-Theoretic Region Correspondence Estimation
* Implementing the 2-D Wavelet Transform on SIMD-Enhanced General-Purpose Processors
* Improving Robustness of Quantization-Based Image Watermarking via Adaptive Receiver
* Integrated Mining of Visual Features, Speech Features, and Frequent Patterns for Semantic Video Annotation
* Interactive Transmission of JPEG2000 Images Using Web Proxy Caching
* Intra/Inter Macroblock Mode Decision for Error-Resilient Transcoding
* Joined Spectral Trees for Scalable SPIHT-Based Multispectral Image Compression
* Joint Source-Channel Video Coding Scheme Based on Distributed Source Coding, A
* Linear Rate Control and Optimum Statistical Multiplexing for H.264 Video Broadcast
* Low Complexity Detection of Discrete Cross Differences for Fast H.264/AVC Intra Prediction, A
* Low-Complexity Heterogeneous Video Transcoding Using Data Mining
* Mining Appearance Models Directly From Compressed Video
* Multilevel Asymmetric Scheme for Digital Fingerprinting, A
* Multimodal and Multilevel Ranking Scheme for Large-Scale Video Retrieval, A
* Multimodal Scheme for Program Segmentation and Representation in Broadcast Video Streams, A
* No-Reference PSNR Estimation for Quality Monitoring of Motion JPEG2000 Video Over Lossy Packet Networks
* Novel Framework for Semantic Annotation and Personalized Retrieval of Sports Video, A
* Optimizing Multiple Object Tracking and Best View Video Synthesis
* Paired Subimage Matching Watermarking Method on Ordered Dither Images and Its High-Quality Progressive Coding
* Partitioning of Multiple Fine-Grained Scalable Video Sequences Concurrently Streamed to Heterogeneous Clients
* Predicting Visual Focus of Attention From Intention in Remote Collaborative Tasks
* Real-Time Vision and Speech Driven Avatars for Multimedia Applications
* Recognizing Human Emotional State From Audiovisual Signals
* Recognizing Human Emotional State From Audiovisual Signals*
* Robust Audio-Visual Speech Recognition Based on Late Integration
* Robust Image Corner Detection Based on the Chord-to-Point Distance Accumulation Technique
* Scalable 3-D Terrain Visualization Through Reversible JPEG2000-Based Blind Data Hiding
* Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces
* Shot Change Detection via Local Keypoint Matching
* Spatiotemporal Motion Analysis for the Detection and Classification of Moving Targets
* Synthesis of Silhouettes and Visual Hull Reconstruction for Articulated Humans
* Using Webcast Text for Semantic Event Detection in Broadcast Sports Video
* Video Annotation Based on Kernel Linear Neighborhood Propagation
* Video Error Concealment Using Spatio-Temporal Boundary Matching and Partial Differential Equation
* Video Semantic Event/Concept Detection Using a Subspace-Based Multimedia Data Mining Framework
* Video Streaming for Mobile Video Surveillance
* Video-Based Human Movement Analysis and Its Application to Surveillance Systems
* Vision-Based Augmented-Reality System For Multiuser Collaborative Environments, A
68 for MultMed(10)

MultMed(11) * 3-D Face Detection, Landmark Localization, and Registration Using a Point Distribution Model
* Architectures for Fast Transcoding of H.264/AVC to Quality-Scalable SVC Streams
* Attack on Watermarking Method Based on Significant Difference of Wavelet Coefficient Quantization
* Bandwidth Aggregation-Aware Dynamic QoS Negotiation for Real-Time Video Streaming in Next-Generation Wireless Networks
* Blind Robust 3-D Mesh Watermarking Based on Oblate Spheroidal Harmonics
* Capacity Gain of Mixed Multicast/Unicast Transport Schemes in a TV Distribution Network
* Character Identification in Feature-Length Films Using Global Face-Name Matching
* Coherent Phrase Model for Efficient Image Near-Duplicate Retrieval
* Community Streaming With Interactive Visual Overlays: System and Optimization
* Concealment of Whole-Picture Loss in Hierarchical B-Picture Scalable Video Coding
* Content-Aware Distortion-Fair Video Streaming in Congested Networks
* Content-Based Attention Ranking Using Visual and Contextual Attention Model for Baseball Videos
* Context-Aware Person Identification in Personal Photo Collections
* Control-Theoretic Approach to Rate Control for Streaming Videos, A
* Controlling Virtual Cameras Based on a Robust Model-Free Pose Acquisition Technique
* Design of a Scalable Multicast Scheme With an Application-Network Cross-Layer Approach
* Discriminant Subspace Analysis: An Adaptive Approach for Image Classification
* Dynamic Resource Allocation for MGS H.264/AVC Video Transmission Over Link-Adaptive Networks
* Effective Annotation and Search for Video Blogs with Integration of Context and Content Analysis
* Efficient Background Subtraction and Shadow Removal for Monochromatic Video Sequences
* Efficient Mode Selection Prior to the Actual Encoding for H.264/AVC Encoder, An
* Efficient Near-Duplicate Video Shot Detection Method Using Shot-Based Interest Points, An
* Ellipsoidal Harmonics for 3-D Shape Description and Retrieval
* Event Tactic Analysis Based on Broadcast Sports Video
* Expression-Invariant Face Recognition With Constrained Optical Flow Warping
* FaceSeg: Automatic Face Segmentation for Real-Time Video
* Fast Motion Estimation on Graphics Hardware for H.264 Video Encoding
* Fast-Mesh: A Low-Delay High-Bandwidth Mesh for Peer-to-Peer Live Streaming
* Generalized PCRTT Offline Bandwidth Smoothing Based on SVM and Systematic Video Segmentation
* Hierarchical Modeling and Adaptive Clustering for Real-Time Summarization of Rush Videos
* Human Perception of Audio-Visual Synthetic Character Emotion Expression in the Presence of Ambiguous and Conflicting Information
* Image Annotation Within the Context of Personal Photo Collections Using Hierarchical Event and Scene Models
* Image Retargeting Using Mesh Parametrization
* Island Multicast: Combining IP Multicast With Overlay Data Distribution
* Joint Source Coding and Network-Supported Distributed Error Control for Video Streaming in Wireless Multihop Networks
* LayerP2P: Using Layered Video Chunks in P2P Live Streaming
* Lipreading With Local Spatiotemporal Descriptors
* Liveness Enforcing Supervision of Video Streaming Systems Using Nonsequential Petri Nets
* Low-Complexity Cross-Layer Optimization Algorithm for Video Communication Over Wireless Networks, A
* Multicue Bayesian State Estimator for Gaze Prediction in Open Signed Video, A
* Multiuser Rate Allocation Games for Multimedia Communications
* No-Reference Video Quality Monitoring for H.264/AVC Coded Video
* Novel Video Summarization Based on Mining the Story-Structure and Semantic Relations Among Concept Entities, A
* Optimal Channel Adaptation of Scalable Video Over a Multicarrier-Based Multicell Environment
* Optimal Packet Loss Protection of Progressively Compressed 3-D Meshes
* Optimized H.264 Video Encoding and Packetization for Video Transmission Over Pipeline Forwarding Networks
* Picture Collage
* Quality-Driven Cross-Layer Solution for MPEG Video Streaming Over WiMAX Networks, A
* Real-Time Near-Duplicate Elimination for Web Video Search With Content and Context
* Registration Based on Scene Recognition and Natural Features Tracking Techniques for Wide-Area Augmented Reality Systems
* Reliable Application Layer Multicast Over Combined Wired and Wireless Networks
* Rhombic Dodecahedron Map: An Efficient Scheme for Encoding Panoramic Video, The
* Robust Scaling-Based Image Watermarking Using Maximum-Likelihood Decoder With Optimum Strength Factor
* Salient Region Detection by Modeling Distributions of Color and Orientation
* Scalable Video Multicast Using Expanding Window Fountain Codes
* Scale-Invariant Visual Language Modeling for Object Categorization
* Scene Detection in Videos Using Shot Clustering and Sequence Alignment
* Segmentation-Driven Image Fusion Based on Alpha-Stable Modeling of Wavelet Coefficients
* Sketch-Based Spatial Queries for Retrieving Human Locomotion Patterns From Continuously Archived GPS Data
* Smooth Control of Adaptive Media Playout for Video Streaming
* Spatial Correlation Model for Visual Information in Wireless Multimedia Sensor Networks, A
* Sports Video Mining via Multichannel Segmental Hidden Markov Models
* Statistical Scheduling of Offline Comparative Subjective Evaluations for Real-Time Multimedia
* Structural Descriptors for Category Level Object Detection
* Support Vector Machine Approach for Detection and Localization of Transmission Errors Within Standard H.263++ Decoders, A
* Syntactic Matching of Trajectories for Ambient Intelligence Applications
* Tensor-Based Transductive Learning for Multimodality Video Semantic Concept Detection
* Trade-Offs in Bit-Rate Allocation for Wireless Video Streaming
* Unified Traffic Model for MPEG-4 and H.264 Video Traces, A
* Using RTT Variability for Adaptive Cross-Layer Approach to Multimedia Delivery in Heterogeneous Networks
* Using Visual Context and Region Semantics for High-Level Concept Detection
71 for MultMed(11)

MultMed(12) * 3-D Audio-Visual Corpus of Affective Communication, A
* 3-D Model Search and Retrieval From Range Images Using Salient Features
* Adaptation of Multimedia Presentations for Different Display Sizes in the Presence of Preferences and Temporal Constraints
* Adaptive Computational Model for Salient Object Detection, An
* Affective Audio-Visual Words and Latent Topic Driving Model for Realizing Movie Affective Scene Classification
* Affective Visualization and Retrieval for Music Video
* Authentication of Scalable Video Streams With Low Communication Overhead
* Bayesian Approach to Automated Creation of Tactile Facial Images, A
* Blind Audiovisual Source Separation Based on Sparse Redundant Representations
* Bridging the Semantic Gap Between Image Contents and Tags
* Browsing Video Along Multiple Threads
* Camera Motion-Based Analysis of User Generated Video
* Combining Context, Consistency, and Diversity Cues for Interactive Image Categorization
* Comparison of Perceptually-Based Metrics for Objective Evaluation of Geometry Processing, A
* Constructing Concept Lexica With Small Semantic Gaps
* Controlling the Bit Rate of Multi-Object Videos With Noncooperative Game Theory
* Cross-Media Alignment of Names and Faces
* Digital Cinema Watermarking for Estimating the Position of the Pirate
* Dynamic FEC Algorithms for TFRC Flows
* Efficient and Robust Algorithm for Shape Indexing and Retrieval, An
* Emotion Recognition in Text for 3-D Facial Expression Rendering
* Energy Efficient H.263 Video Transmission in Power Saving Wireless LAN Infrastructure
* Estimating Cohesion in Small Groups Using Audio-Visual Nonverbal Behavior
* Fine-Granularity Transmission Distortion Modeling for Video Packet Scheduling Over Mesh Networks
* Framework of Enhancing Image Steganography With Picture Quality Optimization and Anti-Steganalysis Based on Simulated Annealing Algorithm, A
* Image Annotation by Graph-Based Inference With Integrated Multiple/Single Instance Representations
* Image Classification With Kernelized Spatial-Context
* Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering, An
* Impact of Network Dynamics on User's Video Quality: Analytical Framework and QoS Provision
* In-Image Accessibility Indication
* Information-Theoretic Analysis of Input Strokes in Visual Object Cutout
* Joint Compressive Video Coding and Analysis
* Lightweight SCTP for Partially Reliable Overlay Video Multicast Service for Mobile Terminals, A
* Low-Complexity Analytical Modeling for Cross-Layer Adaptive Error Protection in Video Over WLAN, A
* Mining Compositional Features From GPS and Visual Cues for Event Recognition in Photo Collections
* Mining Group Nonverbal Conversational Patterns Using Probabilistic Topic Models
* Multi-View Video Summarization
* Multihop Packet Delay Bound Violation Modeling for Resource Allocation in Video Streaming Over Mesh Networks
* Multimedia Quality-Driven Network Resource Management Architecture for Wireless Sensor Networks With Stream Authentication, A
* Multitransform Architecture for H.264/AVC High-Profile Coders, A
* Natural Visible and Infrared Facial Expression Database for Expression Recognition and Emotion Inference, A
* Network Awareness of P2P Live Streaming Applications: A Measurement Study
* On Energy Efficient Encryption for Video Streaming in Wireless Sensor Networks
* On the Annotation of Web Videos by Efficient Near-Duplicate Search
* Practical Online Near-Duplicate Subsequence Detection for Continuous Video Streams
* Predicting Speaker Head Nods and the Effects of Affective Information
* Real-Time Framework for Video Time and Pitch Scale Modification, A
* Real-Time Visual Concept Classification
* Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study
* Robust Block-Based Image/Video Registration Approach for Mobile Imaging Devices, A
* Robust Symbolic Dual-View Facial Expression Recognition With Skin Wrinkles: Local Versus Global Approach
* Scalable Intraband and Composite Wavelet-Based Coding of Semiregular Meshes
* Sequence Multi-Labeling: A Unified Video Annotation Scheme With Spatial and Temporal Context
* SPANC: Optimizing Scheduling Delay for Peer-to-Peer Live Streaming
* Special Issue on Multimodal Affective Interaction
* Stochastic Approach to Image Retrieval Using Relevance Feedback and Particle Swarm Optimization, A
* Synchronization of Multiple Camera Videos Using Audio-Visual Features
* System for Real-Time Multimodal Analysis of Nonverbal Affective Social Interaction in User-Centric Media, A
* Towards a Relevant and Diverse Search of Social Images
* TURINstream: A Totally pUsh, Robust, and effIcieNt P2P Video Streaming Architecture
* Video Annotation Through Search and Graph Reinforcement Mining
* Video Précis: Highlighting Diverse Aspects of Videos
* Visualizing Image Collections Using High-Entropy Layout Distributions
63 for MultMed(12)

MultMed(13) * Adaptive Context-Tree-Based Statistical Filtering for Raster Map Image Denoising
* Adaptive Learning for Target Tracking and True Linking Discovering Across Multiple Non-Overlapping Cameras
* Adaptive Resource Allocation for Layer-Encoded IPTV Multicasting in IEEE 802.16 WiMAX Wireless Networks
* Algorithm and Architecture Design of Perception Engine for Video Coding Applications
* Analysis and Exploitation of Musician Social Networks for Recommendation and Discovery
* Audiovisual Discrimination Between Speech and Laughter: Why and When Visual Information Might Help
* Automated Assembly of Shredded Pieces From Multiple Photos
* Autonomous Framework to Produce and Distribute Personalized Team-Sport Video Summaries: A Basketball Case Study, An
* Balancing Attended and Global Stimuli in Perceived Video Quality Assessment
* Bayesian Visual Reranking
* Collaborative Face Recognition for Improved Face Annotation in Personal Photo Collections Shared on Online Social Networks
* ConnectBoard: Enabling Genuine Eye Contact and Accurate Gaze in Remote Collaboration
* Connotative Space for Supporting Movie Affective Recommendation, A
* Content-Aware Display Adaptation and Interactive Editing for Stereoscopic Images
* Cooperative Layered Video Multicast Using Randomized Distributed Space Time Codes
* Cost-Sensitive Multi-Label Learning for Audio Tag Annotation and Retrieval
* Cross-Layer Optimization for Downlink Wavelet Video Transmission
* Depth Image-Based Rendering With Advanced Texture Synthesis for 3-D Video
* Editing by Viewing: Automatic Home Video Summarization by Viewing Behavior Analysis
* Effective Method for Movable Projector Keystone Correction, An
* Effective Pseudonoise Sequence and Decoding Function for Imperceptibility and Robustness Enhancement in Time-Spread Echo-Based Audio Watermarking
* Effective Semantic Annotation by Image-to-Concept Distribution Model
* Efficient Algorithms for Multi-Sender Data Transmission in Swarm-Based Peer-to-Peer Streaming Systems
* Efficient Feature Detection and Effective Post-Verification for Large Scale Near-Duplicate Image Search
* Empowering Visual Categorization With the GPU
* Enabling Composition-Based Video-Conferencing for the Home
* Energy-Efficient Multicasting of Scalable Video Streams Over WiMAX Networks
* Event-Based Semantic Image Adaptation for User-Centric Mobile Display Devices
* Exploiting Visual-Audio-Textual Characteristics for Automatic TV Commercial Block Detection and Segmentation
* Exploring Distributional Discrepancy for Multidimensional Point Set Retrieval
* Exposing Digital Image Forgeries by Detecting Discrepancies in Motion Blur
* Fast Action Detection via Discriminative Random Forest Voting and Top-K Subvolume Search
* Fast Visual Retrieval Using Accelerated Sequence Matching
* Feature-Based Sparse Representation for Image Similarity Assessment
* Flash Translation Layer for NAND Flash-Based Multimedia Storage Devices, A
* Fuzzy Clustering Algorithm for Virtual Character Animation Representation, A
* Fuzzy Similarity-Based Emotional Classification of Color Images
* Game-Theoretic Strategies and Equilibriums in Multimedia Fingerprinting Social Networks
* Geometric Invariant Audio Watermarking Based on an LCM Feature
* Guided Face Cartoon Synthesis
* High-Quality Visualization for Geographically Distributed 3-D Teleimmersive Applications
* Human Psychology of Common Appraisal: The Reddit Score
* Image Quality Assessment by Separately Evaluating Detail Losses and Additive Impairments
* Image Retagging Using Collaborative Tag Propagation
* Impact of Spectrum Sensing Frequency and Packet-Loading Scheme on Multimedia Transmission Over Cognitive Radio Networks, The
* In-Network Packet Scheduling and Rate Allocation: A Content Delivery Perspective
* Integrating Visual Saliency and Consistency for Re-Ranking Image Search Results
* Interactive 3-D Audio System With Loudspeakers, An
* Interactive Image Segmentation With Multiple Linear Reconstructions in Windows
* Introduction to the ICME2010 Special Issue
* IRS: A Detour Routing System to Improve Quality of Online Games
* Kernel Framework for Content-Based Artist Recommendation System in Music, A
* Layer-Aware Forward Error Correction for Mobile Broadcast of Layered Media
* Layered Internet Video Adaptation (LIVA): Network-Assisted Bandwidth Sharing and Transient Loss Protection for Video Streaming
* Layered Multicast With Inter-Layer Network Coding for Multimedia Streaming
* Learning Visual Contexts for Image Annotation From Flickr Groups
* Less is More: Efficient 3-D Object Retrieval With Query View Selection
* Low Complexity Sign Detection and Text Localization Method for Mobile Applications, A
* Low-Complexity Inverse Transforms of Video Codecs in an Embedded Programmable Platform
* Markup SVG: An Online Content-Aware Image Abstraction and Annotation Tool
* MIMiC: Multimodal Interactive Motion Controller
* Missing Image Data Reconstruction Based on Adaptive Inverse Projection via Sparse Representation
* MobiUP: An Upsampling-Based System Architecture for High-Quality Video Streaming on Mobile Devices
* Moving Region Segmentation From Compressed Video Using Global Motion Estimation and Markov Random Fields
* Multi-Core Platforms for Beamforming and Wave Field Synthesis
* Multi-Gesture Interaction System Using a 3-D Iris Disk Model for Gaze Estimation and an Active Appearance Model for 3-D Hand Pointing, A
* Multi-Resolution Design for Large-Scale and High-Resolution Monitoring
* Nongeometric Distortion Smoothing Approach for Depth Map Preprocessing
* Object Retrieval Using Visual Query Context
* On Complexity Modeling of H.264/AVC Video Decoding and Its Application for Energy Efficient Decoding
* On Distributed Multimedia Scheduling With Constrained Control Channels
* On-the-Fly Erasure Coding for Real-Time Video Applications
* One-Pulse FEC Coding for Robust CELP-Coded Speech Transmission Over Erasure Channels
* Online Buffer Fullness Estimation Aided Adaptive Media Playout for Video Streaming
* Online Video Stream Abstraction and Stylization
* Optimal Bandwidth Assignment for Multiple-Description-Coded Video
* Optimal Layered Video IPTV Multicast Streaming Over Mobile WiMAX Systems
* Optimizing FEC Transmission Strategy for Minimizing Delay in Lossless Sequential Streaming
* Optimizing Multi-Rate Peer-to-Peer Video Conferencing Applications
* Optimizing Visual Search Reranking via Pairwise Learning
* Perceptually Guided Fast Compression of 3-D Motion Capture Data
* Performance Evaluation of IPTV Over Wireless Home Networks
* Practical Image Quality Metric Applied to Image Coding
* Prioritized Distributed Video Delivery With Randomized Network Coding
* Probabilistic Novelty Detection for Acoustic Surveillance Under Real-World Conditions
* Rate and Distortion Modeling of CGS Coded Scalable Video Content
* Reduced-Reference Image Quality Assessment Using Reorganized DCT-Based Image Representation
* Robust Camera Calibration and Player Tracking in Broadcast Basketball Video
* Robust Luby Transform Encoding Pattern-Aware Symbol Packetization Algorithm for Video Streaming Over Wireless Network, A
* Robust Spatial Matching for Object Retrieval and Its Parallel Implementation on GPU
* Routing-Aware Multiple Description Video Coding Over Mobile Ad-Hoc Networks
* Scalable Video Multicast in Hybrid 3G/Ad-Hoc Networks
* Selection of Network Coding Nodes for Minimal Playback Delay in Streaming Overlays
* Semi-Automatic Tagging of Photo Albums via Exemplar Selection and Tag Inference
* Sensitivity Analysis of the Human Visual System for Depth Cues in Stereoscopic 3-D Displays
* Spatial Audio Object Coding With Two-Step Coding Structure for Interactive Audio Service
* Spatial Correlation-Based Image Compression Framework for Wireless Multimedia Sensor Networks, A
* Special Section on Interactive Multimedia
* Spread Spectrum Visual Sensor Network Resource Management Using an End-to-End Cross-Layer Design
* Stratification-Based Keyframe Cliques for Effective and Efficient Video Representation
* Subjective Quality Evaluation via Paired Comparison: Application to Scalable Video Coding
* Superchunk-Based Efficient Search in P2P-VoD System
* Survey of Audio-Based Music Classification and Annotation, A
* Tag Tagging: Towards More Descriptive Keywords of Image Content
* Temporal Color Consistency-Based Video Reproduction for Dichromats
* Text-Video Completion Using Structure Repair and Texture Propagation
* Touch Interface Exploiting Time-Frequency Classification Using Zak Transform for Source Localization on Solids, A
* Training Surrogate Sensors in Musical Gesture Acquisition Systems
* Two-Level Downlink Scheduling for Real-Time Multimedia Services in LTE Networks
* Unequal Error Protection Using Fountain Codes With Applications to Video Communication
* Unifying Low-Level and High-Level Music Similarity Measures
* Unsupervised Alignment of News Video and Text Using Visual Patterns and Textual Concepts
* Utilizing Related Samples to Enhance Interactive Concept-Based Video Search
* Video Inpainting on Digitized Vintage Films via Maintaining Spatiotemporal Continuity
* Virtual Contour Guided Video Object Inpainting Using Posture Mapping and Retrieval
* Web Image and Video Mining Towards Universal and Robust Age Estimator
116 for MultMed(13)

MultMed(14) * Adaptive Workload Equalization in Multi-Camera Surveillance Systems
* Advanced Hierarchical Motion Estimation Scheme With Lossless Frame Recompression and Early-Level Termination for Beyond High-Definition Video Coding, An
* Advanced IPTV Services Personalization Through Context-Aware Content Recommendation
* Aesthetics-Based Stereoscopic Photo Cropping for Heterogeneous Displays
* Affine Model Based Motion Compensation Prediction for Zoom
* Analysis and Evaluation of Adaptive LDPC AL-FEC Codes for Content Download Services
* Analytical Framework for Improving the Quality of Streaming Over TCP
* Analytical Modeling for Delay-Sensitive Video Over WLAN
* Assessment of Stereoscopic Crosstalk Perception
* Asymmetric Coding of Multi-View Video Plus Depth Based 3-D Video for View Rendering
* Automatic Light Scene Setting Through Image-Based Sparse Light Effect Approximation
* Automatic Role Recognition in Multiparty Conversations: An Approach Based on Turn Organization, Prosody, and Conditional Random Fields
* Bayesian Visual Reranking
* Bottom-Up Saliency Detection Model Based on Human Visual Sensitivity and Amplitude Spectrum
* Bridging the Semantic Gap via Functional Brain Imaging
* Causal Flow
* Content-Based Analysis Improves Audiovisual Archive Retrieval
* Content-Based Image Compression for Arbitrary-Resolution Display Devices
* Cooperation and Coalition in Multimedia Fingerprinting Colluder Social Networks
* Coordinate Live Streaming and Storage Sharing for Social Media Content Distribution
* Correlation-Aware QoS Routing With Differential Coding for Wireless Video Sensor Networks
* Cross-Layer Framework for QoS Support in Wireless Multimedia Sensor Networks
* Delay-Cognizant Interactive Streaming of Multiview Video With Free Viewpoint Synthesis
* Depth Video Coding Using Adaptive Geometry Based Intra Prediction for 3-D Video Systems
* Design and Synthesis for Multimedia Systems Using the Targeted Dataflow Interchange Format
* Difficulty Guided Image Retrieval Using Linear Multiple Feature Embedding
* Discovering Image Semantics in Codebook Derivative Space
* Discriminating Joint Feature Analysis for Multimedia Data Understanding
* Dynamic Sub-GOP Forward Error Correction Code for Real-Time Video Applications
* Effective Codebooks for Human Action Representation and Classification in Unconstrained Videos
* Efficient and Rate-Distortion Optimal Wavelet Packet Basis Selection in JPEG2000
* Efficient Frame Concealment for Depth Image-Based 3-D Video Transmission
* Efficient Genre-Specific Semantic Video Indexing
* Efficient Parallel Framework for H.264/AVC Deblocking Filter on Many-Core Platform
* Efficient Video Coding Using Legacy Algorithmic Approaches
* Energy-Efficient Resource Allocation and Scheduling for Multicast of Scalable Video Over Wireless Networks
* Enhanced 3-D Modeling for Landmark Image Classification
* Enhanced Bag-of-Visual Word Vector Space Model to Represent Visual Content in Athletics Images, An
* Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition
* Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification
* Exploring Contextual Redundancy in Improving Object-Based Video Coding for Video Sensor Networks Surveillance
* Exploring Locality of Reference in P2P VoD Systems
* Fast and Accurate Video Semantic-Indexing System Using Fast MAP Adaptation and GMM Supervectors, A
* Fast Dynamic Range Compression With Local Contrast Preservation Algorithm and Its Application to Real-Time Video Enhancement, A
* Fast Mode Decision for H.264/AVC Based on Rate-Distortion Clustering
* Feature Combination in Kernel Space for Distance Based Image Hashing
* Finding Celebrities in Billions of Web Images
* Frame Rate Optimization Framework for Improving Continuity in Video Streaming, A
* Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification
* Generic Framework for Video Annotation via Semi-Supervised Learning, A
* Global 1-Mbps Peer-Assisted Streaming: Fine-Grain Measurement of a Configurable Platform
* Harvesting Social Images for Bi-Concept Search
* Hidden-Concept Driven Multilabel Image Annotation and Label Ranking
* HodgeRank on Random Graphs for Subjective Video Quality Assessment
* Hybrid Algorithm for Effective Lossless Compression of Video Display Frames, A
* Interactive Video Indexing With Statistical Active Learning
* Introduction to the ICME 2011 Special Issue
* Introduction to the Special Section on Smart, Social, and Converged TV
* Investigating the Effects of Multiple Factors Towards More Accurate 3-D Object Retrieval
* Joint Demosaicing and Subpixel-Based Down-Sampling for Bayer Images: A Fast Frequency-Domain Analysis Approach
* Joint Source-Channel Coding and Optimization for Layered Video Broadcasting to Heterogeneous Devices
* Kernel Cross-Modal Factor Analysis for Information Fusion With Application to Bimodal Emotion Recognition
* Large-Scale Vehicle Detection, Indexing, and Search in Urban Surveillance Videos
* Learn to Personalized Image Search From the Photo Sharing Websites
* Learn2Dance: Learning Statistical Music-to-Dance Mappings for Choreography Synthesis
* Learn2Dance: Learning Statistical Music-to-Dance Mappings for Choreography Synthesis
* Learning Hierarchical Semantic Description Via Mixed-Norm Regularization for Image Understanding
* Learning Semantics From Multimedia Web Resources: An Introduction to the Special Issue
* Long-Term Incremental Web-Supervised Learning of Visual Concepts via Random Savannas
* Low-Complexity Video Quality Assessment Using Temporal Quality Variations
* Low-Decoding-Latency Buffer Compression for Graphics Processing Units
* Low-Delay Peer-To-Peer Media Streaming Based on Network Coding Over Randomized Multicast Trees
* Low-Latency Video Streaming With Congestion Control in Mobile Ad-Hoc Networks
* Managing Digital Rights for P2P Live Broadcast and Recording on the Internet
* Matrix-Based Approach to Unsupervised Human Action Categorization, A
* Model-Based Shot Boundary Detection Technique Using Frame Transition Parameters, A
* Movie2Comics: Towards a Lively Video Content Presentation
* Moving Object Detection and Tracking Using a Spatio-Temporal Graph in H.264/AVC Bitstreams for Video Surveillance
* Multi-Camera Approach to Image-Based Rendering and 3-D/Multiview Display of Ancient Chinese Artifacts, A
* Multimodal Video Indexing and Retrieval Using Directed Information
* Multipitch Estimation of Piano Music by Exemplar-Based Sparse Representation
* Multiple Description of Coded Video for Path Diversity Streaming Adaptation
* Nonrigid Structure-From-Motion From 2-D Images Using Markov Chain Monte Carlo
* Nonverbal Behavior Approach to Identify Emergent Leaders in Small Groups, A
* Normalized Energy Density-Based Forensic Detection of Resampled Images
* Novel Large-Scale Digital Forensics Service Platform for Internet Videos, A
* Novel Multiple Kernel Learning Framework for Heterogeneous Feature Fusion and Variable Selection, A
* Object Co-Segmentation Based on Shortest Path Algorithm and Saliency Model
* Optimizing Selective ARQ for H.264 Live Streaming: A Novel Method for Predicting Loss-Impact in Real Time
* P2P-Based IPTV Services: Design, Deployment, and QoE Measurement
* Parallel Lasso for Large-Scale Video Concept Detection
* Path Modeling and Retrieval in Distributed Video Surveillance Databases
* Photo Stream Alignment and Summarization for Collaborative Photo Collection and Sharing
* Preference-Aware View Recommendation System for Scenic Photos Based on Bag-of-Aesthetics-Preserving Features
* Pricing and Investment for Online TV Content Platforms
* Privacy Enabled Digital Rights Management Without Trusted Third Party Assumption
* Probabilistic Motion Diffusion of Labeling Priors for Coherent Video Segmentation
* Prototype-Based Image Search Reranking
* Purposive Hidden-Object-Game: Embedding Human Computation in Popular Game
* QoE Prediction Model and its Application in Video Quality Adaptation Over UMTS Networks
* Quality-Centric TCP-Friendly Congestion Control for Multimedia Transmission, A
* Quantitative Characterization of Semantic Gaps for Learning Complexity Estimation and Inference Model Selection
* Query Difficulty Prediction for Web Image Search
* Reading Users' Minds From Their Eyes: A Method for Implicit Image Annotation
* Real-Time Head and Hand Tracking Based on 2.5D Data
* Recommender System for Sport Videos Based on User Audiovisual Consumption
* Reducing DRAM Image Data Access Energy Consumption in Video Processing
* Rhythm of Motion Extraction and Rhythm-Based Cross-Media Alignment for Dance Videos
* Robust Face-Name Graph Matching for Movie Character Identification
* Robust Image Coding Based Upon Compressive Sensing
* Robust Watermarking of Compressed and Encrypted JPEG2000 Images
* Robustly Extracting Captions in Videos Based on Stroke-Like Edges and Spatio-Temporal Analysis
* S3-MKL: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications
* Sampling and Ontologically Pooling Web Images for Visual Concept Learning
* Scalable Comic-Like Video Summaries and Layout Disturbance
* Search and Retrieval of Rich Media Objects Supporting Multiple Multimodal Queries
* Secure and Efficient Authentication Scheme for Access Control in Mobile Pay-TV Systems, A
* Semantic Model Vectors for Complex Video Event Recognition
* Single Image Realism Assessment and Recoloring by Color Compatibility
* Sketch-Based Annotation and Visualization in Video Authoring
* Sliding-Window Designs for Vertex-Based Shape Coding
* Sparse Ensemble Learning for Concept Detection
* Structure Tensor Series-Based Large Scale Near-Duplicate Video Retrieval
* Summarizing Rushes Videos by Motion, Object, and Event Understanding
* Tag-Based Image Retrieval Improved by Augmented Features and Group-Based Refinement
* Tennis Real Play
* Throughput Scaling of Convolution for Error-Tolerant Multimedia Applications
* Towards Cross-Version Harmonic Analysis of Music
* Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection
* Understanding Kin Relationships in a Photo
* Unsupervised Salient Object Segmentation Based on Kernel Density Estimation and Two-Phase Graph Cut
* Unsupervised Semantic Feature Discovery for Image Object Retrieval and Tag Refinement
* User-Aware Image Tag Refinement via Ternary Semantic Analysis
* Video Completion Using Bandlet Transform
* Visual Sentences for Pose Retrieval Over Low-Resolution Cross-Media Dance Collections
* Visually Summarizing Web Pages Through Internal and External Images
* Weakly Supervised Graph Propagation Towards Collective Image Parsing
* Web Image Annotation Via Subspace-Sparsity Collaborated Feature Selection
* Web Video Geolocation by Geotagged Social Resources
* Web-Based Classifiers for Human Action Recognition
* Wireless H.264 Video Quality Enhancement Through Optimal Prioritized Packet Fragmentation
141 for MultMed(14)

MultMed(15) * Access Point-Based FEC Mechanism for Video Transmission Over Wireless LANs, An
* Active Bucket Categorization for High Recall Video Retrieval
* Adaptive Cloud Downloading Service, An
* Adaptive Mobile Cloud Computing to Enable Rich Mobile Multimedia Applications
* Aesthetic Image Enhancement by Dependence-Aware Object Recomposition
* Affective Labeling in a Content-Based Recommender System for Images
* AMES-Cloud: A Framework of Adaptive Mobile Video Streaming and Efficient Social Video Sharing in the Clouds
* Appearance-Based QR Code Beautifier
* Attribute-Based Access to Scalable Media in Cloud-Assisted Content Sharing Networks
* Automatic Training Image Acquisition and Effective Feature Selection From Community-Contributed Photos for Facial Attribute Detection
* Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information
* Bootstrapping Visual Categorization With Relevant Negatives
* Branch and Data Herding: Reducing Control and Memory Divergence for Error-Tolerant GPU Applications
* Capacity Management of Seed Servers in Peer-to-Peer Streaming Systems With Scalable Video Streams
* Casual Stereoscopic Photo Authoring
* Cloud-Based Image Coding for Mobile Devices: Toward Thousands to One Compression
* CloudMoV: Cloud-Based Mobile Social TV
* Co-Salient Object Detection From Multiple Images
* Collusion-Resistant Conditional Access System for Flexible-Pay-Per-Channel Pay-TV Broadcasting, A
* Compressing 3D Trees With Rendering Efficiency Based on Differential Data
* Compressive Video Streaming: Design and Rate-Energy-Distortion Analysis
* Connectivity, Online Social Capital, and Mood: A Bayesian Nonparametric Analysis
* Consistent Stereo Matching Under Varying Radiometric Conditions
* Content-Based Photo Quality Assessment
* Context-Aware Video Retargeting via Graph Model
* Continuous Birdsong Recognition Using Gaussian Mixture Modeling of Image Shape Features
* Cooperative Delivery Techniques to Support Video-on-Demand Service in IPTV Networks
* Correspondence Matching of Multi-View Video Sequences Using Mutual Information Based Similarity Measure
* Crowdsourcing Multimedia QoE Evaluation: A Trusted Framework
* Cube2Video: Navigate Between Cubic Panoramas in Real-Time
* Design QoS-Aware Multi-Path Provisioning Strategies for Efficient Cloud-Assisted SVC Video Streaming to Heterogeneous Clients
* Differential Coding-Based Scheduling Framework for Wireless Multimedia Sensor Networks, A
* Directive Contrast Based Multimodal Medical Image Fusion in NSCT Domain
* Discovering Video Shot Categories by Unsupervised Stochastic Graph Partition
* Downlink Power Control for Multi-User VBR Video Streaming in Cellular Networks
* Edge-Preserving Texture Suppression Filter Based on Joint Filtering Schemes
* Effective CU Size Decision Method for HEVC Encoders, An
* Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval
* Efficient Fine-Granular Scalable Coding of 3D Mesh Sequences
* Efficient Resource Provisioning and Rate Selection for Stream Mining in a Community Cloud
* Emotional Accompaniment Generation System Based on Harmonic Progression
* Empirical Model of Multiview Video Coding Efficiency for Wireless Multimedia Sensor Networks, An
* Energy and Quality-Aware Multimedia Signal Processing
* Error Tolerant Multimedia Stream Processing: There's Plenty of Room at the Top (of the System Stack)
* Example-Based Color Transfer for Gradient Meshes
* Example-Based Super-Resolution With Soft Information and Decision
* Exploiting Semantic and Visual Context for Effective Video Annotation
* Face Expression Recognition by Cross Modal Data Association
* Fairness Resource Allocation in Blind Wireless Multimedia Communications
* Fast and Efficient Transcoding Based on Low-Complexity Background Modeling and Adaptive Block Classification
* Fast Intra-Coding for H.264/AVC by Using Projection-Based Predicted Block Residuals
* FAST Rate Allocation for JPEG2000 Video Transmission Over Time-Varying Channels
* Feature Processing and Modeling for 6D Motion Gesture Recognition
* Feature Selection for Multimedia Analysis by Sharing Information Among Multiple Tasks
* Fluorescence Tomography Reconstruction With Simultaneous Positron Emission Tomography Priors
* From Logo to Object Segmentation
* Fully Automatic and Frame-Accurate Video Synchronization Using Bitrate Sequences
* Generating Visual Summaries of Geographic Areas Using Community-Contributed Images
* GPS Estimation for Places of Interest From Social Users' Uploaded Photos
* GPS/HPS-and Wi-Fi Fingerprint-Based Location Recognition for Check-In Applications Over Smartphones in Cloud-Based LBSs
* GPU-Accelerated Real-Time Tracking of Full-Body Motion With Multi-Layer Search
* Gram-Based String Paradigm for Efficient Video Subsequence Search, A
* Graph-Based Topic-Focused Retrieval in Distributed Camera Network
* Group Delay Based Methods for Speaker Segregation and its Application in Multimedia Information Retrieval
* Guest Editorial for Special Section on Multimodal Biomedical Imaging: Algorithms and Applications
* Guest Editorial: Special section on cloud-based mobile media: Infrastructure, services, and applications
* Guest Editorial: Special Section on New Software/Hardware Paradigms for Error-Tolerant Multimedia Systems
* Hessian Regularized Support Vector Machines for Mobile Image Annotation on the Cloud
* Image Re-Attentionizing
* Inferring Contexts From Facebook Interactions: A Social Publicity Scenario
* Integrating Non-Repetitive LT Encoders With Modified Distribution to Achieve Unequal Erasure Protection
* Integration of Multivariate Data Streams With Bandpower Signals
* Interaction Design for Mobile Visual Search
* Interactive Multimodal Visual Search on Mobile Device
* Interactive Multiview Video System With Low Complexity 2D Look Around at Decoder
* Interactive Schematic Summaries for Faceted Exploration of Surveillance Video
* Joint Bit Allocation and Rate Control for Coding Multi-View Video Plus Depth Based 3D Video
* Joint Multimodal Group Analysis Framework for Modeling Corticomuscular Activity, A
* Joint Social and Content Recommendation for User-Generated Videos in Online Social Network
* Joint Spatio-Temporal Alignment of Sequences
* JPIP Proxy Server With Prefetching Strategies Based on User-Navigation Model and Semantic Map
* Just Noticeable Difference Estimation for Images With Free-Energy Principle
* Kinect-Like Depth Data Compression
* Latent Mixture of Discriminative Experts
* Learning a Contextual Multi-Thread Model for Movie/TV Scene Segmentation
* Learning Crowdsourced User Preferences for Visual Summarization of Image Collections
* Learning Query-Specific Distance Functions for Large-Scale Web Image Search
* Learning Semantic Signatures for 3D Object Retrieval
* Learning to Distribute Vocabulary Indexing for Scalable Visual Search
* Learning to Photograph: A Compositional Perspective
* Learning to Produce 3D Media From a Captured 2D Video
* Learning to Reassemble Shredded Documents
* Linking Brain Responses to Naturalistic Music Through Analysis of Ongoing EEG and Stimulus Features
* Local Disparity Estimation With Three-Moded Cross Census and Advanced Support Weight
* Localization of Taps on Solid Surfaces for Human-Computer Touch Interfaces
* Low-Complexity Bit-Plane Entropy Coding and Rate Control for 3-D DWT Based Video Coding, A
* Low-Cost Eye Gaze Prediction System for Interactive Networked Video Streaming
* LP-SR: Approaching Optimal Storage and Retrieval for Video-on-Demand
* Markov Decision Process Based Energy-Efficient On-Line Scheduling for Slice-Parallel Video Decoders on Multicore Systems
* Measurement and Modeling of Video Watching Time in a Large-Scale Internet Video-on-Demand System
* Message Passing Matching Dynamics for Overlapping Point Identification
* Mixed Reality Virtual Clothes Try-On System, A
* Mode Decision-Based Algorithm for Complexity Control in H.264/AVC
* Modeling and Analysis of Skype Video Calls: Rate Control and Video Quality
* Modeling Functional Roles Dynamics in Small Group Interactions
* Modeling of Driver Behavior in Real World Scenarios Using Multiple Noninvasive Sensors
* Monitoring of Tumor Response to Au Nanorod-Indocyanine Green Conjugates Mediated Therapy With Fluorescence Imaging and Positron Emission Tomography
* MSIDX: Multi-Sort Indexing for Efficient Content-Based Image Search and Retrieval
* Multi-Feature Fusion via Hierarchical Regression for Multimedia Analysis
* Multimedia Event Detection Using A Classifier-Specific Intermediate Representation
* Multimedia Fusion With Mean-Covariance Analysis
* Multimedia Information Retrieval Based on Late Semantic Fusion Approaches: Experiments on a Wikipedia Image Collection
* Multimodal Analysis for Identification and Segmentation of Moving-Sounding Objects
* Multimodal Approach to Speaker Diarization on TV Talk-Shows, A
* Multimodal Photoacoustic Tomography
* Multimodal Saliency and Fusion for Movie Summarization Based on Aural, Visual, and Textual Attention
* NetClust: A Framework for Scalable and Pareto-Optimal Media Server Placement
* Network and Device Aware QoS Approach for Cloud-Based Mobile Streaming, A
* Network Coding Meets Multimedia: A Review
* New Fast Encoding Algorithm Based on an Efficient Motion Estimation Process for the Scalable Video Coding Standard, A
* Non-Parametric Super-Resolution Using a Bi-Sensor Camera
* On a Highly Efficient RDO-Based Mode Decision Pipeline Design for AVS
* On the Investigation of Cloud-Based Mobile Media Environments with Service-Populating and QoS-Aware Mechanisms
* On-Device Mobile Visual Location Recognition by Integrating Vision and Inertial Sensors
* Online Allocation of Communication and Computation Resources for Real-Time Multimedia Services
* Optimization Framework for QoS-Enabled Adaptive Video Streaming Over OpenFlow Networks, An
* Optimizing Cloud Resources for Delivering IPTV Services Through Virtualization
* Patch-Based Image Warping for Content-Aware Retargeting
* Personal Clothing Retrieval on Photo Collections by Color and Attributes
* Preserving Motion-Tolerant Contextual Visual Saliency for Video Resizing
* Proxy-Based Multi-Stream Scalable Video Adaptation Over Wireless Networks Using Subjective Quality and Rate Models
* QoE-Driven Cache Management for HTTP Adaptive Bit Rate Streaming Over Wireless Networks
* Quantitative Model and Analysis of Information Confusion in Social Networks, A
* Quantitative Study of Music Listening Behavior in a Social and Affective Context
* Query-Adaptive Image Search With Hash Codes
* Query-Document-Dependent Fusion: A Case Study of Multimodal Music Retrieval
* Raptor Codes Based Unequal Protection for Compressed Video According to Packet Priority
* Real-Time, Full 3-D Reconstruction of Moving Foreground Objects From Multiple Consumer Depth Cameras
* Reduced-Reference Image Quality Assessment with Visual Information Fidelity
* Reversible Data Hiding With Optimal Value Transfer
* Review of Recent Advances in Registration Techniques Applied to Minimally Invasive Therapy, A
* Robust and Energy Efficient Multimedia Systems via Likelihood Processing
* Robust and Scalable Visual Category and Action Recognition System Using Kernel Discriminant Analysis With Spectral Regression, A
* Robust Part-Based Hand Gesture Recognition Using Kinect Sensor
* Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval
* Robust Technique for Motion-Based Video Sequences Temporal Alignment, A
* Saliency Detection Model Using Low-Level Features Based on Wavelet Transform, A
* Scalable Content-Based Music Retrieval Using Chord Progression Histogram and Tree-Structure LSH
* Scalable Face Image Retrieval Using Attribute-Enhanced Sparse Codewords
* Scalable Precision Analysis Framework, A
* Scalable Resource Allocation for SVC Video Streaming Over Multiuser MIMO-OFDM Networks
* Script-to-Movie: A Computational Framework for Story Movie Composition
* Segmentation and Rectification of Pictures in the Camera-Captured Images of Printed Documents
* Self-Learning Approach to Single Image Super-Resolution, A
* Sensing Trending Topics in Twitter
* Sequential Error Concealment for Video/Images by Sparse Linear Prediction
* Shape Similarity Analysis by Self-Tuning Locally Constrained Mixed-Diffusion
* Simplification Resilient LDPC-Coded Sparse-QIM Watermarking for 3D-Meshes
* Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring
* Speaking Effect Removal on Emotion Recognition From Facial Expressions Based on Eigenface Conversion
* Spectral Hashing With Semantically Consistent Graph for Image Indexing
* Style Transfer Via Image Component Analysis
* Toward Blind Scheduling in Mobile Media Cloud: Fairness, Simplicity, and Asymptotic Optimality
* Towards Cross-Domain Learning for Social Video Popularity Prediction
* Tracking Human Under Occlusion Based on Adaptive Multiple Kernels With Projected Gradients
* Tracking Large-Scale Video Remix in Real-World Events
* Transcranial Ultrasound and Magnetic Resonance Image Fusion With Virtual Navigator
* Travel Recommendation by Mining People Attributes and Travel Group Types From Community-Contributed Photos
* Two-Level Hierarchical Alignment for Semi-Coupled HMM-Based Audiovisual Emotion Recognition With Temporal Course
* Understanding the Characteristics of Internet Short Video Sharing: A YouTube-Based Measurement Study
* Understanding the External Links of Video Sharing Sites: Measurement and Analysis
* Unsupervised Hierarchical Feature Learning Framework for One-Shot Image Recognition, An
* Video Aesthetic Quality Assessment by Temporal Integration of Photo- and Motion-Based Features
* Video Error Concealment Using a Computation-Efficient Low Saliency Prior
* Video-to-Shot Tag Propagation by Graph Sparse Group Lasso
* VideoPuzzle: Descriptive One-Shot Video Composition
* Visual Speech Synthesis Using a Variable-Order Switching Shared Gaussian Process Dynamical Model
* Visually Favorable Tone-Mapping With High Compression Performance in Bit-Depth Scalable Video Coding
* Web Multimedia Object Classification Using Cross-Domain Correlation Knowledge
* YouTube Lens: Crowdsourced Personality Impressions and Audiovisual Analysis of Vlogs, The
180 for MultMed(15)

MultMed(16) * 3-D Interfaces to Improve the Performance of Visual Known-Item Search
* Accelerating Index-Based Audio Identification
* Acceptability-Based QoE Models for Mobile Video
* Accurate and Robust Range Image Registration Algorithm for 3D Object Modeling, An
* Adaptive Learning for Celebrity Identification With Video Context
* Adaptive Mechanism for Optimal Content Download in Wireless Networks, An
* Adaptive Thread Scheduling Mechanism With Low-Power Register File for Mobile GPUs, An
* Adaptive Watermarking and Tree Structure Based Image Quality Estimation
* Advanced Moving Object Detection Algorithm for Automatic Traffic Monitoring in Real-World Limited Bandwidth Networks, An
* Analysis and Predictive Modeling of Body Language Behavior in Dyadic Interactions From Multimodal Interlocutor Cues
* Analysis of Buffer Starvation With Application to Objective QoE Optimization of Streaming Services
* Analytical Approach for Voice Capacity Estimation Over WiFi Network Using ITU-T E-Model, An
* Assessment of Learned Score Features for Modeling Expressive Dynamics in Music, An
* Asymmetric Pruning for Learning Cascade Detectors
* Atmospheric Perspective Effect Enhancement of Landscape Photographs Through Depth-Aware Contrast Manipulation
* Audio Properties of Perceived Boundaries in Music
* Augmenting Image Descriptions Using Structured Prediction Output
* Automatic Estimation of Multiple Motion Fields From Video Sequences Using a Region Matching Based Approach
* Automatic Human Mocap Data Classification
* Bag-of-Importance Model With Locality-Constrained Coding Based Feature Learning for Video Summarization, A
* Band Codes for Energy-Efficient Network Coding With Application to P2P Mobile Streaming
* Best Practices for QoE Crowdtesting: QoE Assessment With Crowdsourcing
* BM25 With Exponential IDF for Instance Search
* Broadcasting Oneself: Visual Discovery of Vlogging Styles
* CAVVA: Computational Affective Video-in-Video Advertising
* CBM: Online Strategies on Cost-Aware Buffer Management for Mobile Video Streaming
* Channel Time Allocation PSO for Gigabit Multimedia Wireless Networks
* Classification of Cinematographic Shots Using Lie Algebra and its Application to Complex Event Recognition
* Cloud Mobile Media: Reflections and Outlook
* Coding Structure and Replication Optimization for Interactive Multiview Video Streaming
* Comprehensive Study Over VLAD and Product Quantization in Large-Scale Image Retrieval, A
* Compressing Encrypted Images With Auxiliary Information
* Conceptlets: Selective Semantics for Classifying Video Events
* Concurrent Single-Label Image Classification and Annotation via Efficient Multi-Layer Group Sparse Coding
* Content-Based Prediction of Movie Style, Aesthetics, and Affect: Data Set and Baseline Experiments
* Contextual Object Detection With Spatial Context Prototypes
* Contextual Query Expansion for Image Retrieval
* Corpus Development for Affective Video Indexing
* Correlation-Aware Packet Scheduling in Multi-Camera Networks
* Corruptive Artifacts Suppression for Example-Based Color Transfer
* Creating Experts From the Crowd: Techniques for Finding Workers for Difficult Tasks
* Creating the Sydney York Morphological and Acoustic Recordings of Ears Database
* Cross-Modal Approach for Extracting Semantic Relationships Between Concepts Using Tagged Images, A
* Data-Driven Approach for Facial Expression Retargeting in Video, A
* Depth-Based Multiview Distributed Video Coding
* Depth-Discrepancy-Compensated Inter-Prediction With Adaptive Segment Management for Multiview Depth Video Coding
* Discrete Cosine Transform Locality-Sensitive Hashes for Face Retrieval
* Discriminative Soft Bag-of-Visual Phrase for Mobile Landmark Recognition
* Discriminative Structure Learning for Semantic Concept Detection With Graph Embedding
* Distortion-Fair Cross-Layer Resource Allocation for Scalable Video Transmission in OFDMA Wireless Networks
* Distributed QoS Architectures for Multimedia Streaming Over Software Defined Networks
* Distributed Rate Allocation in Inter-Session Network Coding
* Distributed Scheduling for Low-Delay and Loss-Resilient Media Streaming With Network Coding
* Dynamic Load Balancing for Real-Time Video Encoding on Heterogeneous CPU+GPU Systems
* Dynamic Request Redirection and Elastic Service Scaling in Cloud-Centric Media Networks
* Dynamic Texture Recognition Using Multiscale Binarized Statistical Image Features
* Effective Results Ranking for Mobile Query by Singing/Humming Using a Hybrid Recommendation Mechanism
* Effective Video Retargeting With Jittery Assessment
* Efficient H.264/AVC Video Coding with Adaptive Transforms
* Efficient MRF Energy Propagation for Video Segmentation via Bilateral Filters
* Efficient Multi-View Generation Method From a Single-View Video Based on Affine Geometry Information, An
* Efficient Patch-Wise Non-Uniform Deblurring for a Single Image
* Efficient Viewer-Centric Depth Adjustment Based on Virtual Fronto-Parallel Planar Projection in Stereo 3D Images
* Enabling Geometry-Based 3-D Tele-Immersion With Fast Mesh Compression and Linear Rateless Coding
* Example-Based Human Motion Extrapolation and Motion Repairing Using Contour Manifold
* Example-Based Video Stereolization With Foreground Segmentation and Depth Propagation
* Exploiting Click Constraints and Multi-view Features for Image Re-ranking
* Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss
* Extracting Primary Objects by Video Co-Segmentation
* Face Distortion Recovery Based on Online Learning Database for Conversational Video
* Fashion Parsing With Weak Color-Category Labels
* Fast HEVC Inter CU Selection Method Based on Pyramid Motion Divergence, A
* Fast Single Image Super-Resolution via Self-Example Learning and Sparse Representation
* Gaze-Based Relevance Feedback for Realizing Region-Based Image Retrieval
* Generalized Equalization Model for Image Enhancement
* Generative Model for Concurrent Image Retrieval and ROI Segmentation, A
* Glottal and Vocal Tract Characteristics of Voice Impersonators
* Guest Editorial: Special Section on Music Data Mining
* Guest Editorial: Special Section on Socio-Mobile Media Analysis and Retrieval
* H.264 High-Profile Intra-Prediction with Adaptive Selection Between the Parallel and Pipelined Executions of Prediction Modes, An
* Hire me: Computational Inference of Hirability in Employment Interviews Based on Nonverbal Behavior
* Illumination Robust Video Foreground Prediction Based on Color Recovering
* Image Alignment by Piecewise Planar Region Matching
* Image Attribute Adaptation
* Image Relevance Prediction Using Query-Context Bag-of-Object Retrieval Model
* Image Similarity Using Sparse Representation and Compression Distance
* Impact of Random and Burst Packet Losses on H.264 Scalable Video Coding
* In-Network Quality Optimization for Adaptive Video Streaming Services
* Instant Mobile Video Search With Layered Audio-Video Indexing and Progressive Transmission
* Intent-Aware Video Search Result Optimization
* Interference Reduction in Reverberant Speech Separation With Visual Voice Activity Detection
* Interruption Probability of Wireless Video Streaming With Limited Video Lengths
* Investigating Redundant Internet Video Streaming Traffic on iOS Devices: Causes and Solutions
* Iterative Pricing-Based Rate Allocation for Video Streams With Fluctuating Bandwidth Availability
* Joint Sampling Rate and Bit-Depth Optimization in Compressive Video Sampling
* Kernel-Based MMSE Multimedia Signal Reconstruction and Its Application to Spatial Error Concealment
* Layered Wireless Video Relying on Minimum-Distortion Inter-Layer FEC Coding
* Learning Effective Event Models to Recognize a Large Number of Human Actions
* Learning High-Level Feature by Deep Belief Networks for 3-D Model Retrieval and Recognition
* Learning Salient Features for Speech Emotion Recognition Using Convolutional Neural Networks
* Loss-Resilient Coding of Texture and Depth for Free-Viewpoint Video Conferencing
* Low Complexity Adaptive View Synthesis Optimization in HEVC Based 3D Video Coding
* Low Transmission Overhead Framework of Mobile Visual Search Based on Vocabulary Decomposition, A
* Low-Complexity Packet Scheduling Algorithms for Streaming Scalable Media Based on Time Utility Function
* Mining Crowdsourced First Impressions in Online Social Video
* Mobile Landmark Search with 3D Models
* Model-Assisted Cross-Layer Design of an Energy-Efficient Mobile Video Cloud, A
* Motion Vector Recovery for Video Error Concealment by Using Iterative Dynamic-Programming Optimization
* MRF-Based Fast HEVC Inter CU Decision With the Variance of Absolute Differences
* Multi-Array Camera Disparity Enhancement
* Multi-Label Learning With Fused Multimodal Bi-Relational Graph
* Multi-Objective Optimization for Multimodal Visualization
* Multi-Source-Driven Asynchronous Diffusion Model for Video-Sharing in Online Social Networks
* Multimodal Interactive Continuous Scoring of Subjective 3D Video Quality of Experience
* Multipath Video Real-Time Streaming by Field-Based Anycast Routing
* Near-Duplicate Subsequence Matching Between the Continuous Stream and Large Video Dataset
* New Reference Frame Recompression Algorithm and Its VLSI Architecture for UHDTV Video Codec, A
* Noise Robust Face Hallucination via Locality-Constrained Representation
* Non-Blind Structure-Preserving Substitution Watermarking of H.264/CAVLC Inter-Frames
* Non-Rigid Structure-From-Motion With Uniqueness Constraint and Low Rank Matrix Fitting Factorization
* Normalized Correlation-Based Quantization Modulation for Robust Watermarking
* Novel Efficient HEVC Decoding Solution on General-Purpose Processors
* On a Hashing-Based Enhancement of Source Separation Algorithms Over Finite Fields With Network Coding Perspectives
* On Designing Paired Comparison Experiments for Subjective Multimedia Quality Assessment
* On the Quality of Service of Cloud Gaming Systems
* Online HodgeRank on Random Graphs for Crowdsourceable QoE Evaluation
* Optimized Motion Energy Estimation for Group of Pictures in Multi-Level Error Protection of H.264/AVC Video Bitstreams
* ParCast+: Parallel Video Unicast in MIMO-OFDM WLANs
* Parsing the Hand in Depth Images
* Per-Cluster Ensemble Kernel Learning for Multi-Modal Image Clustering With Group-Dependent Feature Selection
* Person Identity Label Propagation in Stereo Videos
* Personalized Geo-Specific Tag Recommendation for Photos on Social Websites
* Physical Metaphor for Streaming Media Retargeting
* PicWords: Render a Picture by Packing Keywords
* Point Cloud Encoding for 3D Building Model Retrieval
* Point of Interest Detection and Visual Distance Estimation for Sensor-Rich Video
* Post-Processing for Blocking Artifact Reduction Based on Inter-Block Correlation
* Predicting Failing Queries in Video Search
* Prior-Free Weighting Scheme for Binary Code Ranking, A
* Prototype-Based Modeling for Facial Expression Analysis
* Quaternionic Signal Processing Techniques for Automatic Evaluation of Dance Performances From MoCap Data
* Random Network Coding for Multimedia Delivery Services in LTE/LTE-Advanced
* Rate-Distortion Optimized Mode Switching for Error-Resilient Multi-View Video Plus Depth Based 3-D Video Coding
* Recursive On-Line 2D PCA and Its Application to Long-Term Background Subtraction
* Reducing Operational Costs in Cloud Social TV: An Opportunity for Cloud Cloning
* Regularity Preserved Superpixels and Supervoxels
* Relevant Window-Based Bitmap Compression in P2P Systems: Framework and Solution
* Representative Discovery of Structure Cues for Weakly-Supervised Image Segmentation
* Resource Allocation for Personalized Video Summarization
* Reversible Data Hiding in Encrypted JPEG Bitstream
* Robust Multi-Speaker Tracking via Dictionary Learning and Identity Modeling
* Robust Semi-Automatic Depth Map Generation in Unconstrained Images and Video Sequences for 2D to Stereoscopic 3D Conversion
* Scalable Mobile Visual Classification by Kernel Preserving Projection Over High-Dimensional Features
* Screen Content Coding Based on HEVC Framework
* Self-Learning Based Image Decomposition With Applications to Single Image Denoising
* Self-Sorting Map: An Efficient Algorithm for Presenting Multimedia Data in Structured Layouts
* Semi-Supervised Multiple Feature Analysis for Action Recognition
* Similarity Assessment Model for Chinese Sign Language Videos
* Simple and Efficient Re-Scrambling Scheme for DTV Programs, A
* Simple Method to Determine if a Music Information Retrieval System is a Horse, A
* Simultaneous-Speaker Voice Activity Detection and Localization Using Mid-Fusion of SVM and HMMs
* Social Image Analysis From a Non-IID Perspective
* Socialized Mobile Photography: Learning to Photograph With Social Context via Mobile Devices
* Solving a Special Type of Jigsaw Puzzles: Banknote Reconstruction From a Large Number of Fragments
* Space-Time Facet Model for Human Activity Classification
* Sparse Multi-Modal Hashing
* Sphere Image for 3-D Model Retrieval
* Sport Type Classification of Mobile Videos
* Standard-Compliant Low-Pass Temporal Filter to Reduce the Perceived Flicker Artifact
* Stationary Probability Model for Microscopic Parallelism in JPEG2000
* Systematic Evaluation of the Bag-of-Frames Representation for Music Information Retrieval, A
* Texture Modeling Using Contourlets and Finite Mixtures of Generalized Gaussian Distributions and Applications
* Topic-Sensitive Influencer Mining in Interest-Based Social Media Networks via Hypergraph Learning
* Touch Saliency: Characteristics and Prediction
* Towards Codebook-Free: Scalable Cascaded Hashing for Mobile Image Search
* Towards Mobile Document Image Retrieval for Digital Library
* Trace Transform Based Method for Color Image Domain Identification
* UMSM: A Traffic Reduction Method on Multi-View Video Streaming for Multiple Users
* Unified Framework of Latent Feature Learning in Social Media, A
* Unsupervised Music Structure Annotation by Time Series Structure Features and Segment Similarity
* Using Audio-Derived Affective Offset to Enhance TV Recommendation
* Using Dynamically Promoted Experts for Music Recommendation
* Variational Bayesian Methods For Multimedia Problems
* Video Activity-Based Traffic Policing: A New Paradigm
* Video Annotation via Image Groups from the Web
* Video Event Detection Using Motion Relativity and Feature Selection
* Video Object Co-Segmentation via Subspace Clustering and Quadratic Pseudo-Boolean Optimization in an MRF Framework
* Visual Protection of HEVC Video by Selective Encryption of CABAC Binstrings
* Weakly Supervised Multi-Graph Learning for Robust Image Reranking
* Weakly Supervised Photo Cropping
190 for MultMed(16)

MultMed(17) * Accuracy of Subjects in a Quality Experiment: A Theoretical Subject Model, The
* Adaptive Optimal Shape Prior for Easy Interactive Object Segmentation
* Adaptive Prioritized Random Linear Coding and Scheduling for Layered Data Delivery From Multiple Servers
* Adaptive Scalable Video Transmission Strategy in Energy Harvesting Communication System
* Anchor View Allocation for Collaborative Free Viewpoint Video Streaming
* Asymmetric Cyclical Hashing for Large Scale Image Retrieval
* Audio Assisted Robust Visual Tracking With Adaptive Particle Filtering
* Author Topic Model-Based Collaborative Filtering for Personalized POI Recommendations
* Automatic Recognition of Emergent Social Roles in Small Group Interactions
* Automatic Visual Concept Learning for Social Event Understanding
* Auxiliary Metadata Delivery in View Synthesis Using Depth No-Synthesis-Error Model
* Barcode Modulation Method for Data Transmission in Mobile Devices
* Battery Aware Video Delivery Techniques Using Rate Adaptation and Base Station Reconfiguration
* Beyond Multimedia Adaptation: Quality of Experience-Aware Multi-Sensorial Media Delivery
* Biased Discriminant Analysis With Feature Line Embedding for Relevance Feedback-Based Image Retrieval
* Bucket-Filling: An Asymptotically Optimal Video-on-Demand Network With Source Coding
* Characterization of SURF and BRISK Interest Point Distribution for Distributed Feature Extraction in Visual Sensor Networks
* Cloud-Assisted Live Streaming for Crowdsourced Multimedia Content
* Cloud-Based Multimedia Content Protection System
* Compact Image Fingerprint Via Multiple Kernel Hashing
* Competence-Based Song Recommendation: Matching Songs to One's Singing Skill
* Connection Discovery Using Big Data of User-Shared Images in Social Media
* Content-Aware Video2Comics With Manga-Style Layout
* Content-Based Video Quality Prediction for HEVC Encoded Videos Streamed Over Packet Networks
* Context-Adaptive Binary Arithmetic Coding With Fixed-Length Codewords
* Contextual Online Learning for Multimedia Content Aggregation
* Continuous Learning Framework for Activity Recognition Using Deep Hybrid Feature Models, A
* Control-Theoretic Approach to Adaptive Video Streaming in Dense Wireless Networks, A
* Controlling a Robotic Fish Via a Natural User Interface for Informal Science Education
* Covariance-Based Descriptors for Efficient 3D Shape Matching, Retrieval, and Classification
* CPCDN: Content Delivery Powered by Context and User Intelligence
* Cross Indexing With Grouplets
* Cross-Domain Feature Learning in Multimedia
* Cross-Layer Resource Allocation for Video Streaming Over OFDMA Cognitive Radio Networks
* Cross-OSN User Modeling by Homogeneous Behavior Quantification and Local Social Regularization
* Cross-Platform Multi-Modal Topic Modeling for Personalized Inter-Platform Recommendation
* Database Saliency for Fast Image Retrieval
* Deep Head Pose: Gaze-Direction Estimation in Multimodal Video
* Deep Learning and Music Adversaries
* Deep Multimodal Learning for Affective Analysis and Retrieval
* DeepBag: Recognizing Handbag Models
* Demonstration of OpenFlow-Controlled Network Orchestration for Adaptive SVC Video Manycast
* Depth Sensation Enhancement for Multiple Virtual View Rendering
* Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks
* Detection and Classification of Acoustic Scenes and Events
* Disparity Vector Correction for View Synthesis Prediction-Based 3-D Video Transmission
* Distributed Online Hybrid Cloud Management for Profit-Driven Multimedia Cloud Computing
* Dynamic Time Warping for Music Conducting Gestures Evaluation
* Effective Image Retrieval System Using Dot-Diffused Block Truncation Coding Features
* Efficient 3-D Scene Prefetching From Learning User Access Patterns
* Efficient Cascaded Filtering Retrieval Method for Big Audio Data, An
* Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept Detection
* Efficient In-Loop Filtering Across Tile Boundaries for Multi-Core HEVC Hardware Decoders With 4 K/8 K-UHD Video Applications
* Efficient Inter-View Bit Allocation Methods for Stereo Image Coding
* Efficient Mining of Optimal AND/OR Patterns for Visual Recognition
* Efficient QR Code Beautification With High Quality Visual Content
* Enabling Enriched TV Shopping Experience via Computational and Temporal Aware View-Centric Multimedia Abstraction
* Energy-Efficient Coarse-Grained Reconfigurable Processing Unit for Multiple-Standard Video Decoding, An
* Energy-Efficient Coarse-Grained Reconfigurable Processing Unit for Multiple-Standard Video Decoding, An
* Energy-Efficient HTTP Adaptive Video Streaming With Networking Cost Constraint Over Heterogeneous Wireless Networks, An
* Enhancing Video Event Recognition Using Automatically Constructed Semantic-Visual Knowledge Base
* Estimation of Signal Distortion Using Effective Sampling Density for Light Field-Based Free Viewpoint Video
* EventMask: A Game-Based Framework for Event-Saliency Identification in Images
* Exploitation and Exploration Balanced Hierarchical Summary for Landmark Images
* Exploiting the Deep-Link Commentsphere to Support Non-Linear Video Access
* Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss
* Face Recognition and Retrieval Using Cross-Age Reference Coding With Cross-Age Celebrity Dataset
* Faithful Disocclusion Filling in Depth Image Based Rendering Using Superpixel-Based Inpainting
* Fashion Parsing With Video Context
* Fast HEVC Inter CU Decision Based on Latent SAD Estimation
* Fast Image Retrieval: Query Pruning and Early Termination
* Fast Object Retrieval Using Direct Spatial Matching
* Fine-Grained Image Search
* Framework for Composition and Enforcement of Privacy-Aware and Context-Driven Authorization Mechanism for Multimedia Big Data, A
* Geolocalized Modeling for Dish Recognition
* Gestalt Rule Feature Points
* Global-Scale Location Prediction for Social Images Using Geo-Visual Ranking
* Guest Editorial Multimedia: The Biggest Big Data
* Guest Editorial: Deep Learning for Multimedia Computing
* Hash-Based Block Matching for Screen Content Coding
* Head Motion Modeling for Human Behavior Analysis in Dyadic Interaction
* Hessian Semi-Supervised Sparse Feature Selection Based on L_2,1/2 -Matrix Norm
* Heterogeneous Feature Selection With Multi-Modal Deep Neural Networks and Sparse Group LASSO
* Hybrid Mobile Visual Search System With Compact Global Signatures, A
* Improving Multimedia Content Delivery via Augmentation With Social Information: The Social Prefetcher Approach
* Intelligent Acoustic Interfaces With Multisensor Acquisition for Immersive Reproduction
* Interactive Multimodal Learning for Venue Recommendation
* Interactive Streaming of Sequences of High Resolution JPEG2000 Images
* Joint Online Transcoding and Delivery Approach for Dynamic Adaptive Streaming, A
* Joint Super Resolution and Denoising From a Single Depth Image
* Joint Time-Domain Resource Partitioning, Rate Allocation, and Video Quality Adaptation in Heterogeneous Cellular Networks
* Knowing Verb From Object: Retagging With Transfer Learning on Verb-Object Concept Images
* Landmark Classification With Hierarchical Multi-Modal Exemplar Feature
* Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition
* Large-Scale Image Retrieval Based on Compressed Camera Identification
* Learning Compact Hash Codes for Multimodal Representations Using Orthogonal Deep Structure
* Learning Consistent Feature Representation for Cross-Modal Multimedia Retrieval
* Learning Cross Space Mapping via DNN Using Large Scale Click-Through Logs
* Learning Feature Hierarchies: A Layer-Wise Tag-Embedded Approach
* Learning Representative Deep Features for Image Set Analysis
* Learning Spatial and Temporal Extents of Human Actions for Action Detection
* Learning-Based Joint Super-Resolution and Deblocking for a Highly Compressed Image
* Let Your Body Speak: Communicative Cue Extraction on Natural Interaction Using RGBD Data
* Loss Visibility Optimized Real-Time Video Transmission Over MIMO Systems
* Mining Latent Attributes From Click-Through Logs for Image Recognition
* Multi-Resolution Disparity Processing and Fusion for Large High-Resolution Stereo Image
* Multi-Task CNN Model for Attribute Prediction
* Multi-View Video Summarization Using Bipartite Matching Constrained Optimum-Path Forest Clustering
* Multifaceted Approach to Social Multimedia-Based Prediction of Elections, A
* Multimedia Summarization for Social Events in Microblog Stream
* Multimodal Multi-Channel On-Line Speaker Diarization Using Sensor Fusion Through SVM
* Multiple Emotion Tagging for Multimedia Data by Exploiting High-Order Dependencies Among Emotions
* New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video, A
* Non-Rigid Structure-From-Motion on Degenerate Deformations With Low-Rank Shape Deformation Model
* Novel Efficient HEVC Decoding Solution on General-Purpose Processors
* Novel No-Reference Video Quality Metric for Evaluating Temporal Jerkiness due to Frame Freezing, A
* Novel Traffic Rate Measurement Algorithm for Quality of Experience-Aware Video Admission Control, A
* Object Tracking With Multi-View Support Vector Machines
* On Achieving Short Channel Switching Delay and Playback Lag in IP-Based TV Systems
* On Generating Content-Oriented Geo Features for Sensor-Rich Outdoor Video Search
* On-Road Pedestrian Tracking Across Multiple Driving Recorders
* Optimized Comics-Based Storytelling for Temporal Image Sequences
* Optimized Packet Scheduling in Multiview Video Navigation Systems
* Optimizing HTTP-Based Adaptive Streaming in Vehicular Environment Using Markov Decision Process
* Partial-Duplicate Clustering and Visual Pattern Discovery on Web Scale Image Database
* Pattern-Based Near-Duplicate Video Retrieval and Localization on Web-Scale Videos
* Perceived Synchronization of Mulsemedia Services
* Perceptual Quality Assessment for 3D Triangle Mesh Based on Curvature
* PixNet: A Localized Feature Representation for Classification and Visual Search
* Predicting Eye Fixations on Webpage With an Ensemble of Early Features and High-Level Representations from Deep Network
* Predictive Texture Synthesis-Based Intra Coding Scheme for Advanced Video Coding
* Probabilistic Skimlets Fusion for Summarizing Multiple Consumer Landmark Videos
* Profit Optimization for Wireless Video Broadcasting Systems Based on Polymatroidal Analysis
* Pseudo-Multiple-Exposure-Based Tone Fusion With Local Region Adjustment
* Query Difficulty Estimation for Image Search With Query Reconstruction Error
* Query-Dependent Aesthetic Model With Deep Learning for Photo Quality Assessment
* Rate and Power Allocation for Joint Coding and Transmission in Wireless Video Chat Applications
* Rate Distortion Optimized Inter-View Frame Level Bit Allocation Method for MV-HEVC
* Rating Image Aesthetics Using Deep Learning
* Real-Time Piano Music Transcription Based on Computer Vision
* Recognition of Genuine Smiles
* Reduced Reference Stereoscopic Image Quality Assessment Based on Binocular Perceptual Information
* Relational User Attribute Inference in Social Media
* Retargeting Semantically-Rich Photos
* RGB-D Object Recognition via Incorporating Latent Data Structure and Prior Knowledge
* Robust Face Recognition via Multimodal Deep Face Representation
* Secure and Robust Two-Phase Image Authentication
* Semantic-Based Location Recommendation With Multimodal Venue Semantics
* Semantic-Improved Color Imaging Applications: It Is All About Context
* Simple Countermeasures to Mitigate the Effect of Pollution Attack in Network Coding-Based Peer-to-Peer Live Streaming
* Sketch-Based Image Retrieval Through Hypothesis-Driven Object Boundary Selection With HLR Descriptor
* Smart Streaming for Online Video Services
* Spatio-Temporal Video Segmentation of Static Scenes and Its Applications
* Spatio-Temporally Consistent Color and Structure Optimization for Multiview Video Color Correction
* Structure-Preserving Hybrid Digital-Analog Video Delivery in Wireless Networks
* Structured Visual Feature Learning for Classification via Supervised Probabilistic Tensor Factorization
* Structured-Patch Optimization for Dense Correspondence
* Study of Multimodal Addressee Detection in Human-Human-Computer Interaction, A
* Super Fast Event Recognition in Internet Videos
* Superpixel-Based Hand Gesture Recognition With Kinect Depth Camera
* TCD-TIMIT: An Audio-Visual Corpus of Continuous Speech
* Tennis Ball Tracking Using a Two-Layered Data Association Approach
* Topological Spatial Verification for Instance Search
* Towards Cost-Efficient Video Transcoding in Media Cloud: Insights Learned From User Viewing Patterns
* Towards Effective Image Classification Using Class-Specific Codebooks and Distinctive Local Features
* Towards Practical Self-Embedding for JPEG-Compressed Digital Images
* Transition of Visual Attention Assessment in Stereoscopic Images With Evaluation of Subjective Visual Quality and Discomfort
* Tri-Subject Kinship Verification: Understanding the Core of A Family
* Unconstrained Multimodal Multi-Label Learning
* Understanding Blooming Human Groups in Social Networks
* Uniting Keypoints: Local Visual Information Fusion for Large-Scale Image Search
* Unravelling the Impact of Temporal and Geographical Locality in Content Caching Systems
* Unreeling Xunlei Kankan: Understanding Hybrid CDN-P2P Video-on-Demand Streaming
* Unsupervised Celebrity Face Naming in Web Videos
* Unsupervised Web Topic Detection Using A Ranked Clustering-Like Pattern Across Similarity Cascades
* Uploader Intent for Online Video: Typology, Inference, and Applications
* Using Free Energy Principle For Blind Image Quality Assessment
* Utility-Based H.264/SVC Video Streaming Over Multi-Channel Cognitive Radio Networks
* Utility-Based Optimized Cross-Layer Scheme for Real-Time Video Transmission Over HSDPA
* Video Delivery Performance of a Large-Scale VoD System and the Implications on Content Delivery
* Video Object Segmentation Via Dense Trajectories
* Video Popularity Dynamics and Its Implication for Replication
* Visual Object Tracking by Structure Complexity Coefficients
* Visual Tracking Using Strong Classifier and Structural Local Sparse Descriptors
* Weakly Supervised Deep Metric Learning for Community-Contributed Image Retrieval
* Weighted Component Hashing of Binary Aggregated Descriptors for Fast Visual Search
* Wireless Video Multicast With Cooperative and Incremental Transmission of Parity Packets
* Word-of-Mouth Understanding: Entity-Centric Multimodal Aspect-Opinion Mining in Social Media
* YouTube Video Promotion by Cross-Network Association: @Britney to Advertise Gangnam Style
189 for MultMed(17)

MultMed(18) * 3D Ear Identification Using Block-Wise Statistics-Based Features and LC-KSVD
* 6-DOF Image Localization From Massive Geo-Tagged Reference Images
* Adaptive Video Streaming With Optimized Bitstream Extraction and PID-Based Quality Control
* All-Zero Block Detection Scheme for Low-Complexity HEVC Encoders, An
* Analytics-Driven Visualization on Digital Directory via Screen-Smart Device Interactions
* Animal Detection From Highly Cluttered Natural Scenes Using Spatiotemporal Object Region Proposals and Patch Verification
* Animating Still Landscape Photographs Through Cloud Motion Creation
* Audio Recapture Detection With Convolutional Neural Networks
* Audiovisual Spatial-Audio Analysis by Means of Sound Localization and Imaging: A Multimedia Healthcare Framework in Abdominal Sound Mapping
* Background Basis Selection-Based Foreground Detection Method, A
* Background Subtraction Using Background Sets With Image- and Color-Space Reduction
* Bandwidth-Efficient Packet Scheduling for Live Streaming With Network Coding
* Bi-level Protected Compressive Sampling
* Binocular Responses for No-Reference 3D Image Quality Assessment
* Blind Image Quality Assessment Using Statistical Structural and Luminance Features
* Blind Quality Assessment of Tone-Mapped Images Via Analysis of Information, Naturalness, and Structure
* Bridging Music and Image via Cross-Modal Ranking Analysis
* CCR: Clustering and Collaborative Representation for Fast Single Image Super-Resolution
* Characterization of Band Codes for Pollution-Resilient Peer-to-Peer Video Streaming
* Classification-Based Record Linkage With Pseudonymized Data for Epidemiological Cancer Registries
* Clothes Co-Parsing Via Joint Image Segmentation and Labeling With Application to Clothing Retrieval
* Clothing Cosegmentation for Shopping Images With Cluttered Background
* Cloud-Based Actor Identification With Batch-Orthogonal Local-Sensitive Hashing and Sparse Representation
* Clustering-Based Content Adaptive Tiles Under On-chip Memory Constraints
* Collaborative Wireless Freeview Video Streaming With Network Coding
* Combined Deblocking Filter and SAO Hardware Architecture for HEVC, A
* Comparison and Evaluation of Sonification Strategies for Guidance Tasks
* Complexity Control Based on a Fast Coding Unit Decision Method in the HEVC Video Coding Standard
* Compressed-Sensed-Domain L1-PCA Video Surveillance
* Computational Model for Object-Based Visual Saliency: Spreading Attention Along Gestalt Cues, A
* ConfidentCare: A Clinical Decision Support System for Personalized Breast Cancer Screening
* Consistent Coding Scheme for Single-Image Super-Resolution Via Independent Dictionaries
* Constellation Design Methodology Based on QoS and User Demand in High-Altitude Platform Broadband Networks, A
* Content-Based Guided Image Filtering, Weighted Semi-Global Optimization, and Efficient Disparity Refinement for Fast and Accurate Disparity Estimation
* Context-Aware Framework for Reducing Bandwidth Usage of Mobile Video Chats, A
* Context-Aware Hypergraph Modeling for Re-identification and Summarization
* Coping With Heterogeneous Video Contributors and Viewers in Crowdsourced Live Streaming: A Cloud-Based Approach
* Core Failure Mitigation in Integer Sum-of-Product Computations on Cloud Computing Systems
* Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation
* Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation
* Cross-Modal Retrieval via Deep and Bidirectional Representation Learning
* CSPS: An Adaptive Pooling Method for Image Classification
* DAC-Mobi: Data-Assisted Communications of Mobile Images with Cloud Computing Support
* Data Hiding Robust to Mobile Communication Vocoders
* Data-Driven Crowd Understanding: A Baseline for a Large-Scale Crowd Dataset
* Dealing With User Heterogeneity in P2P Multi-Party Video Conferencing: Layered Distribution Versus Partitioned Simulcast
* Decision-Tree-Based Perceptual Video Quality Prediction Model and Its Application in FEC for Wireless Multimedia Communications, A
* Deep Aging Face Verification With Large Gaps
* Deep Learning for Surface Material Classification Using Haptic and Visual Information
* Deep Neural Network-Driven Feature Learning Method for Multi-view Facial Expression Recognition, A
* Deep Relative Attributes
* Delay-Optimized Video Traffic Routing in Software-Defined Interdatacenter Networks
* Democratic Diffusion Aggregation for Image Retrieval
* Depth Map Down-Sampling and Coding Based on Synthesized View Distortion
* Differentially Private Online Learning for Cloud-Based Video Recommendation With Multimedia Big Data in Social Networks
* Discriminative Dictionary Learning With Common Label Alignment for Cross-Modal Retrieval
* Distance-Computation-Free Search Scheme for Binary Code Databases, A
* Do Personality and Culture Influence Perceived Video Quality and Enjoyment?
* DPcode: Privacy-Preserving Frequent Visual Patterns Publication on Cloud
* Effective Active Skeleton Representation for Low Latency Human Action Recognition
* Efficient Bit Rate Transcoding for High Efficiency Video Coding
* Efficient Cache Placement Strategy in Two-Tier Wireless Content Delivery Network
* Efficient Image Sharpness Assessment Based on Content Aware Total Variation
* Efficient Residual DPCM Using an L_1 Robust Linear Prediction in Screen Content Video Coding
* Efficient Summarization From Multiple Georeferenced User-Generated Videos
* Enabling Secure and Fast Indexing for Privacy-Assured Healthcare Monitoring via Compressive Sensing
* Energy-Aware and Bandwidth-Efficient Hybrid Video Streaming Over Mobile Networks
* Energy-Efficient Resource Allocation Optimization for Multimedia Heterogeneous Cloud Radio Access Networks
* Error Mitigation Technique for Erasure Channels Based on a Wavelet Representation of the Speech Excitation Signal, An
* Estimating 3D Gaze Directions Using Unlabeled Eye Images via Synthetic Iris Appearance Fitting
* Estimating Snow Cover From Publicly Available Images
* Exemplar-AMMs: Recognizing Crowd Movements From Pedestrian Trajectories
* Exploiting Perceptual Anchoring for Color Image Enhancement
* Face and Hair Region Labeling Using Semi-Supervised Spectral Clustering-Based Multiple Segmentations
* Factorization Algorithms for Temporal Psychovisual Modulation Display
* Fast Covariant VLAD for Image Search
* Fast Learning-Based Single Image Super-Resolution
* Filtering of Brand-Related Microblogs Using Social-Smooth Multiview Embedding
* Flickr Circles: Aesthetic Tendency Discovery by Multi-View Regularized Topic Modeling
* Folksonomy-Based Visual Ontology Construction and Its Applications
* Frame Interpolation for Cloud-Based Mobile Video Streaming
* Free-Energy Principle Inspired Video Quality Metric and Its Use in Video Coding
* Game Theoretic Resource Allocation in Media Cloud With Mobile Social Users
* GameFlow: Narrative Visualization of NBA Basketball Games
* Geometric Approach to Server Selection for Interactive Video Streaming, A
* Guest Editorial: Cloud-Based Video Processing and Content Sharing
* Guest Editorial: Multimedia-Based Healthcare
* Guest Editorial: Visual Analytics in Multimedia: Opportunities and Research Challenges
* Guided Image Contrast Enhancement Based on Retrieved Images in Cloud
* HEMS: Hierarchical Exemplar-Based Matching-Synthesis for Object-Aware Image Reconstruction
* Hierarchical Visualization of Video Search Results for Topic-Based Browsing
* High-Throughput and Multi-Parallel VLSI Architecture for HEVC Deblocking Filter, A
* High-Throughput Hardware Design of a One-Dimensional SPIHT Algorithm, A
* Higher-Order Image Co-segmentation
* Hirability in the Wild: Analysis of Online Conversational Video Resumes
* Holons Visual Representation for Image Retrieval
* Human Visual System-Based Saliency Detection for High Dynamic Range Content
* Hybrid Zero Block Detection for High Efficiency Video Coding
* Image Classification by Cross-Media Active Learning with Privileged Information
* Image Classification by Selective Regularized Subspace Learning
* Image Co-segmentation via Saliency Co-fusion
* Image Interpolation Based on Non-local Geometric Similarities and Directional Gradients
* Image Retargeting for Preserving Robust Local Feature: Application to Mobile Visual Search
* Image Sharpness Assessment by Sparse Representation
* In-Network View Synthesis for Interactive Multiview Video Systems
* Inter-Prediction Optimizations for Video Coding Using Adaptive Coding Unit Visiting Order
* Interactive Multilabel Image Segmentation via Robust Multilayer Graph Constraints
* Interactive Spiral Tape Video Summarization, An
* Joint Inference of Objects and Scenes With Efficient Learning of Text-Object-Scene Relations
* Kernel Combined Sparse Representation for Disease Recognition
* Keypoint Detection in RGBD Images Based on an Anisotropic Scale Space
* Keypoint Encoding for Improved Feature Extraction From Compressed Video at Low Bitrates
* Knowledge-Based Coding of Objects for Multisource Surveillance Video Data
* lambda-Domain Rate Control Algorithm for HEVC Scalable Extension
* Learning Blind Quality Evaluator for Stereoscopic Images Using Joint Sparse Representation
* Learning Cascaded Deep Auto-Encoder Networks for Face Alignment
* Learning Geographical Hierarchy Features via a Compositional Model
* Learning Personalized Models for Facial Expression Analysis and Gesture Recognition
* Link Adaptation for High-Quality Uncompressed Video Streaming in 60-GHz Wireless Networks
* Locality Sensitive Low-Rank Model for Image Tag Completion, A
* Looking Into Saliency Model via Space-Time Visualization
* Low-Power Video Recording System With Multiple Operation Modes for H.264 and Light-Weight Compression, A
* mDASH: A Markov Decision-Based Rate Adaptation Approach for Dynamic HTTP Streaming
* Mean-Shift and Sparse Sampling-Based SMC-PHD Filtering for Audio Informed Visual Speaker Tracking
* Media Query Processing for the Internet-of-Things: Coupling of Device Energy Consumption and Cloud Infrastructure Billing
* Modeling Dynamics of Online Video Popularity
* Monet: A System for Reliving Your Memories by Theme-Based Photo Storytelling
* MoshViz: A Detail+Overview Approach to Visualize Music Elements
* Multi-Instance Multi-Label Learning Combining Hierarchical Context and its Application to Image Annotation
* Multi-Modal Event Topic Model for Social Event Analysis
* Multi-Perspective Cost-Sensitive Context-Aware Multi-Instance Sparse Coding and Its Application to Sensitive Video Recognition
* Multimedia Pivot Tables for Multimedia Analytics on Image Collections
* Multimodal Personality Recognition in Collaborative Goal-Oriented Tasks
* Multimodal Web Aesthetics Assessment Based on Structural SVM and Multitask Fusion Learning
* Multiple Human Identification and Cosegmentation: A Human-Oriented CRF Approach With Poselets
* Multiple Stage Residual Model for Image Classification and Vector Compression
* Multiple Video Delivery in m-Health Emergency Applications
* Multiplicative Watermark Decoder in Contourlet Domain Using the Normal Inverse Gaussian Distribution
* Multiview and 3D Video Compression Using Neighboring Block Based Disparity Vectors
* Multiview Skeletal Interaction Recognition Using Active Joint Interaction Graph
* Muscular Movement Model-Based Automatic 3D/4D Facial Expression Recognition
* Neyman-Pearson-Based Early Mode Decision for HEVC Encoding
* No-Reference Retargeted Image Quality Assessment Based on Pairwise Rank Learning
* Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion, A
* Novel UEP Fountain Coding Scheme for Scalable Multimedia Transmission, A
* Object Instance Search in Videos via Spatio-Temporal Trajectory Discovery
* On Branded Handbag Recognition
* On Constructing z -Dimensional DIBR-Synthesized Images
* On Data-Driven Delay Estimation for Media Cloud
* On Evaluating Perceptual Quality of Online User-Generated Videos
* On the Optimal Linear Network Coding Design for Information Theoretically Secure Unicast Streaming
* Optimal Incentive Design for Cloud-Enabled Multimedia Crowdsourcing
* Optimality of Greedy Algorithm for Generating Just-Noticeable Difference Surfaces
* Perceiving Graphical and Pictorial Information via Hearing and Touch
* Perceptual Annoyance Models for Videos With Combinations of Spatial and Temporal Artifacts
* Person Reidentification via Ranking Aggregation of Similarity Pulling and Dissimilarity Pushing
* PhenoTree: Interactive Visual Analytics for Hierarchical Phenotyping From Large-Scale Electronic Health Records
* Predicting the Performance in Decision-Making Tasks: From Individual Cues to Group Interaction
* Probabilistic Approach for Predicting the Size of Coding Units in the Quad-Tree Structure of the Quality and Spatial Scalable HEVC
* Pseudo 2D String Matching Technique for High Efficiency Screen Content Coding
* QoE Evaluation of Multimedia Services Based on Audiovisual Quality and User Interest
* Quadtree Degeneration for HEVC
* Quality of Experience Driven Multi-User Video Streaming in Cellular Cognitive Radio Networks With Single Channel Access
* Query-Adaptive Small Object Search Using Object Proposals and Shape-Aware Descriptors
* Rating Prediction Based on Social Sentiment From Textual Reviews
* Region-Aware 3-D Warping for DIBR
* Reliable Methodology to Collect Ground Truth Data of Image Aesthetic Appeal, A
* Resource Allocation With Video Traffic Prediction in Cloud-Based Space Systems
* Resource-Efficient Mobile Multimedia Streaming With Adaptive Network Selection
* Reversible Data Hiding in Encrypted Images by Reversible Image Transformation
* Robust DT CWT-Based DIBR 3D Video Watermarking Using Chrominance Embedding
* Robust Fingertip Detection in a Complex Environment
* Robust Latent Poisson Deconvolution From Multiple Features for Web Topic Detection
* SALIC: Social Active Learning for Image Classification
* Saliency-Guided Quality Assessment of Screen Content Images
* Scalable Video Event Retrieval by Visual State Binary Embedding
* Semantic Discriminative Metric Learning for Image Similarity Measurement
* Semi-Supervised Bi-Dictionary Learning for Image Classification With Smooth Representation-Based Label Propagation
* Sensing Matrix Optimization Based on Equiangular Tight Frames With Consideration of Sparse Representation Error
* Significance Evaluation of Video Data Over Media Cloud Based on Compressed Sensing
* Sketch-Based Image Retrieval by Salient Contour Reinforcement
* Social Diffusion Analysis With Common-Interest Model for Image Annotation
* Social Friend Recommendation Based on Multiple Network Correlation
* Sparse Kernel Reduced-Rank Regression for Bimodal Emotion Recognition From Facial Expression and Speech
* Sparse Pose Regression via Componentwise Clustering Feature Point Representation
* Spin Contour
* SSIM-Based Game Theory Approach for Rate-Distortion Optimized Intra Frame CTU-Level Bit Allocation
* Survey on Visual Analytics of Social Media Data, A
* Tag-Based Image Search by Social Re-ranking
* TagBook: A Semantic Video Representation Without Supervision for Event Detection
* Task-Driven Progressive Part Localization for Fine-Grained Object Recognition
* Tensor Manifold Discriminant Projections for Acceleration-Based Human Activity Recognition
* Tiling in Interactive Panoramic Video: Approaches and Evaluation
* Time-Domain Attribute-Based Access Control for Cloud-Based Video Content Sharing: A Cryptographic Approach
* Toward Cost-Efficient Content Placement in Media Cloud: Modeling and Analysis
* Trend-Aware Video Caching Through Online Learning
* Universal Framework for Salient Object Detection, A
* User-Service Rating Prediction by Exploring Social Users' Rating Behaviors
* View-Level Rate Distortion Model for Multi-View/3D Video, A
* Visual Analytics of Political Networks From Face-Tracking of News Video
* Visual Movie Analytics
* Visual Understanding via Multi-Feature Shared Learning With Global Consistency
* Visual Voice Activity Detection in the Wild
* Visualization-Based Active Learning for Video Annotation
* Visualizing and Analyzing Video Content With Interactive Scalable Maps
* Zero-Shot Person Re-identification via Cross-View Consistency
206 for MultMed(18)

MultMed(19) * Accelerating Image-Domain-Warping Virtual View Synthesis on GPGPU
* Accurate Depth Extraction Method for Multiple Light-Coding-Based Depth Cameras
* Active Sampling Exploiting Reliable Informativeness for Subjective Image Quality Assessment Based on Pairwise Comparison
* Adaptive Fusion Algorithm for Visible and Infrared Videos Based on Entropy and the Cumulative Distribution of Gray Levels, An
* Adaptive LSTAR Model for Long-Range Variable Bit Rate Video Traffic Prediction
* Adaptive Video Streaming With Network Coding Enabled Named Data Networking
* Analog Coded SoftCast: A Network Slice Design for Multimedia Broadcast/Multicast
* Asymmetric Binary Coding for Image Search
* Attentive Contexts for Object Detection
* Audio Identification by Sampling Sub-fingerprints and Counting Matches
* Automated Online Exam Proctoring
* Automatic Mesh Animation Preview With User Voting-Based Refinement
* Automatic Synchronization of Multi-user Photo Galleries
* Background-Driven Salient Object Detection
* Bayesian Hierarchical Regression Models for QoE Estimation and Prediction in Audiovisual Communications
* Being a Supercook: Joint Food Attributes and Multimodal Content Modeling for Recipe Retrieval and Exploration
* Blind Image Quality Assessment Based on Rank-Order Regularized Regression
* Blind Stereo Quality Assessment Based on Learned Features From Binocular Combined Images
* Cartoon and Texture Decomposition-Based Color Transfer for Fabric Images
* Collective First-Person Vision for Automatic Gaze Analysis in Multiparty Conversations
* Color Enhancement With Adaptive Illumination Estimation for Low-Backlighted Displays
* Color-Guided Depth Recovery via Joint Local Structural and Nonlocal Low-Rank Regularization
* Compact Hash Codes for Efficient Visual Descriptors Retrieval in Large Scale Databases
* Compete or Collaborate: Architectures for Collaborative DASH Video Over Future Networks
* Comprehensive Feature-Based Robust Video Fingerprinting Using Tensor Model
* Compressed Sensing for Efficient Encoding of Dense 3D Meshes Using Model-Based Bayesian Learning
* Context-Associative Hierarchical Memory Model for Human Activity Recognition and Prediction
* Continuous Probability Distribution Prediction of Image Emotions via Multitask Shared Sparse Regression
* Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling, A
* Cost-Effective Low-Delay Design for Multiparty Cloud Video Conferencing
* Cross-Layer Resource Allocation for Scalable Video Over OFDMA Wireless Networks: Tradeoff Between Quality Fairness and Efficiency
* Cross-Modal Hashing via Rank-Order Preserving
* Cross-Modal Retrieval Using Multiordered Discriminative Structured Subspace Learning
* Crowdsourcing Thousands of Specialized Labels: A Bayesian Active Training Approach
* CrowdTranscoding: Online Video Transcoding With Massive Viewers
* Dancelets Mining for Video Recommendation Based on Dance Styles
* DCAR: A Discriminative and Compact Audio Representation for Audio Processing
* Deep Coupled Metric Learning for Cross-Modal Matching
* Deep Multimetric Learning for Shape-Based 3D Model Retrieval
* Deep Video Hashing
* Depth-Preserving Stereo Image Retargeting Based on Pixel Fusion
* Detecting Dominant Vanishing Points in Natural Scenes with Application to Composition-Sensitive Image Retrieval
* Detecting Low-Quality Workers in QoE Crowdtesting: A Worker Behavior-Based Approach
* Dictionary Learning-Based 3D Morphable Shape Model, A
* Discrete Multimodal Hashing With Canonical Views for Robust Mobile Landmark Search
* Discriminative Multi-instance Multitask Learning for 3D Action Recognition
* Distributed Compressive Sensing for Cloud-Based Wireless Image Transmission
* Distributed Content Based Video Identification in Peer-to-Peer Networks: Requirements and Solutions
* Diversified Visual Attention Networks for Fine-Grained Object Classification
* Dynamic Adaptive Video Streaming: Towards a Systematic Comparison of ICN and TCP/IP
* Dynamic Manga: Animating Still Manga via Camera Movement
* Dynamic Topic Model and Matrix Factorization-Based Travel Recommendation Method Exploiting Ubiquitous Data, A
* Edge Caching for Layered Video Contents in Mobile Social Networks
* Efficient Unsupervised Temporal Segmentation of Motion Data
* Estimating Heart Rate and Rhythm via 3D Motion Tracking in Depth Video
* Exploiting Web Images for Dataset Construction: A Domain Robust Approach
* Exploring Viewer Gazing Patterns for Touch-Based Mobile Gamecasting
* Fast Algorithm and VLSI Architecture of Rate Distortion Optimization in H.265-HEVC
* Fast and Adaptive 3D Reconstruction With Extensively High Completeness
* Fast Image Dehazing Method Based on Linear Transformation
* Fast, Compact, and Discriminative: Evaluation of Binary Descriptors for Mobile Applications
* Focus-Plus-Context Techniques for Picoprojection-Based Interaction
* FreeScup: A Novel Platform for Assisting Sculpture Pose Design
* Frequency-Selective Mesh-to-Grid Resampling for Image Communication
* Fusion of Magnetic and Visual Sensors for Indoor Localization: Infrastructure-Free and More Effective
* Generalized Residual Vector Quantization and Aggregating Tree for Large Scale Search
* GHEVC: An Efficient HEVC Decoder for Graphics Processing Units
* GIFT: Towards Scalable 3D Shape Retrieval
* Graph PCA Hashing for Similarity Search
* Guest Editorial: Large-Scale Multimedia Data Retrieval, Classification, and Understanding
* Guest Editorial: Video Over Future Networks
* Hashing With Pairwise Correlation Learning and Reconstruction
* Hierarchical Bayesian Theme Models for Multipose Facial Expression Recognition
* Hierarchical MK Splines: Algorithm and Applications to Data Fitting
* Hierarchical Spatio-Temporal Model for Human Activity Recognition, A
* HNIP: Compact Deep Invariant Representations for Video Matching, Localization, and Retrieval
* Human Facial Age Estimation by Cost-Sensitive Label Ranking and Trace Norm Regularization
* Image Location Inference by Multisaliency Enhancement
* Image-Based Appraisal of Real Estate Properties
* Imbalance Compensation Framework for Background Subtraction, An
* Implicit Analysis of Perceptual Multimedia Experience Based on Physiological Response: A Review
* Improved Depth-Assisted Error Concealment Algorithm for 3D Video Transmission
* Inferring Emotional Tags From Social Images With User Demographics
* Instrument Learning and Sparse NMD for Automatic Polyphonic Music Transcription
* Integration of Diverse Data Sources for Spatial PM2.5 Data Interpolation
* Interactive Screen Video Streaming-Based Pervasive Mobile Workstyle
* Inverse Sparse Group Lasso Model for Robust Object Tracking
* Joint Admission Control and Routing Via Approximate Dynamic Programming for Streaming Video Over Software-Defined Networking
* Joint Compression of Near-Duplicate Videos
* Joint Deep Boltzmann Machine (jDBM) Model for Person Identification Using Mobile Phone Data, A
* Joint Image-Text News Topic Detection and Tracking by Multimodal Topic And-Or Graph
* Known-Artist Live Song Identification Using Audio Hashprints
* Large-Scale Tracking for Images With Few Textures
* Learning Efficient Binary Codes From High-Level Feature Representations for Multilabel Image Retrieval
* Learning Sparse Representation for No-Reference Quality Assessment of Multiply Distorted Stereoscopic Images
* Learning to Predict High-Quality Edge Maps for Room Layout Estimation
* Live Broadcast With Community Interactions: Bottlenecks and Optimizations
* Local Pattern Collocations Using Regional Co-occurrence Factorization
* Many Shades of Negativity, The
* Matryoshka Peek: Toward Learning Fine-Grained, Robust, Discriminative Features for Product Search
* Maximum a Posterior and Perceptually Motivated Reconstruction Algorithm: A Generic Framework
* Media Quality Assessment by Perceptual Gaze-Shift Patterns Discovery
* Methodology for Designing and Evaluating Cloud Scheduling Strategies in Distributed Videoconferencing Systems, A
* Mining Fashion Outfit Composition Using an End-to-End Deep Learning Approach on Set Data
* Mirror Mirror on the Wall... An Unobtrusive Intelligent Multisensory Mirror for Well-Being Status Self-Assessment and Visualization
* Mobile Live Video Streaming Optimization via Crowdsourcing Brokerage
* Modeling Restaurant Context for Food Recognition
* Motion Classification-Based Fast Motion Estimation for High-Efficiency Video Coding
* Motion-Homogeneous-Based Fast Transcoding Method From H.264: AVC to HEVC
* Multi-View Surveillance Video Summarization via Joint Embedding and Sparse Optimization
* Multimedia Classification Using Bipolar Relation Graphs
* Multimodal 2D+3D Facial Expression Recognition With Deep Fusion Convolutional Neural Network
* Multimodal Video-to-Near-Scene Annotation
* Multipath Cooperative Communications Networks for Augmented and Virtual Reality Transmission
* MuVi: Multiview Video Aware Transmission Over MIMO Wireless Systems
* Neighborhood Matching for Image Retrieval
* No-Reference and Robust Image Sharpness Evaluation Based on Multiscale Spatial and Spectral Features
* Nonlinear Discrete Hashing
* Nonlinear Sparse Hashing
* Nonparametric Sparse Matrix Decomposition for Cross-View Dimensionality Reduction
* Novel Data Hiding Algorithm for High Dynamic Range Images, A
* Novel Transient Wrinkle Detection Algorithm and Its Application for Expression Synthesis, A
* Novel Visual and Statistical Image Features for Microblogs News Verification
* Nuclear Norm-Based 2DLPP for Image Classification
* Object Localization Based on Proposal Fusion
* Object-Based Visual Saliency via Laplacian Regularized Kernel Regression
* Occlusion-Aware Real-Time Object Tracking
* On Market-Driven Hybrid-P2P Video Streaming
* Online MoCap Data Coding With Bit Allocation, Rate Control, and Motion-Adaptive Post-Processing
* Online Variable Coding Length Product Quantization for Fast Nearest Neighbor Search in Mobile Retrieval
* Optimal Representations for Adaptive Streaming in Interactive Multiview Video Systems
* Optimized Adaptive Streaming of Multi-video Stream Bundles
* Overlapping Community Detection for Multimedia Social Networks
* Parametric Planning Model for Video Quality Evaluation of IPTV Services Combining Channel and Video Characteristics
* Parametric Quality-Estimation Model for Adaptive-Bitrate-Streaming Services
* PBC: Polygon-Based Classifier for Fine-Grained Categorization
* Perceptual Pruning: A Context-Aware Transcoder for Immersive Video Conferencing Systems
* Personalized Egocentric Video Summarization of Cultural Tour on User Preferences Input
* Personalized Social Image Recommendation Method Based on User-Image-Tag Model
* Photo Aesthetics Analysis via DCNN Feature Encoding
* Photo Filter Recommendation by Category-Aware Aesthetic Learning
* Picking Neural Activations for Fine-Grained Recognition
* Pipeline-Based Ray-Tracing Runtime System for HSA-Compliant Frameworks, A
* PLTD: Patch-Based Low-Rank Tensor Decomposition for Hyperspectral Images
* Predicting Image Memorability Through Adaptive Transfer Learning From External Sources
* Predicting Popularity of Online Videos Using Support Vector Regression
* Privacy Preserving Cloth Try-On Using Mobile Augmented Reality
* Probabilistic Approach to People-Centric Photo Selection and Sequencing, A
* Progressive Pseudo-analog Transmission for Mobile Video Streaming
* Real-Time Correlation Filter Tracking by Efficient Dense Belief Propagation With Structure Preserving
* Recognizing and Presenting the Storytelling Video Structure With Deep Multimodal Networks
* Reducing Latency for Multimedia Broadcast Services Over Mobile Networks
* Redundancy Allocation Based on the Weighted Mismatch-Rate Slope for Multiple Description Video Coding
* Reliable Video Streaming With Strict Playout Deadline in Multihop Wireless Networks
* Resource Provisioning and Profit Maximization for Transcoding in Clouds: A Two-Timescale Approach
* Retrieval Compensated Group Structured Sparsity for Image Super-Resolution
* Retrieval From and Understanding of Large-Scale Multi-modal Medical Datasets: A Review
* Robust Generalized Low-Rank Decomposition of Multimatrices for Image Recovery
* Saliency Detection by Fully Learning a Continuous Conditional Random Field
* Saliency Detection for 3D Surface Geometry Using Semi-regular Meshes
* Saliency Prior Context Model for Real-Time Object Tracking, A
* Salient Object Segmentation via Effective Integration of Saliency and Objectness
* Scalable Image Retrieval by Sparse Product Quantization
* SDNHAS: An SDN-Enabled Architecture to Optimize QoE in HTTP Adaptive Streaming
* Segment-Based Storage and Transcoding Trade-off Strategy for Multi-version VoD Systems in the Cloud, A
* Semisupervised Online Multikernel Similarity Learning for Image Retrieval
* Sequential Deep Trajectory Descriptor for Action Recognition With Three-Stream CNN
* Signal Dependent Transform Based on SVD for HEVC Intracoding
* Single Image Super-Resolution via Adaptive Transform-Based Nonlocal Self-Similarity Modeling and Learning-Based Gradient Regularization
* Single Image Super-Resolution via Locally Regularized Anchored Neighborhood Regression and Nonlocal Means
* Skin Segmentation Algorithm Based on Stacked Autoencoders, A
* Sleep Apnea Detection via Depth Video and Audio Feature Learning
* Social Attribute Aware Incentive Mechanism for Device-to-Device Video Distribution
* Social Force Model-Based MCMC-OCSVM Particle PHD Filter for Multiple Human Tracking
* Social-Aware Rate Based Content Sharing Mode Selection for D2D Content Sharing Scenarios
* Social-Aware Video Recommendation for Online Social Groups
* Socially Aware Energy-Efficient Mobile Edge Collaboration for Video Distribution
* Sound-Event Classification Using Robust Texture Features for Robot Hearing
* Sparse Multigraph Embedding for Multimodal Feature Representation
* Sparse Recovery-Based Error Concealment
* Sparse Representation Model Using the Complete Marginal Fisher Analysis Framework and Its Applications to Visual Recognition, A
* SRLSP: A Face Image Super-Resolution Algorithm Using Smooth Regression With Local Structure Prior
* Statistically Indifferent Quality Variation: An Approach for Reducing Multimedia Distribution Cost for Adaptive Video Streaming Services
* Stochastic Multiview Hashing for Large-Scale Near-Duplicate Video Retrieval
* Structural Variation Classification Model for Image Quality Assessment, A
* Structure-Preserving Image Super-Resolution via Contextualized Multitask Learning
* Sufficient Image Appearance Transfer Combining Color and Texture
* Supervised Local Descriptor Learning for Human Action Recognition
* Texture Plus Depth Video Coding Using Camera Global Motion Information
* Toward Encrypted Cloud Media Center With Secure Deduplication
* Toward Physiology-Aware DASH: Bandwidth-Compliant Prioritized Clinical Multimedia Communication in Ambulances
* Toward QoE-Assured 4K Video-on-Demand Delivery Through Mobile Edge Virtualization With Adaptive Prefetching
* Tradeoffs Between Cost and Performance for CDN Provisioning Based on Coordinate Transformation
* Trip Outfits Advisor: Location-Oriented Clothing Recommendation
* Two-Stage Friend Recommendation Based on Network Alignment and Series Expansion of Probabilistic Topic Model
* Two-View 3D Reconstruction for Food Volume Estimation
* Unimodal Stopping Model-Based Early SKIP Mode Decision for High-Efficiency Video Coding
* Utility-Driven Adaptive Preprocessing for Screen Content Video Compression
* Video Captioning With Attention-Based LSTM and Semantic Consistency
* Video eCommerce: Toward Large Scale Online Video Advertising
* Video Encoder Architecture for Low-Delay Live-Streaming Events
* Video Object Segmentation via Global Consistency Aware Query Strategy
* VideoWhisper: Toward Discriminative Unsupervised Video Feature Learning With Attention-Based Recurrent Neural Networks
* Vision-Based Fingertip Tracking Utilizing Curvature Points Clustering and Hash Model Representation
* Visual Importance and Distortion Guided Deep Image Quality Assessment Framework
* Visual Tracking via Nonnegative Multiple Coding
* Visualizing Video Sounds With Sound Word Animation to Enrich User Experience
* Voronoi-Based Compact Image Descriptors: Efficient Region-of-Interest Retrieval With VLAD and Deep-Learning-Based Descriptors
* VRFP: On-the-Fly Video Retrieval Using Web Images and Fast Fisher Vector Products
* Wavelet-Based L_infty Semi-regular Mesh Coding
* Weakly Supervised Learning of Deformable Part-Based Models for Object Detection via Region Proposals
* Who Are Your Real Friends: Analyzing and Distinguishing Between Offline and Online Friendships From Social Multimedia Data
* Words Matter: Scene Text for Image Classification and Retrieval
213 for MultMed(19)

MultMed(20) * 3DQoE-Oriented and Energy-Efficient 2D plus Depth Based 3D Video Streaming Over Centrally Controlled Networks
* Accessible Melanoma Detection Using Smartphones and Mobile Image Analysis
* Active Learning for Crowdsourced QoE Modeling
* AENet: Learning Deep Audio Features for Video Analysis
* Aesthetics-Driven Stereoscopic 3-D Image Recomposition With Depth Adaptation
* Analysis of Structural Characteristics for Quality Assessment of Multiply Distorted Images
* Anomaly Detection Based on Stacked Sparse Coding With Intraframe Classification Strategy
* Arbitrary-Oriented Scene Text Detection via Rotation Proposals
* Audio-Visual System for Object-Based Audio: From Recording to Listening, An
* Automatic Image Cropping for Visual Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression
* Background Modeling and Referencing for Moving Cameras-Captured Surveillance Video Coding in HEVC
* Bag of Surrogate Parts Feature for Visual Recognition
* Behavioral Analysis of Kinetic Telepresence for Small Symmetric Group-to-Group Meetings
* Bilevel Feature Learning for Video Saliency Detection
* Blackthorn: Large-Scale Interactive Multimodal Learning
* Blind Image Quality Assessment via Vector Regression and Object Oriented Pooling
* Blind Quality Assessment Based on Pseudo-Reference Image
* Blind Quality Index for Multiply Distorted Images Using Biorder Structure Degradation and Nonlocal Statistics
* Building Emotional Machines: Recognizing Image Emotions Through Deep Neural Networks
* Bundled Object Context for Referring Expressions
* BVI-HD: A Video Quality Database for HEVC Compressed and Texture Synthesized Content
* CCL: Cross-modal Correlation Learning With Multigrained Fusion by Hierarchical Network
* Check Out This Place: Inferring Ambiance From Airbnb Photos
* Closed-Form Optimization on Saliency-Guided Image Compression for HEVC-MSP
* CNN-Based Joint Clustering and Representation Learning with Feature Drift Compensation for Large-Scale Image Data
* Coherent Deep-Net Fusion To Classify Shots In Concert Videos
* Collaborative Scheduling-Based Parallel Solution for HEVC Encoding on Multicore Platforms, A
* Collective Density Clustering for Coherent Motion Detection
* Content-Adaptive Joint Image Compression and Encryption Scheme, A
* Content-Attention Representation by Factorized Action-Scene Network for Action Recognition
* Content-Aware Delivery of Scalable Video in Network Coding Enabled Named Data Networks
* Controllable Multicast for Adaptive Scalable Video Streaming in Software-Defined Networks
* Convolutional Neural Network for Intermediate View Enhancement in Multiview Streaming
* Cooperative Bargaining Game-Based Multiuser Bandwidth Allocation for Dynamic Adaptive Streaming Over HTTP
* Cost-Constrained Video Quality Satisfaction Study on Mobile Devices, A
* Cost-Distortion Optimization and Resource Control in Pseudo-Analog Visual Communications
* Cross-Domain Collaborative Learning via Discriminative Nonparametric Bayesian Model
* Cross-Media Similarity Evaluation for Web Image Retrieval in the Wild
* Cross-Space Distortion Directed Color Image Compression
* CTU-Level Complexity Control for High Efficiency Video Coding
* CUNet: A Compact Unsupervised Network For Image Classification
* DASH Adaptation Algorithm Based on Adaptive Forgetting Factor Estimation
* Data Analysis in Multimedia Quality Assessment: Revisiting the Statistical Tests
* Data Driven 2-D-to-3-D Video Conversion for Soccer
* Data-Driven Lightweight Interest Point Selection for Large-Scale Visual Search
* Deep Age Estimation: From Classification to Ranking
* Deep Salient Object Detection With Dense Connections and Distraction Diagnosis
* Deep Spatiotemporal Perspective for Understanding Crowd Behavior, A
* Deep Temporal Multimodal Fusion for Medical Procedure Monitoring Using Wearable Sensors
* Deep-Structured Event Modeling for User-Generated Photos
* Depth Assisted Adaptive Workload Balancing for Parallel View Synthesis
* Depth Pooling Based Large-Scale 3-D Action Recognition with Convolutional Neural Networks
* Depth-Adaptive Deep Neural Network for Semantic Segmentation
* Detecting and Removing Visual Distractors for Video Aesthetic Enhancement
* Detecting Socially Significant Music Events Using Temporally Noisy Labels
* Detecting Topic Authoritative Social Media Users: A Multilayer Network Approach
* Discovering Triangles in Portraits for Supporting Photographic Creation
* Discovery of Repeated Melodic Phrases in Folk Singing Recordings
* Discriminative Part Selection for Human Action Recognition
* Disseminating Multilayer Multimedia Content Over Challenged Networks
* Distributed Consolidation of Highly Incomplete Dynamic Point Clouds Based on Rank Minimization
* Dual-Graph Regularized Discriminative Multitask Tracker
* Dynamic Resource Allocation by Batch Optimization for Value-Added Video Services Over SDN
* Dynamic Texture Recognition Using Volume Local Binary Count Patterns With an Application to 2D Face Spoofing Detection
* Edge Computing Framework for Cooperative Video Processing in Multimedia IoT Systems
* Editorial IEEE Transactions on Multimedia Special Section on Video Analytics: Challenges, Algorithms, and Applications
* Editorial Introduction to the Special Issue on Multimedia Big Data for Extreme Events
* Efficient and Robust Image Coding and Transmission Based on Scrambled Block Compressive Sensing
* Efficient Architecture of In-Loop Filters for Multicore Scalable HEVC Hardware Decoders, An
* Efficient Audio Rendering Using Angular Region-Wise Source Enhancement for 360° Video
* EgoGesture: A New Dataset and Benchmark for Egocentric Hand Gesture Recognition
* Energy-Aware Mobile Edge Computing and Routing for Low-Latency Visual Data Processing
* Event-Based Perceptual Quality Assessment for HTTP-Based Video Streaming With Playback Interruption
* Expanding-Window BATS Code for Scalable Video Multicasting Over Erasure Networks
* Explicit Shape Regression With Characteristic Number for Facial Landmark Localization
* Exploiting Pseudo-Quadtree Structure for Accelerating HEVC Spatial Resolution Downscaling Transcoder
* Exploiting Video Quality Information With Lightweight Network Coordination for HTTP-Based Adaptive Video Streaming
* Exploiting Web Images for Video Highlight Detection With Triplet Deep Ranking
* Extracting Key Segments of Videos for Event Detection by Learning From Web Sources
* F-DES: Fast and Deep Event Summarization
* Fast Forgery Detection Algorithm Based on Exponential-Fourier Moments for Video Region Duplication, A
* Fast Uyghur Text Detector for Complex Background Images, A
* Fast-PADMA: Rapidly Adapting Facial Affect Model From Similar Individuals
* Feature Descriptor Based on Local Normalized Difference for Real-World Texture Classification, A
* Field-of-Experts Filters Guided Tensor Completion
* Foveation-Based Wireless Soft Image Delivery
* Free-Viewpoint Television System for Horizontal Virtual Navigation, A
* Full-Reference Objective Quality Assessment of Tone-Mapped Images
* Fully Convolutional Network for Multiscale Temporal Action Proposals
* Fusing Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks
* General Knowledge Embedded Image Representation Learning
* Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval
* Geo-Distinctive Visual Element Matching for Location Estimation of Images
* Geodesic Path-Based Diffusion Acceleration for Image Denoising
* GLA: Global-Local Attention for Image Description
* Grab, Pay, and Eat: Semantic Food Detection for Smart Restaurants
* Group-Sensitive Triplet Embedding for Vehicle Reidentification
* H.264 and H.265 Video Bandwidth Prediction
* HEVC Selective Encryption Using RC6 Block Cipher Technique
* Hierarchical Parsing Net: Semantic Scene Parsing From Global Scene to Objects
* High-Quality Soft Video Delivery With GMRF-Based Overhead Reduction
* Highly Accurate Image Reconstruction for Multimodal Noise Suppression Using Semisupervised Learning on Big Data
* Hole Filling With Multiple Reference Views in DIBR View Synthesis
* Holographic Data Coding: Benchmarking and Extending HEVC With Adapted Transforms
* Hybrid Digital-Analog Video Delivery With Shannon-Kotel'nikov Mapping
* Hybrid Intraprediction Based on Local and Nonlocal Correlations
* IF-MCA: Importance Factor-Based Multiple Correspondence Analysis for Multimedia Data Analytics
* Image Style Classification Based on Learnt Deep Correlation Features
* Impact Localization on Rigid Surfaces Using Hermitian Angle Distribution for Human-Computer Interface Applications
* Improved Image-Based Localization Using SFM and Modified Coordinate System Transfer
* Improving Existing Collaborative Filtering Recommendations via Serendipity-Based Algorithm
* Improving Multipath Video Transmission With Raptor Codes in Heterogeneous Wireless Networks
* Improving Video Saliency Detection via Localized Estimation and Spatiotemporal Refinement
* Information Bottleneck Approach to Optimize the Dictionary of Visual Data, An
* Intelligent Detail Enhancement for Exposure Fusion
* Interactive Image Segmentation Using Semi-transparent Wearable Glasses
* Interpreting Video Recommendation Mechanisms by Mining View Count Traces
* Iterative Feedback Control-Based Salient Object Segmentation
* Iterative Framework of Cascaded Deblocking and Superresolution for Compressed Images, An
* Joint Coding-Transmission Optimization for a Video Surveillance System With Multiple Cameras
* Joint Dynamic Rate Control and Transmission Scheduling for Scalable Video Multirate Multicast Over Wireless Networks
* Joint Intra and Multiple Description Coding for Packet Loss Resilient Video Transmission
* Joint Latent Dirichlet Allocation for Social Tags
* Joint Optimization of Radio and Virtual Machine Resources With Uncertain User Demands in Mobile Cloud Computing
* Joint Sponsor Scheduling in Cellular and Edge Caching Networks for Mobile Video Delivery
* JPEG Image Encryption with Improved Format Compatibility and File Size Preservation
* Key-Frame-Based Background Sprite Generation for Hole Filling in Depth Image-Based Rendering
* Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation
* Label Distribution-Based Facial Attractiveness Computation by Deep Residual Learning
* Large Margin Learning in Set-to-Set Similarity Comparison for Person Reidentification
* Learning Deep Spatio-Temporal Dependence for Semantic Video Segmentation
* Learning From Cross-Domain Media Streams for Event-of-Interest Discovery
* Learning From Hierarchical Spatiotemporal Descriptors for Micro-Expression Recognition
* Leveraging Structural Context Models and Ranking Score Fusion for Human Interaction Prediction
* Light Field Coding With Field-of-View Scalability and Exemplar-Based Interlayer Prediction
* Local Wavelet Acoustic Pattern: A Novel Time-Frequency Descriptor for Birdsong Recognition
* Lossless Compression of Color Filter Array Mosaic Images With Visualization via JPEG 2000
* Low-Rank Linear Embedding for Image Recognition
* Maya Codical Glyph Segmentation: A Crowdsourcing Approach
* Measuring Crowd Collectiveness by Macroscopic and Microscopic Motion Consistencies
* MixedEmotions: An Open-Source Toolbox for Multimodal Emotion Analysis
* Mobile Instant Video Clip Sharing With Screen Scrolling: Measurement and Enhancement
* Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification
* Multigranular Event Recognition of Personal Photo Albums
* Multilabel Image Classification With Regional Latent Semantic Dependencies
* Multimodal Framework for Analyzing the Affect of a Group of People
* Multimodal Recurrent Neural Networks With Information Transfer Layers for Indoor Scene Labeling
* Multiple Speaker Tracking in Spatial Audio via PHD Filtering and Depth-Audio Fusion
* Multiscale Deep Alternative Neural Network for Large-Scale Video Classification
* Multisensor Image Fusion and Enhancement in Spectral Total Variation Domain
* Multistage Object Detection With Group Recursive Learning
* Multistage Pooling for Blind Quality Prediction of Asymmetric Multiply-Distorted Stereoscopic Images
* Multiview Label Sharing for Visual Representations and Classifications
* Multiview Multimodal System for Monitoring Patient Sleep, A
* Multiview Video Transmission Over Underwater Acoustic Path
* Music Popularity: Metrics, Characteristics, and Audio-Based Prediction
* Naturalness Preserved Nonuniform Illumination Estimation for Image Enhancement Based on Retinex
* New Model-Based Method for Multi-View Human Body Tracking and Its Application to View Transfer in Image-Based Rendering, A
* No-Reference Image Quality Assessment Using Orthogonal Color Planes Patterns
* No-Reference Image Sharpness Assessment Based on Maximum Gradient and Variability of Gradients
* No-Reference View Synthesis Quality Prediction for 3-D Videos Based on Color-Depth Interactions
* Noncoverage Field Model for Improving the Rendering Quality of Virtual Views, A
* Nonnegative OPLS for Supervised Design of Filter Banks: Application to Image and Audio Feature Extraction
* Novel Digital Watermarking Based on General Non-Negative Matrix Factorization, A
* Novel No-Reference Metric for Estimating the Impact of Frame Freezing Artifacts on Perceptual Quality of Streamed Videos, A
* Object Detection and Tracking Under Occlusion for Object-Level RGB-D Video Segmentation
* On Influential Trends in Interactive Video Retrieval: Video Browser Showdown 2015-2017
* On the Minimization of Glass-to-Glass and Glass-to-Algorithm Delay in Video Communication
* Online Modeling of Esthetic Communities Using Deep Perception Graph Analytics
* Online Multimodal Multiexpert Learning for Social Event Tracking
* Optimal Transmission Estimation via Fog Density Perception for Efficient Single Image Defogging
* Optimal Transmission Topology Construction and Secure Linear Network Coding Design for Virtual-Source Multicast With Integral Link Rates
* Optimized Data Representation for Interactive Multiview Navigation
* Optimizing Multistage Discriminative Dictionaries for Blind Image Quality Assessment
* Optimizing Quality of Experience for Adaptive Bitrate Streaming via Viewer Interest Inference
* Parallax-Tolerant Image Stitching Based on Robust Elastic Warping
* Pedestrian Detection via Body Part Semantic and Contextual Information With DNN
* Perceptual Quality Maximization for Video Calls With Packet Losses by Optimizing FEC, Frame Rate, and Quantization
* Personalized Classifier for Food Image Recognition
* Photo Stylistic Brush: Robust Style Transfer via Superpixel-Based Bipartite Graph
* PQTable: Nonexhaustive Fast Search for Product-Quantized Codes Using Hash Tables
* Predicting Microblog Sentiments via Weakly Supervised Multimodal Deep Learning
* Predicting Visual Features From Text for Image and Video Caption Retrieval
* Prediction of the Leadership Style of an Emergent Leader Using Audio and Visual Nonverbal Features
* PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance
* QoE-Driven Mobile Edge Caching Placement for Adaptive Video Streaming
* Quality Assessment of DIBR-Synthesized Images by Measuring Local Geometric Distortions and Global Sharpness
* Quality of Experience in a Stereoscopic Multiview Environment
* Quality-Guided Fusion-Based Co-Saliency Estimation for Image Co-Segmentation and Colocalization
* Quasi-Homography Warps in Image Stitching
* Query Adaptive Multiview Object Instance Search and Localization Using Sketches
* Query-Adaptive Image Retrieval by Deep-Weighted Hashing
* Query-Free Clothing Retrieval via Implicit Relevance Feedback
* Ranking-Preserving Low-Rank Factorization for Image Annotation With Missing Labels
* Real-Time Long-Term Tracking With Prediction-Detection-Correction
* Real-Time, Curvature-Sensitive Surface Simplification Using Depth Images
* Recognition of Emotions in User-Generated Videos With Kernelized Features
* Recurrent Spatial Pyramid CNN for Optical Flow Estimation
* Reduced-Reference Image Quality Assessment in Free-Energy Principle and Sparse Representation
* Region-Based Multiple Description Coding for Multiview Video Plus Depth Video
* Regularized Semi-non-negative Matrix Factorization for Hashing
* Reliable and Reversible Image Privacy Protection Based on False Colors, A
* Removing Haze Particles From Single Image via Exponential Inference With Support Vector Data Description
* RETRIEVAL: An Online Performance Evaluation Tool for Information Retrieval Methods
* Reversible Data Hiding in Encrypted Three-Dimensional Mesh Models
* Robust 3-D Human Detection in Complex Environments With a Depth Camera
* Robust 3D Action Recognition Through Sampling Local Appearances and Global Distributions
* Robust Coverless Image Steganography Based on DCT and LDA Topic Classification
* Robust Detection of Extreme Events Using Twitter: Worldwide Earthquake Monitoring
* Robust Multiview Synthesis for Wide-Baseline Camera Arrays
* Robust Sparse and Dense Nonrigid Structure From Motion
* Robust Tracking and Redetection: Collaboratively Modeling the Target and Its Context
* Robust Visual Tracking via Smooth Manifold Kernel Sparse Learning
* Saliency Detection in Face Videos: A Data-Driven Approach
* Scale-Aware Edge-Preserving Image Filtering via Iterative Global Optimization
* Scale-Aware Fast R-CNN for Pedestrian Detection
* Scale-Aware Fast R-CNN for Pedestrian Detection
* Scene Text Detection Using Superpixel-Based Stroke Feature Transform and Deep Learning Based Region Classification
* SeaShips: A Large-Scale Precisely Annotated Dataset for Ship Detection
* Seeds-Based Part Segmentation by Seeds Propagation and Region Convexity Decomposition
* Semi-Supervised Image Classification With Self-Paced Cross-Task Networks
* Server Allocation Problem for Session-Based Multiplayer Cloud Gaming, The
* Single Image Dehazing Using Ranking Convolutional Neural Network
* Snowflake Removal for Videos via Global and Local Low-Rank Decomposition
* SNR-Constrained Heuristics for Optimizing the Scaling Parameter of Robust Audio Watermarking
* Social-Aware Movie Recommendation via Multimodal Network Learning
* Spatio-Temporal Disocclusion Filling Using Novel Sprite Cells
* Spatio-Temporal Saliency Networks for Dynamic Saliency Prediction
* Spatiotemporal Saliency Estimation by Spectral Foreground Detection
* Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
* Spherical Superpixel Segmentation
* SPIHT Algorithm With Adaptive Selection of Compression Ratio Depending on DWT Coefficients
* Spring-Electric Graph Model for Socialized Group Photography, A
* Statistical Study of View Preferences for Online Videos With Cross-Platform Information
* Step Count and Pulse Rate Detection Based on the Contactless Image Measurement Method
* Structure-Guided Image Inpainting Using Homography Transformation
* Summarization of User-Generated Sports Video by Using Deep Action Recognition Features
* Super Resolution by Comprehensively Exploiting Dependencies of Wavelet Coefficients
* Superpixel-Based Single Nighttime Image Haze Removal
* Supervised Distributed Hashing for Large-Scale Multimedia Retrieval
* SVD-Based Adaptive QIM Watermarking on Stereo Audio Signals
* Text2Video: An End-to-end Learning Framework for Expressing Text With Videos
* Thin-Feature-Aware Transport-Velocity Formulation for SPH-Based Liquid Animation
* Three-Dimensional Attention-Based Deep Ranking Model for Video Highlight Detection
* Toward Intelligent Product Retrieval for TV-to-Online (T2O) Application: A Transfer Metric Learning Approach
* Toward Rendering-Latency Reduction for Composable Web Services via Priority-Based Object Caching
* Towards Individual QoE for Multiparty Videoconferencing
* Traffic-Optimized Data Placement for Social Media
* Twitter100k: A Real-World Dataset for Weakly Supervised Cross-Media Retrieval
* Two-Stream 3-D convNet Fusion for Action Recognition in Videos With Arbitrary Size and Length
* Ultrasonic Communication Using Consumer Hardware
* Understanding Dynamic Cross-OSN Associations for Cold-Start Recommendation
* Unequal Error Protection for Scalable Video Storage in the Cloud
* Universal String Matching Approach to Screen Content Coding, A
* Unsupervised Discovery of Character Dictionaries in Animation Movies
* Unsupervised Salient Object Detection via Inferring from Imperfect Saliency Models
* Variational Fusion of Time-of-Flight and Stereo Data for Depth Estimation Using Edge-Selective Joint Filtering
* Visual Sentiment Prediction Based on Automatic Discovery of Affective Regions
* Vocabulary for Growth: Topic Modeling of Content Popularity Evolution, A
* Worst Case Driven Display Frame Compression for Energy-Efficient Ultra-HD Display Processing
* You Are What You Eat: Exploring Rich Recipe Information for Cross-Region Food Analysis
261 for MultMed(20)

MultMed(22) * 2-D Skeleton-Based Action Recognition via Two-Branch Stacked LSTM-RNNs
* 2D Pose-Based Real-Time Human Action Recognition With Occlusion-Handling
* 3D Room Layout Estimation From a Single RGB Image
* Accurate and Robust Video Saliency Detection via Self-Paced Diffusion
* Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration
* Adaptation-Oriented Feature Projection for One-Shot Action Recognition
* Adaptive Image Sampling Using Deep Learning and Its Application on X-Ray Fluorescence Image Reconstruction
* Adaptive Single Image Dehazing Using Joint Local-Global Illumination Adjustment
* Adversarial Attribute-Text Embedding for Person Search With Natural Language Query
* Affective Video Content Analysis With Adaptive Fusion Recurrent Network
* Asymmetric Joint GANs for Normalizing Face Illumination From a Single Image
* ATMFN: Adaptive-Threshold-Based Multi-Model Fusion Network for Compressed Face Hallucination
* Attentive Sequence to Sequence Translator for Localizing Video Clips by Natural Language, An
* Audio-Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking
* Automated Colorization of a Grayscale Image With Seed Points Propagation
* Bidirectional Attention-Recognition Model for Fine-Grained Object Classification
* Blind Night-Time Image Quality Assessment: Subjective and Objective Approaches
* Blind Watermarking for 3-D Printed Objects by Locally Modifying Layer Thickness
* CGR-GAN: CG Facial Image Regeneration for Antiforensics Based on Generative Adversarial Network
* Character-Oriented Video Summarization With Visual and Textual Cues
* CI-GNN: Building a Category-Instance Graph for Zero-Shot Video Classification
* CKD: Cross-Task Knowledge Distillation for Text-to-Image Synthesis
* Co-Prediction-Based Compression Scheme for Correlated Images, A
* Coarse-to-Fine Localization of Temporal Action Proposals
* Collaborative Content Placement Among Wireless Edge Caching Stations With Time-to-Live Cache
* Compact Hash Code Learning With Binary Deep Neural Network
* Concentrated Local Part Discovery With Fine-Grained Part Representation for Person Re-Identification
* Content-Based Light Field Image Compression Method With Gaussian Process Regression
* Contextualized CNN for Scene-Aware Depth Estimation From Single RGB Image
* Convolutional Networks With Channel and STIPs Attention Model for Action Recognition in Videos
* Cuboid CNN Model with an Attention Mechanism for Skeleton-Based Action Recognition, A
* Cycle-IR: Deep Cyclic Image Retargeting
* Deep Co-Saliency Detection via Stacked Autoencoder-Enabled Fusion and Self-Trained CNNs
* Deep Dual-Channel Neural Network for Image-Based Smoke Detection
* Deep Fusion Feature Representation Learning With Hard Mining Center-Triplet Loss for Person Re-Identification
* Deep Gesture Video Generation With Learning on Regions of Interest
* Deep Manifold-to-Manifold Transforming Network for Skeleton-Based Action Recognition
* Deep Metric Learning With Density Adaptivity
* Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos
* Deep Multi-Scale Context Aware Feature Aggregation for Curved Scene Text Detection
* Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment
* Deep Position-Sensitive Tracking
* Deep Reference Generation With Multi-Domain Hierarchical Constraints for Inter Prediction
* Deep Reinforcement Learning for Image Hashing
* Deep Top-k Ranking for Image-Sentence Matching
* Deep0Tag: Deep Multiple Instance Learning for Zero-Shot Image Tagging
* DeepFacade: A Deep Learning Approach to Facade Parsing With Symmetric Loss
* DeepQoE: A Multimodal Learning Framework for Video Quality of Experience (QoE) Prediction
* Design of Compressed Sensing System With Probability-Based Prior Information
* Detecting Social Signals in User-Shared Images for Connection Discovery Using Deep Learning
* Dilated Inception Network for Visual Saliency Prediction, A
* Disentangled Spectrum Variations Networks for NIR-VIS Face Recognition
* Distance-Driven Alliance for a P2P Live Video System, A
* Distinct Feature Extraction for Video-Based Gait Phase Classification
* Dual Convolutional LSTM Network for Referring Image Segmentation
* Dynamic Objectives Learning for Facial Expression Recognition
* Dynamic Spectrum Access for Multimedia Transmission Over Multi-User, Multi-Channel Cognitive Radio Networks
* EEG-Based Study on Perception of Video Distortion Under Various Content Motion Conditions, An
* Efficient and Secure Image Communication System Based on Compressed Sensing for IoT Monitoring Applications
* Efficient Mobile Video Streaming via Context-Aware RaptorQ-Based Unequal Error Protection
* Efficient NVoD Scheme Using Implicit Error Correction and Subchannels for Wireless Networks, An
* Efficient Supervised Discrete Multi-View Hashing for Large-Scale Multimedia Search
* Energy Compaction-Based Image Compression Using Convolutional AutoEncoder
* Enhanced Intra Prediction for Video Coding by Using Multiple Neural Networks
* Enhancing the Quality of Image Tagging Using a Visio-Textual Knowledge Base
* Ensemble Tracking Based on Diverse Collaborative Framework With Multi-Cue Dynamic Fusion
* Equalized Margin Loss for Face Recognition, An
* Exploiting Vulnerabilities of Deep Neural Networks for Privacy Protection
* Exploring Discriminative Representations for Image Emotion Recognition With CNNs
* Exploring Global and Local Linguistic Representations for Text-to-Image Synthesis
* Fast Depth and Inter Mode Prediction for Quality Scalable High Efficiency Video Coding
* Fast FoV-Switching DASH System Based on Tiling Mechanism for Practical Omnidirectional Video Services, A
* Fast User-Guided Single Image Reflection Removal via Edge-Aware Cascaded Networks
* Feature Matching With Intra-Group Sparse Model
* Feature-Flow Interpretation of Deep Convolutional Neural Networks
* FFTMI: Features Fusion for Natural Tone-Mapped Images Quality Evaluation
* Fine-Grained Classification of Internet Video Traffic From QoS Perspective Using Fractal Spectrum
* Flexible Deep CNN Framework for Image Restoration, A
* Flexibly Connectable Light Field System For Free View Exploration
* Flickr Image Community Analytics by Deep Noise-Refined Matrix Factorization
* Food Recommendation: Framework, Existing Solutions, and Challenges
* Frame Augmented Alternating Attention Network for Video Question Answering
* Fuzzy Least Squares Support Vector Machine With Adaptive Membership for Object Tracking
* GAIM: Graph Attention Interaction Model for Collective Activity Recognition
* Generative Adversarial Network-Based Intra Prediction for Video Coding
* Generative Model Driven Representation Learning in a Hybrid Framework for Environmental Audio Scene and Sound Event Recognition
* GENPass: A Multi-Source Deep Learning Model for Password Guessing
* Gestures In-The-Wild: Detecting Conversational Hand Gestures in Crowded Scenes Using a Multimodal Fusion of Bags of Video Trajectories and Body Worn Acceleration
* GLNet: Global Local Network for Weakly Supervised Action Localization
* Guide to Match: Multi-Layer Feature Matching With a Hybrid Gaussian Mixture Model
* Hierarchical Attention Network for Visually-Aware Food Recommendation
* Hierarchical Coding of Convolutional Features for Scene Recognition
* Hierarchical Context Features Embedding for Object Detection
* Hierarchical Prototype Learning for Zero-Shot Recognition
* How Do We Experience Crossmodal Correspondent Mulsemedia Content?
* Illumination-Adaptive Person Re-Identification
* Image Compression Based on Compressive Sensing: End-to-End Comparison With JPEG
* Image Retargetability
* Image Vectorization With Real-Time Thin-Plate Spline
* Importance of Context When Recommending TV Content: Dataset and Algorithms, The
* Improved Deep Hashing With Soft Pairwise Similarity for Multi-Label Image Retrieval
* Improved Reversible Data Hiding in Encrypted Images Using Parametric Binary Tree Labeling, An
* Incentive Mechanism for Cooperative Scalable Video Coding (SVC) Multicast Based on Contract Theory
* Interact as You Intend: Intention-Driven Human-Object Interaction Detection
* Interpretable Fast Multi-Scale Deep Decoder for the Standard HEVC Bitstreams, The
* Intra Coding Strategy for Video Error Resiliency: Behavioral Analysis
* Iterative Deep Neural Network Quantization With Lipschitz Constraint
* iWave: CNN-Based Wavelet-Like Transform for Image Compression
* Joint Deep Learning of Facial Expression Synthesis and Recognition
* Joint Learning in the Spatio-Temporal and Frequency Domains for Skeleton-Based Action Recognition
* Jointly Learning Kernel Representation Tensor and Affinity Matrix for Multi-View Clustering
* Jointly Sparse Locality Regression for Image Feature Extraction
* Kernel-Based Mixture Mapping for Image and Text Association
* Kernelized Fuzzy Modal Variation for Local Change Detection From Video Scenes
* Knowledge-Augmented Multimodal Deep Regression Bayesian Networks for Emotion Video Tagging
* Knowledge-Based Topic Model for Multi-Modal Social Event Analysis
* Latency-Aware Adaptive Video Summarization for Mobile Edge Clouds
* Learning Discriminative and Generative Shape Embeddings for Three-Dimensional Shape Retrieval
* Learning How to Smile: Expression Video Generation With Conditional Adversarial Recurrent Nets
* Learning Local Quality-Aware Structures of Salient Regions for Stereoscopic Images via Deep Neural Networks
* Learning Non-Locally Regularized Compressed Sensing Network With Half-Quadratic Splitting
* Learning Normal Patterns via Adversarial Attention-Based Autoencoder for Abnormal Event Detection in Videos
* Learning Reliable Visual Saliency For Model Explanations
* Learning Scene Attribute for Scene Recognition
* Learning the Traditional Art of Chinese Calligraphy via Three-Dimensional Reconstruction and Assessment
* Learning-Based User Clustering and Link Allocation for Content Recommendation Based on D2D Multicast Communications
* Leveraging Virtual and Real Person for Unsupervised Person Re-Identification
* Light Field Super-Resolution Using Edge-Preserved Graph-Based Regularization
* Locally Confined Modality Fusion Network With a Global Perspective for Multimodal Human Affective Computing
* Loopy Residual Hashing: Filling the Quantization Gap for Image Retrieval
* Low-Light Image Enhancement With Semi-Decoupled Decomposition
* Low-Rank Regularized Multi-Representation Learning for Fashion Compatibility Prediction
* Massive-Scale Genre Communities Learning Using a Noise-Tolerant Deep Architecture
* MLC STT-MRAM-Aware Memory Subsystem for Smart Image Applications
* Mobile Streaming of Live 360-Degree Videos
* Moving Cast Shadows Segmentation Using Illumination Invariant Feature
* MRFN: Multi-Receptive-Field Network for Fast and Accurate Single Image Super-Resolution
* MSTGAR: Multioperator-Based Stereoscopic Thumbnail Generation With Arbitrary Resolution
* Multi-Attribute Blind Quality Evaluator for Tone-Mapped Images, A
* Multi-Direction Dictionary Learning Based Depth Map Super-Resolution With Autoregressive Modeling
* Multi-Focus Image Fusion by Hessian Matrix Based Decomposition
* Multi-Level Correlation Adversarial Hashing for Cross-Modal Retrieval
* Multi-Level Policy and Reward-Based Deep Reinforcement Learning Framework for Image Captioning
* Multi-Party WebRTC Services Using Delay and Bandwidth Aware SDN-Assisted IP Multicasting of Scalable Video Over 5G Networks
* Multi-Pathway Generative Adversarial Hashing for Unsupervised Cross-Modal Retrieval
* Multi-Scale Based Context-Aware Net for Action Detection
* Multi-Task Learning for Acoustic Event Detection Using Event and Frame Position Information
* Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization, A
* Multi-View Saliency Guided Deep Neural Network for 3-D Object Retrieval and Classification
* Multimedia Intelligence: When Multimedia Meets Artificial Intelligence
* Multiscale Superpixel-Based Hyperspectral Image Classification Using Recurrent Neural Networks With Stacked Autoencoders
* Neighborhood Pyramid Preserving Hashing
* Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking
* New Method and Benchmark for Detecting Co-Saliency Within a Single Image, A
* No-Reference Quality Evaluation of Stereoscopic Video Based on Spatio-Temporal Texture
* Novel Convolutional Neural Network for Image Steganalysis With Shared Normalization, A
* Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval
* Online Robust Principal Component Analysis With Change Point Detection
* Optimizing Fixation Prediction Using Recurrent Neural Networks for 360° Video Streaming in Head-Mounted Virtual Reality
* Oriented Spatial Transformer Network for Pedestrian Detection Using Fish-Eye Camera
* Part-Aware Fine-Grained Object Categorization Using Weakly Supervised Part Detection Network
* Partition-Aware Adaptive Switching Neural Networks for Post-Processing in HEVC
* Patch-Based Image Hallucination for Super Resolution With Detail Reconstruction From Similar Sample Images
* Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition
* PDR-Net: Perception-Inspired Single Image Dehazing Network With Refinement
* PixelRL: Fully Convolutional Network With Reinforcement Learning for Image Processing
* PointHop: An Explainable Machine Learning Method for Point Cloud Classification
* Prediction of Saliency Map for Head and Eye Movements in 360 Degree Images, The
* Pruning 3D Filters For Accelerating 3D ConvNets
* PTB-TIR: A Thermal Infrared Pedestrian Tracking Benchmark
* QoE Analysis of Dense Multiview Video With Head-Mounted Devices
* Radiance-Reflectance Combined Optimization and Structure-Guided L_0-Norm for Single Image Dehazing
* Rate Constrained Multiple-QP Optimization for HEVC
* Rate-Distortion Optimal Joint Texture and Depth Map Coding for 3-D Video Streaming
* Realistic Facial Expression Reconstruction for VR HMD Users
* Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-Modal Retrieval
* Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver
* Recall What You See Continually Using GridLSTM in Image Captioning
* Reduced Reference Stereoscopic Image Quality Assessment Using Sparse Representation and Natural Scene Statistics
* Referring Image Segmentation by Generative Adversarial Learning
* Refined TV-L1 Optical Flow Estimation Using Joint Filtering
* Relation Attention for Temporal Action Localization
* Representing Modifiable and Reusable Musical Content on the Web With Constrained Multi-Hierarchical Structures
* Reversible Data Hiding in Encrypted Images Based on Multi-MSB Prediction and Huffman Coding
* RGB-T Image Saliency Detection via Collaborative Graph Learning
* Rich Features Embedding for Cross-Modal Retrieval: A Simple Baseline
* Robust QoE-Driven DASH Over OFDMA Networks
* Robust Visual Tracking via Constrained Multi-Kernel Correlation Filters
* Role of the Input in Natural Language Video Description, The
* Saliency Detection via a Multiple Self-Weighted Graph-Based Manifold Ranking
* Salient Object Detection via Multiple Instance Joint Re-Learning
* Screen Content Compression Based on Enhanced Soft Context Formation
* SDN-Based Caching Decision Policy for Video Caching in Information-Centric Networking, An
* Semantic Segmentation Guided Pixel Fusion for Image Retargeting
* Semi-Supervised Cross-Modal Retrieval With Label Prediction
* Sensor-Augmented Neural Adaptive Bitrate Video Streaming on UAVs
* Sentiment Recognition for Short Annotated GIFs Using Visual-Textual Fusion
* Show, Tell, and Polish: Ruminant Decoding for Image Captioning
* Similarity-Aware and Variational Deep Adversarial Learning for Robust Facial Age Estimation
* Single-Image Super-Resolution Method Based on Progressive-Iterative Approximation, A
* Sketch-Based Shape Retrieval via Best View Selection and a Cross-Domain Similarity Measure
* Snapshot High Dynamic Range Imaging via Sparse Representations and Feature Learning
* Spatio-Temporal Attention Networks for Action Recognition and Detection
* Spatio-Temporal VLAD Encoding of Visual Events Using Temporal Ordering of the Mid-Level Deep Semantics
* Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions
* Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions
* STAT: Spatial-Temporal Attention Mechanism for Video Captioning
* STAT: Spatial-Temporal Attention Mechanism for Video Captioning
* Statistical Learning Based Congestion Control for Real-Time Video Communication
* Steered Mixture-of-Experts for Light Field Images and Video: Representation and Coding
* Steganographic Security Analysis From Side Channel Steganalysis and Its Complementary Attacks
* Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending
* STNReID: Deep Convolutional Networks With Pairwise Spatial Transformer Networks for Partial Person Re-Identification
* Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification, A
* Study on 2D Feature-Based Hash Learning
* Style-Controlled Synthesis of Clothing Segments for Fashion Image Manipulation
* Tamper-Proofing Video With Hierarchical Attention Autoencoder Hashing on Blockchain
* Tile-Based Joint Caching and Delivery of 360° Videos in Heterogeneous Networks
* Toward Making Unsupervised Graph Hashing Discriminative
* Towards Efficient Front-End Visual Sensing for Digital Retina: A Model-Centric Paradigm
* Towards Improving Robustness of Deep Neural Networks to Adversarial Perturbations
* Training Objective Image and Video Quality Estimators Using Multiple Databases
* Two-Stage Triplet Network Training Framework for Image Retrieval, A
* Ultra-Low Complexity and High Efficiency Approach for Lossless Alpha Channel Coding, An
* Uni-and-Bi-Directional Video Prediction via Learning Object-Centric Transformation
* Unified Deep Metric Representation for Mesh Saliency Detection and Non-Rigid Shape Matching, A
* Unmanned Aircraft System Aided Adaptive Video Streaming: A Joint Optimization Approach
* Unsupervised Real-Time Framework of Human Pose Tracking From Range Image Sequences, An
* Unsupervised Variational Video Hashing With 1D-CNN-LSTM Networks
* Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks
* Using Blockchain for Improved Video Integrity Verification
* Using Cell Phone Pictures of Sheet Music To Retrieve MIDI Passages
* Vabis: Video Adaptation Bitrate System for Time-Critical Live Streaming
* Variational Single Image Dehazing for Enhanced Visualization
* Vibrotactile Quality Assessment: Hybrid Metric Design Based on SNR and SSIM
* Video Anomaly Detection and Localization Based on an Adaptive Intra-Frame Classification Network
* Video Storytelling: Textual Summaries for Events
* VINet: A Visually Interpretable Image Diagnosis Network
* Visual Font Pairing
* Visual Relationship Embedding Network for Image Paragraph Generation
* Visual-Texual Emotion Analysis With Deep Coupled Video and Danmu Neural Networks
* WeGAN: Deep Image Hashing With Weighted Generative Adversarial Networks
* Weighted and Class-Specific Maximum Mean Discrepancy for Unsupervised Domain Adaptation
* What Image Features Boost Housing Market Predictions?
* WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild
* WSCNet: Weakly Supervised Coupled Networks for Visual Sentiment Classification and Detection
246 for MultMed(22)

MultMed(28) * 3D Semantic Gaussian via Geometric-Semantic Hypergraph Computation
* 3D-SceneQ: Empowering 3D LLM With Query-Guided Adaptive Pruning and Multi-Modal Feature Enhancement
* 3D-VMSS: Distributed Trust and Visually Meaningful Secret Sharing for 3D Mesh Models
* 3DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation
* Accelerating Adaptive Diffusion and Uncertainty Modeling for Underwater Image Enhancement
* Adaptive in Adapter: Boosting Open-Vocabulary Semantic Segmentation With Adaptive Dropout Adapter
* Adaptive Multi-Modal Visual Tracking With Dynamic Semantic Prompts
* Adaptive Multimodal Semantic Balancing Framework for Sentiment Analysis
* Adaptive Use of Convex or Non-Convex Optimization in Deep Unfolding Network for Image Compressive Sensing
* Adaptive Visual Prompting for Effective Satellite Video Tracking
* Adaptively Clustering Neighbor Elements for Image-Text Generation
* Admitting Ignorance Helps the Video Question Answering Models to Answer
* Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation
* Adversarial Pruning Networks for Compact 3D Gaussian Splatting
* AesPrompt: Zero-Shot Image Aesthetics Assessment With Multi-Granularity Aesthetic Prompt Learning
* Ambiguity-Aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning
* AMFOR: Adaptive Multi-Granularity Fusion and Occlusion Reconstruction for Person Re-Identification
* Anisotropic Optical Flow Guided Adaptive Multi-Stage Video Inpainting
* Arbitrary-Scale Fusion Operator for High-Resolution Hyperspectral Imaging
* ASK-HOI: Affordance-Scene Knowledge Prompting for Human-Object Interaction Detection
* ASR-Enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
* AttenCraft: Attention-Based Disentanglement of Multiple Concepts for Text-to-Image Customization
* Attribute-Centric Cross-Modal Alignment for Weakly Supervised Text-Based Person Re-ID
* Auto-DBPA: Density-Aware Ball-Pivoting Algorithm With Adaptive Radius Using Contextual Bandits for Object and Scene Reconstruction
* AV2TS: A Multivariate Time Series Modeling Framework for Audio-Visual Segmentation
* Balancing Optimization Strategies and Practical Goals: An Efficient Scene Text Detector
* BCNet: Butterfly-Shaped Convolutions Network for Lightweight Edge Detection
* Beneficial Noise Learning for Robust Multimodal Fusion
* Beyond Mere Tuning: Harnessing the Full Potential of Prompts for Text-Video Retrieval
* Bimodal Commuting Alignment Network for Zero-Shot Recognition
* Boosting Universal Adversarial Attack on Deep Neural Networks
* Bootstrap Deep Spectral Clustering With Optimal Transport
* Breaking the Curse of Knowledge: Towards Effective Multimodal Recommendation Using Knowledge Soft Integration
* Bridging Component Learning With Degradation Modelling for Blind Image Super-Resolution
* C-CTX: Cubic-Checkerboard Context Entropy Model for Learned Image Compression
* CaASR: A Causal Lens for Refining Temporal Action Segmentation
* CAD-Mesher: A Convenient, Accurate, Dense Mesh-Based Mapping Module in SLAM for Dynamic Environments
* CapHDR2IR: Caption-Driven Transfer From Visible Light to Infrared Domain
* Cas-OVD: Cascaded Open-Vocabulary Detection of Small Objects Using Multi-Refined Region Proposal Network in Autonomous Driving
* Causality-Inspired Graph Neural Networks for Cross-Modal Retrieval
* CCDM: Continuous Conditional Diffusion Models for Image Generation
* Class-Aware Diversified Augmentation for Open-Set Single Domain Generalization
* Classifier Enhancement Using Extended Context and Domain Experts for Semantic Segmentation
* ClickEnhance: Efficient 3D Interactive Segmentation With Click-Specific Encoder and Contrastive Learning
* CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
* Closely-Coupled Reconstruction and Generation for Blind Face Restoration
* CMANet: Context-Aware Mutual Attention Network for Referring Image Segmentation
* CMI-Net: Cross-View Message Token Interaction Network for 3D Shape Recognition
* CNN-Based 360° Scene Recognition for Automatic Generation of Omnidirectional Scent Effects
* CoLeQ: Improving Data-Free Quantization via Contrastive Learning
* Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration
* ColView: Consistent Text-Guided Grayscale Scene Colorization From Multi-View Images
* Compositional Text-to-Image Synthesis With Training-Free Layout-Guided Diffusion
* CompoVis: Is Cross-Modal Semantic Alignment of CLIP Optimal? A Visual Analysis Attempt
* Compressed Video Quality Assessment With Fine-Grained Artifact Perception and Evaluation
* Compression Framework for Light 3D Scene Graph Generation via Pruning-as-Search and Distillation
* Constructing Enhanced Mutual Information for Online Class-Incremental Learning
* Context Modeling With Multimodal Prompts for Emotion Recognition in Conversation
* Continual Conceptual Entity Learning for Text-to-Image Generative Models
* Contrastive Diversity Augmentation for Single Domain Generalization
* COP: CrOss-View Attention Prompt for Zero-Shot Sketch-Based Image Retrieval
* Copycat vs. Original: Multi-Modal Pretraining and Variable Importance in Box-Office Prediction
* Correspondence Calibrating and Dynamic Consistency Learning for Noisy Cross-Modal Retrieval
* Counterfactual Co-Occurring Learning for Bias Mitigation in Weakly-Supervised Object Localization
* Coverless Image Steganography Technique Based on Multi-Object Mapping Rules, A
* Cross-Modal Explicit Invariance Coordinated Dual Constraint Reconstruction for Heterogeneous Incomplete Multimodal Sentiment Analysis
* Cross-Modal Fusion With Mixture-of-Experts for Efficient RGB-D Salient Object Detection
* Cross-Modal Spherical Aggregation for Weakly Supervised Remote Sensing Shadow Removal
* Cross-View and Multi-Step Interaction for Change Captioning
* CrossHypergraph: Consistent High-Order Semantic Network for Few-Shot Image Classification
* CSP: Channel and Space Pruning for Compressing Deep Convolutional Neural Networks
* D3BSR: Blind Super-Resolution via Diffusion-Based Disentangled Degradation Representation
* Deep Distance Weighted Sampling Hashing for Cross-Modal Retrieval
* Deep Multi-View Clustering With Intra-View Similarity and Cross-View Correlation Learning
* Deep Neighbor Discriminant Binary Embedding for Multi-Label Image Retrieval
* Deep Reinforcement Learning for Lunar Polar Low-Light Enhancement
* Deep Semantic Tuplet-Based Hashing by Hypergraph Modeling for Cross-Modal Retrieval
* Deep Video Coding With Bit-Depth Scalability
* DEEP: Decoupled Semantic Prompt Learning, Guiding and Embedding for Multi-Spectral Object Re-Identification
* DeepFake Detection With Multi-View Fusion and Graph Convolutional Network
* DesCLIP: Robust Continual Learning via General Attribute Descriptions for VLM-Based Visual Recognition
* DESSM: Dual Encoder-Based State Space Model for Image Inpainting
* Det-Agent: Open-Vocabulary Object Localization and Detection With Reinforcement Learning Agent
* DHSNet: Denoised-Modulated Hybrid-Semantic Scale-Aware Network for Low-Light Image Enhancement
* DiffusionTrend: A Minimalist Approach to Virtual Fashion Try-On
* DiffW: Multi-Encoder Based on Conditional Diffusion Model for Robust Image Watermarking
* DINVMark: A Deep Invertible Network for Video Watermarking
* Disreo: Provably Secure No-Box-Extraction Linguistic Steganography Based on Distribution Reorganization
* Distortion-Sensitive Masked Autoencoder for Omnidirectional Video Quality Assessment
* Distributed Cloud Storage and Medical Image Encryption Algorithm Application: Medical Image Sharing System, A
* Distributed Deep Point Cloud Feature Compression for Vehicle-to-Vehicle Cooperative Perception
* DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
* Domain Generalization for Face Anti-Spoofing via Content-Aware Composite Prompt Engineering
* Double-Chain Graph Convolution Transformer for 3D Human Pose Estimation
* DPAKS: Reliable DETR Guided by Prior Auxiliary Knowledge for Small Object Detection
* DreamJourney: Perpetual View Generation With Video Diffusion Models
* DT-JRD: Deep Transformer-Based Just Recognizable Difference Prediction Model for Video Coding for Machines
* DTSNet: Dynamic Transformer Slimming for Efficient Vision Recognition
* Dual Feature Fusion for Incomplete Multi-View Multi-Label Learning
* Dual Visual Prompting With Context-Modulated Diffusion Prompts
* Dual-Domain Adaptation Networks for Realistic Image Super-Resolution
* Dual-Domain Modulation Network for Lightweight Image Super-Resolution
* DV-Net: Detecting and Distinguishing Copy-Move Regions From Dual Views
* DVD: A Debiased Visual Dialog Model via Disentangling Knowledge Features
* Dynamic Query Management and Internal Consistency Representation Based Transformer for Online Vectorized HD Map Construction
* EdinoGait: Transferring Large Visual Models to Event-Based Vision for Enhancing Gait Recognition
* EEformer: Early Exiting for Transformer With Global-Local Exits and Progressive Fine-Tuning
* Efficient Approximation of Earth Mover's Distance Based on Nearest Neighbor Search
* Efficient Arbitrary-Scale Image Super-Resolution via Functional Tensor Decomposition
* Efficient Oriented Object Detection via Wavelet-Based Energy Label Reassignment and Dual Prediction Strategy
* Efficient Single Image Dehazing Based on Gradient Line Prior
* Efficient VVC Intra Partitioning With Shared Feature Extraction and Complexity-Aware Threshold Decision
* EFIN: A Novel Enhanced Feature Interaction Network for Temporal Sentence Grounding in Videos
* Elliptic Curve Integrated Encryption Based 3D Mesh Model Privacy Preservation Scheme via Geometric Projection
* EmoSpeaker: One-Shot Fine-Grained Emotion-Controlled Talking Face Generation
* Enabling Real-World Supervised Video Anomaly Detection: New Open-Set Benchmark and New Framework
* Energy-Driven Explicit Alignment Network: A Blended-Target Domain Adaptation Approach
* Enhance Panoramic Object Detection Using Planar Image Datasets
* Enhanced Audio-Visual Speech Synthesis via Multi-Discriminative Learning
* Enhancing Cross-Domain Correspondence for Unsupervised Image-to-Image Translation
* ErasableMask: A Robust and Erasable Privacy Protection Scheme Against Black-Box Face Recognition Models
* ExpDiff: Generating High-Fidelity 3D Facial Expression Meshes and BRDF Textures via Diffusion Model
* Exploiting Multimodal Knowledge Graph for Multimodal Machine Translation
* Exploiting Regional Information Transformer for Single Image Deraining
* Exploiting Temporal Audio-Visual Correlation Embedding for Audio-Driven One-Shot Talking Head Animation
* Exploring Cross-Modal Mutual Prompt Learning for Video Quality Assessment
* F&S-Net: A Dual Mission (Fusion and Super-Resolution) Framework Under Various Input Resolution
* F2M: Improving Skin Disease Recognition by Fusing Multi-Source and Multi-Scale Image Features
* Fairness-Aware Multicategory 360° Video Streaming in Cloud-Edge Collaboration Networks
* Fast and Effective Overwrite Attack Against DNN-Based Image Watermarking Models
* Feature Dispersion Adaptation With Pre-Pooling Prototype for Continual Image Classification
* FGDepth: Fine-Grained Boundary Perception Enhancement in Self-Supervised Indoor Depth Estimation
* FluencyVE: Marrying Temporal-Aware Mamba with Bypass Attention for Video Editing
* FoodDiff: A Collaborative Relationship Perception Framework for Food Image Synthesis Using Diffusion Models
* Forgery-Aware and Edge-Guided Diffusion Model for General and Robust Image Forgery Localization
* From Sight to Insight: Enhancing Confusable Structure Segmentation via Vision-Language Mutual Prompting
* From Sight to Insight: Unleashing Eye-Tracking in Weakly Supervised Video Salient Object Detection
* Fusion of Infrared and Visible Images Based on Iterative Dual-Branch Attention and Modality Discrepancy Guidance
* Fusion-Driven Task Mutual-Guidance Network for Few-Shot Hyperspectral Image Classification
* Fusion-Enhanced Network for Infrared and Visible High-Level Vision Tasks, A
* Gain From Give Up: Intuitive Data Augmentation Framework for Image Retrieval
* Generalizable and Adaptive Continual Learning Framework for AI-Generated Image Detection
* Generalized Trusted Multi-View Classification Framework With Hierarchical Opinion Aggregation
* Generalizing Stylized Motion Generation Method by Introducing Metadata-Independent Learning and Unified Multiple Motion Dataset
* Generic-to-Personalised Learning for Multimodal Image Synthesis With Bidirectional Variational GAN
* GeoEdgeFormer: 3D Point Cloud Saliency Detection via Edge-Enhanced Graph-Transformer Network
* Geometry-Aware 3D Gaussian Representation for Real-Time Rendering of Large-Scale Scenes
* GeoTree: A Dynamic Tree-Based Geometry Problem Solver Through LLM-Symbolic Reasoning
* Graph-Free Multiview Clustering with Anchors
* HAAP: Vision-Context Hierarchical Attention Autoregressive With Adaptive Permutation for Scene Text Recognition
* HCFMaNet: A Novel Holistic Cross-Modal Fusion Mamba Network for Multi-Modal Medical Image Fusion
* HDMDN: Hierarchical-Decoupling Based Meta-Knowledge Single Image Dehazing Network
* Heterogeneous Multimodal Federated Learning With Missing Modality via Mask-Restoration and Self-Guidance
* Hierarchical Concept Bottleneck with Compensation Concept Learning
* Hierarchical Scene Graph Generation With Coarse-to-Fine Reasoning
* Hierarchical Semantic-Visual Fusion of Visible and Near-Infrared Images for Long-Range Haze Removal
* High-Capacity Generative Image Steganography Approach for Hiding Multiple Secret Images
* High-Capacity Reversible Data Hiding for JPEG Images Using Ternary Matrix Embedding
* High-Frequency Prioritized Sparse Attention Network for Image Restoration
* HitBack: Transformer With Hierarchical-Semantic Cross Attention and Background Contrast for Weakly Supervised Wildlife Semantic Segmentation
* HMS^2Net: Heterogeneous Multimodal State Space Network via CLIP for Dynamic Scene Classification in Livestreaming
* How Vision-Language Tasks Benefit From Large Pre-Trained Models: A Survey
* HP-C4D: A Fast Camera and 4D Radar Fusion Framework With Height Prediction for 3D Object Detection
* Hybrid Debiasing Transformer With Adaptive Regularization for Video Moment Localization
* IA2GNN: Imbalance-Aware Adaptive Graph Construction for Multi-Modal Image Fusion
* ICDSR: Integrated Conditional Diffusion Model for Single Image Super-Resolution
* ICPL-ReID: Identity-Conditional Prompt Learning for Multi-Spectral Object Re-Identification
* Identity Clue Refinement and Enhancement for Visible-Infrared Person Re-Identification
* Illumination-Guided Grouped Attention and Masked Progressive Denoising for Low-Light Image Enhancement
* Image Coding for Machines via Feature-Preserving Rate-Distortion Optimization
* Image Enhancement Based on Pigment Representation
* Image Singularity Scattering Representation Learning Classification
* Image Super-Resolution Using Hierarchical Cross-Scale Self-Similarity
* IMMPOI: An Interest-Aware Multimodal Adaptive Fusion Framework for POI Recommendations
* Imperceptible Protection Against Style Imitation From Diffusion Models
* Independent Block-Wise Attribution for Vision Transformer Interpretability Through Semantic Relevance
* Inference-Time Rule Eraser: Fair Recognition via Distilling and Removing Biased Rules
* Information Disclosure Risk of Thumbnail-Preserving Encryption
* Infrared UAV Target Tracking With Dynamic Feature Refinement and Global Contextual Attention Knowledge Distillation
* Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering
* Interpretable Multi-View Feature Representation via Physical Partial Differential Equation
* Interpretable Multi-View Representation Learning Towards Complex Scenes: From Homogeneity to Heterogeneity
* Intra-Sample and Intra-Modal Enhancement for Multimodal Sentiment Analysis With Missing Modalities
* Investigate Interactive Semantic Segmentation via an Uncertainty Mining View
* Invisible Backdoor Attack With Siamese Tuning on Pre-Trained Vision-Language Models
* ISDNet: High-Fidelity Single-View Reconstruction of Indoor Scenes via Instance Separation and Deformation
* Isharah: A Large-Scale Multi-Scene Dataset for Continuous Sign Language Recognition
* Joint Attribute Graph Reasoning and Aggregation for Composed Image Retrieval
* Joint Information Interaction and Semantic Fusion for Multi-View Unsupervised Feature Selection
* Joint JPEG Compression and Encryption With DC Groups' Random Cross-Permutation and ZRVs' Inter-Block Permutation
* KANM^2L: Enhancing Multi-Modal Recommendation With KAN and Dilated Attention
* Knowledge-Enhanced Dynamic Scene Graph Attention Network for Fake News Video Detection
* Knowledge-Enhanced Graph Contrastive Learning for Recommendations
* L-CLIPScore: A Lightweight Embedding-Based Captioning Metric for Evaluating and Training
* Language-Guided Multimodal Spiking Neural Networks for Event-Based Action Recognition
* Learned Image Compression via Local-to-Global Cross-Component Prior
* Learning Compact Representations With an Information Bottleneck for Camouflaged Object Detection
* Learning Dual Modality Interactions for Event-Based Motion Deblurring
* Learning Multi-View Anomaly Detection With Efficient Adaptive Selection
* Learning to Prompt With Refining Text Knowledge for Zero-Shot Video Action Recognition
* LETTER: Self-Harmonized Representation Learning for Multimodal Recommendation
* Leveraging Static-Dynamic Scene Parsing to Enhance Progressive Symbolic Reasoning for Video Question Answering
* LEViT: Locally Enhanced Vision Transformer for Efficient Object Re-Identification
* Light CNN-Transformer Dual-Branch Network for Real-Time Semantic Segmentation
* LightingGen: A DMX Based Generation Method for Entertainment Stage Lighting
* LLMI3D: MLLM-Based 3D Perception From a Single 2D Image
* Long Video Understanding With Learnable Retrieval in Video-Language Models
* Long-Short Match for Lost Control in UAV Multi-Object Tracking
* Long-Tailed Continual Learning for Visual Food Recognition
* Low-Distortion Steganography in Neural Networks
* Low-Light Image Enhancement Using a Retinex-Based Variational Model with Weighted L_p Norm Constraint
* LSTD: Long Short-Term Temporal Diffusion for Video Generation
* MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning
* Mamba-Based Progressive-Recovery Framework for Multimodal Low Light Image Enhancement
* Mask-Aware Kernel Learning for Action Recognition
* MCSF-Net: A Multi-Color Space Fusion Network for Underwater Image Enhancement
* MDT-FI: Mask-Guided Dual-Branch Transformer With Texture and Structure Feature Interaction for Image Inpainting
* Metaphorical Visual Question Answering: Benchmark and Knowledge-Enhanced Metaphor Understanding Method
* Mitigating Hallucinations in Large Vision-Language Models via Visual-Enhanced Contrastive Decoding
* Mitigating Inherent Bias of Answer Heuristic Based Frameworks in Knowledge-Based Visual Question Answering
* Mixed-Curvature Metric Learning for Image Retrieval
* MMCPose: Multimodal Condition-Driven 3D Human Pose Estimation Via Diffusion Models
* MME-Based Piecewise Data Transformation and 2D Mapping Optimization for Reversible Data Hiding
* MMToT: Multi-Modal Token-of-Thought Reasoning for Large Models
* Modality Adaptive Network for Arbitrary Modality Salient Object Detection
* Modality-Aware Gated Attention Network for Audio-Visual Event Localization
* Modality-Collaborative Low-Rank Decomposers for Few-Shot Video Domain Adaptation
* Modeling User Perception for Multi-Quality Tile-Based 360° Video Streaming
* MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
* MoPD: Mixture-of-Prompts Distillation for Vision-Language Models
* More is Not Always Better: Toward General Cross-Modal Saliency Prediction for Immersive Communications
* MotionFlow: Efficient Motion Generation With Latent Flow Matching
* MPSS: A Model Pruning Method for Semantic Image Segmentation Networks
* MRQE-Net: A Mixture of Reduced Biquaternion Experts Network for General Digital Photography Image Fusion
* Msa-Splatting: Multi-Scale Adaptive Gaussian Splatting for High-Fidelity View Synthesis
* MSF-Mamba: Motion-Aware State Fusion Mamba for Efficient Micro-Gesture Recognition
* MSG-Net: Structure-Guided Enhancement for Underwater Images Based on Multi-View Feature Interaction
* MSSG: Multi-Scale Speaker Graph Network for Active Speaker Detection
* Multi-Clue Sliding Window Attention for Camouflaged Object Detection
* Multi-Constraint Relational Semantic Alignment Toward Image-Text Retrieval
* Multi-Granularity Query Network With Adaptive Category Feature Embedding for Behavior Recognition
* Multi-Granularity Semantic Complementarity Fusion Hashing for Cross-Modal Retrieval
* Multi-Granularity Superpoint Graph Learning for Weakly Supervised 3D Semantic Segmentation
* Multi-Modal Knowledge Distillation Hashing Based on CLIP for Weakly Supervised Image Retrieval
* Multi-Modal Motion Retrieval by Learning a Fine-Grained Joint Embedding Space
* Multi-Modal Prompt-Tuning Framework for Non-Overlapping Multi-Domain Recommendation, A
* Multi-Modal Refined Prompting for Advancing Knowledge-Based Visual Question Answering
* Multi-Scale Spatial Channel Joint Representation for General Multi-Modality Image Fusion With Self-Supervision
* Multi-View Aligned Clustering via Sample-Bundled Optimization: Anchor Graph Enhancement and Contrastive Propagation
* Multi-Way Cascade-Attention Network for Multi-Modal Sequential Recommendation
* Multidimensional Media Adaptation Framework for Live Holographic Communication, A
* Multimodal Emotion Recognition with Temporal Slicing Encoder and Attention-Enhanced Synergy Integration
* Multimodal Industrial Anomaly Detection via Attention-Enhanced Memory-Guided Network
* Multimodal Multi-Graph Fusion Learning for Alzheimer's Disease Diagnosis
* Multimodal Recommendation via Modality-Shared Encoding and Multi-Dimensional Loss Optimization
* Multiscale Feature Fusion Spatial-Channel Attention Network for Infrared Small Target Segmentation
* Multiscale Spatial-Frequency Learning for Degradation Decoupling in RS Image Restoration
* MUVOD: A Novel Multi-View Video Object Segmentation Dataset and a Benchmark for 3D Segmentation
* Natural Cognizing Video: A Decoupling and Integration Network for General Event Boundary Captioning
* Negative Semantic Guided Identity Boundary Construction for Open-World Person Re-Identification
* Next Chain Prediction: A Generative Recommendation Model With Sequence-Chain Attention
* NIDC: General Task Backbone for Neuroimaging Analysis via Interpretable Deep Clustering
* Noise Aware Audio-Visual Speech Denoising
* Novel Robust Reversible Watermarking Scheme Using Fractional-Order Polar Complex Exponential Transform, A
* On the Adversarial Robustness of Learning-Based Image Compression Against Rate-Distortion Attacks
* One-Step Multi-View Clustering With Adaptive Low-Rank Anchor-Graph Learning
* Open Set Industrial Surface Defect Recognition With High Frequency Feature Enhancement and Class Mutual-Information Constraint
* Optimizing a 4D Lookup Table for Low-Light Video Enhancement via Wavelet Priori
* Orientation-Aware Task-Decoupled Learning for Oriented Object Detection
* Outliers Adaptation Exploration and Centroids Matching Label Refinement for Unsupervised Person Re-Identification
* Partition Map-Based Fast Block Partitioning for VVC Inter Coding
* PASK: Sparse Framework for Crafting Natural Adversarial Example
* Patch-Discontinuity Mining for Generalized Deepfake Detection
* PC-NSVC: An End-to-End Neural Scalable Vibrotactile Codec With Psychohaptic Calibration
* PI-Net: Point-to-Image Knowledge Distillation for Camera-Based 3D Semantic Scene Completion
* PIC-CMH: Efficient Prompt-Infused Continual Cross-Modal Hashing
* PixelBoost: Leveraging Brownian Motion for Realistic-Image Super-Resolution
* Plug-and-Play Model-Agnostic Embedding Enhancement Approach for Explainable Recommendation, A
* Posture-Movement-Frequency-Enhanced Graph Convolutional Network for Gait Emotion Recognition
* PPDSA: Privacy-Preserving Cross-Modal Retrieval With Disentangled Soft-Label Alignment
* Practical No-Box Adversarial Attacks With Training-Free Hybrid Image Transformation
* Progressive Learning of Instance-Level Proxy Semantics for Few-Shot Action Recognition
* Progressively Alleviating Noise for Unsupervised Cross-Domain Image Retrieval
* Prompt Learning with Knowledge Regularization for Pre-Trained Vision-Language Models
* Prompt-Image-Caption Consistency for AI-Generated Image Quality Assessment
* PromptSR: Cascade Prompting for Lightweight Image Super-Resolution
* PromptTrack: Streaming Spatial-Temporal Prompt Learning for RGB-T Tracking
* Prototype Perturbation for Relaxing Alignment Constraints in Backward-Compatible Learning
* Prototype-Based Asymmetric Hierarchical Matching for Text-Video Retrieval
* PSAM: Parameter-Free Spatiotemporal Attention Mechanism for Video Question Answering
* PseR: Pseudo-Label Refinement for Point-Supervised Temporal Action Detection
* Pseudo-Label Similarity Graph-Driven Multi-View Contrastive Clustering
* Purified Zero-Shot Sketch-Based Image Retrieval
* PyUIE: A Coarse-to-Fine Deep Pyramid Network for Underwater Image Enhancement
* Quality Evaluation of AI-Generated Images: Subjective Study and Objective Methodology
* Question Understanding and Temporality Guiding for Video Question Answering
* Ranking Vision-Language Models in Fully Unlabeled Tasks
* Ranking-Based Self-Supervised Representation Learning for Skeleton-Based Action Recognition
* RcFormer: Reconfigurable Self-Attention Transformer for Image Restoration
* RCNet: Reliable Co-Training Network for Weakly Supervised Change Detection
* Real-Scene Image Dehazing via Laplacian Pyramid-Based Conditional Diffusion Model
* Rectifying Adversarial Sample With Low Entropy Prior for Test-Time Defense
* ReE3D: Boosting Novel View Synthesis for Monocular Images Using Residual Encoders
* RefHCM: A Unified Model for Referring Perceptions in Human-Centric Scenarios
* REFusion: A Dual-Stage Network With Repeated Key Feature Embedding for Infrared and Visible Image Fusion
* RegR-PCQA: Deep Learning Based Colored Point Cloud Quality Assessment Using 3D-to-2D Regularized Representation
* Regularized-Aware Discriminative Transformer Tracker for Satellite Videos
* ReIDMamba: Learning Discriminative Features With Visual State Space Model for Person Re-Identification
* Reinforcement and Complementary Space Search for Multimodal Black-Box Attack
* Rendered 2D Semantic and Generative Priors Guided 3D Multi-Object Grounding
* REPAIR: Rank Correlation and Noisy Pair Half-Replacing With Memory for Noisy Correspondence
* Rethinking Class-Incremental Learning From a Dynamic Imbalanced Learning Perspective
* Rethinking Point Cloud Representation Learning for Freeing Transformer to Perceive Local
* Rethinking the Upsampling Layer in Hyperspectral Image Super Resolution
* Rethinking Vision Transformer for Large-Scale Fine-Grained Image Retrieval
* RVQ-NVC: Taming RVQ-VAE for High Fidelity Neural Vibrotactile Compression
* S2ML: Spatio-Spectral Mutual Learning for Depth Completion
* SA-BCT: Self-Adapting Backward-Compatible Training
* Scale-Aware Attention and Multi-Modal Prompt Learning With Fusion Adapter for RGBT Tracking
* Scene-Text Grounding for Text-Based Video Question Answering
* SCL: Semantic Coherence Learning for Video Question Answering
* Screen Detection From Egocentric Image Streams Leveraging Multi-View Vision Language Model
* Self-Guided Discriminative Locality Preserving Projections
* Self-Paced Attribute Prototype Contrastive Learning for Cross-Modal Materials Perception
* Semantic Distribution and Authenticity Discrepancy Alignment for AI-Generated Image Detection
* Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion, A
* Separating Domain-Private Classes for Universal Unsupervised Cross-Domain 3D Model Retrieval
* ShadowNeRF: Learning Neural Radiance Field With Sight Degradation and Recovery
* Sharpness-Aware Dynamic Anchor Selection for Generalized Category Discovery
* Signed Relation Graph Based Dynamical Interacting System Modeling for Multi-Agent Trajectory Prediction
* SimDEM: Audio-Driven Emotional Talking Head Generation With Simplified and Decoupled Expression Modeling
* Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization
* SingingHead: A Large-Scale 4D Dataset for Singing Head Animation
* Slice-and-Align for Clothes-Irrelevant Features: A Clothes-Changing Person Re-Identification Approach Without Additional Input
* SLSM-Net: Sparse LiDAR Point Clouds Supervised Stereo Matching
* SOI-Net: Structural Optimization-Inspired Interpretable Network for Incomplete Multi-View Clustering
* SORT-LFR: Revisiting SORT for Multi-Object Tracking in Low-Frame-Rate Videos
* Sounding Depressed? Personalized Deep Learning Model for Depression Detection From Speech and Text
* Sparse Transformer for Ultra-Sparse Sampled Video Compressive Sensing
* Spatially-Guided Temporal Aggregation for Robust Event-RGB Optical Flow Estimation
* SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding
* SSMPD: Semi-Supervised Learning for Multispectral Pedestrian Detection
* SSVD: Efficient Video Deinterlacing With Spatiotemporal Synchronization and Refinement
* STAR: Skeletal Token Alignment and Rearrangement for Interaction Recognition
* StereoMamba+: A Novel Stereo Image Super-Resolution Framework With Adaptive Dependency Capture and Enhanced Feature Fusion
* STNMamba: Mamba-Based Spatial-Temporal Normality Learning for Video Anomaly Detection
* StructGS: Adaptive Spherical Harmonics and Rendering Enhancements for Superior 3D Gaussian Splatting
* Structure-Preserving Frequency-Regularized Text-Guided Optimal Transport for Unpaired Rain Streaks and Raindrops Removal
* T-Mamba: A Unified Framework With Long-Range Dependency in Dual-Domain for 2D & 3D Tooth Segmentation
* TAS-DAQ: Task-Adaptive Sparse Prediction With Dense Query Auxiliary Supervisory for Efficient 3D Object Detection
* Task-Generalized Adaptive Cross-Domain Learning for Multimodal Image Fusion
* TEDFuse: Task-Driven Equivariant Consistency Decomposition Network for Multi-Modal Image Fusion
* Temporal Consistency-Aware Dynamic Point Clouds Color Attribute Enhancement
* Temporal Prompt Learning With Depth Memory for Video Mirror Detection
* Tensor Wheel Completion With Low-Rank Factor Prior and Adaptive Graph Regularizer for Hyperspectral Image Recovery
* Tensor-Based Graph Learning With Consistency and Specificity for Multi-View Clustering
* Text-KeyPoint Human Representation Based Multi-Level Semantic Guided Model for Human Mesh Recovery
* Text-Pass Filter: An Efficient Scene Text Detector
* TextBridge: A Text-Centered Framework for Enhanced Multimodal Integration and Retrieval
* TextDCTv2: Scene Text Detection With Patch-Based Discrete Cosine Transform Representation
* TextRSR: Enhanced Arbitrary-Shaped Scene Text Representation via Robust Subspace Recovery
* TFFN: Three-Branch Feature Fusion Network for Stereoscopic Omnidirectional Image Quality Assessment
* TMT: Tri-Modal Translation Between Speech, Image, and Text by Processing Different Modalities as Different Languages
* Topology-Aware Modeling for Unsupervised Simulation-to-Reality Point Cloud Recognition
* Toward Bidirectional Adaptability for Few-Shot Class-Incremental Learning With Forward-Backward Knowledge Transfer
* Toward Copyright Leakage Mitigation for Spherical Panoramic Images
* Toward Multi-Source Sky-Ground Re-Identification: A New Benchmark and an Innovative Approach
* Toward Smooth Depth Driven by Selective Attention and Selective Aggregation
* Toward Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering
* Towards General Cross-Modal Visual Coding for Emergency Communications
* Towards Mitigation of False Negatives in Text-to-Image Person Re-Identification
* Towards Structure-Aware Model for Multi-Modal Knowledge Graph Completion
* Towards Ultra-High-Definition Image Deraining: A Benchmark and an Efficient Method
* Training-Free Dual Hyperbolic Adapters for Better Cross-Modal Reasoning
* Trajectory-Aware Attack: Explainable Adversarial Attack Against Multiple Object Trackers
* Transferable Backdoor Attack on Any CLIP Model With Any Target Class by Pre-Trained Hack Network
* Transferring and Refining Visual-Semantic Priors via Graph-Enhanced CLIP for 3D Hand Pose Estimation
* Transformer-Based Tracker Integrating Motion and Representation Information, A
* TransGOP-R: Transformer-Based Real-World Gaze Object Prediction
* TransZSIS: Superpixel-Guided Irregular Patch-Pair Features Learning With Transformer for Zero-Shot Instance Segmentation in Robotic Environments
* Triple-View Knowledge Distillation for Semi-Supervised Semantic Segmentation
* Trustworthy Continuous Sign Language Recognition
* Tuning-Free High-Resolution Video Diffusion With Spatial-Temporal Latent Grouping
* Twin Tensor Learning for Consistency and Inconsistency: A Unified Affinity Learning Framework for Multi-View Clustering
* Uncertainty-Aware Audio-Visual Segmentation With Dynamic Fusion for Multimodal Alignment
* Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation
* UniAlign: A Universal Cross-Modality Knowledge Alignment Framework for Fine-Grained Action Recognition
* UniCrossGait: Unified Cross-Modal Gait Recognition Based on Knowledge Distillation
* Unsupervised Point Cloud Reconstruction via Recurrent Multi-Step Moving Strategy
* Useg-PanoDepth: Unified 360° Depth Estimation for Indoor and Outdoor Scenes With Semantic Assistance
* USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s
* VDMamba: Vector Decomposition in Vision Mamba for Image Deraining and Beyond
* Video-Based Instantaneous Heart Rate Measurement With Enhanced Time-Frequency Representations
* Video-Level Cross-Modal Temporal-Navigation for RGBT Tracking
* Viewpoint-Centric Approach to Transition and Quality Delays in Multi-View Video Streaming, A
* VikitaFusion: Object Recognition Based on Heterogeneous Visual-Kinesthetic-Tactile Information
* Visibility-Based Geometry Pruning of Neural Plenoptic Scene Representations
* Visual Context and Commonsense-Guided Causal Chain-of-Thoughts for Visual Commonsense Reasoning
* Visual Position Prompt for MLLM Based Visual Grounding
* VSIS-RDPA: Verifiable Secret Image Sharing Based on Polynomial Interpolation for Resisting Dishonest Participant Attacks
* Watch Where You Move: Region-Aware Dynamic Aggregation and Excitation for Gait Recognition
* When Multi-Focus Image Fusion Meets Nonlinear Spiking Neural P Systems
* YACT-Net: Asymmetric YUV Color Transfer for Reference-Based Colorization
403 for MultMed(28)

MultMed(4) * Image-Based Virtual World Generation
* Virtualized Reality: Constructing Virtual Worlds From Real Scenes
* Visually Searching the Web for Content

MultMed(5) * Classifying color edges in video into shadow-geometry, highlight, or material transitions
* Similarity Retrieval of Trademark Images

MultMed(6) * Isolated regions in video coding

MultMed(7) * Detection and Representation of Scenes in Videos

MultMed(9) * 3-D Head Model Retrieval Using a Single Face View Query
* Active Rearranged Capturing of Image-Based Rendering Scenes: Theory and Practice
* Adaptive Media-Aware Retransmission Timeout Estimation Method for Low-Delay Packet Video, An
* Adding Semantics to Detectors for Video Retrieval
* Audio-Visual Affect Recognition
* Audio-Visual Event Recognition in Surveillance Video Sequences
* Automatic Meeting Segmentation Using Dynamic Bayesian Networks
* Automatically-Determined Region of Interest in JPEG 2000
* Bayesian Approach for Morphology-Based 2-D Human Motion Capture
* Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News
* Combination of Warping Robust Elastic Graph Matching and Kernel-Based Projection Discriminant Analysis for Face Recognition
* Comments on An SVD-Based Watermarking Scheme for Protecting Rightful Ownership
* Content-Based Copy Retrieval Using Distortion-Based Probabilistic Similarity Search
* Content-Based Image Retrieval by Feature Adaptation and Relevance Feedback
* Content-Based Retrieval of 3-D Objects Using Spin Image Signatures
* Delay-Distortion Optimization for Content-Adaptive Video Streaming
* Digital Image Tracing by Sequential Multiple Watermarking
* Discrete Wavelet Transform on Consumer-Level Graphics Hardware
* Edge Potential Functions (EPF) and Genetic Algorithms (GA) for Edge-Based Matching of Visual Objects
* Efficient Mode Decision Algorithm for H.264/AVC Encoding Optimization, An
* Efficient Short Video Repeat Identification With Application to News Video Structure Analysis
* Encoding of Affine Motion Vectors
* End-to-End Embedded Approach for Multicast/Broadcast of Scalable Video over Multiuser CDMA Wireless Networks, An
* Enhanced Eigen-Audioframes for Audiovisual Scene Change Detection
* Face Modeling and Animation Language for MPEG-4 XMT Framework
* Generic Framework for Efficient 2-D and 3-D Facial Expression Analogy, A
* Head-Size Equalization for Improved Visual Perception in Video Conferencing
* Human Behavior Analysis for Highlight Ranking in Broadcast Racket Sports Video
* Hybrid Model to Detect Zero Quantized DCT Coefficients in H.264
* Image Collection Organization and Its Application to Indexing, Browsing, Summarization, and Semantic Retrieval
* Incorporating Concept Ontology for Hierarchical Video Classification, Annotation, and Visualization
* Joint Design of Source Rate Control and QoS-Aware Congestion Control for Video Streaming Over the Internet
* Learned Lexicon-Driven Paradigm for Interactive Video Retrieval, A
* Learning Personal Preference From Viewer's Operations for Browsing and Its Application to Baseball Video Retrieval and Summarization
* Lecture Video Enhancement and Editing by Integrating Posture, Gesture, and Text
* Major Cast Detection in Video Using Both Speaker and Face Information
* Modeling and Mining of Users' Capture Intention for Home Videos
* Modeling Human Judgment of Digital Imagery for Multimedia Retrieval
* Motion Flow-Based Video Retrieval
* Moving Cast Shadows Detection Using Ratio Edge
* Moving-Object Detection, Association, and Selection in Home Videos
* Multistreaming of 3-D Scenes With Optimized Transmission and Rendering Scalability
* Near-Duplicate Keyframe Identification With Interest Point Matching and Pattern Learning
* New Model-Based Digital Halftoning and Data Hiding Designed With LMS Optimization, A
* Novel 4-D Perceptual Quantization Modeling for H.264 Bit-Rate Control, A
* Novel Point-Oriented Inner Searches for Fast Block Motion Estimation
* On Transcoding a B-Frame to a P-Frame in the Compressed Domain
* Optimized Content-Aware Authentication Scheme for Streaming JPEG-2000 Images Over Lossy Networks, An
* Pattern-Based Data Hiding for Binary Image Authentication by Connectivity-Preserving
* Perceptual Temporal Quality Metric for Compressed Video
* Perceptually Optimized 3-D Transmission Over Wireless Networks
* Quad-Tree Motion Estimation in the Frequency Domain Using Gradient Correlation
* Real-Time Motion Trajectory-Based Indexing and Retrieval of Video Sequences
* Real-Time Whiteboard Capture and Processing Using a Video Camera for Remote Collaboration
* Robust Biometric Person Identification Using Automatic Classifier Fusion of Speech, Mouth, and Face Experts
* Rule Based Technique for Extraction of Visual Attention Regions Based on Real-Time Clustering, A
* Scalable, Wavelet-Based Video: From Server to Hardware-Accelerated Client
* Scene Parsing Using Region-Based Generative Models
* Scene-Change Aware Dynamic Bandwidth Allocation for Real-Time VBR Video Transmission Over IEEE 802.15.3 Wireless Home Networks
* Security and Robustness Enhancement for Image Data Hiding
* Semantic Image and Video Indexing in Broad Domains
* Shape Indexing and Recognition Based on Regional Analysis
* Spatiotemporal Visual Considerations for Video Coding
* Summarization of Visual Content in Instructional Videos
* Super-Resolution of Face Images Using Kernel PCA-Based Prior
* Target Tracking Using a Joint Acoustic Video System
* Two-Dimensional Channel Coding Scheme for MCTF-Based Scalable Video Coding
* Video Packet Selection and Scheduling for Multipath Streaming
* Video Segmentation via Temporal Pattern Classification
* Virtual Viewpoint Replay for a Soccer Match by View Interpolation From Multiple Cameras
* Visual Salience-Guided Mesh Decomposition
* Watermarked 3-D Mesh Quality Assessment
* Watermarking Digital 3-D Volumes in the Discrete Fourier Transform Domain
* Word-Level Parallel Architecture of JPEG 2000 Embedded Block Coding Decoder
74 for MultMed(9)

MultMedMag * *IEEE MultiMedia Magazine
* MPEG-21 and Its Interoperability with Rights-Information Standards
* SignTutor: An Interactive System for Sign Language Tutoring

MultMedMag(12) * MPEG Standard for Rich Media Services, An
* MPEG-A: Multimedia Application Formats
* VERL: An Ontology Framework for Representing and Annotating Video Events
* What's New with MPEG?

MultMedMag(15) * Dynamic Video Transcoding in Mobile Environments

MultMedMag(16) * Dynamic Pictorially Enriched Ontologies for Digital Video Libraries
* Ecosystem for Semantics, An
* Folk Song Retrieval System with a Gesture-Based Interface, A
* Hybrid Tagging and Browsing Approaches for Efficient Manual Image Annotation
* Learning Video Preferences Using Visual Features and Closed Captions
* Multimedia at Work: Harvesting Resources for Recording Concurrent Videoconferences
* Novel Approach to Steganography in High- Dynamic-Range Images, A
* Standards: The MPEG Open Access Application Format
8 for MultMedMag(16)

MultMedMag(17) * AR-Immersive Cinema at the Aula Natura Visitors Center
* Archive and Preservation of Media Content Using MPEG-A
* Cross-Modal Approach to Cleansing Weakly Tagged Images, A
* Crowdsourcing What Is Where: Community-Contributed Photos as Volunteered Geographic Information
* Data-Driven Approaches to Community-Contributed Video Applications
* Hiding Multitone Watermarks in Halftone Images
* Intelligent Multimedia Presentation in Ubiquitous Multidevice Scenarios
* Keyframe-Based Video Summary Using Visual Attention Clues
* Landscaping Future Interaction: Special issue on Mobile and Ubiquitous Multimedia
* Local Wavelet Features for Statistical Object Classification and Localization
* Mobile Multimedia Technology to Aid Those with Alzheimer's Disease, A
* Mobility Management for Video Streaming on Heterogeneous Networks
* Modeling Media Synchronization with Semiotic Agents
* New Paradigm for Content Producers, A
* Optimal Rate Allocation for Video Transmission over Wireless Ad Hoc Networks
* Picture Context Capturing for Mobile Databases
* Platform for Context-Aware and Digital Rights Management-Enabled Content Adaptation, A
* Question Answering over Community-Contributed Web Videos
* Social Surroundings: Bridging the Virtual and Physical Divide
* System Concept for Socially Enriched Access to Soccer Video Collections, A
* Video Annotation and Retrieval Using Ontologies and Rule Learning
* Video in the Web: Technical Challenges and Standardization
* Visual Navigation for Mobile Devices
23 for MultMedMag(17)

MultMedMag(18) * Augmenting Live Broadcast Sports with 3D Tracking Information
* Cluster-Based Landmark and Event Detection for Tagged Photo Collections
* Converting 2D Video to 3D: An Efficient Path to a 3D Experience
* Data-Hiding in Halftone Images Using Adaptive Noise-Balanced Error Diffusion
* Discovering the Thematic Object in Commercial Videos
* Enhancing Bag-of-Words Models with Semantics-Preserving Metric Learning
* Film Analysis of Archived Documentaries
* Implementation and Analysis of a Peer-to-Peer Retransmissions System for Live Video Services
* Large-Scale Multimedia Retrieval and Mining
* Mining Event Structures from Web Videos
* Mixed-Reality System for Broadcasting Sports Video to Mobile Devices, A
* Mobile Visual Search: Architectures, Technologies, and the Emerging MPEG Standard
* MPEG-DASH Standard for Multimedia Streaming Over the Internet, The
* Music Generation with Markov Models
* Naming People in News Videos with Label Propagation
* Online Video Recommendation through Tag-Cloud Aggregation
* Personalized Coverage of Large Athletic Events
* Preserving Wayang Kulit for Future Generations
* Real-Time Video Copy-Location Detection in Large-Scale Repositories
* Semantic Annotation Architecture for Accessible Multimedia Resources
* Using Modality Replacement to Facilitate Communication between Visually and Hearing-Impaired People
* Visual Content Identification and Search
* Visual Reranking: From Objectives to Strategies
* Visual Rhythm Detection and Its Applications in Interactive Multimedia
* Vocabulary Hierarchy Optimization and Transfer for Scalable Image Search
* Web-Scale Multimedia Analysis: Does Content Matter?
* Weighted Subspace Filtering and Ranking Algorithms for Video Concept Retrieval
* You Can Judge an Artist by an Album Cover: Using Images for Music Annotation
28 for MultMedMag(18)

MultMedMag(19) * Anatomy of an Optical Biopsy Semantic Retrieval System, The
* Boosting, Sparsity- Constrained Bilinear Model for Object Recognition, A
* Building Reliable and Reusable Test Collections for Image Retrieval: The Wikipedia Task at ImageCLEF
* Collecting Large, Richly Annotated Facial-Expression Databases from Movies
* Combining Face and Eye Detectors in a High- Performance Face-Detection System
* Current Developments and Future Trends in Audio Authentication
* Digital Image Scrambling Using 2D Cellular Automata
* Efficient Image Copy Detection Using Multiscale Fingerprints
* Face Matching and Retrieval in Forensics Applications
* Finding Information in Multimedia Meeting Records
* Image Retrieval in Forensics: Tattoo Image Database Application
* Immersive Environment: An Emerging Future of Telecommunications
* Indexing Large Online Multimedia Repositories Using Semantic Expansion and Visual Analysis
* Microsoft Kinect Sensor and Its Effect
* Mobile Media in Action: Remote Target Localization and Tracking
* Posterity Logging of Face Imagery for Video Surveillance
* Profiling Online Auction Sellers Using Image-Editing Styles
* Real-Time Compressed- Domain Video Watermarking Resistance to Geometric Distortions
* Threefold Dataset for Activity and Workflow Recognition in Complex Industrial Environments, A
* Using Texture Analysis for Medical Diagnosis
* Where Is the User in Multimedia Retrieval?
21 for MultMedMag(19)

MultMedMag(20) * 3D Imaging Techniques and Multimedia Applications: Guest editor's introduction
* Affect in Media: Embodied Media Interaction in Performance and Public Art
* Applications of Face Analysis and Modeling in Media Production
* Character Behavior Planning and Visual Simulation in Virtual 3D Space
* Classification and Analysis of 3D Teleimmersive Activities
* Depth Sensing for 3DTV: A Survey
* Immersive 3D Holoscopic Video System
* In-Kernel Relay for Scalable One-to-Many Streaming
* JPEG's JPSearch Standard: Harmonizing Image Management and Search
* Large Visual Repository Search with Hash Collision Design Optimization
* Large-Scale Image Phylogeny: Tracing Image Ancestral Relationships
* Large-Scale Near-Duplicate Web Video Retrieval: Challenges and Approaches
* Learning to Rerank Web Images
* MMT: An Emerging MPEG Standard for Multimedia Delivery over the Internet
* New Writing Experience: Finger Writing in the Air Using a Kinect Sensor, A
* Partial-Duplicate Image Retrieval via Saliency-Guided Visual Matching
* Scalable Media Coding Enabling Content-Aware Networking
* Scalable Mobile Video Retrieval with Sparse Projection Learning and Pseudo Label Mining
* Securing Multimedia Content Using Joint Compression and Encryption
* Software-Based Solution for Distributing and Displaying 3D UHD Films, A
* Standards-Based Architectures for Content Management
* Unified Access to Media Metadata on the Web
* Video Copy-Detection and Localization with a Scalable Cascading Framework
* Video Delivery Challenges and Opportunities in 4G Networks
* Viewport: A Distributed, Immersive Teleconferencing System with Infrared Dot Pattern
* Walking in Colors: Human Gait Recognition Using Kinect and CBIR
* Web-Scale Image Retrieval Using Compact Tensor Aggregation of Visual Descriptors
* Web-Scale Near-Duplicate Search: Techniques and Applications
28 for MultMedMag(20)

MultMedMag(21) * Clustering Faces in Movies Using an Automatically Constructed Social Network
* Compact Descriptors for Visual Search
* Context-Adaptive Modeling for Wavelet-Domain Distributed Video Coding
* Critical Multimedia
* Efficient BOF Generation and Compression for On-Device Mobile Visual Location Recognition
* Fashion Analysis: Current Techniques and Future Directions
* Finding the Needle in the Image Stack: Performance Metrics for Big Data Image Analysis
* Future of Smart Photography, The
* Graph-Based Residence Location Inference for Social Media Users
* How Many Visual Concepts?
* Joint Video and Text Parsing for Understanding Events and Answering Queries
* Large-Scale Geosocial Multimedia
* Latent Subspace Projection Pursuit with Online Optimization for Robust Visual Tracking
* Local Stereo Matching with Improved Matching Cost and Disparity Refinement
* Memory-Efficient Image Databases for Mobile Visual Search
* Mobile Photo Recommendation and Logbook Generation Using Context-Tagged Images
* Multimedia Grand Challenge 2013, The
* Multimedia Semantic Retrieval Mobile System Based on HCFGs, A
* Multimodal Feature Fusion for 3D Shape Recognition and Retrieval
* Multimodal Spatio-Temporal Theme Modeling for Landmark Analysis
* New Paradigm for Querying Blobs in Vehicular Networks, A
* Next-Generation 3D Formats with Depth Map Support
* Objective Self
* Online Learning a High-Quality Dictionary and Classifier Jointly for Multitask Object Tracking
* Projected Residual Vector Quantization for ANN Search
* Real-Time Gaze Estimation with Online Calibration
* Scalable Extensions of HEVC for Ultra-High-Definition Video Delivery, The
* Self-Recognized Image Protection Technique that Resists Large-Scale Cropping
* Standardization of Biometric Template Protection
* Toward Experiential Mobile Media Processing
* Toward Haptic Cinematography: Enhancing Movie Experiences with Camera-Based Haptic Effects
* Toward Multiscreen Social TV with Geolocation-Aware Social Sense
* Training Quality-Aware Filters for No-Reference Image Quality Assessment
* User-Centric Media Retrieval Competition: The Video Browser Showdown 2012-2014, A
* View-Based 3D Object Retrieval: Challenges and Approaches
* Visions for Augmented Cultural Heritage Experience
36 for MultMedMag(21)

MultMedMag(22) * Bidirectional Mesh-Based Frame Rate Up-Conversion
* CitySensing: Fusing City Data for Visual Storytelling
* Cross-Platform Social Event Detection
* Data-Driven Scene Understanding with Adaptively Retrieved Exemplars
* Designing an Interactive Audio Interface for Climate Science
* Effects of Auditory Feedback on Menu Selection in Hand-Gesture Interfaces
* Effects of Ecological Auditory Feedback on Rhythmic Walking Interaction, The
* Emerging Multimedia Research and Applications
* Emotional and Social Signals: A Neglected Frontier in Multimedia Computing?
* Experiments with Distributed Theatre
* Green Metadata Standard for Energy-Efficient Video Consumption, The
* Integrating Multimedia into Autism Intervention
* Interactive Sonification in Rowing: Acoustic Feedback for On-Water Training
* Interleaved Time Bases in Hypermedia Synchronization
* Let's Share a Story: Socially Enhanced Multimedia Storytelling
* Let's Weave the Visual Web
* Machine Intelligence Approach to Virtual Ballet Training, A
* Manipulating Ultra-High Definition Video Traffic
* Multimedia Big Data
* Multimedia Big Data Computing
* Multimedia Search: From Relevance to Usefulness
* Novel Markov Logic Rule Induction Strategy for Characterizing Sports Video Footage, A
* Optimizing the Perceptual Quality of Real-Time Multimedia Applications
* Photos to Remember, Photos to Forget
* Saliency-Guided Deep Framework for Image Quality Assessment
* Social Multimedia and Storytelling
* Sonic Trampoline: How Audio Feedback Impacts the User's Experience of Jumping
* Sonification of Surface Tapping Changes Behavior, Surface Perception, and Emotion
* Survey of Current YouTube Video Characteristics, A
* Syncing Shared Multimedia through Audiovisual Bimodal Segmentation
* Teaching Privacy: Multimedia Making a Difference
* Variable Markov Oracle: Algorithms for Human Gesture Applications, The
* Viewpoint Sequence Recommendation Based on Contextual Information for Multiview Video
* Wearable Auditory Biofeedback Device for Blind and Sighted Individuals
34 for MultMedMag(22)

MultMedMag(23) * Collaborative Sparse Coding for Multiview Action Recognition
* Computational Modeling of Affective Qualities of Abstract Paintings
* Example-Based Image Textural Style Transfer
* Expressive Modulation of Neutral Visual Speech
* Extended Guided Filtering for Depth Map Upsampling
* Eye-Controlled Interfaces for Multimedia Interaction
* Fast Summarization of User-Generated Videos: Exploiting Semantic, Emotional, and Quality Clues
* Fusing Incomplete Multisensor Heterogeneous Data to Estimate Urban Traffic
* Guest editors' introduction: Perception, Aesthetics, and Emotion in Multimedia Quality Modeling
* Image Encryption Algorithm Based on Autoblocking and Electrocardiography, An
* JPEG Pleno: Toward an Efficient Representation of Visual Reality
* JPEG XT: A New Family of JPEG Backward-Compatible Standards
* Multimedia Hashing and Networking
* Multimedia Memory Cues for Augmenting Human Memory
* Multimodal Ensemble Fusion for Disambiguation and Retrieval
* Nonlocal In-Loop Filter: The Way Toward Next-Generation Video Coding?
* Nonparametric Quality Assessment of Natural Images
* Novel Semi-Supervised Dimensionality Reduction Framework, A
* Person-Centered Multimedia Computing: A New Paradigm Inspired by Assistive and Rehabilitative Applications
* Planogram Compliance Checking Based on Detection of Recurring Patterns
* Scale-Aware Spatially Guided Mapping
* Selecting Interesting Image Regions to Automatically Create Cinemagraphs
* Ubiquitous Multimedia: Emerging Research on Multimedia Computing
* Unsupervised Speaker Identification for TV News
* Visual Attention Retargeting
25 for MultMedMag(23)

MultMedMag(24) * Audience Behavior Mining: Integrating TV Ratings with Multimedia Content
* Augmented Reality in Reality
* Benchmarking Initiative for Multimedia Evaluation: MediaEval 2016, The
* Beyond 1 Million Nodes: A Crowdsourced Video Content Delivery Network
* ChildGuard: A Child-Safety Monitoring System
* Continuing Reinvention of Content-Based Retrieval: Multimedia Is Not Dead, The
* Crowdsensing Multimedia Data: Security and Privacy Issues
* Cryptanalyzing an Image-Scrambling Encryption Algorithm of Pixel Bits
* Deep Learning Triggers a New Era in Industrial Robotics
* Dynamic Deployment and Optimization of Virtual Content Delivery Networks
* Evaluating Responsive Web Design's Impact on Blind Users
* Extreme-Dynamic-Range Sensing: Real-Time Adaptation to Extreme Signals
* Flow Watermarking for Antinoise and Multistream Tracing in Anonymous Networks
* Future of Multimedia Distribution: An Interview with Baochun Li, Diego R. Lopez, and Christian Timmerer, The
* JPEG at 25: Still Going Strong
* Latest Multimedia Research from ISM 2016, The
* Light-Field Journey to Virtual Reality, A
* Multimedia Content Delivery with Network Function Virtualization: The Energy Perspective
* Multimedia Technologies for Enriched Music Performance, Production, and Consumption
* Multisensory Experiences in HCI
* Network Function Virtualization and Software-Defined Networking: Advancing Multimedia Distribution
* NFV-Based Video Quality Assessment Method over 5G Small Cell Networks, An
* Nonlinear Discrete Cross-Modal Hashing for Visual-Textual Data
* Object-Detection-Based Video Compression for Wireless Surveillance Systems
* Pooling-Based Quantitative Approach to Evaluating Binarization Algorithms
* Price-Based Controller for Utility-Aware HTTP Adaptive Streaming
* QoE-Aware Bandwidth Allocation for Video Traffic Using Sigmoidal Programming
* Querying Users as Oracles in Tag Engines for Personalized Image Tagging
* Selective Privacy-Preserving Approach for Multimedia Data, A
* vCache: Supporting Cost-Efficient Adaptive Bitrate Streaming
* When Cloud Media Meet Network Function Virtualization: Challenges and Applications
* Word of Mouth Mobile Crowdsourcing: Increasing Awareness of Physical, Cyber, and Social Interactions
32 for MultMedMag(24)

MultMedMag(25) * 360-Degree Virtual-Reality Cameras for the Masses
* Adding a New Dimension to HTTP Adaptive Streaming Through Multiple-Source Capabilities
* Behavior Analysis through Multimodal Sensing for Care of Parkinson's and Alzheimer's Patients
* Biometrics: In Search of Identity and Security (Q & A)
* Clustering of Musical Pieces Through Complex Networks: An Assessment Over Guitar Solos
* Crossmodal Approach to Multimodal Fusion in Video Hyperlinking, A
* Cryptanalyzing an Image Encryption Algorithm Based on Autoblocking and Electrocardiography
* Deep Medical Image Computing in Preventive and Precision Medicine
* Discovering Latent Aspects for Diversity-Induced Image Retrieval
* Generalized Multi-Instance Control Mapping for Interactive Media Systems
* Health Media: From Multimedia Signals to Personal Health Insights
* Image and Video Captioning with Augmented Neural Architectures
* Integrating Vision and Language for First-Impression Personality Analysis
* Multimedia for Disaster Information Management
* Multiview Cross-Media Hashing with Semantic Consistency
* Non-uniform Watermark Sharing Based on Optimal Iterative BTC for Image Tampering Recovery
* pDisVPL: Probabilistic Discriminative Visual Part Learning for Image Classification
* Rhythm: A Unified Measurement Platform for Human Organizations
* Sensing Technologies for Monitoring Serious Mental Illnesses
* Social Relationship Labeling Based on Multimodal Behaviors and Social Interactions
* Technical Evaluation of HoloLens for Multimedia: A First Look
* Toward Real-Time Delivery of Immersive Sports Content
* Vision and Language Integration Meets Multimedia Fusion
* Visual Nonverbal Behavior Analysis: The Path Forward
* Watermarking Mechanism With High Capacity for Three-Dimensional Mesh Objects Using Integer Planning, A
25 for MultMedMag(25)

MultMedMag(26) * 3-D Scene Management Method Based on the Triangular Mesh for Large-Scale Web3D Scenes, A
* AI-Oriented Large-Scale Video Management for Smart City: Technologies, Standards, and Beyond
* Arbitrary Screen-Aware Manga Reading Framework with Parameter-Optimized Panel Extraction
* Cloud Resource Optimization for Processing Multiple Streams of Visual Data
* Compact Descriptors for Video Analysis: The Emerging MPEG Standard
* Coping With the Challenges of Delivering Multiple Sensorial Media
* Discovering Latent Topics With Saliency-Weighted LDA for Image Scene Understanding
* Edge Caching and Computing in 5G for Mobile AR/VR and Tactile Internet
* Emotion-Aware Video QoE Assessment Via Transfer Learning
* Enhancing Video QoE Over High-Speed Train Using Segment-Based Prefetching and Caching
* Gender Differences in Multimodal Contact-Free Deception Detection
* Hierarchical Deep Cosegmentation of Primary Objects in Aerial Videos
* Multi-Bitrate Video Caching for D2D-Enabled Cellular Networks
* Multimedia for Autonomous Driving
* Multipoint Cooperative Transmission for Virtual Reality in 5G New Radio
* Person Reidentification by Deep Structured Prediction: A Fully Parameterized Approach
* QoE-Oriented Multimedia Assessment: A Facial Expression Recognition Approach
* Rank-Based Encoding Features for Stereo Matching
* Residual-Based Post-Processing for HEVC
* Retrieval System of Medicine Molecules Based on Graph Similarity, A
* Smart Media Transport: A Burgeoning Intelligent System for Next Generation Multimedia Convergence Service Over Heterogeneous Networks in China
* ToothPic: Camera-Based Image Retrieval on Large Scales
* Towards a QoE Model to Evaluate Holographic Augmented Reality Devices
* Ubiquitous Intelligent Cameras: Between Legal Nightmare and Social Empowerment
* Who is the Film's Director? Authorship Recognition Based on Shot Features
25 for MultMedMag(26)

MultMedMag(27) * Adversarial Learning-Based Semantic Correlation Representation for Cross-Modal Retrieval
* Artificial Intelligence Fights Crime and Terrorism at a New Level
* Attribute-Guided Feature Learning Network for Vehicle Reidentification
* Building a Manga Dataset Manga109 With Annotations for Multimedia Applications
* Compression-Then-Encryption-Based Secure Watermarking Technique for Smart Healthcare System
* Deep Residual Split Directed Graph Convolutional Neural Networks for Action Recognition
* Detecting Disaster-Related Tweets Via Multimodal Adversarial Neural Network
* Do I Smell Coffee? The Tale of a 360° Mulsemedia Experience
* Domain Adaptation With Foreground/Background Cues and Gated Discriminators
* Effective Approach for Nonrigid Structure From Motion With Complex Deformation, An
* End-to-End Framework for Clothing Collocation Based on Semantic Feature Fusion, An
* Glasses-Free 3-D and Augmented Reality Display Advances: From Theory to Implementation
* Image Retrieval via Gated Multiscale NetVLAD for Social Media Applications
* Joint Watermarking-Encryption-ECC for Patient Record Security in Wavelet Domain
* Key-Point Sequence Lossless Compression for Intelligent Video Analysis
* Learning Quintuplet Loss for Large-Scale Visual Geolocalization
* Legal and Ethical Challenges in Multimedia Research
* Leveraging Smart Devices for Scene Text Preserved Image Stylization: A Deep Gaming Approach
* Leveraging Smart Devices for Scene Text Preserved Image Stylization: A Deep Gaming Approach
* Metric Learning-Based Multimodal Audio-Visual Emotion Recognition
* Multilabel Text Classification With Incomplete Labels: A Safe Generative Model With Label Manifold Regularization and Confidence Constraint
* Multimedia and the Tactile Internet
* Multimedia Data Privacy Against Machines
* PGAN: Part-Based Nondirect Coupling Embedded GAN for Person Reidentification
* Style Transfer of Urban Road Images Using Generative Adversarial Networks With Structural Details
* Toward Sensing Emotions With Deep Visual Analysis: A Long-Term Psychological Modeling Approach
* Urban Multimedia Computing: Emerging Methods in Multimedia Computing for Urban Data Analysis and Applications
* Wall Screen: An Ultra-High Definition Video-Card for the Internet of Things
* WarpClothingOut: A Stepwise Framework for Clothes Translation From the Human Body to Tiled Images
* Wavelet-Based Quality-Constrained ECG Data Compression System Without Decoding Process
30 for MultMedMag(27)

MultMedMag(28) * AffectiveNet: Affective-Motion Feature Learning for Microexpression Recognition
* Characteristic Analysis of 2D Lag-Complex Logistic Map and Its Application in Image Encryption
* Class-Balanced Text to Image Synthesis With Attentive Generative Adversarial Network
* Destruction and Reconstruction Learning for Facial Expression Recognition
* EGGAN: Learning Latent Space for Fine-Grained Expression Manipulation
* Emotion Detection for Conversations Based on Reinforcement Learning Framework
* End-to-End Learning for Multimodal Emotion Recognition in Video With Adaptive Loss
* Enhancing QoE for Viewport-Adaptive 360-Degree Video Streaming: Perception Analysis and Implementation
* Facial Expression Recognition With Multiscale Graph Convolutional Networks
* Feature-Guided Spatial Attention Upsampling for Real-Time Stereo Matching Network
* From Semantic to Spatial Awareness: Vehicle Reidentification With Multiple Attention Mechanisms
* Generalized Face Antispoofing by Learning to Fuse Features From High- and Low-Frequency Domains
* Gradient-Based Intraprediction Fusion for Video Coding
* Implicit Emotion Relationship Mining Based on Optimal and Majority Synthesis From Multimodal Data Prediction
* Improved Speaker and Navigator for Vision-and-Language Navigation
* Introduction to the Special Issue on MMAC: Multimodal Affective Computing of Large-Scale Multimedia Data
* Large Dataset With a New Framework for Abandoned Object Detection in Complex Scenarios, A
* Learning-Based Satisfied User Ratio Prediction for Symmetrically and Asymmetrically Compressed Stereoscopic Images
* LFI-Augmenter: Intelligent Light Field Image Editing With Interleaved Spatial-Angular Convolution
* Magnitude and Angle Combined Optical Flow Feature for Microexpression Spotting, A
* Modeling Incongruity between Modalities for Multimodal Sarcasm Detection
* Multichannel Steganography in Digital Images for Multiple Receivers
* Multimedia in Virtual Reality and Augmented Reality
* Multimodal and Context-Aware Emotion Perception Model With Multiplicative Fusion
* Multimodal Event-Aware Network for Sentiment Analysis in Tourism
* Multimodal Political Deception Detection
* Multimodal Semantics-Based Supervised Latent Dirichlet Allocation for Event Classification
* Neighborhood Adaptive Loss Function for Deep Learning-Based Point Cloud Coding With Implicit and Explicit Quantization
* No-Reference Nonuniform Distorted Video Quality Assessment Based on Deep Multiple Instance Learning
* On the User-Centric Comparative Remote Evaluation of Interactive Video Search Systems
* Prediction With Multicross Component for Future Video Coding
* Real Testbed for Autonomous Anomaly Detection in Power Grid Using Low-Cost Unmanned Aerial Vehicles and Aerial Imaging
* Semantic Place Prediction With User Attribute in Social Media
* Sentiment-Aware Emoji Insertion Via Sequence Tagging
* Single Image Dehazing Via Region Adaptive Two-Shot Network
* State Representation Learning With Adjacent State Consistency Loss for Deep Reinforcement Learning
* Survey on Facial Expression Recognition: History, Applications, and Challenges
* Video Compression With CNN-Based Postprocessing
38 for MultMedMag(28)

MultMedMag(29) * Comprehensive Framework of Early and Late Fusion for Image-Sentence Retrieval
* Context- and Knowledge-Aware Graph Convolutional Network for Multimodal Emotion Recognition
* Deep Multigraph Hierarchical Enhanced Semantic Representation for Cross-Modal Retrieval
* Detection of Risky Situations for Frail Adults With Hybrid Neural Networks on Multimodal Health Data
* DHNet: Double MPEG-4 Compression Detection via Multiple DCT Histograms
* DIBR Zero-Watermarking Based on Invariant Feature and Geometric Rectification
* Dual Expression Fusion: A Universal Microexpression Recognition Framework
* Efficient Low-Complexity Convolutional Neural Network Filter, An
* Efficient Multimedia Frame-Skipping Architecture Using Deep Learning in Vehicular Networks
* Emotion Recognition With Multimodal Transformer Fusion Framework Based on Acoustic and Lexical Information
* Enhanced Local and Global Learning for Rotation-Invariant Point Cloud Representation
* Exploiting the Structure Information of Suppositional Mesh for Unsupervised Multiview Stereo
* Fast Skin Segmentation on Low-Resolution Grayscale Images for Remote PhotoPlethysmoGraphy
* FLeak-Seg: Automated Fundus Fluorescein Leakage Segmentation via Cross-Modal Attention Learning
* Garment Style Creator: Using StarGAN for Image-to-Image Translation of Multidomain Garments
* Generating Dance Videos Using Pose Transfer Generative Adversarial Network With Multiple Scale Region Extractor and Learnable Region Normalization
* Integrity of Multimedia and Multimodal Data: From Capture to Use
* LIMAN: Local Information-Based Multiattention Network for 3D Shape Recognition
* MPEG Immersive Video Standard: Current Status and Future Outlook, The
* Multimedia Monitoring System of Obstructive Sleep Apnea via a Deep Active Learning Model
* Multimodal Fusion-Based Deep Learning Network for Effective Diagnosis of Alzheimer's Disease
* Next Frontier For MPEG-5 LCEVC: From HDR and Immersive Video to the Metaverse, The
* Novel Security Framework for Medical Data in IoT Ecosystems, A
* Postgraduate Student Depression Assessment by Multimedia Gait Analysis
* Privacy-Preserving Image Classification Using an Isotropic Network
* Privacy-Preserving Video Fall Detection via Chaotic Compressed Sensing and GAN-Based Feature Enhancement
* Robust Image Denoising Method With Multiview Texture-Aware Convolutional Neural Networks, A
* Scene-Adaptive Instance Modification for Semisupervised Pedestrian Detection
* Self-Supervised Cross-Modal Distillation for Thermal Infrared Tracking
* Transferring Deep Gaussian Denoiser for Compressed Sensing MRI Reconstruction
* Translational Symmetry-Aware Facade Parsing for 3-D Building Reconstruction
* Unpaired Image-to-Image Translation Using Negative Learning for Noisy Patches
* Views Meet Labels: Personalized Relation Refinement Network for Multiview Multilabel Learning
* Visual Surveillance for Human Fall Detection in Healthcare IoT
* Why Accuracy is Not Enough: The Need for Consistency in Object Detection
* Why VR Games Sickness? An Empirical Study of Capturing and Analyzing VR Games Head Movement Dataset
36 for MultMedMag(29)

MultMedMag(30) * Anchor-Free Tracker Based on Space-Time Memory Network
* Artistic Line Drawing Rendering With Priors of Depth and Edge Density
* Bandwidth-Aware High-Efficiency Video Coding Design Scheme on a Multiprocessor System on Chip
* CADW: CGAN-Based Attack on Deep Robust Image Watermarking
* Content-Aware Latent Semantic Direction Fusion for Multi-Attribute Editing
* Could Head Motions Affect Quality When Viewing 360° Videos?
* Deep Blind Chest X-Ray Image Quality Assessment With Region-of-Interest-Guided Attention
* Distributed Architecture for an Elderly Accompaniment Service Based on IoT Devices, AI, and Cloud Services
* Edge Distraction-aware Salient Object Detection
* Edge Intelligence-Empowered Immersive Media
* Edge-Assisted Virtual Viewpoint Generation for Immersive Light Field
* Enabling Manageable and Secure Hybrid P2P-CDN Video-on-Demand Streaming Services Through Coordinating Blockchain and Zero Knowledge
* Encoding of Media Value Chain Processes Through Blockchains and MPEG-21 Smart Contracts for Media
* Improved Interaction Estimation and Optimization Method for Surveillance Video Synopsis, An
* JPEG AI Standard: Providing Efficient Human and Machine Visual Data Consumption, The
* Learning 3-D Face Shape From Diverse Sources With Cross-Domain Face Synthesis
* Learning From Coding Features: High Efficiency Rate Control for AOMedia Video 1
* Multiview Language Bias Reduction for Visual Question Answering
* Novel Learning Dictionary for Sparse Coding-Based Key Point Detection, A
* Optimizing Multidimensional Perceptual Quality in Online Interactive Multimedia
* Passthrough Mixed Reality With Oculus Quest 2: A Case Study on Learning Piano
* Perceptual Authentication Hashing for Digital Images With Contrastive Unsupervised Learning
* PP8K: A New Dataset for 8K UHD Video Compression and Processing
* Recent Advances in Immersive Multimedia
* Reversible Modal Conversion Model for Thermal Infrared Tracking
* Reviving Standard-Dynamic-Range Videos for High-Dynamic-Range Devices: A Learning Paradigm With Hybrid Attention Mechanisms
* Short-Long-Term Propagation-Based Video Inpainting
* Specular Detection and Rendering for Immersive Multimedia
* VR2Gather: A Collaborative, Social Virtual Reality System for Adaptive, Multiparty Real-Time Communication
29 for MultMedMag(30)

MultMedMag(31) * Adaptive Detachable Partition-Based Reference Frame Recompression for Video Coding
* aVCSR: Adaptive Video Compressive Sensing Using Region-of-Interest Detection in the Compressed Domain
* ConvNet-HIDE: Deep-Learning-Based Dual Watermarking for Health-Care Images
* Convolutional Neural Network Ensemble for Video Source Camera Forensics, A
* Cryptanalyzing an Image Encryption Algorithm Underpinned by 2-D Lag-Complex Logistic Map
* Cryptanalyzing an Image Encryption Algorithm Underpinned by a 3-D Boolean Convolution Neural Network
* Depth-Guided Aggregation for Real-Time Binocular Depth Estimation Network
* Development of an Image Encryption Algorithm Based on Compressed Sensing and Chaotic Mapping
* Exploiting Illumination Knowledge in the Real World for Low-Light Image Enhancement
* Feature Fusion-Based Data Augmentation Method for Small Object Detection
* Generative Adversarial Networks for Biomedical Imaging
* Generative AI for 3-D Point Clouds
* High-Performance Embedded System Design for QR Code Recognition With Deep Learning
* Hyperspectral Anomaly Detection Based on a Beta Wavelet Graph Neural Network
* Image-Relevant Entities Knowledge-Aware News Image Captioning
* JPEG AI: The First International Standard for Image Coding Based on an End-to-End Learning-Based Approach
* Multimodal Agents: From Vision to Reality
* Multimodal Integration of an Enhanced Novel Pulmonary Auscultation Real-Time Diagnostic System
* Multimodal Scene Recognition Method Based on Self-Attention and Distillation, The
* On Perceived AV Synchronization in 360° Multimedia
* Perceptual Hashing With Deep and Texture Features
* Retinex-Guided Channel Grouping-Based Patch Swap for Arbitrary Style Transfer
* Robust Color Image Hashing With Nonnegative Matrix Factorization and Saliency Map for Copy Detection
* Robust Image Registration for Power Equipment Using Large-Gap Fracture Contours
* S5: Sketch-to-Image Synthesis via Scene and Size Sensing
* Self-Supervised Cross-Modal Distillation for Thermal Infrared Tracking
* Software-Defined Networking-Driven Reliable Transmission Architecture for Enhancing Real-Time Video Streaming Quality, A
* Spatiotemporal Feature Fusion for Video Summarization
* Specific Diverse Text-to-Image Synthesis via Exemplar Guidance
* Uncertainty-Guided Different Levels of Pseudolabels for Semisupervised Medical Image Segmentation
* Vehicle Reidentification Based on Convolution and Vision Transformer Feature Fusion
* You-Only-Look-Once Multiple-Strategy Printed Circuit Board Defect Detection Model
32 for MultMedMag(31)

MultMedMag(32) * Advanced Defect Analysis With Self-Supervised Pretraining and Knowledge Distillation
* Application of Deep Learning in Steel Cable Surface Damage Detection: A Case Study of the YOLOv8-SWRSD Model
* Bilateral Two-Dimensional Multiview Discriminant Analysis for Image Recognition
* CAMUL: Context-Aware Multiconditional Instance Synthesis for Image Segmentation
* Comic Speaker Prediction Based on Visual Relations and Natural Language Processing Tasks
* Comparative Study of Feature Impact on a Perceptual Quality Evaluation in Smoky Laparoscopic Videos Using Artificial Neural Networks, A
* Conversation Between the Founding and the Current EICs on 30 Years of Multimedia Computing, A
* Deformable Two-Stage Generative Adversarial Network Augmentation for Drosophila Image Recognition in Complex Environments
* Denoising Convoluted Neural Network-Assisted ECG Signal Watermarking for Secure Transmission in E-Health-Care Applications
* Detection of Violent Content in Videos Using Attention-Augmented 3-D Convolutional Networks
* DiLien: Domain-Incremental Low-Light Image Enhancement
* EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation
* Enhanced Aortic CT Synthesis Based on Multiscale Information Fusion
* Enhancing Adversarial Attack Defenses for Image Identification Using Generative Adversarial Networks
* Exploring Point Cloud Voxelization for 3-D Object Detection
* Feature-Attention-Mechanism-Based Attack for Deep Robust Watermarking
* Few-Shot Segmentation via Information Interaction Enhancement and Multiscale Feature Aggregation
* FGCM: Modality-Behavior Fusion Model Integrated with Graph Contrastive Learning for Multimodal Recommendation
* High-Quality Dynamic Human Novel View Synthesis Based on Multiview Video
* MSCT: Multiscale Conv-Transformer for Underwater Image Enhancement
* Multi-Modal Deep Fusion Network: Enhancing Graft Survival Prediction in Non-Alcoholic Fatty Liver Disease Patients Prior to Liver Transplantation, A
* Object Detection Algorithm of Literatures' Key Information Based on Optimized Cascade R-CNN, An
* Pro-MA: Progressively Margin-Based Attribution in Pretrained Vision-Language Models
* QformerID: Quaternion Transformer-Based Image Denoising
* Quality Assessment for Text-to-Image Generation: A Survey
* Real-Time Portable Diagnostics for Seborrheic Dermatitis via Hierarchical Few-Shot Learning
* Robust and Multilayer PowerPoint Watermarking for Source Tracing
* Scalable Neural Light Field With Layer Add-ons of Multilayer Perceptron
* SD-Prompt: Learnable and Adaptive Prompts for Enhancing Subject-Driven Text-to-Image Synthesis
* SF-Mamba: A Semantic-Flow Foreground-Aware Mamba for Semantic Segmentation of Remote Sensing Images
* Stereoscopic Visual Attention Model Based on Bioinspiration
* STERR-GAN: Spatiotemporal Rerendering for Facial Video Restoration
* Terrain Segmentation Network in Wild Environments With Hybrid Plus Downsampling
* Thirty Years of Multimedia Computing: From Academic Vision to Ubiquitous Intelligence
* Vision-Language-Guided Adaptive Cross-Modal Fusion for Multispectral Object Detection Under Adverse Weather Conditions
35 for MultMedMag(32)

MultMedMag(33) * DRT: Dilated Recurrent Transformer for Event-Based Video Reconstruction
* ExamVision: A Dual-Task System for Academic Misconduct and Stress Detection in Online Exams
* Image Painter: An Optimized Stroke-Based Algorithm for Artistic Image Stylization
* Improving Scene Knowledge Referring Expression Comprehension With Large Language Models
* Multiscale Asymmetric Concurrent Network for Static Hand Gesture Recognition With Cross-Channel Attention Mechanism, A
* Pseudo and Reordered Subaperture Image Formats for Light Field Residual Compression
* Toward Improving Arbitrary-Shaped Text Detection With Boundary Adaptation in Noisy Scene Images
* Transformer-Based Decoupled Modality Feature Learning for Visible-Infrared Person Re-Identification
8 for MultMedMag(33)

MultMedMag(9) * Applications of video-content analysis and retrieval

MultSys( Vol No. ) * *Multimedia Systems

MultSys(1) * Automatic partitioning of full-motion video

MultSys(5) * Advances in Fractal Compression for Multimedia Applications
* Alive System: Wireless, Full-Body Interaction with Autonomous Agents, The
* Towards Video-Based Immersive Environments

MultSys(7) * Curvature Scale Space Image in Shape Similarity Retrieval
* feature-based algorithm for detecting and classifying production effects, A

MultSys(8) * Relevance Feedback for Image Retrieval: A Comprenhensive Review

MultToolApp( Vol No. ) * *Multimedia Tools and Applications

MultToolApp(14) * Audio Partitioning and Transcription for Broadcast Data Indexation
* Fixed Queries Array: A Fast and Economical Data Structure for Proximity Searching
* Guest Editorial: Content-Based Multimedia Indexing and Retrieval
* Multi-Modal Dialog Scene Detection Using Hidden Markov Models for Content-Based Multimedia Indexing
* Regions-of-Interest and Spatial Layout for Content-Based Image Retrieval
* Shot Change Detection Using Scene-Based Constraint
* ToCAI Description Scheme for Indexing and Retrieval of Multimedia Documents, The
7 for MultToolApp(14)

MultToolApp(27) * survey of MPEG-1 audio, video and semantic analysis techniques, A

MultToolApp(3) * Content-Based Representation and Retrieval of Visual Media: A State-of-the-Art Review
* Content-Based Retrieval for Trademark Registration
* Fractal-Based Clustering Approach in Large Visual Database-Systems, A
* Introduction to Special Issue on Representation and Retrieval of Visual Media in Multimedia Systems

MultToolApp(4) * Application of Video Semantics and Theme Representation in Automated Video Editing, The
* Automatic Video Database Indexing and Retrieval
* Introduction to Special Issue on Representation and Retrieval of Visual Media in Multimedia Systems II
* Supporting Content-Based Retrieval in Large Image Database-Systems
* Techniques for Fast Partitioning of Compressed and Uncompressed Video
* VIMS: A Video Information Management System

MultToolApp(41) * Independent query refinement and feature re-weighting using positive and negative examples for content-based image retrieval

MultToolApp(5) * Annotation Engine for Supporting Video Database Population, An
* Similarity Is a Geometer

MultToolApp(7) * Approach to a Content Based Retrieval of Multimedia Data, An
* Conceptual Modeling and Querying in Multimedia Databases

Index for "m"

Last update:24-Jul-26 16:37:25
Use price@usc.edu for comments.