19.6.1 Cross-Modal Indexing, Cross-Modal Retrieval

Chapter Contents (Back)
Cross-Modal Retrieval.

Costa Pereira, J., Coviello, E.[Emanuele], Doyle, G., Rasiwasia, N., Lanckriet, G.R.G.[Gert R.G.], Levy, R., Vasconcelos, N.M.,
On the Role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval,
PAMI(36), No. 3, March 2014, pp. 521-535.
IEEE DOI 1403
image matching. E.g. use image to search for text. Correlation matching. Semantic matching. Semantic correlation matching. BibRef

Costa Pereira, J.[Jose], Vasconcelos, N.M.[Nuno M.],
Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems,
CVIU(124), No. 1, 2014, pp. 123-135.
Elsevier DOI 1406
Content-based image retrieval BibRef

Zhai, X.H.[Xiao-Hua], Peng, Y.X.[Yu-Xin], Xiao, J.G.[Jian-Guo],
Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization,
CirSysVideo(24), No. 6, June 2014, pp. 965-978.
IEEE DOI 1407
Correlation BibRef

Peng, Y.X.[Yu-Xin], Qi, J.W.[Jin-Wei],
Quintuple-Media Joint Correlation Learning With Deep Compression and Regularization,
CirSysVideo(30), No. 8, August 2020, pp. 2709-2722.
IEEE DOI 2008
Media, Correlation, Semantics, Solid modeling, Data models, Image coding, Cross-media retrieval, network regularization BibRef

Peng, Y., Zhai, X., Zhao, Y., Huang, X.,
Semi-Supervised Cross-Media Feature Learning With Unified Patch Graph Regularization,
CirSysVideo(26), No. 3, March 2016, pp. 583-596.
IEEE DOI 1603
Correlation BibRef

Bellini, P.[Pierfrancesco], Cenni, D.[Daniele], Nesi, P.[Paolo],
Optimization of information retrieval for cross media contents in a best practice network,
MultInfoRetr(3), No. 3, September 2014, pp. 147-159.
Springer DOI 1408
BibRef

Kang, C., Xiang, S., Liao, S., Xu, C., Pan, C.,
Learning Consistent Feature Representation for Cross-Modal Multimedia Retrieval,
MultMed(17), No. 3, March 2015, pp. 370-381.
IEEE DOI 1502
Algorithm design and analysis BibRef

He, Y., Xiang, S., Kang, C., Wang, J., Pan, C.,
Cross-Modal Retrieval via Deep and Bidirectional Representation Learning,
MultMed(18), No. 7, July 2016, pp. 1363-1377.
IEEE DOI 1608
backpropagation BibRef

Zhang, S., Wang, X., Lin, Y., Tian, Q.,
Cross Indexing With Grouplets,
MultMed(17), No. 11, November 2015, pp. 1969-1979.
IEEE DOI 1511
Feature extraction BibRef

Chu, L., Zhang, Y., Li, G., Wang, S., Zhang, W., Huang, Q.,
Effective Multimodality Fusion Framework for Cross-Media Topic Detection,
CirSysVideo(26), No. 3, March 2016, pp. 556-569.
IEEE DOI 1603
Complexity theory BibRef

Ding, K.[Kun], Fan, B.[Bin], Huo, C.L.[Chun-Lei], Xiang, S.M.[Shi-Ming], Pan, C.H.[Chun-Hong],
Cross-Modal Hashing via Rank-Order Preserving,
MultMed(19), No. 3, March 2017, pp. 571-585.
IEEE DOI 1702
Binary codes BibRef

Jiang, B.[Bin], Yang, J.C.[Jia-Chen], Lv, Z.H.[Zhi-Han], Tian, K.[Kun], Meng, Q.G.[Qing-Gang], Yan, Y.[Yan],
Internet cross-media retrieval based on deep learning,
JVCIR(48), No. 1, 2017, pp. 356-366.
Elsevier DOI 1708
Cross-media, retrieval BibRef

Hu, Y., Zheng, L., Yang, Y., Huang, Y.,
Twitter100k: A Real-World Dataset for Weakly Supervised Cross-Media Retrieval,
MultMed(20), No. 4, April 2018, pp. 927-938.
IEEE DOI 1804
Electronic publishing, Encyclopedias, Internet, Optical character recognition software, Training, Visualization, weakly supervised method BibRef

Verma, Y.[Yashaswi], Jha, A.[Abhishek], Jawahar, C.V.,
Cross-specificity: modelling data semantics for cross-modal matching and retrieval,
MultInfoRetr(8), No. 2, June 2018, pp. 139-146.
Springer DOI 1805
BibRef

Dorfer, M.[Matthias], Schlüter, J.[Jan], Vall, A.[Andreu], Korzeniowski, F.[Filip], Widmer, G.[Gerhard],
End-to-end cross-modality retrieval with CCA projections and pairwise ranking loss,
MultInfoRetr(8), No. 2, June 2018, pp. 117-128.
Springer DOI 1805
BibRef

Lu, X.[Xu], Zhang, H.X.[Hua-Xiang], Sun, J.[Jiande], Wang, Z.H.[Zhen-Hua], Guo, P.[Peilian], Wan, W.[Wenbo],
Discriminative correlation hashing for supervised cross-modal retrieval,
SP:IC(65), 2018, pp. 221-230.
Elsevier DOI 1805
Cross-modal retrieval, Hashing, Subspace learning, Discriminant analysis BibRef

Wang, L.[Li], Zhu, L.[Lei], Dong, X.[Xiao], Liu, L.[Li], Sun, J.[Jiande], Zhang, H.X.[Hua-Xiang],
Joint Feature Selection and Graph Regularization for Modality-Dependent Cross-Modal Retrieval,
JVCIR(54), 2018, pp. 213-222.
Elsevier DOI 1806
Cross-modal retrieval, Feature selection, Subspace learning, Graph regularization BibRef

Zhong, F.M.[Fang-Ming], Chen, Z.K.[Zhi-Kui], Min, G.Y.[Ge-Yong],
Deep Discrete Cross-Modal Hashing for Cross-Media Retrieval,
PR(83), 2018, pp. 64-77.
Elsevier DOI 1808
Cross-modal retrieval, deep learning, discrete hashing, alternative optimization BibRef

Yuan, X.[Xu], Wang, G.Z.[Guang-Ze], Chen, Z.K.[Zhi-Kui], Zhong, F.M.[Fang-Ming],
CHOP: An orthogonal hashing method for zero-shot cross-modal retrieval,
PRL(145), 2021, pp. 247-253.
Elsevier DOI 2104
Zero-shot, Cross-modal retrieval, Orthogonal projection BibRef

Vukotic, V., Raymond, C., Gravier, G.,
A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking,
MultMedMag(25), No. 2, April 2018, pp. 11-23.
IEEE DOI 1808
Computer architecture, Task analysis, Neural networks, Visualization, Streaming media, Hypertext systems, Training, multimedia BibRef

Liu, R., Wei, S., Zhao, Y., Zhu, Z., Wang, J.,
Multiview Cross-Media Hashing with Semantic Consistency,
MultMedMag(25), No. 2, April 2018, pp. 71-86.
IEEE DOI 1808
Media, Semantics, Correlation, Multimedia communication, Optimization, Feature extraction, hashing, cross-media, multiview, searching BibRef

Wang, D.[Di], Wang, Q.[Quan], Gao, X.[Xinbo],
Robust and Flexible Discrete Hashing for Cross-Modal Similarity Search,
CirSysVideo(28), No. 10, October 2018, pp. 2703-2715.
IEEE DOI 1811
Robustness, Training, Binary codes, Quantization (signal), Linear programming, Matrix decomposition, Sparse matrices, Hashing, unsupervised learning BibRef

Wang, D.[Di], Gao, X.B.[Xin-Bo], Wang, X.M.[Xiu-Mei], He, L.H.[Li-Huo],
Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search,
PAMI(41), No. 10, October 2019, pp. 2466-2479.
IEEE DOI 1909
Semantics, Correlation, Training, Transforms, Binary codes, Image reconstruction, Sparse matrices, Hashing, multimodal, cross-modal BibRef

Wang, D.[Di], Gao, X.B.[Xin-Bo], Wang, X.M.[Xiu-Mei], He, L.[Lihuo], Yuan, B.[Bo],
Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval,
IP(25), No. 10, October 2016, pp. 4540-4554.
IEEE DOI 1610
Internet BibRef

Wang, D.[Di], Wang, Q.[Quan], He, L.[Lihuo], Gao, X.B.[Xin-Bo], Tian, Y.[Yumin],
Joint and individual matrix factorization hashing for large-scale cross-modal retrieval,
PR(107), 2020, pp. 107479.
Elsevier DOI 2008
Hashing, Multimodal, Retrieval, Cross-modal, Matrix factorization BibRef

Dong, F.[Fei], Nie, X.S.[Xiu-Shan], Liu, X.B.[Xing-Bo], Geng, L.L.[Lei-Lei], Wang, Q.[Qian],
Cross-modal hashing based on category structure preserving,
JVCIR(57), 2018, pp. 28-33.
Elsevier DOI 1812
Cross-modal retrieval, Supervised hashing, Category-specific structure preserving BibRef

Zhang, M.J.[Mei-Jia], Zhang, H.X.[Hua-Xiang], Li, J.Z.[Jun-Zheng], Wang, L.[Li], Fang, Y.X.[Yi-Xian], Sun, J.[Jiande],
Supervised graph regularization based cross media retrieval with intra and inter-class correlation,
JVCIR(58), 2019, pp. 1-11.
Elsevier DOI 1901
Cross media retrieval, Subspace learning, Supervised graph regularization BibRef

Yao, T.[Tao], Wang, G.[Gang], Yan, L.S.[Lian-Shan], Kong, X.W.[Xiang-Wei], Su, Q.T.[Qing-Tang], Zhang, C.M.[Cai-Ming], Tian, Q.[Qi],
Online latent semantic hashing for cross-media retrieval,
PR(89), 2019, pp. 1-11.
Elsevier DOI 1902
Cross-media retrieval, Online learning, Hashing, Latent semantic concept BibRef

Yao, T.[Tao], Kong, X.W.[Xiang-Wei], Fu, H.Y.[Hai-Yan], Tian, Q.[Qi],
Discrete Semantic Alignment Hashing for Cross-Media Retrieval,
Cyber(50), No. 12, December 2020, pp. 4896-4907.
IEEE DOI 2012
Semantics, Hash functions, Correlation, Quantization (signal), Optimization, Task analysis, Internet, Attribute, hashing BibRef

Dutta, T.[Titir], Biswas, S.[Soma],
Cross-modal retrieval in challenging scenarios using attributes,
PRL(125), 2019, pp. 618-624.
Elsevier DOI 1909
Cross-modal retrieval, Attributes, Unseen query, Low-resolution data BibRef

Liu, H.P.[Hua-Ping], Wang, F.[Feng], Zhang, X.Y.[Xin-Yu], Sun, F.C.[Fu-Chun],
Weakly-paired deep dictionary learning for cross-modal retrieval,
PRL(130), 2020, pp. 199-206.
Elsevier DOI 2002
Deep dictionary learning, Cross-modal retrieval, Weak pairing BibRef

Zhang, H.[Hong], Wang, T.[Ting], Dai, G.[Gang],
Semi-supervised cross-modal common representation learning with vector-valued manifold regularization,
PRL(130), 2020, pp. 335-344.
Elsevier DOI 2002
Cross-media retrieval, Vector-valued RKHS, Manifold regularization, Semi-supervised, Kernel method BibRef

Chaudhuri, U.[Ushasi], Banerjee, B.[Biplab], Bhattacharya, A.[Avik], Datcu, M.[Mihai],
CMIR-NET: A deep learning based model for cross-modal retrieval in remote sensing,
PRL(131), 2020, pp. 456-462.
Elsevier DOI 2004
Remote sensing, Cross-modal retrieval, Deep learning, Panchromatic, Multispectral, Audio samples BibRef

Chi, J.Z.[Jing-Ze], Peng, Y.X.[Yu-Xin],
Zero-Shot Cross-Media Embedding Learning With Dual Adversarial Distribution Network,
CirSysVideo(30), No. 4, April 2020, pp. 1173-1187.
IEEE DOI 2004
Semantics, Media, Correlation, Training, Dogs, Measurement, Cross-media retrieval, zero-shot learning, maximum mean discrepancy BibRef

Wu, F.[Fei], Jing, X.Y.[Xiao-Yuan], Wu, Z.Y.[Zhi-Yong], Ji, Y.[Yimu], Dong, X.[Xiwei], Luo, X.[Xiaokai], Huang, Q.H.[Qing-Hua], Wang, R.[Ruchuan],
Modality-specific and shared generative adversarial network for cross-modal retrieval,
PR(104), 2020, pp. 107335.
Elsevier DOI 2005
Cross-modal retrieval, Generative adversarial networks (GAN), Modality-specific feature learning, Modality-shared feature learning BibRef

Wu, F.[Fei], Luo, X.[Xiaokai], Huang, Q.H.[Qing-Hua], Wei, P.F.[Peng-Fei], Sun, Y.[Ying], Dong, X.[Xiwei], Wu, Z.Y.[Zhi-Yong],
Semantic Preserving Generative Adversarial Network for Cross-Modal Hashing,
ICIP21(2743-2747)
IEEE DOI 2201
Measurement, Quantization (signal), Image processing, Semantics, Focusing, Network architecture, cross-modal hashing, semantic preserving BibRef

Zhong, F.M.[Fang-Ming], Chen, Z.K.[Zhi-Kui], Min, G.Y.[Ge-Yong], Xia, F.[Feng],
A novel strategy to balance the results of cross-modal hashing,
PR(107), 2020, pp. 107523.
Elsevier DOI 2008
Cross-modal hashing, Semantic gap, Semantic augmentation, Cross-modal retrieval BibRef

Peng, Y., Chi, J.,
Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene Graph,
CirSysVideo(30), No. 11, November 2020, pp. 4368-4379.
IEEE DOI 2011
Media, Correlation, Visualization, Genomics, Bioinformatics, Training data, Training, Cross-media retrieval, domain adaptation, scene graph BibRef

Zhu, L., Song, J., Zhu, X., Zhang, C., Zhang, S., Yuan, X.,
Adversarial Learning-Based Semantic Correlation Representation for Cross-Modal Retrieval,
MultMedMag(27), No. 4, October 2020, pp. 79-90.
IEEE DOI 2012
Correlation, Semantics, Computer science, Internet, Streaming media BibRef

Chaudhuri, U.[Ushasi], Banerjee, B.[Biplab], Bhattacharya, A.[Avik], Datcu, M.[Mihai],
CrossATNet: A novel cross-attention based framework for sketch-based image retrieval,
IVC(104), 2020, pp. 104003.
Elsevier DOI 2012
Neural networks, Sketch-based image retrieval, Cross-modal retrieval, Deep-learning, Cross-attention network, Cross-triplets BibRef

Zhang, Y., Zhou, W., Wang, M., Tian, Q., Li, H.,
Deep Relation Embedding for Cross-Modal Retrieval,
IP(30), 2021, pp. 617-627.
IEEE DOI 2012
Semantics, Feature extraction, Visualization, Computational modeling, Task analysis, Training, Optimization, relation BibRef

Zhang, L.[Lei], Chen, L.T.[Lei-Ting], Ou, W.H.[Wei-Hua], Zhou, C.[Chuan],
Semi-supervised cross-modal representation learning with GAN-based Asymmetric Transfer Network,
JVCIR(73), 2020, pp. 102899.
Elsevier DOI 2012
Cross-modal retrieval, Modality gap, Generative adversarial network BibRef

Wang, L.[Lu], Yang, J.[Jie], Zareapoor, M.[Masoumeh], Zheng, Z.L.[Zhong-Long],
Cluster-wise unsupervised hashing for cross-modal similarity search,
PR(111), 2021, pp. 107732.
Elsevier DOI 2012
Cross-modal similarity retrieval, Multi-view clustering, The cluster-wise code-prototypes, Cross-modal hashing, BibRef

Meng, M., Wang, H., Yu, J., Chen, H., Wu, J.,
Asymmetric Supervised Consistent and Specific Hashing for Cross-Modal Retrieval,
IP(30), 2021, pp. 986-1000.
IEEE DOI 2012
Semantics, Optimization, Quantization (signal), Correlation, Symmetric matrices, Image coding, Sparse matrices, multimedia BibRef

Matsubara, T.[Takashi],
Target-Oriented Deformation of Visual-Semantic Embedding Space,
IEICE(E104-D), No. 1, January 2021, pp. 24-33.
WWW Link. 2101
BibRef

Nie, X., Wang, B., Li, J., Hao, F., Jian, M., Yin, Y.,
Deep Multiscale Fusion Hashing for Cross-Modal Retrieval,
CirSysVideo(31), No. 1, January 2021, pp. 401-410.
IEEE DOI 2101
Semantics, Machine learning, Training data, Media, Correlation, Retrieval, hashing, deep learning, cross-modal BibRef

Liu, X.[Xin], Hu, Z.K.[Zhi-Kai], Ling, H.B.[Hai-Bin], Cheung, Y.M.[Yiu-Ming],
MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval,
PAMI(43), No. 3, March 2021, pp. 964-981.
IEEE DOI 2102
Lips, Semantics, Adaptation models, Task analysis, Encoding, Correlation, Cross-modal retrieval, semantic correlation matrix BibRef

Wu, Y., Wang, S., Song, G., Huang, Q.,
Augmented Adversarial Training for Cross-Modal Retrieval,
MultMed(23), 2021, pp. 559-571.
IEEE DOI 2102
image representation, image retrieval, neural nets, text analysis, adversarial training process, adversa-rial training BibRef

Lin, Q., Cao, W., He, Z., He, Z.,
Mask Cross-Modal Hashing Networks,
MultMed(23), 2021, pp. 550-558.
IEEE DOI 2102
deep learning (artificial intelligence), feature extraction, file organisation, image retrieval, text analysis, cross-modal retrieval BibRef

Qi, M., Qin, J., Yang, Y., Wang, Y., Luo, J.,
Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video Retrieval,
IP(30), 2021, pp. 2989-3004.
IEEE DOI 2102
Semantics, Binary codes, Feature extraction, Visualization, Task analysis, Natural languages, Stochastic processes, natural language BibRef

Wu, J.L.[Jian-Long], Xie, X.X.[Xing-Xu], Nie, L.Q.[Li-Qiang], Lin, Z.C.[Zhou-Chen], Zha, H.B.[Hong-Bin],
Reconstruction regularized low-rank subspace learning for cross-modal retrieval,
PR(113), 2021, pp. 107813.
Elsevier DOI 2103
Cross-modal retrieval, Low-rank subspace learning, Reconstruction regularization BibRef

Zou, X.T.[Xi-Tao], Wang, X.Z.[Xin-Zhi], Bakker, E.M.[Erwin M.], Wu, S.[Song],
Multi-label semantics preserving based deep cross-modal hashing,
SP:IC(93), 2021, pp. 116131.
Elsevier DOI 2103
Multi-modal retrieval, Deep cross-modal hashing, Multi-label semantic learning BibRef

Shu, X.[Xin], Zhao, G.Y.[Guo-Ying],
Scalable multi-label canonical correlation analysis for cross-modal retrieval,
PR(115), 2021, pp. 107905.
Elsevier DOI 2104
Canonical correlation analysis, Semantic transformation, Cross-modal retrieval, Singular value decomposition BibRef

Song, G.[Ge], Tan, X.Y.[Xiao-Yang],
Real-world Cross-modal Retrieval via Sequential Learning,
MultMed(23), 2021, pp. 1708-1721.
IEEE DOI 2106
BibRef
Earlier:
Sequential Learning for Cross-Modal Retrieval,
CroMoL19(4531-4539)
IEEE DOI 2004
Plugs, Task analysis, Data models, Learning systems, Brain modeling, Adaptation models, Technological innovation, meta learning. information retrieval, learning (artificial intelligence), multimodal data, meta learning BibRef

Chen, W.[Wei], Liu, Y.[Yu], Bakker, E.M.[Erwin M.], Lew, M.S.[Michael S.],
Integrating information theory and adversarial learning for cross-modal retrieval,
PR(117), 2021, pp. 107983.
Elsevier DOI 2106
Cross-modal retrieval, Shannon information theory, Adversarial learning, Modality uncertainty, Data imbalance BibRef

Huang, Z.Y.[Zhen-Yu], Zhou, J.T.[Joey Tianyi], Zhu, H.Y.[Hong-Yuan], Zhang, C.Q.[Chang-Qing], Lv, J.C.[Jian-Cheng], Peng, X.[Xi],
Deep Spectral Representation Learning From Multi-View Data,
IP(30), 2021, pp. 5352-5362.
IEEE DOI 2106
Deep learning, Laplace equations, Neural networks, Collaboration, Data models, Task analysis, cross-modal retrieval BibRef

Wen, X.[Xin], Han, Z.Z.[Zhi-Zhong], Liu, Y.S.[Yu-Shen],
CMPD: Using Cross Memory Network With Pair Discrimination for Image-Text Retrieval,
CirSysVideo(31), No. 6, June 2021, pp. 2427-2437.
IEEE DOI 2106
Semantics, Task analysis, Training, Generators, Optimization, Marine vehicles, Retrieval, cross-modal retrieval, adversarial learning BibRef

Liu, J.H.[Jun-Hao], Yang, M.[Min], Li, C.M.[Cheng-Ming], Xu, R.F.[Rui-Feng],
Improving Cross-Modal Image-Text Retrieval With Teacher-Student Learning,
CirSysVideo(31), No. 8, August 2021, pp. 3242-3253.
IEEE DOI 2108
Semantics, Task analysis, Data models, Neural networks, Correlation, Binary codes, Feature extraction, teacher-student learning BibRef

Song, G.[Ge], Tan, X.Y.[Xiao-Yang], Zhao, J.[Jun], Yang, M.[Ming],
Deep robust multilevel semantic hashing for multi-label cross-modal retrieval,
PR(120), 2021, pp. 108084.
Elsevier DOI 2109
Hashing, Multi-label, Cross-modal retrieval, Deep learning BibRef

Fang, Y.[Yuzhi],
Robust multimodal discrete hashing for cross-modal similarity search,
JVCIR(79), 2021, pp. 103256.
Elsevier DOI 2109
Hashing, Robust, Cross-modal retrieval, Unsupervised learning BibRef

Nie, X.S.[Xiu-Shan], Liu, X.B.[Xing-Bo], Xi, X.M.[Xiao-Ming], Li, C.L.[Cheng-Long], Yin, Y.L.[Yi-Long],
Fast Unmediated Hashing for Cross-Modal Retrieval,
CirSysVideo(31), No. 9, September 2021, pp. 3669-3678.
IEEE DOI 2109
Semantics, Training, Optimization, Training data, Binary codes, Correlation, Videos, Cross-modal retrieval, hashing, unmediated, double supervision BibRef

Zhang, D.L.[Dong-Lin], Wu, X.J.[Xiao-Jun], Yin, H.F.[He-Feng], Kittler, J.V.[Josef V.],
MOON: Multi-hash codes joint learning for cross-media retrieval,
PRL(151), 2021, pp. 19-25.
Elsevier DOI 2110
Cross-media retrieval, Hashing, Discrete optimization, Joint learning BibRef

Hu, P.[Peng], Peng, X.[Xi], Zhu, H.Y.[Hong-Yuan], Lin, J.[Jie], Zhen, L.[Liangli], Peng, D.[Dezhong],
Joint Versus Independent Multiview Hashing for Cross-View Retrieval,
Cyber(51), No. 10, October 2021, pp. 4982-4993.
IEEE DOI 2110
Semantics, Decoding, Training, Computer science, Kernel, Logistics, Cybernetics, Common hamming space, cross-view retrieval, multiview representation learning BibRef

Zhang, D.L.[Dong-Lin], Wu, X.J.[Xiao-Jun],
Robust and discrete matrix factorization hashing for cross-modal retrieval,
PR(122), 2022, pp. 108343.
Elsevier DOI 2112
Cross-modal retrieval, Hashing, Autoencoder, Discrete optimization, BibRef

Zhang, D.L.[Dong-Lin], Wu, X.J.[Xiao-Jun], Liu, Z.[Zhen], Yu, J.[Jun], Kitter, J.V.[Josef V.],
Fast Discrete Cross-Modal Hashing Based on Label Relaxation and Matrix Factorization,
ICPR21(4845-4850)
IEEE DOI 2105
Technological innovation, Quantization (signal), Databases, Instruments, Semantics, Binary codes, Media BibRef

Zhang, L.[Li], Wu, X.Q.[Xiang-Qian],
Multi-task framework based on feature separation and reconstruction for cross-modal retrieval,
PR(122), 2022, pp. 108217.
Elsevier DOI 2112
Cross-modal retrieval, Feature separation, Image reconstruction, Text reconstruction BibRef

Liu, F.[Fangcen], Gao, C.Q.[Chen-Qiang], Sun, Y.Q.[Yong-Qing], Zhao, Y.[Yue], Yang, F.[Feng], Qin, A.[Anyong], Meng, D.Y.[De-Yu],
Infrared and Visible Cross-Modal Image Retrieval Through Shared Features,
CirSysVideo(31), No. 11, November 2021, pp. 4485-4496.
IEEE DOI 2112
Image retrieval, Feature extraction, Task analysis, Imaging, Semantics, Image color analysis, Cameras, maximum mean discrepancy BibRef

Wang, C.Y.[Chao-Yi], Li, L.[Liang], Yan, C.G.[Cheng-Gang], Wang, Z.[Zhan], Sun, Y.Q.[Yao-Qi], Zhang, J.Y.[Ji-Yong],
Cross-modal semantic correlation learning by Bi-CNN network,
IET-IPR(15), No. 14, 2021, pp. 3674-3684.
DOI Link 2112
BibRef

Chakraborty, B.[Bela], Wang, P.[Peng], Wang, L.[Lei],
Inter-Modality Fusion Based Attention for Zero-Shot Cross-Modal Retrieval,
ICIP21(2648-2652)
IEEE DOI 2201
Training, Heating systems, Image processing, Semantics, Pipelines, MIMICs, Zero-shot Learning, Inter-Modality Fusion, Cross-modal Retrieval BibRef

Zhang, P.F.[Peng-Fei], Li, Y.[Yang], Huang, Z.[Zi], Xu, X.S.[Xin-Shun],
Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval,
MultMed(24), 2022, pp. 466-479.
IEEE DOI 2202
Semantics, Convolutional codes, Binary codes, Convolution, Measurement, Feature extraction, Sparse matrices, Multimodal, graph convolutional networks BibRef

Shin, A.[Andrew], Ishii, M.[Masato], Narihira, T.[Takuya],
Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision,
IJCV(130), No. 2, February 2022, pp. 435-454.
Springer DOI 2202
BibRef

Ji, Z.[Zhong], Wang, H.R.[Hao-Ran], Han, J.G.[Jun-Gong], Pang, Y.W.[Yan-Wei],
SMAN: Stacked Multimodal Attention Network for Cross-Modal Image-Text Retrieval,
Cyber(52), No. 2, February 2022, pp. 1086-1097.
IEEE DOI 2202
Visualization, Semantics, Feature extraction, Correlation, Task analysis, Extraterrestrial measurements, Deep learning, vision and language BibRef

Ma, J.J.[Jing-Jing], Shi, D.[Duanpeng], Tang, X.[Xu], Zhang, X.R.[Xiang-Rong], Jiao, L.C.[Li-Cheng],
Dual Modality Collaborative Learning for Cross-Source Remote Sensing Retrieval,
RS(14), No. 6, 2022, pp. xx-yy.
DOI Link 2204
BibRef

Huang, Y.[Yan], Wang, J.D.[Jing-Dong], Wang, L.[Liang],
Few-Shot Image and Sentence Matching via Aligned Cross-Modal Memory,
PAMI(44), No. 6, June 2022, pp. 2968-2983.
IEEE DOI 2205
BibRef
Earlier: A1, A3, Only:
ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching,
ICCV19(5773-5782)
IEEE DOI 2004
Adaptation models, Task analysis, Pattern matching, Logic gates, Visualization, Image color analysis, Data models, similarity gated fusion. image matching, learning (artificial intelligence), storage management, few-shot content, sentence matching tasks, Micromechanical devices BibRef

Xu, X.[Xing], Lin, K.Y.[Kai-Yi], Yang, Y.[Yang], Hanjalic, A.[Alan], Shen, H.T.[Heng Tao],
Joint Feature Synthesis and Embedding: Adversarial Cross-Modal Retrieval Revisited,
PAMI(44), No. 6, June 2022, pp. 3030-3047.
IEEE DOI 2205
Art, Generative adversarial networks, Training, Correlation, Visualization, Standards, Cross-modal retrieval, knowledge transfer BibRef

Duan, Y.X.[You-Xiang], Chen, N.[Ning], Zhang, P.Y.[Pei-Ying], Kumar, N.[Neeraj], Chang, L.[Lunjie], Wen, W.[Wu],
MS2GAH: Multi-label semantic supervised graph attention hashing for robust cross-modal retrieval,
PR(128), 2022, pp. 108676.
Elsevier DOI 2205
Cross-modal retrieval, Deep hashing, Graph attention network BibRef


Wang, Y.[Yun], Zhang, T.[Tong], Zhang, X.[Xueya], Cui, Z.[Zhen], Huang, Y.[Yuge], Shen, P.C.[Peng-Cheng], Li, S.X.[Shao-Xin], Yang, J.[Jian],
Wasserstein Coupled Graph Learning for Cross-Modal Retrieval,
ICCV21(1793-1802)
IEEE DOI 2203
Training, Representation learning, Analytical models, Dictionaries, Correlation, Computational modeling, Vision + language, BibRef

Cai, G.[Guanyu], Zhang, J.[Jun], Jiang, X.Y.[Xin-Yang], Gong, Y.[Yifei], He, L.[Lianghua], Yu, F.[Fufu], Peng, P.[Pai], Guo, X.W.[Xiao-Wei], Huang, F.Y.[Fei-Yue], Sun, X.[Xing],
Ask amp;Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query,
ICCV21(1815-1824)
IEEE DOI 2203
Training, Codes, Computational modeling, Image retrieval, Search problems, Robustness, Vision + language, Image and video retrieval BibRef

Wen, K.Y.[Ke-Yu], Xia, J.[Jin], Huang, Y.Y.[Yuan-Yuan], Li, L.[Linyang], Xu, J.Y.[Jia-Yan], Shao, J.[Jie],
COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation,
ICCV21(2188-2197)
IEEE DOI 2203
Visualization, Codes, Computational modeling, Image retrieval, Semantics, Transformers, Vision + language, Representation learning BibRef

Patrick, M.[Mandela], Huang, P.Y.[Po-Yao], Misra, I.[Ishan], Metze, F.[Florian], Vedaldi, A.[Andrea], Asano, Y.M.[Yuki M.], Henriques, J.[João],
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning,
ICCV21(10540-10552)
IEEE DOI 2203
Representation learning, Costs, Codes, Computational modeling, Crops, Image representation, Representation learning, Vision + other modalities BibRef

Lin, M.X.[Ming-Xian], Yang, J.[Jie], Wang, H.[He], Lai, Y.K.[Yu-Kun], Jia, R.[Rongfei], Zhao, B.Q.[Bin-Qiang], Gao, L.[Lin],
Single Image 3D Shape Retrieval via Cross-Modal Instance and Category Contrastive Learning,
ICCV21(11385-11395)
IEEE DOI 2203
Representation learning, Deep learning, Shape, Image color analysis, Pipelines, Gray-scale, 3D from a single image and shape-from-x BibRef

Zhan, X.[Xunlin], Wu, Y.X.[Yang-Xin], Dong, X.[Xiao], Wei, Y.[Yunchao], Lu, M.[Minlong], Zhang, Y.[Yichi], Xu, H.[Hang], Liang, X.D.[Xiao-Dan],
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-Modal Pretraining,
ICCV21(11762-11771)
IEEE DOI 2203
Industries, Measurement, Codes, Transformers, Solids, Electronic commerce, Image and video retrieval, Vision + language BibRef

Changpinyo, S.[Soravit], Pont-Tuset, J.[Jordi], Ferrari, V.[Vittorio], Soricut, R.[Radu],
Telling the What while Pointing to the Where: Multimodal Queries for Image Retrieval,
ICCV21(12116-12126)
IEEE DOI 2203
Location awareness, Error analysis, Computational modeling, Image retrieval, Natural languages, Mice, Vision + other modalities BibRef

Gabeur, V.[Valentin], Nagrani, A.[Arsha], Sun, C.[Chen], Alahari, K.[Karteek], Schmid, C.[Cordelia],
Masking Modalities for Cross-modal Video Retrieval,
WACV22(2111-2120)
IEEE DOI 2202
Manuals, Benchmark testing, Motion pictures, Natural language processing, Proposals, Speech processing, Scene Understanding BibRef

Galanopoulos, D.[Damianos], Mezaris, V.[Vasileios],
Hard-Negatives or Non-Negatives? A Hard-Negative Selection Strategy for Cross-Modal Retrieval Using the Improved Marginal Ranking Loss,
ViRaL21(2312-2316)
IEEE DOI 2112
Training, Computational modeling, Computer architecture, Network architecture BibRef

Jing, L.L.[Long-Long], Vahdani, E.[Elahe], Tan, J.X.[Jia-Xing], Tian, Y.L.[Ying-Li],
Cross-Modal Center Loss for 3D Cross-Modal Retrieval,
CVPR21(3141-3150)
IEEE DOI 2111
Solid modeling, Computational modeling, Metadata, Feature extraction, Pattern recognition BibRef

Hu, P.[Peng], Peng, X.[Xi], Zhu, H.Y.[Hong-Yuan], Zhen, L.[Liangli], Lin, J.[Jie],
Learning Cross-Modal Retrieval with Noisy Labels,
CVPR21(5399-5409)
IEEE DOI 2111
Costs, Annotations, Interference, Pattern recognition, Noise measurement, Labeling BibRef

Chun, S.[Sanghyuk], Oh, S.J.[Seong Joon], Sampaio de Rezende, R.[Rafael], Kalantidis, Y.[Yannis], Larlus, D.[Diane],
Probabilistic Embeddings for Cross-Modal Retrieval,
CVPR21(8411-8420)
IEEE DOI 2111
Uncertainty, Codes, Databases, Annotations, Tools, Benchmark testing BibRef

Croitoru, I.[Ioana], Bogolin, S.V.[Simion-Vlad], Leordeanu, M.[Marius], Jin, H.L.[Hai-Lin], Zisserman, A.[Andrew], Albanie, S.[Samuel], Liu, Y.[Yang],
TeachText: CrossModal Generalized Distillation for Text-Video Retrieval,
ICCV21(11563-11573)
IEEE DOI 2203
Visualization, Codes, Computational modeling, Noise reduction, Computer architecture, Benchmark testing, Vision + language BibRef

Liu, Y.[Yang], Chen, Q.[Qingchao], Albanie, S.[Samuel],
Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval,
CVPR21(14949-14959)
IEEE DOI 2111
Visualization, Prototypes, Pattern recognition, Task analysis, Mutual information, Videos BibRef

Salvador, A.[Amaia], Gundogdu, E.[Erhan], Bazzani, L.[Loris], Donoser, M.[Michael],
Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning,
CVPR21(15470-15479)
IEEE DOI 2111
Training, Codes, Computational modeling, Semantics, Machine learning, Transformers BibRef

Dzabraev, M.[Maksim], Kalashnikov, M.[Maksim], Komkov, S.[Stepan], Petiushko, A.[Aleksandr],
MDMMT: Multidomain Multimodal Transformer for Video Retrieval,
HVU21(3349-3358)
IEEE DOI 2109
Training, Benchmark testing, Pattern recognition, Task analysis BibRef

Wang, K.[Kai], Herranz, L.[Luis], van de Weijer, J.[Joost],
Continual learning in cross-modal retrieval,
OmniCV21(3623-3633)
IEEE DOI 2109
Training, Visualization, Human intelligence, Focusing, Interference, Tools, Pattern recognition BibRef

Mafla, A.[Andrés], Rezende, R.S.[Rafael S.], Gómez, L.[Lluís], Larlus, D.[Diane], Karatzas, D.[Dimosthenis],
StacMR: Scene-Text Aware Cross-Modal Retrieval,
WACV21(2219-2229)
IEEE DOI 2106
Visualization, Annotations, Computational modeling, Semantics, Task analysis BibRef

Feng, C.T.[Chang-Ting], Li, D.G.[Da-Gang], Zheng, J.W.[Jing-Wei],
Improving Supervised Cross-modal Retrieval with Semantic Graph Embedding,
MMMod21(I:187-199).
Springer DOI 2106
BibRef

Wen, Z.Y.[Zhen-Yu], Feng, A.[Aimin],
Deep Centralized Cross-modal Retrieval,
MMMod21(I:443-455).
Springer DOI 2106
BibRef

Li, Z.X.[Zhi-Xin], Ling, F.[Feng], Xu, C.S.[Chuan-Sheng], Zhang, C.L.[Can-Long], Ma, H.F.[Hui-Fang],
Cross-Media Hash Retrieval Using Multi-Head Attention Network,
ICPR21(1290-1297)
IEEE DOI 2105
Correlation, Semantics, Neural networks, Media, Extraterrestrial measurements, cross-media retrieval BibRef

Jin, C.[Cong], Zhang, T.[Tian], Liu, S.X.[Shou-Xun], Tie, Y.[Yun], Lv, X.[Xin], Li, J.G.[Jian-Guang], Yan, W.C.[Wen-Cai], Yan, M.[Ming], Xu, Q.[Qian], Guan, Y.C.[Yi-Cong], Yang, Z.G.[Zheng-Gougou],
Cross-modal Deep Learning Applications: Audio-visual Retrieval,
MMDLCA20(301-313).
Springer DOI 2103
BibRef

Thomas, C.[Christopher], Kovashka, A.[Adriana],
Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval,
ECCV20(XVIII:317-335).
Springer DOI 2012
BibRef

Wang, Z., Liu, X., Li, H., Sheng, L., Yan, J., Wang, X., Shao, J.,
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval,
ICCV19(5763-5772)
IEEE DOI 2004
entropy, feature extraction, image matching, image retrieval, message passing, natural language processing, text analysis, Task analysis BibRef

Nawaz, S., Janjua, M.K., Gallo, I., Mahmood, A., Calefati, A., Shafait, F.,
Do Cross Modal Systems Leverage Semantic Relationships?,
CroMoL19(4501-4510)
IEEE DOI 2004
image representation, image retrieval, image segmentation, learning (artificial intelligence), Text to Image BibRef

Su, S., Zhong, Z., Zhang, C.,
Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval,
ICCV19(3027-3035)
IEEE DOI 2004
binary codes, image coding, image retrieval, multimedia computing, neural nets, binary codes, reconstructing framework, DJSRH, Correlation BibRef

Ning, X.C.[Xue-Cheng], Yang, X.S.[Xiao-Shan], Xu, C.S.[Chang-Sheng],
Multi-Hop Interactive Cross-modal Retrieval,
MMMod20(II:681-693).
Springer DOI 2003
BibRef

Cornia, M.[Marcella], Baraldi, L.[Lorenzo], Tavakoli, H.R.[Hamed R.], Cucchiara, R.[Rita],
Towards Cycle-Consistent Models for Text and Image Retrieval,
WiCV-E18(IV:687-691).
Springer DOI 1905
BibRef

Surís, D.[Didac], Duarte, A.[Amanda], Salvador, A.[Amaia], Torres, J.[Jordi], Giró-i-Nieto, X.[Xavier],
Cross-modal Embeddings for Video and Audio Retrieval,
WiCV-E18(IV:711-716).
Springer DOI 1905
BibRef

Liu, C.L.[Chen-Lu], Xu, X.[Xing], Yang, Y.[Yang], Lu, H.M.[Hui-Min], Shen, F.M.[Fu-Min], Ji, Y.L.[Yan-Li],
Domain Invariant Subspace Learning for Cross-Modal Retrieval,
MMMod18(II:94-105).
Springer DOI 1802
BibRef

Yuan, Y.X.[Yu-Xin], Peng, Y.X.[Yu-Xin],
Recursive Pyramid Network with Joint Attention for Cross-Media Retrieval,
MMMod18(I:405-416).
Springer DOI 1802
BibRef

Jia, Y.H.[Yu-Hua], Bai, L.[Liang], Wang, P.[Peng], Guo, J.L.[Jin-Lin], Xie, Y.X.[Yu-Xiang], Yu, T.Y.[Tian-Yuan],
Utilizing Locality-Sensitive Hash Learning for Cross-Media Retrieval,
MMMod17(I: 550-561).
Springer DOI 1701
BibRef

Shang, X.[Xindi], Zhang, H.[Hanwang], Chua, T.S.[Tat-Seng],
Deep Learning Generic Features for Cross-Media Retrieval,
MMMod16(I: 264-275).
Springer DOI 1601
BibRef

Huang, L.[Lei], Peng, Y.X.[Yu-Xin],
Cross-Media Retrieval via Semantic Entity Projection,
MMMod16(I: 276-288).
Springer DOI 1601
BibRef

Gu, Y.[Yun], Xue, H.Y.[Hao-Yang], Yang, J.[Jie], Shi, P.F.[Peng-Fei],
Cross-modality hashing with partial correspondence,
ICIP15(1925-1929)
IEEE DOI 1512
Cross-modality; Hashing; Multimedia Search; Partial Correspondence BibRef

Zhang, H.[Hong], Chen, L.[Li],
Learning optimal data representation for cross-media retrieval,
ICIP12(1925-1928).
IEEE DOI 1302
BibRef

Lin, W.X.[Wan-Xia], Lu, T.[Tong], Su, F.[Feng],
A Novel Multi-modal Integration and Propagation Model for Cross-Media Information Retrieval,
MMMod12(740-749).
Springer DOI 1201
BibRef

Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
Video Delivery, Video-on-Demand, Indexing, Techniques, Systems .


Last update:May 21, 2022 at 16:37:58