20.4.3.3.6 Image-Text Matching, Image Text Retrieval, Image-Text Retrieval

Chapter Contents (Back)
Image Text Retrieval. Image-Text Matching.

Zhou, N.[Ning], Fan, J.P.[Jian-Ping],
Automatic image-text alignment for large-scale web image indexing and retrieval,
PR(48), No. 1, 2015, pp. 205-219.
Elsevier DOI 1410
Automatic image-text alignment BibRef

Huang, F.R.[Fei-Ran], Zhang, X.M.[Xiao-Ming], Zhao, Z.H.[Zhong-Hua], Li, Z.J.[Zhou-Jun],
Bi-Directional Spatial-Semantic Attention Networks for Image-Text Matching,
IP(28), No. 4, April 2019, pp. 2008-2020.
IEEE DOI 1901
Word to region and visual object to words. image matching, image retrieval, learning (artificial intelligence), spatial-semantic BibRef

Otto, C.[Christian], Springstein, M.[Matthias], Anand, A.[Avishek], Ewerth, R.[Ralph],
Characterization and classification of semantic image-text relations,
MultInfoRetr(9), No. 1, March 2020, pp. 31-45.
Springer DOI 2003
Textual information with the imagery. BibRef

Niu, K.[Kai], Huang, Y.[Yan], Wang, L.[Liang],
Re-ranking image-text matching by adaptive metric fusion,
PR(104), 2020, pp. 107351.
Elsevier DOI 2005
Image-text matching, Re-ranking method, Adaptive metric fusion BibRef

Wen, K.Y.[Ke-Yu], Gu, X.D.[Xiao-Dong], Cheng, Q.R.[Qing-Rong],
Learning Dual Semantic Relations With Graph Attention for Image-Text Matching,
CirSysVideo(31), No. 7, July 2021, pp. 2866-2879.
IEEE DOI 2107
Semantics, Visualization, Task analysis, Feature extraction, Birds, Automobiles, Image retrieval, Cross-modal retrieval, image text matching BibRef

Yang, S.[Song], Li, Q.[Qiang], Li, W.H.[Wen-Hui], Li, X.[Xuanya], Liu, A.A.[An-An],
Dual-Level Representation Enhancement on Characteristic and Context for Image-Text Retrieval,
CirSysVideo(32), No. 11, November 2022, pp. 8037-8050.
IEEE DOI 2211
Semantics, Visualization, Feature extraction, Correlation, Learning systems, Task analysis, Filtration, image-text retrieval BibRef

Jing, Y.[Ya], Wang, W.[Wei], Wang, L.[Liang], Tan, T.N.[Tie-Niu],
Learning Aligned Image-Text Representations Using Graph Attentive Relational Network,
IP(30), 2021, pp. 1840-1852.
IEEE DOI 2101
Graph neural networks, Visualization, Semantics, Task analysis, Feature extraction, Annotations, Recurrent neural networks, graph neural network BibRef

Zhao, F.[Fang], Huang, Y.Z.[Yong-Zhen], Wang, L.[Liang], Tan, T.N.[Tie-Niu],
Deep Semantic Ranking Based Hashing for Multi-Label Image Retrieval,
CVPR15(1556-1564)
IEEE DOI 1510
BibRef

Lan, H.[Hong], Zhang, P.[Pufen],
Learning and Integrating Multi-Level Matching Features for Image-Text Retrieval,
SPLetters(29), 2022, pp. 374-378.
IEEE DOI 2202
Pattern matching, Feature extraction, Visualization, Head, Task analysis, Semantics, Recurrent neural networks, attention mechanism BibRef

Wu, J.[Jie], Wu, C.L.[Chun-Lei], Lu, J.[Jing], Wang, L.Q.[Lei-Quan], Cui, X.R.[Xue-Rong],
Region Reinforcement Network With Topic Constraint for Image-Text Matching,
CirSysVideo(32), No. 1, January 2022, pp. 388-397.
IEEE DOI 2201
Semantics, Visualization, Petroleum, Linear programming, Cameras, Training data, Image-text matching, topic constraint BibRef

Malali, N.[Noam], Keller, Y.[Yosi],
Learning to Embed Semantic Similarity for Joint Image-Text Retrieval,
PAMI(44), No. 12, December 2022, pp. 10252-10260.
IEEE DOI 2212
Semantics, Training, Measurement, Fasteners, Visualization, Correlation, Deep learning, Text and image fusion, deep learning, joint embedding BibRef

Tian, M.X.[Meng-Xiao], Wu, X.X.[Xin-Xiao], Jia, Y.D.[Yun-De],
Adaptive Latent Graph Representation Learning for Image-Text Matching,
IP(32), 2023, pp. 471-482.
IEEE DOI 2301
Visualization, Feature extraction, Representation learning, Image edge detection, Adaptation models, Task analysis, Semantics, graph variational autoencoder BibRef

Li, K.P.[Kun-Peng], Zhang, Y.L.[Yu-Lun], Li, K.[Kai], Li, Y.Y.[Yuan-Yuan], Fu, Y.[Yun],
Image-Text Embedding Learning via Visual and Textual Semantic Reasoning,
PAMI(45), No. 1, January 2023, pp. 641-656.
IEEE DOI 2212
BibRef
Earlier:
Visual Semantic Reasoning for Image-Text Matching,
ICCV19(4653-4661)
IEEE DOI 2004
Semantics, Cognition, Visualization, Correlation, Image representation, Logic gates, Image-text retrieval, deep learning. convolutional neural nets, graph theory, image matching, image retrieval, Image edge detection BibRef

Diao, H.W.[Hai-Wen], Zhang, Y.[Ying], Liu, W.[Wei], Ruan, X.[Xiang], Lu, H.C.[Hu-Chuan],
Plug-and-Play Regulators for Image-Text Matching,
IP(32), 2023, pp. 2322-2334.
IEEE DOI 2305
Regulators, Visualization, Semantics, Regulation, Pattern matching, Feature extraction, Image-text matching, plug-and-play operation BibRef

Tian, Y.M.[Yu-Min], Ding, A.[Aqiang], Wang, D.[Di], Luo, X.M.[Xue-Mei], Wan, B.[Bo], Wang, Y.F.[Yi-Feng],
Bi-Attention enhanced representation learning for image-text matching,
PR(140), 2023, pp. 109548.
Elsevier DOI 2305
Image-text matching, Bi-attention, Polynomial loss BibRef

Zhang, K.[Kun], Mao, Z.D.[Zhen-Dong], Liu, A.A.[An-An], Zhang, Y.D.[Yong-Dong],
Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching,
MultMed(25), 2023, pp. 1320-1332.
IEEE DOI 2305
Semantics, Optimization, Visualization, Training, Task analysis, Representation learning, Correlation, Image-text matching, unified adaptive relevance distinguishable learning BibRef

Liu, Z.[Zejun], Chen, F.L.[Fang-Lin], Xu, J.[Jun], Pei, W.J.[Wen-Jie], Lu, G.M.[Guang-Ming],
Image-Text Retrieval With Cross-Modal Semantic Importance Consistency,
CirSysVideo(33), No. 5, May 2023, pp. 2465-2476.
IEEE DOI 2305
Semantics, Visualization, Task analysis, Representation learning, Recurrent neural networks, Glass, Feature extraction, Cross-modal, alignment BibRef

Shang, H.[Heng], Zhao, G.S.[Guo-Shuai], Shi, J.[Jing], Qian, X.M.[Xue-Ming],
A Multiview Text Imagination Network Based on Latent Alignment for Image-Text Matching,
IEEE_Int_Sys(38), No. 3, May 2023, pp. 41-50.
IEEE DOI 2307
Feature extraction, Semantics, Text mining, Intelligent systems, Image representation, Task analysis, Image edge detection BibRef

Liu, C.[Chong], Zhang, Y.Q.[Yu-Qi], Wang, H.[Hongsong], Chen, W.H.[Wei-Hua], Wang, F.[Fan], Huang, Y.[Yan], Shen, Y.D.[Yi-Dong], Wang, L.[Liang],
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training,
IP(32), 2023, pp. 3622-3633.
IEEE DOI 2307
Semantics, Transformers, Task analysis, Training, Feature extraction, Visualization, Image-text retrieval, multimodal transformer, multimodal contrastive training BibRef

Li, W.R.[Wen-Rui], Ma, Z.Y.[Zheng-Yu], Deng, L.J.[Liang-Jian], Fan, X.P.[Xiao-Peng], Tian, Y.H.[Yong-Hong],
Neuron-Based Spiking Transmission and Reasoning Network for Robust Image-Text Retrieval,
CirSysVideo(33), No. 7, July 2023, pp. 3516-3528.
IEEE DOI 2307
Visualization, Feature extraction, Wireless communication, Transformers, Task analysis, Receivers, Semantics, joint source-channel coding BibRef

Li, W.R.[Wen-Rui], Ma, Z.Y.[Zheng-Yu], Shi, J.Q.[Jin-Qiao], Fan, X.P.[Xiao-Peng],
The Style Transformer With Common Knowledge Optimization for Image-Text Retrieval,
SPLetters(30), 2023, pp. 1197-1201.
IEEE DOI 2310
BibRef

Zhu, H.G.[Hong-Guang], Zhang, C.J.[Chun-Jie], Wei, Y.C.[Yun-Chao], Huang, S.[Shujuan], Zhao, Y.[Yao],
ESA: External Space Attention Aggregation for Image-Text Retrieval,
CirSysVideo(33), No. 10, October 2023, pp. 6131-6143.
IEEE DOI Code:
WWW Link. 2310
BibRef

Li, Z.[Zheng], Guo, C.[Caili], Feng, Z.[Zerun], Hwang, J.N.[Jenq-Neng], Du, Z.T.[Zhong-Tian],
Integrating Language Guidance Into Image-Text Matching for Correcting False Negatives,
MultMed(26), 2024, pp. 103-116.
IEEE DOI 2401
BibRef

Zhang, Y.[Yan], Ji, Z.[Zhong], Wang, D.[Di], Pang, Y.W.[Yan-Wei], Li, X.L.[Xue-Long],
USER: Unified Semantic Enhancement With Momentum Contrast for Image-Text Retrieval,
IP(33), 2024, pp. 595-609.
IEEE DOI Code:
WWW Link. 2402
Training, Semantics, Visualization, Feature extraction, Task analysis, Heuristic algorithms, Correlation, dynamic queue BibRef

Zhuang, J.[Jiamin], Yu, J.[Jing], Ding, Y.[Yang], Qu, X.Y.[Xiang-Yan], Hu, Y.[Yue],
Towards Fast and Accurate Image-Text Retrieval With Self-Supervised Fine-Grained Alignment,
MultMed(26), 2024, pp. 1361-1372.
IEEE DOI Code:
WWW Link. 2402
Visualization, Semantics, Image coding, Training, Encoding, Computational modeling, Costs, Fast image-text retrieval, self-supervised learning BibRef

Liu, X.[Xin], He, Y.[Yi], Cheung, Y.M.[Yiu-Ming], Xu, X.[Xing], Wang, N.N.[Nan-Nan],
Learning Relationship-Enhanced Semantic Graph for Fine-Grained Image-Text Matching,
Cyber(54), No. 2, February 2024, pp. 948-961.
IEEE DOI 2402
Semantics, Visualization, Correlation, Technological innovation, Task analysis, Snow, relationship-enhanced graph BibRef

Li, W.H.[Wen-Hui], Yang, S.[Song], Li, Q.[Qiang], Li, X.[Xuanya], Liu, A.A.[An-An],
Commonsense-Guided Semantic and Relational Consistencies for Image-Text Retrieval,
MultMed(26), 2024, pp. 1867-1880.
IEEE DOI 2402
Semantics, Visualization, Correlation, Task analysis, Commonsense reasoning, Oceans, Sea measurements, image-text retrieval BibRef

Wu, D.Q.[Dong-Qing], Li, H.H.[Hui-Hui], Gu, C.[Cang], Liu, H.[Hang], Xu, C.[Cuili], Hou, Y.[Yinxuan], Guo, L.[Lei],
Feature First: Advancing Image-Text Retrieval Through Improved Visual Features,
MultMed(26), 2024, pp. 3827-3841.
IEEE DOI 2402
Visualization, Feature extraction, Semantics, Task analysis, Mouth, Dogs, Aggregates, Graph attention networks, grid feature, region feature BibRef

Yang, R.[Rui], Wang, S.[Shuang], Gu, Y.[Yu], Wang, J.H.[Ji-Hui], Sun, Y.Z.[Ying-Zhi], Zhang, H.[Huan], Liao, Y.[Yu], Jiao, L.C.[Li-Cheng],
Continual Learning for Cross-Modal Image-Text Retrieval Based on Domain-Selective Attention,
PR(149), 2024, pp. 110273.
Elsevier DOI 2403
Cross-modal retrieval, Continual learning, Attention, Weight regularization BibRef


Fu, Z.R.[Zhe-Ren], Mao, Z.D.[Zhen-Dong], Song, Y.[Yan], Zhang, Y.D.[Yong-Dong],
Learning Semantic Relationship among Instances for Image-Text Matching,
CVPR23(15159-15168)
IEEE DOI 2309
BibRef

Zhang, W.[Wei], Xu, X.W.[Xiao-Wei], Tao, Y.[Ye], Wang, X.D.[Xiao-Dong], Wang, C.[Chenglin], Wei, Z.M.[Zhi-Min],
Bi-Directional Image-Text Retrieval With Position Attention and Similarity Filtering,
ICIVC22(635-640)
IEEE DOI 2301
Visualization, Filtering, Interference, Bidirectional control, Information retrieval, Reliability, Task analysis, filtering mechanism BibRef

Li, Z.[Zhao], Nian, X.H.[Xiao-Hong], Pan, C.[Chao], Yang, D.[Dayin], Xiong, H.Y.[Hong-Yun], Wang, H.B.[Hai-Bo],
Relation Graph Reasoning for Image-Text Matching,
ICIVC22(319-324)
IEEE DOI 2301
Visualization, Semantics, Brightness, Feature extraction, Cognition, Data mining, Task analysis, Image-Text Matching, Relation Graph, Visual Reasoning BibRef

Zhang, K.[Kun], Mao, Z.D.[Zhen-Dong], Wang, Q.[Quan], Zhang, Y.D.[Yong-Dong],
Negative-Aware Attention Framework for Image-Text Matching,
CVPR22(15640-15649)
IEEE DOI 2210
Force measurement, Codes, Machine vision, Optimization methods, Benchmark testing, Pattern recognition, Vision+language, Vision applications and systems BibRef

Long, S.[Siqu], Han, S.C.[Soyeon Caren], Wan, X.J.[Xiao-Jun], Poon, J.[Josiah],
GraDual: Graph-based Dual-modal Representation for Image-Text Matching,
WACV22(2463-2472)
IEEE DOI 2202
Visualization, Matched filters, Computational modeling, Semantics, Image retrieval, Vision and Languages BibRef

Biten, A.F.[Ali Furkan], Mafla, A.[Andrés], Gómez, L.[Lluís], Karatzas, D.[Dimosthenis],
Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching,
WACV22(2483-2492)
IEEE DOI 2202
Measurement, Training, Integrated circuits, Annotations, Semantics, Training data, Semisupervised learning, Vision and Languages BibRef

Mithun, N.C.[Niluthpol C.], Pasricha, R.[Ravdeep], Papalexakis, E.[Evangelos], Roy-Chowdhury, A.K.[Amit K.],
Webly Supervised Image-Text Embedding with Noisy Tag Refinement,
ICPR21(7454-7461)
IEEE DOI 2105
utilizing web images in training. Training, Tensors, Correlation, Performance gain, Benchmark testing, Data models BibRef

Chen, J.A.[Jian-An], Zhang, L.[Lu], Wang, Q.[Qiong], Bai, C.[Cong], Kpalma, K.[Kidiyo],
Intra-Modal Constraint Loss for Image-Text Retrieval,
ICIP22(4023-4027)
IEEE DOI 2211
Convolutional codes, Recurrent neural networks, Bidirectional control, Network architecture, Feature extraction, similarity distance BibRef

Liu, Y.[Yang], Wang, H.Q.[Hua-Qiu], Meng, F.Y.[Fan-Yang], Liu, M.Y.[Meng-Yuan], Liu, H.[Hong],
Attend, Correct and Focus: A Bidirectional Correct Attention Network for Image-Text Matching,
ICIP21(2673-2677)
IEEE DOI 2201
Image processing, Semantics, Benchmark testing, Task analysis, Image-text matching, cross modal retrieval, attention mechanism BibRef

Yang, S.T.[Sean T.], Huang, K.H.[Kuan-Hao], Howe, B.[Bill],
JECL: Joint Embedding and Cluster Learning for Image-Text Pairs,
ICPR21(8344-8351)
IEEE DOI 2105
Training, Sensitivity, Computational modeling, Training data, Benchmark testing, Robustness BibRef

Mikriukov, G.[Georgii], Ravanbakhsh, M.[Mahdyar], Demir, B.[Begüm],
An Unsupervised Cross-Modal Hashing Method Robust to Noisy Training Image-Text Correspondences in Remote Sensing,
ICIP22(2556-2560)
IEEE DOI 2211
Training, Codes, Coherence, Feature extraction, Distortion, Sensors, Noise robustness, cross-modal retrieval, caption-noise BibRef

Anwaar, M.U.[Muhammad Umer], Labintcev, E.[Egor], Kleinsteuber, M.[Martin],
Compositional Learning of Image-Text Query for Image Retrieval,
WACV21(1139-1148)
IEEE DOI 2106
Training, Image color analysis, Databases, Image retrieval, Benchmark testing BibRef

Messina, N.[Nicola], Falchi, F.[Fabrizio], Esuli, A.[Andrea], Amato, G.[Giuseppe],
Transformer Reasoning Network for Image-Text Matching and Retrieval,
ICPR21(5222-5229)
IEEE DOI 2105
Measurement, Weight measurement, Training, Visualization, Feature extraction, Cognition BibRef

Zhang, Q.[Qi], Lei, Z.[Zhen], Zhang, Z.X.[Zhao-Xiang], Li, S.Z.[Stan Z.],
Context-Aware Attention Network for Image-Text Retrieval,
CVPR20(3533-3542)
IEEE DOI 2008
Semantics, Correlation, Visualization, Feature extraction, Task analysis, Bidirectional control, Knowledge discovery BibRef

Chen, Y.C.[Yen-Chun], Li, L.J.[Lin-Jie], Yu, L.C.[Li-Cheng], El Kholy, A.[Ahmed], Ahmed, F.[Faisal], Gan, Z.[Zhe], Cheng, Y.[Yu], Liu, J.J.[Jing-Jing],
Uniter: Universal Image-Text Representation Learning,
ECCV20(XXX: 104-120).
Springer DOI 2010
joint visual and textual understanding. BibRef

Wang, H.R.[Hao-Ran], Zhang, Y.[Ying], Ji, Z.[Zhong], Pang, Y.W.[Yan-Wei], Ma, L.[Lin],
Consensus-aware Visual-semantic Embedding for Image-Text Matching,
ECCV20(XXIV:18-34).
Springer DOI 2012
BibRef

Chen, T.L.[Tian-Lang], Deng, J.J.[Jia-Jun], Luo, J.B.[Jie-Bo],
Adaptive Offline Quintuplet Loss for Image-text Matching,
ECCV20(XIII:549-565).
Springer DOI 2011
Code, Retrieval.
WWW Link. BibRef

Lee, K.H.[Kuang-Huei], Chen, X.[Xi], Hua, G.[Gang], Hu, H.D.[Hou-Dong], He, X.D.[Xiao-Dong],
Stacked Cross Attention for Image-Text Matching,
ECCV18(II: 212-228).
Springer DOI 1810
Match text descriptions to the image. BibRef

Zhang, Y.[Ying], Lu, H.C.[Hu-Chuan],
Deep Cross-Modal Projection Learning for Image-Text Matching,
ECCV18(I: 707-723).
Springer DOI 1810
Measure similarity between image and text descriptions. BibRef

Plummer, B.A.[Bryan A.], Kordas, P.[Paige], Kiapour, M.H.[M. Hadi], Zheng, S.[Shuai], Piramuthu, R.[Robinson], Lazebnik, S.[Svetlana],
Conditional Image-Text Embedding Networks,
ECCV18(XII: 258-274).
Springer DOI 1810
Code:
WWW Link. BibRef

Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
Visual Grounding, Grounding Expressions .


Last update:Mar 16, 2024 at 20:36:19