11.14.3.3 Text to Image, Layout to Image, Image Based Rendering

Chapter Contents (Back)
Image Based Rendering. Stereo Image Based Rendering. Image Synthesis. Text to Image.
See also Adversarial Networks for Image Synthesis.

Zhang, J.[Ji], Mei, K.Z.[Kui-Zhi], Zheng, Y.[Yu], Fan, J.P.[Jian-Ping],
Exploiting Mid-Level Semantics for Large-Scale Complex Video Classification,
MultMed(21), No. 10, October 2019, pp. 2518-2530.
IEEE DOI 1910
computer vision, feature extraction, image classification, image motion analysis, image representation, large-scale video classification BibRef

Zhang, J.[Ji], Mei, K.Z.[Kui-Zhi], Wang, X., Zheng, Y.[Yu], Fan, J.P.[Jian-Ping],
From Text to Video: Exploiting Mid-Level Semantics for Large-Scale Video Classification,
ICPR18(1695-1700)
IEEE DOI 1812
Semantics, Task analysis, Visualization, Streaming media, Detectors, Encoding, Bridges BibRef

Peng, Y.X.[Yu-Xin], Qi, J.W.[Jin-Wei],
Show and Tell in the Loop: Cross-Modal Circular Correlation Learning,
MultMed(21), No. 6, June 2019, pp. 1538-1550.
IEEE DOI 1906
Correlation, Bridges, Logic gates, Semantics, Task analysis, Cognition, Feeds, Circular correlation learning, cross-modal retrieval, text-to-image synthesis BibRef

Zhang, X.W.[Xin-Wei], Wang, J.[Jin], Lu, G.D.[Guo-Dong], Zhang, X.S.[Xu-Sheng],
Pattern understanding and synthesis based on layout tree descriptor,
VC(36), No. 6, June 2020, pp. 1141-1155.
WWW Link. 2005
BibRef

Baraheem, S.S.[Samah S.], Nguyen, T.V.[Tam V.],
Text-to-image via mask anchor points,
PRL(133), 2020, pp. 25-32.
Elsevier DOI 2005
Text-to-image, Mask dataset, Image synthesis, Anchor points BibRef

Chen, Q.[Qi], Wu, Q.[Qi], Chen, J.[Jian], Wu, Q.Y.[Qing-Yao], van den Hengel, A.J.[Anton J.], Tan, M.[Mingkui],
Scripted Video Generation With a Bottom-Up Generative Adversarial Network,
IP(29), 2020, pp. 7454-7467.
IEEE DOI 2007
Generative adversarial networks, video generation, semantic alignment, temporal coherence BibRef

Yang, M.[Min], Liu, J.H.[Jun-Hao], Shen, Y.[Ying], Zhao, Z.[Zhou], Chen, X.J.[Xiao-Jun], Wu, Q.Y.[Qing-Yao], Li, C.M.[Cheng-Ming],
An Ensemble of Generation- and Retrieval-Based Image Captioning With Dual Generator Generative Adversarial Network,
IP(29), 2020, pp. 9627-9640.
IEEE DOI 2011
Generators, Decoding, Generative adversarial networks, Training, Computational modeling, Task analysis, Image captioning, adversarial learning BibRef

Yuan, M., Peng, Y.,
CKD: Cross-Task Knowledge Distillation for Text-to-Image Synthesis,
MultMed(22), No. 8, August 2020, pp. 1955-1968.
IEEE DOI 2007
Semantics, Visualization, Task analysis, Image synthesis, Generative adversarial networks, Neural networks, image semantic understanding BibRef

Osahor, U., Kazemi, H., Dabouei, A., Nasrabadi, N.,
Quality Guided Sketch-to-Photo Image Synthesis,
Biometrics20(3575-3584)
IEEE DOI 2008
Computer vision, Pattern recognition BibRef

Zhao, B.[Bo], Yin, W.D.[Wei-Dong], Meng, L.L.[Li-Li], Sigal, L.[Leonid],
Layout2image: Image Generation from Layout,
IJCV(128), No. 10-11, November 2020, pp. 2418-2435.
Springer DOI 2009
BibRef
Earlier: A1, A3, A2, A4:
Image Generation From Layout,
CVPR19(8576-8585).
IEEE DOI 2002
BibRef

Sheng, L.[Lu], Pan, J.T.[Jun-Ting], Guo, J.M.[Jia-Ming], Shao, J.[Jing], Loy, C.C.[Chen Change],
High-Quality Video Generation from Static Structural Annotations,
IJCV(128), No. 10-11, November 2020, pp. 2552-2569.
Springer DOI 2009
BibRef

Li, K.[Ke], Peng, S.C.[Shi-Chong], Zhang, T.H.[Tian-Hao], Malik, J.[Jitendra],
Multimodal Image Synthesis with Conditional Implicit Maximum Likelihood Estimation,
IJCV(128), No. 10-11, November 2020, pp. 2607-2628.
Springer DOI 2009
BibRef
Earlier: A1, A3, A4, Only:
Diverse Image Synthesis From Semantic Layouts via Conditional IMLE,
ICCV19(4219-4228)
IEEE DOI 2004
image representation, image segmentation, learning (artificial intelligence), Probabilistic logic BibRef

Gao, L.L.[Lian-Li], Chen, D.Y.[Dai-Yuan], Zhao, Z.[Zhou], Shao, J.[Jie], Shen, H.T.[Heng Tao],
Lightweight dynamic conditional GAN with pyramid attention for text-to-image synthesis,
PR(110), 2021, pp. 107384.
Elsevier DOI 2011
Text-to-image synthesis, Conditional generative adversarial network (CGAN), Pyramid attentive fusion BibRef

Dong, Y.[Yanlong], Zhang, Y.[Ying], Ma, L.[Lin], Wang, Z.[Zhi], Luo, J.B.[Jie-Bo],
Unsupervised text-to-image synthesis,
PR(110), 2021, pp. 107573.
Elsevier DOI 2011
Text-to-image synthesis, Generative adversarial network (GAN), Unsupervised training BibRef

Yuan, M., Peng, Y.,
Bridge-GAN: Interpretable Representation Learning for Text-to-Image Synthesis,
CirSysVideo(30), No. 11, November 2020, pp. 4258-4268.
IEEE DOI 2011
Visualization, Mutual information, Image synthesis, Task analysis, Training, Bridge circuits, Semantics, Text-to-image synthesis, Bridge-GAN BibRef

Li, R.F.[Rui-Fan], Wang, N.[Ning], Feng, F.X.[Fang-Xiang], Zhang, G.W.[Guang-Wei], Wang, X.J.[Xiao-Jie],
Exploring Global and Local Linguistic Representations for Text-to-Image Synthesis,
MultMed(22), No. 12, December 2020, pp. 3075-3087.
IEEE DOI 2011
Task analysis, Linguistics, Generators, Generative adversarial networks, Training, Correlation, cross-modal BibRef

Li, C.Y.[Chun-Ye], Kong, L.Y.[Li-Ya], Zhou, Z.P.[Zhi-Ping],
Improved-StoryGAN for sequential images visualization,
JVCIR(73), 2020, pp. 102956.
Elsevier DOI 2012
Story visualization, Weighted Activation Degree (WAD), Dilated Convolution, Gated Convolution BibRef

Tan, H., Liu, X., Liu, M., Yin, B., Li, X.,
KT-GAN: Knowledge-Transfer Generative Adversarial Network for Text-to-Image Synthesis,
IP(30), 2021, pp. 1275-1290.
IEEE DOI 2012
Task analysis, Semantics, Generators, Generative adversarial networks, Knowledge engineering, alternate attention-transfer mechanism BibRef

Wang, M.[Min], Lang, C.Y.[Cong-Yan], Feng, S.H.[Song-He], Wang, T.[Tao], Jin, Y.[Yi], Li, Y.D.[Yi-Dong],
Text to photo-realistic image synthesis via chained deep recurrent generative adversarial network,
JVCIR(74), 2021, pp. 102955.
Elsevier DOI 2101
Text-to-image synthesis, Logic relationships, Computational bottlenecks, Parameters sharing BibRef

Yang, Y., Wang, L., Xie, D., Deng, C., Tao, D.,
Multi-Sentence Auxiliary Adversarial Networks for Fine-Grained Text-to-Image Synthesis,
IP(30), 2021, pp. 2798-2809.
IEEE DOI 2102
Semantics, Task analysis, Visualization, Training, Generative adversarial networks, Correlation, Birds, negative sample learning BibRef

Elu, A.[Aitzol], Azkune, G.[Gorka], de Lacalle, O.L.[Oier Lopez], Arganda-Carreras, I.[Ignacio], Soroa, A.[Aitor], Agirre, E.[Eneko],
Inferring spatial relations from textual descriptions of images,
PR(113), 2021, pp. 107847.
Elsevier DOI 2103
Text-to-image synthesis, Natural language understanding, Spatial relations, Deep learning BibRef

Hu, T.[Tao], Long, C.J.[Cheng-Jiang], Xiao, C.X.[Chun-Xia],
A Novel Visual Representation on Text Using Diverse Conditional GAN for Visual Recognition,
IP(30), 2021, pp. 3499-3512.
IEEE DOI 2103
Use text from social media to train image recognition. Visualization, Feature extraction, Image recognition, Text recognition, Generators, visual recognition BibRef

Yang, C.Y.[Ce-Yuan], Shen, Y.J.[Yu-Jun], Zhou, B.L.[Bo-Lei],
Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis,
IJCV(129), No. 5, May 2021, pp. 1451-1466.
Springer DOI 2105
BibRef

Qi, Z.J.[Zhong-Jian], Fan, C.[Chaogang], Xu, L.[Liangfeng], Li, X.[Xinke], Zhan, S.[Shu],
MRP-GAN: Multi-resolution parallel generative adversarial networks for text-to-image synthesis,
PRL(147), 2021, pp. 1-7.
Elsevier DOI 2106
Text-to-image synthesize, Generative adversarial networks, Image generation BibRef


Long, J.[Jia], Lu, H.T.[Hong-Tao],
Multi-level Gate Feature Aggregation with Spatially Adaptive Batch-instance Normalization for Semantic Image Synthesis,
MMMod21(I:378-390).
Springer DOI 2106
BibRef

Yan, J.W.[Jia-Wei], Lin, C.S.[Ci-Siang], Yang, F.E.[Fu-En], Li, Y.J.[Yu-Jhe], Wang, Y.C.A.F.[Yu-Chi-Ang Frank],
Semantics-Guided Representation Learning with Applications to Visual Synthesis,
ICPR21(7181-7187)
IEEE DOI 2105
Visualization, Interpolation, Computational modeling, Semantics, Data visualization, Semantic interpolation BibRef

Tang, S.C.[Shi-Chang], Zhou, X.[Xu], He, X.M.[Xu-Ming], Ma, Y.[Yi],
Disentangled Representation Learning for Controllable Image Synthesis: An Information-Theoretic Perspective,
ICPR21(10042-10049)
IEEE DOI 2105
Training, Image synthesis, Image color analysis, Mutual information BibRef

Ji, Z., Wang, W., Chen, B., Han, X.,
Text-to-Image Generation via Semi-Supervised Training,
VCIP20(265-268)
IEEE DOI 2102
image classification, learning (artificial intelligence), text analysis, visual databases, text-to-image generation, Pseudo Feature BibRef

Devaranjan, J.[Jeevan], Kar, A.[Amlan], Fidler, S.[Sanja],
Meta-SIM2: Unsupervised Learning of Scene Structure for Synthetic Data Generation,
ECCV20(XVII:715-733).
Springer DOI 2011

WWW Link. BibRef

Song, Y.Z.[Yun-Zhu], Tam, Z.R.[Zhi Rui], Chen, H.J.[Hung-Jen], Lu, H.H.[Huiao-Han], Shuai, H.H.[Hong-Han],
Character-preserving Coherent Story Visualization,
ECCV20(XVII:18-33).
Springer DOI 2011
BibRef

Herzig, R.[Roei], Bar, A.[Amir], Xu, H.J.[Hui-Juan], Chechik, G.[Gal], Darrell, T.J.[Trevor J.], Globerson, A.[Amir],
Learning Canonical Representations for Scene Graph to Image Generation,
ECCV20(XXVI:210-227).
Springer DOI 2011
BibRef

Zheng, H.T.[Hai-Tian], Liao, H.[Haofu], Chen, L.[Lele], Xiong, W.[Wei], Chen, T.L.[Tian-Lang], Luo, J.B.[Jie-Bo],
Example-guided Image Synthesis Using Masked Spatial-channel Attention and Self-supervision,
ECCV20(XIV:422-439).
Springer DOI 2011
BibRef

Mallya, A.[Arun], Wang, T.C.[Ting-Chun], Sapra, K.[Karan], Liu, M.Y.[Ming-Yu],
World-Consistent Video-to-Video Synthesis,
ECCV20(VIII:359-378).
Springer DOI 2011
BibRef

Vo, D.M.[Duc Minh], Sugimoto, A.[Akihiro],
Visual-relation Conscious Image Generation from Structured-text,
ECCV20(XXVIII:290-306).
Springer DOI 2011
BibRef

Burns, A.[Andrea], Kim, D.H.[Dong-Hyun], Wijaya, D.[Derry], Saenko, K.[Kate], Plummer, B.A.[Bryan A.],
Learning to Scale Multilingual Representations for Vision-Language Tasks,
ECCV20(IV:197-213).
Springer DOI 2011
BibRef

Liang, J.D.[Jia-Dong], Pei, W.J.[Wen-Jie], Lu, F.[Feng],
Cpgan: Content-parsing Generative Adversarial Networks for Text-to-image Synthesis,
ECCV20(IV:491-508).
Springer DOI 2011
BibRef

Nawhal, M.[Megha], Zhai, M.Y.[Meng-Yao], Lehrmann, A.[Andreas], Sigal, L.[Leonid], Mori, G.[Greg],
Generating Videos of Zero-shot Compositions of Actions and Objects,
ECCV20(XII: 382-401).
Springer DOI 2010
BibRef

Huang, H.P.[Hsin-Ping], Tseng, H.Y.[Hung-Yu], Lee, H.Y.[Hsin-Ying], Huang, J.B.[Jia-Bin],
Semantic View Synthesis,
ECCV20(XII: 592-608).
Springer DOI 2010
BibRef

Zhu, Z.[Zhen], Xu, Z.L.[Zhi-Liang], You, A.S.[An-Sheng], Bai, X.[Xiang],
Semantically Multi-Modal Image Synthesis,
CVPR20(5466-5475)
IEEE DOI 2008
Semantics, Task analysis, Convolutional codes, Image generation, Decoding, Generators, Controllability BibRef

Luo, A., Zhang, Z., Wu, J., Tenenbaum, J.B.,
End-to-End Optimization of Scene Layout,
CVPR20(3753-3762)
IEEE DOI 2008
Layout, Semantics, Decoding, Rendering (computer graphics), Solid modeling, Training BibRef

Gao, C., Liu, Q., Xu, Q., Wang, L., Liu, J., Zou, C.,
SketchyCOCO: Image Generation From Freehand Scene Sketches,
CVPR20(5173-5182)
IEEE DOI 2008
Image edge detection, Image generation, Training, Data models, Semantics, Image segmentation BibRef

Chen, Q., Wu, Q., Tang, R., Wang, Y., Wang, S., Tan, M.,
Intelligent Home 3D: Automatic 3D-House Design From Linguistic Descriptions Only,
CVPR20(12622-12631)
IEEE DOI 2008
Layout, Buildings, Linguistics, Task analysis, Solid modeling BibRef

Liu, C., Mao, Z., Zhang, T., Xie, H., Wang, B., Zhang, Y.,
Graph Structured Network for Image-Text Matching,
CVPR20(10918-10927)
IEEE DOI 2008
Visualization, Dogs, Semantics, Sparse matrices, Image edge detection, Learning systems, Feature extraction BibRef

Sarafianos, N., Xu, X., Kakadiaris, I.,
Adversarial Representation Learning for Text-to-Image Matching,
ICCV19(5813-5823)
IEEE DOI 2004
image matching, image representation, learning (artificial intelligence), Adversarial representation, Distance measurement BibRef

Tan, F.[Fuwen], Feng, S.[Song], Ordonez, V.[Vicente],
Text2Scene: Generating Compositional Scenes From Textual Descriptions,
CVPR19(6703-6712).
IEEE DOI 2002
BibRef

Yin, G.J.[Guo-Jun], Liu, B.[Bin], Sheng, L.[Lu], Yu, N.H.[Neng-Hai], Wang, X.G.[Xiao-Gang], Shao, J.[Jing],
Semantics Disentangling for Text-To-Image Generation,
CVPR19(2322-2331).
IEEE DOI 2002
BibRef

Li, W.[Wenbo], Zhang, P.C.[Peng-Chuan], Zhang, L.[Lei], Huang, Q.Y.[Qiu-Yuan], He, X.D.[Xiao-Dong], Lyu, S.W.[Si-Wei], Gao, J.F.[Jian-Feng],
Object-Driven Text-To-Image Synthesis via Adversarial Training,
CVPR19(12166-12174).
IEEE DOI 2002
BibRef

Talavera, A., Tan, D.S., Azcarraga, A., Hua, K.,
Layout and Context Understanding for Image Synthesis with Scene Graphs,
ICIP19(1905-1909)
IEEE DOI 1910
Generative Models, Text-to-Image Synthesis, Scene Graphs BibRef

Joseph, K.J., Pal, A.[Arghya], Rajanala, S.[Sailaja], Balasubramanian, V.N.[Vineeth N.],
C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis,
WACV19(358-366)
IEEE DOI 1904
image capture, image processing, virtual reality, visual databases, image editing, virtual reality, plausible image, Data models BibRef

Zhang, Z., Xie, Y., Yang, L.,
Photographic Text-to-Image Synthesis with a Hierarchically-Nested Adversarial Network,
CVPR18(6199-6208)
IEEE DOI 1812
Generators, Training, Image resolution, Task analysis, Semantics, Measurement BibRef

Qi, X., Chen, Q., Jia, J., Koltun, V.,
Semi-Parametric Image Synthesis,
CVPR18(8808-8816)
IEEE DOI 1812
Image segmentation, Semantics, Layout, Training, Image generation, Image color analysis, Pipelines BibRef

Hong, S.H.[Seung-Hoon], Yang, D.D.[Ding-Dong], Choi, J.[Jongwook], Lee, H.L.[Hong-Lak],
Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis,
CVPR18(7986-7994)
IEEE DOI 1812
Layout, Generators, Semantics, Shape, Image generation, Task analysis BibRef

Sah, S., Peri, D., Shringi, A., Zhang, C., Dominguez, M., Savakis, A., Ptucha, R.,
Semantically Invariant Text-to-Image Generation,
ICIP18(3783-3787)
IEEE DOI 1809
Measurement, Image generation, Generators, Image quality, Detectors, Visualization, Cost function BibRef

Kong, C.[Chen], Lin, D.[Dahua], Bansal, M.[Mohit], Urtasun, R.[Raquel], Fidler, S.[Sanja],
What Are You Talking About? Text-to-Image Coreference,
CVPR14(3558-3565)
IEEE DOI 1409
3D object detection; Text and images; scene understanding BibRef

Chapter on 3-D Object Description and Computation Techniques, Surfaces, Deformable, View Generation, Video Conferencing continues in
Merging Views, Object Insertion in Image .


Last update:Jun 14, 2021 at 09:20:36