21.3.6.2.7 Audio-Visual Emotion, Audiovisual Emotion Recognition

Chapter Contents (Back)
Emotion Recognition. Audio-Visual.
See also Multi-Modal Emotion, Multimodal Emotion Recognition.

Wang, Y.J.[Yong-Jin], Guan, L.[Ling],
Recognizing Human Emotional State From Audiovisual Signals*,
MultMed(10), No. 5, August 2008, pp. 936-946.
IEEE DOI 0905
BibRef

Wang, Y.J.[Yong-Jin], Guan, L.[Ling],
Recognizing Human Emotional State From Audiovisual Signals,
MultMed(10), No. 4, June 2008, pp. 659-668.
IEEE DOI 0905
BibRef

Mower, E., Mataric, M.J., Narayanan, S.,
Human Perception of Audio-Visual Synthetic Character Emotion Expression in the Presence of Ambiguous and Conflicting Information,
MultMed(11), No. 5, 2009, pp. 843-855.
IEEE DOI 0907
BibRef

Metallinou, A.[Angeliki], Wollmer, M.[Martin], Katsamanis, A.[Athanasios], Eyben, F.[Florian], Schuller, B.[Bjorn], Narayanan, S.[Shrikanth],
Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification,
AffCom(3), No. 2, 2012, pp. 184-198.
IEEE DOI 1208
BibRef

Mariooryad, S.[Soroosh], Busso, C.[Carlos],
Exploring Cross-Modality Affective Reactions for Audiovisual Emotion Recognition,
AffCom(4), No. 2, 2013, pp. 183-196.
IEEE DOI 1307
Entrainment BibRef

Wu, C.H.[Chung-Hsien], Lin, J.C.[Jen-Chun], Wei, W.L.[Wen-Li],
Two-Level Hierarchical Alignment for Semi-Coupled HMM-Based Audiovisual Emotion Recognition With Temporal Course,
MultMed(15), No. 8, December 2013, pp. 1880-1895.
IEEE DOI 1402
audio signal processing BibRef

Wöllmer, M.[Martin], Kaiser, M.[Moritz], Eyben, F.[Florian], Schuller, B.[Björn], Rigoll, G.[Gerhard],
LSTM-Modeling of continuous emotions in an audiovisual affect recognition framework,
IVC(31), No. 2, February 2013, pp. 153-163.
Elsevier DOI 1303
Emotion recognition; Long Short-Term Memory; Facial movement features; Context modeling BibRef

Ringeval, F.[Fabien], Eyben, F.[Florian], Kroupi, E.[Eleni], Yuce, A.[Anil], Thiran, J.P.[Jean-Philippe], Ebrahimi, T.[Touradj], Lalanne, D.[Denis], Schuller, B.[Björn],
Prediction of asynchronous dimensional emotion ratings from audiovisual and physiological data,
PRL(66), No. 1, 2015, pp. 22-30.
Elsevier DOI 1511
Context-learning long short-term memory recurrent neural networks BibRef

Kim, J.C., Clements, M.A.,
Multimodal Affect Classification at Various Temporal Lengths,
AffCom(6), No. 4, October 2015, pp. 371-384.
IEEE DOI 1512
Audio-visual systems BibRef

Georgakis, C.[Christos], Panagakis, Y.[Yannis], Zafeiriou, S.P.[Stefanos P.], Pantic, M.[Maja],
The Conflict Escalation Resolution (CONFER) Database,
IVC(65), No. 1, 2017, pp. 37-48.
Elsevier DOI 1709
BibRef
Earlier: A2, A3, A4, Only:
Audiovisual Conflict Detection in Political Debates,
FacBeh14(306-314).
Springer DOI 1504
Automatic conflict analysis BibRef

Zhang, S., Zhang, S., Huang, T., Gao, W., Tian, Q.,
Learning Affective Features With a Hybrid Deep Model for Audio-Visual Emotion Recognition,
CirSysVideo(28), No. 10, October 2018, pp. 3030-3043.
IEEE DOI 1811
Feature extraction, Emotion recognition, Visualization, Image segmentation, Machine learning, Databases, Convolution, multimodality fusion BibRef

Noroozi, F., Marjanovic, M., Njegus, A., Escalera, S., Anbarjafari, G.,
Audio-Visual Emotion Recognition in Video Clips,
AffCom(10), No. 1, January 2019, pp. 60-75.
IEEE DOI 1903
Emotion recognition, Visualization, Feature extraction, Databases, Face, Neural networks, Mel frequency cepstral coefficient, convolutional neural networks BibRef

Kim, Y., Provost, E.M.,
ISLA: Temporal Segmentation and Labeling for Audio-Visual Emotion Recognition,
AffCom(10), No. 2, April 2019, pp. 196-208.
IEEE DOI 1906
Speech, Emotion recognition, Speech recognition, Correlation, Labeling, Eyebrows, Visualization, Audio-visual, emotion, recognition, speech BibRef

Avots, E.[Egils], Sapinski, T.[Tomasz], Bachmann, M.[Maie], Kaminska, D.[Dorota],
Audiovisual emotion recognition in wild,
MVA(30), No. 5, July 2019, pp. 975-985.
Springer DOI 1907
BibRef

Basnet, R.[Ramesh], Islam, M.T.[Mohammad Tariqul], Howlader, T.[Tamanna], Rahman, S.M.M.[S. M. Mahbubur], Hatzinakos, D.[Dimitrios],
Estimation of affective dimensions using CNN-based features of audiovisual data,
PRL(128), 2019, pp. 290-297.
Elsevier DOI 1912
Convolutional neural network, Affective features, Emotional dimensions BibRef

Hajarolasvadi, N.[Noushin], Demirel, H.[Hasan],
Deep emotion recognition based on audio-visual correlation,
IET-CV(14), No. 7, October 2020, pp. 517-527.
DOI Link 2010
BibRef

Schoneveld, L.[Liam], Othmani, A.[Alice], Abdelkawy, H.[Hazem],
Leveraging recent advances in deep learning for audio-Visual emotion recognition,
PRL(146), 2021, pp. 1-7.
Elsevier DOI 2105
Human behavior recognition, Audiovisual emotion recognition, Affective computing, Video sequences, Deep learning BibRef

Nie, W.Z.[Wei-Zhi], Ren, M.J.[Min-Jie], Nie, J.[Jie], Zhao, S.C.[Si-Cheng],
C-GCN: Correlation Based Graph Convolutional Network for Audio-Video Emotion Recognition,
MultMed(23), 2021, pp. 3793-3804.
IEEE DOI 2110
Emotion recognition, Feature extraction, Correlation, Task analysis, Visualization, Face recognition, Convolution, multiple graphs BibRef

Nie, W.Z.[Wei-Zhi], Chang, R.[Rihao], Ren, M.J.[Min-Jie], Su, Y.T.[Yu-Ting], Liu, A.[Anan],
I-GCN: Incremental Graph Convolution Network for Conversation Emotion Detection,
MultMed(24), 2022, pp. 4471-4481.
IEEE DOI 2212
Correlation, Semantics, Social networking (online), Convolution, Transformers, Task analysis, Emotion recognition, GCN BibRef

Ren, M.J.[Min-Jie], Huang, X.D.[Xiang-Dong], Li, W.H.[Wen-Hui], Song, D.[Dan], Nie, W.Z.[Wei-Zhi],
LR-GCN: Latent Relation-Aware Graph Convolutional Network for Conversational Emotion Recognition,
MultMed(24), 2022, pp. 4422-4432.
IEEE DOI 2212
Correlation, Emotion recognition, Task analysis, Context modeling, Computer architecture, Transformers, Social networking (online), graph convolutional network BibRef

Goncalves, L.[Lucas], Busso, C.[Carlos],
Robust Audiovisual Emotion Recognition: Aligning Modalities, Capturing Temporal Information, and Handling Missing Features,
AffCom(13), No. 4, October 2022, pp. 2156-2170.
IEEE DOI 2212
Visualization, Emotion recognition, Feature extraction, Acoustics, Training, Transformers, Robustness, Multimodal emotion recognition, auxiliary networks BibRef

Kansizoglou, I.[Ioannis], Bampis, L.[Loukas], Gasteratos, A.[Antonios],
An Active Learning Paradigm for Online Audio-Visual Emotion Recognition,
AffCom(13), No. 2, April 2022, pp. 756-768.
IEEE DOI 2206
Feature extraction, Emotion recognition, Computer architecture, Visualization, Robots, Monitoring, Data mining, emotion in human-computer interaction BibRef

Mocanu, B.[Bogdan], Tapu, R.[Ruxandra], Zaharia, T.[Titus],
Multimodal emotion recognition using cross modal audio-video fusion with attention and deep metric learning,
IVC(133), 2023, pp. 104676.
Elsevier DOI 2305
Spatial attention, Channel attention, Temporal attention, Cross-modal fusion, Emotional metric constraint BibRef


Chumachenko, K.[Kateryna], Iosifidis, A.[Alexandros], Gabbouj, M.[Moncef],
Self-attention fusion for audiovisual emotion recognition with incomplete data,
ICPR22(2822-2828)
IEEE DOI 2212
Emotion recognition, Data analysis, Robustness, Data models, Noise measurement, Standards BibRef

Praveen, R.G.[R. Gnana], de Melo, W.C.[Wheidima Carneiro], Ullah, N.[Nasib], Aslam, H.[Haseeb], Zeeshan, O.[Osama], Denorme, T.[Théo], Pedersoli, M.[Marco], Koerich, A.L.[Alessandro L.], Bacon, S.[Simon], Cardinal, P.[Patrick], Granger, E.[Eric],
A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition,
ABAW22(2485-2494)
IEEE DOI 2210
Correlation coefficient, Emotion recognition, Visualization, Correlation, Computational modeling, Predictive models, Feature extraction BibRef

Praveen, R.G.[R. Gnana], Granger, E.[Eric], Cardinal, P.[Patrick],
Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition,
FG21(1-8)
IEEE DOI 2303
Emotion recognition, Face recognition, Computational modeling, Gesture recognition, Feature extraction, Fatigue BibRef

Zhang, S.[Su], An, R.[Ruyi], Ding, Y.[Yi], Guan, C.T.[Cun-Tai],
Continuous Emotion Recognition using Visual-audio-linguistic Information: A Technical Report for ABAW3,
ABAW22(2375-2380)
IEEE DOI 2210
Training, Correlation coefficient, Visualization, Emotion recognition, Fuses, Databases, Writing BibRef

Zhang, S.[Su], Ding, Y.[Yi], Wei, Z.[Ziquan], Guan, C.T.[Cun-Tai],
Continuous Emotion Recognition with Audio-visual Leader-follower Attentive Fusion,
ABAW21(3560-3567)
IEEE DOI 2112
Deep learning, Training, Correlation coefficient, Convolutional codes, Visualization, Emotion recognition, Databases BibRef

Antoniadis, P.[Panagiotis], Pikoulis, I.[Ioannis], Filntisis, P.P.[Panagiotis P.], Maragos, P.[Petros],
An audiovisual and contextual approach for categorical and continuous emotion recognition in-the-wild,
ABAW21(3638-3644)
IEEE DOI 2112
Emotion recognition, Image resolution, Lighting, Streaming media, Feature extraction, Task analysis BibRef

Ji, X.[Xinya], Zhou, H.[Hang], Wang, K.[Kaisiyuan], Wu, W.[Wayne], Loy, C.C.[Chen Change], Cao, X.[Xun], Xu, F.[Feng],
Audio-Driven Emotional Video Portraits,
CVPR21(14075-14084)
IEEE DOI 2111
Correlation, Shape, Heuristic algorithms, Mouth, Pattern recognition, Faces BibRef

Ghaleb, E., Niehues, J., Asteriadis, S.,
Multimodal Attention-Mechanism For Temporal Emotion Recognition,
ICIP20(251-255)
IEEE DOI 2011
Emotion recognition, Visualization, Training, Human computer interaction, Faces, Fuses, attention, audiovisual emotion recognition BibRef

Aydin, B., Kindiroglu, A.A., Aran, O., Akarun, L.,
Automatic personality prediction from audiovisual data using random forest regression,
ICPR16(37-42)
IEEE DOI 1705
Correlation, Feature extraction, Social network services, Speech, Standards, Time-frequency analysis, Visualization BibRef

Noroozi, F., Marjanovic, M., Njegus, A., Escalera, S., Anbarjafari, G.,
Fusion of classifier predictions for audio-visual emotion recognition,
ICPR16(61-66)
IEEE DOI 1705
Databases, Emotion recognition, Eyebrows, Face, Feature extraction, Mouth, Visualization BibRef

Araujo, R.[Rodrigo], Kamel, M.S.[Mohamed S.],
Audio-Visual Emotion Analysis Using Semi-Supervised Temporal Clustering with Constraint Propagation,
ICIAR14(II: 3-11).
Springer DOI 1410
BibRef

Lu, K.[Kun], Jia, Y.D.[Yun-De],
Audio-visual emotion recognition with boosted coupled HMM,
ICPR12(1148-1151).
WWW Link. 1302
BibRef
And:
Audio-visual emotion recognition using Boltzmann Zippers,
ICIP12(2589-2592).
IEEE DOI 1302
BibRef

Pitas, I., Kotsia, I., Martin, O., Macq, B.,
The eNTERFACE-05 Audio-Visual Emotion Database,
ICDEW06(8).
IEEE DOI BibRef 0600

Chen, L.S.[Lawrence S.], Huang, T.S.[Thomas S.],
Emotional Expressions In Audiovisual Human Computer Interaction,
ICME00(MP7). 0007
BibRef

Chapter on Face Recognition, Detection, Tracking, Gesture Recognition, Fingerprints, Biometrics continues in
Multi-Modal Emotion, Multimodal Emotion Recognition .


Last update:Jun 1, 2023 at 10:05:03