20.4.5.4.5 Audio-Video Analysis for Indexing and Classification

Chapter Contents (Back)
Audio Video. Image Database. Video Indexing. Not just audio, but associated text.
See also Video Analysis -- Captions, Text, Video Text.

Saraceno, C.[Caterina], Leonardi, R.[Riccardo],
Indexing audiovisual databases through joint audio and video processing,
IJIST(9), No. 5, 1999, pp. 320-331. BibRef 9900
Earlier:
Identification of Successive Correlated Camera Shots Using Audio and Video Information,
ICIP97(III: 166-169).
IEEE DOI BibRef
And:
Audio-visual processing for scene change detection,
CIAP97(II: 124-131).
Springer DOI 9709
BibRef

Li, D.G.[Dong-Ge], Sethi, I.K.[Ishwar K.], Dimitrova, N.[Nevenka], McGee, T.[Tom],
Classification of general audio data for content-based retrieval,
PRL(22), No. 5, April 2001, pp. 533-544.
Elsevier DOI 0105
BibRef

Tsekeridou, S.[Sofia], Pitas, I.[Ioannis],
Content-based video parsing and indexing based on audio-visual interaction,
CirSysVideo(11), No. 4, April 2001, pp. 522-535.
IEEE Top Reference. 0104
BibRef
Earlier:
Speaker dependent video indexing based on audio-visual interaction,
ICIP98(I: 358-362).
IEEE DOI 9810
BibRef

Tsekeridou, S.[Sofia], Krinidis, S.[Stelios], Pitas, I.[Ioannis],
Scene Change Detection Based on Audio-Visual Analysis and Interaction,
WTRCV01(214). 0103
BibRef

Kyperountas, M., Kotropoulos, C., Pitas, I.[Ioannis],
Enhanced Eigen-Audioframes for Audiovisual Scene Change Detection,
MultMed(9), No. 4, 2007, pp. 785-797.
IEEE DOI 0905
BibRef

Gauvain, J.L.[Jean-Luc], Lamel, L.[Lori], Adda, G.[Gilles],
Audio Partitioning and Transcription for Broadcast Data Indexation,
MultToolApp(14), No. 2, June 2001, pp. 187-200. 0106
BibRef

Amir, A.[Arnon], Srinivasan, S.[Savitha], Efrat, A.[Alon],
Search the Audio, Browse the Video: A Generic Paradigm for Video Collections,
JASP(2003), No. 2, February 2003, pp. 209.
WWW Link. 0304
BibRef

Beal, M.J.[Matthew J.], Jojic, N.[Nebojsa], Attias, H.T.[Hagai T.],
A graphical model for audiovisual object tracking,
PAMI(25), No. 7, July 2003, pp. 828-836.
IEEE Abstract. 0307
BibRef
Earlier: A1, A3, A2:
Audio-Video Sensor Fusion with Probabilistic Graphical Models,
ECCV02(I: 736 ff.).
Springer DOI 0205
2 microphones and a camera. Track the moving object with clutter and noise. BibRef

Wu, P.[Peng], Li, Y.[Ying], Tretter, D.[Daniel],
Scalable video summarization,
US_Patent7,047,494, May 16, 2006
WWW Link. BibRef 0605

Gong, Y.H.[Yi-Hong],
Summarizing Audiovisual Contents of a Video Program,
JASP(2003), No. 2, February 2003, pp. 160.
WWW Link. 0304
BibRef

Gong, Y.H.[Yi-Hong], Liu, X.[Xin],
Method and system for segmentation, classification, and summarization of video images,
US_Patent7,016,540, Mar 21, 2006
WWW Link. BibRef 0603
And: US_Patent7,151,852, Dec 19, 2006
WWW Link. BibRef
And:
Creating audio-centric, image-centric, and integrated audio-visual summaries,
US_Patent6,925,455, Aug 2, 2005
WWW Link. BibRef
And:
Video Summarization using Singular Value Decomposition,
CVPR00(II: 174-180).
IEEE DOI 0005
BibRef
And:
Video Shot Segmentation and Classification,
ICPR00(Vol I: 860-863).
IEEE DOI 0009
BibRef

Wang, H.L.[Hua-Lu], Divakaran, A.[Ajay], Vetro, A.[Anthony], Chang, S.F.[Shih-Fu], Sun, H.F.[Hui-Fang],
Survey of compressed-domain features used in audio-visual indexing and analysis,
JVCIR(14), No. 2, June 2003, pp. 150-183.
Elsevier DOI 0306
Survey, Image Retrieval. BibRef

Naphade, M.R.[Milind R.],
On supervision and statistical learning for semantic multimedia analysis,
JVCIR(15), No. 3, September 2004, pp. 348-369.
Elsevier DOI 0711
Factor graphs; Sum product algorithm; Active learning; Hidden Markov models; Dynamic Bayesian networks; Support vector machines BibRef

Naphade, M.R., Kozintsev, I.V., Huang, T.S.,
A factor graph framework for semantic video indexing,
CirSysVideo(12), No. 1, January 2002, pp. 40-52.
IEEE Top Reference. 0202
BibRef

Naphade, M.R., Kozintsev, I.V., Huang, T.S., Ramchandran, K.,
A factor graph framework for semantic indexing and retrieval in video,
CBAIVL00(35-39). 0008
BibRef

Naphade, M.R.[Milind R.], Huang, T.S.[Thomas S.],
Detecting Semantic Concepts Using Context and Audio/Visual Features,
EventVideo01(92-98).
IEEE DOI 0106
BibRef
Earlier:
Recognizing High-level Audio-visual Concepts Using Context,
ICIP01(III: 46-49).
IEEE DOI 0108
BibRef
Earlier:
Semantic Video Indexing Using a Probabilistic Framework,
ICPR00(Vol III: 79-84).
IEEE DOI 0009
BibRef
And:
A Probabilistic Framework for Semantic Indexing and Retrieval in Video,
ICME00(MP9). 0007
BibRef
And:
Inferring Semantic Concepts for Video Indexing and Retrieval,
ICIP00(Vol III: 766-769).
IEEE DOI 0008
BibRef

Naphade, M.R., Kristjansson, T., Frey, B.J., Huang, T.S.,
Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems,
ICIP98(III: 536-540).
IEEE DOI 9810
BibRef

Xie, X., Lu, L., Jia, M., Li, H., Seide, F., Ma, W.Y.,
Mobile Search With Multimodal Queries,
PIEEE(96), No. 4, April 2008, pp. 589-601.
IEEE DOI 0804
Text, image, audio queries. BibRef

Kiranyaz, S., Gabbouj, M.,
Generic content-based audio indexing and retrieval framework,
VISP(153), No. 3, June 2006, pp. 285-297.
DOI Link 0608

See also Novel multimedia retrieval technique: progressive query (why wait?). BibRef

Monaci, G., Jost, P., Vandergheynst, P., Mailhe, B., Lesage, S., Gribonval, R.,
Learning Multimodal Dictionaries,
IP(16), No. 9, September 2007, pp. 2272-2283.
IEEE DOI 0709
Integrating audio-visual info. BibRef

Zhang, T.[Tong],
Using background audio change detection for segmenting video,
US_Patent7,266,287, Sep 4, 2007
WWW Link. BibRef 0709

Kotti, M., Ververidis, D., Evangelopoulos, G., Panagakis, I., Kotropoulos, C., Maragos, P., Pitas, I.,
Audio-Assisted Movie Dialogue Detection,
CirSysVideo(18), No. 11, November 2008, pp. 1618-1627.
IEEE DOI 0811
BibRef

Cristani, M.[Marco], Bicego, M.[Manuele], Murino, V.[Vittorio],
Audio-Visual Event Recognition in Surveillance Video Sequences,
MultMed(9), No. 2, February 2007, pp. 257-267.
IEEE DOI 0905
BibRef
Earlier:
Audio-Visual Foreground Extraction for Event Characterization,
SLAM06(116).
IEEE DOI 0609
BibRef
Earlier:
Audio-Video Integration for Background Modelling,
ECCV04(Vol II: 202-213).
Springer DOI 0405
BibRef

Zeng, Z.H.[Zhi-Hong], Tu, J.L.[Ji-Lin], Liu, M.[Ming], Huang, T.S.[Thomas S.], Pianfetti, B.[Brian], Roth, D.[Dan], Levinson, S.[Stephen],
Audio-Visual Affect Recognition,
MultMed(9), No. 2, February 2007, pp. 424-428.
IEEE DOI 0905
BibRef

Zeng, Z.H.[Zhi-Hong], Tu, J.L.[Ji-Lin], Pianfetti, B.M., Huang, T.S.,
Audio-Visual Affective Expression Recognition Through Multistream Fused HMM,
MultMed(10), No. 4, June 2008, pp. 570-577.
IEEE DOI 0905
BibRef

Zeng, Z.H.[Zhi-Hong], Tu, J.L.[Ji-Lin], Pianfetti, B.[Brian], Liu, M.[Ming], Zhang, T.[Tong], Zhang, Z.Q.[Zhen-Qiu], Huang, T.S.[Thomas S.], Levinson, S.[Stephen],
Audio-Visual Affect Recognition through Multi-Stream Fused HMM for HCI,
CVPR05(II: 967-972).
IEEE DOI 0507
BibRef

Zhang, S.L., Huang, Q.M., Jiang, S., Gao, W., Tian, Q.,
Affective Visualization and Retrieval for Music Video,
MultMed(12), No. 6, 2010, pp. 510-522.
IEEE DOI 1003
BibRef

Zhang, S.L.[Shi-Liang], Tian, Q.[Qi], Hua, G., Huang, Q.M.[Qing-Ming], Gao, W.[Wen],
Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications,
IP(20), No. 9, September 2011, pp. 2664-2677.
IEEE DOI 1109

See also Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search. BibRef

Zhang, S.L.[Shi-Liang], Tian, Q.[Qi], Huang, Q.M.[Qing-Ming], Gao, W.[Wen], Rui, Y.[Yong],
USB: Ultrashort Binary Descriptor for Fast Visual Matching and Retrieval,
IP(23), No. 8, August 2014, pp. 3671-3683.
IEEE DOI 1408
data compression
See also Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search. BibRef

Zhang, S.L.[Shi-Liang], Tian, Q.[Qi], Huang, Q.M.[Qing-Ming], Gao, W.[Wen], Rui, Y.,
Cascade Category-Aware Visual Search,
IP(23), No. 6, June 2014, pp. 2514-2527.
IEEE DOI 1406
Accuracy BibRef

Irie, G., Satou, T., Kojima, A., Yamasaki, T., Aizawa, K.,
Affective Audio-Visual Words and Latent Topic Driving Model for Realizing Movie Affective Scene Classification,
MultMed(12), No. 6, 2010, pp. 523-535.
IEEE DOI 1003
BibRef

Ibrahim, Z.A.[Zein Al_Abidin], Ferrane, I.[Isabelle], Joly, P.[Philippe],
A Similarity-Based Approach for Audiovisual Document Classification Using Temporal Relation Analysis,
JIVP(2011), No. 2011, pp. xx-yy.
DOI Link 1104
BibRef

Philippeau, J.[Jeremy], Pinquier, J.[Julien], Joly, P.[Philippe], Carrive, J.[Jean],
Dynamic organization of audiovisual database using a user-defined similarity measure based on low-level features,
ICIP08(33-36).
IEEE DOI 0810
BibRef

Haidar, S.[Siba], Joly, P.[Philippe], Chebaro, B.[Bilal],
Style Similarity Measure for Video Documents Comparison,
CIVR05(307-317).
Springer DOI 0507
BibRef

Huurnink, B.[Bouke], Snoek, C.G.M.[Cees G. M.], de Rijke, M.[Maarten], Smeulders, A.W.M.[Arnold W. M.],
Content-Based Analysis Improves Audiovisual Archive Retrieval,
MultMed(14), No. 4, 2012, pp. 1166-1178.
IEEE DOI 1208
BibRef
Earlier:
Today's and tomorrow's retrieval practice in the audiovisual archive,
CIVR10(18-25).
DOI Link 1007
BibRef

Huurnink, B.[Bouke], de Rijke, M.[Maarten],
The value of stories for speech-based video search,
CIVR07(266-271).
DOI Link 0707
BibRef

Jhuo, I.H.[I-Hong], Ye, G.N.[Guang-Nan], Gao, S.H.[Sheng-Hua], Liu, D.[Dong], Jiang, Y.G.[Yu-Gang], Lee, D.T., Chang, S.F.[Shih-Fu],
Discovering joint audio-visual codewords for video event detection,
MVA(25), No. 1, January 2014, pp. 33-47.
Springer DOI 1412
BibRef
Earlier: A2, A1, A4, A5, A6, A7, Only:
Joint audio-visual bi-modal codewords for video event detection,
ICMR12(39).
DOI Link 1301
BibRef

Feki, I.[Issam], Ben Ammar, A.[Anis], Alimi, A.M.[Adel M.],
Automatic environmental sound concepts discovery for video retrieval,
MultInfoRetr(5), No. 2, June 2016, pp. 105-115.
WWW Link. 1605
BibRef

Khan, M.U.G.[Muhammad Usman Ghani], Gotoh, Y.[Yoshihiko],
Generating natural language tags for video information management,
MVA(28), No. 3-4, May 2017, pp. 243-265.
WWW Link. 1704
BibRef

Khan, M.U.G.[Muhammad Usman Ghani], Zhang, L.[Lei], Gotoh, Y.[Yoshihiko],
Generating coherent natural language annotations for video streams,
ICIP12(2893-2896).
IEEE DOI 1302
BibRef
Earlier:
Towards coherent natural language description of video streams,
SIG11(664-671).
IEEE DOI 1201
BibRef
Earlier: A2, A1, A3:
Video scene classification based on natural language description,
ARTEMIS11(942-949).
IEEE DOI 1201
From the small amount of natural language description. BibRef


Guo, X.N.[Xiao-Na], Zhong, W.[Wei], Ye, L.[Long], Fang, L.[Li], Heng, Y.[Yan], Zhang, Q.[Qin],
Global Affective Video Content Regression Based on Complementary Audio-visual Features,
MMMod20(II:540-550).
Springer DOI 2003
BibRef

Peri, D.[Dheeraj], Sah, S.[Shagan], Ptucha, R.[Raymond],
Show, Translate and Tell,
ICIP19(295-299)
IEEE DOI 1910
Joint images and captions. BibRef

Chen, K.[Kan], Zhang, C.X.[Chuan-Xi], Fang, C.[Chen], Wang, Z.W.[Zhao-Wen], Bui, T.[Trung], Nevatia, R.[Ram],
Visually Indicated Sound Generation by Perceptually Optimized Classification,
MultLearnApp18(VI:560-574).
Springer DOI 1905
Predict visually consistent sound from the video content. BibRef

Haurilet, M.L., Tapaswi, M., Al-Halah, Z., Stiefelhagen, R.,
Naming TV characters by watching and analyzing dialogs,
WACV16(1-9)
IEEE DOI 1606
Data models BibRef

Numano, S.[Shunsuke], Enami, N.[Naoko], Ariki, Y.[Yasuo],
Task-Driven Saliency Detection on Music Video,
CV4AC14(658-671).
Springer DOI 1504
BibRef

Scott, D.[David], Zhang, Z.X.[Zhen-Xing], Albatal, R.[Rami], McGuinness, K.[Kevin], Acar, E.[Esra], Hopfgartner, F.[Frank], Gurrin, C.[Cathal], O'Connor, N.E.[Noel E.], Smeaton, A.F.[Alan F.],
Audio-Visual Classification Video Browser,
MMMod14(II: 398-401).
Springer DOI 1405
BibRef

Lin, Y.T.[Yin-Tzu], Tsai, T.H.[Tsung-Hung], Hu, M.C.[Min-Chun], Cheng, W.H.[Wen-Huang], Wu, J.L.[Ja-Ling],
Semantic Based Background Music Recommendation for Home Videos,
MMMod14(II: 283-290).
Springer DOI 1405
BibRef

Shamma, D.A.[David A.], Kennedy, L.[Lyndon], Churchill, E.F.[Elizabeth F.],
Watching and talking: media content as social nexus,
ICMR12(12).
DOI Link 1301
BibRef

Nowak, S.[Stefanie], Paduschek, R.[Ronny], Kühhirt, U.[Uwe],
Photo summary: automated selection of representative photos from a digital collection,
ICMR11(75).
DOI Link 1301
Demo. BibRef

Paduschek, R.[Ronny], Nowak, S.[Stefanie], Kühhirt, U.[Uwe],
Automated detection of errors and quality issues in audio-visual content,
ICMR11(74).
DOI Link 1301
automated detection of errors and quality issues in audio-visual content AVInspector. BibRef

Vretos, N.[Nicholas], Nikolaidis, N.[Nikos], Pitas, I.[Ioannis],
The use of Audio-Visual Description Profile in 3D video content description,
3DTV12(1-4).
IEEE DOI 1212
BibRef

Ta, A.P.[Anh-Phuong], Ben, M.[Mathieu], Gravier, G.[Guillaume],
Improving Cluster Selection and Event Modeling in Unsupervised Mining for Automatic Audiovisual Video Structuring,
MMMod12(529-540).
Springer DOI 1201
BibRef

Mühling, M.[Markus], Ewerth, R.[Ralph], Freisleben, B.[Bernd],
Improving Cross-Domain Concept Detection via Object-Based Features,
CAIP15(II:359-370).
Springer DOI 1511
BibRef
Earlier:
On the Spatial Extents of SIFT Descriptors for Visual Concept Detection,
CVS11(71-80).
Springer DOI 1109
BibRef

Mühling, M.[Markus], Ewerth, R.[Ralph], Zhou, J.[Jun], Freisleben, B.[Bernd],
Multimodal Video Concept Detection via Bag of Auditory Words and Multiple Kernel Learning,
MMMod12(40-50).
Springer DOI 1201
BibRef

Valio, F.B.[Felipe Braunger], Pedrini, H.[Helio], Leite, N.J.[Neucimar Jeronimo],
Fast Rotation-Invariant Video Caption Detection Based on Visual Rhythm,
CIARP11(157-164).
Springer DOI 1111
BibRef

Gianni, F.[Frédéric], Pinquier, J.[Julien], Irisa, E.K.[Ewa Kijak],
ACADI showcase: Automatic character indexing in audiovisual document,
CIVR07(109-112).
DOI Link 0707
BibRef

Putthividhy, D.[Duangmanee], Attias, H.T.[Hagai T.], Nagarajan, S.S.[Srikantan S.],
Topic regression multi-modal Latent Dirichlet Allocation for image annotation,
CVPR10(3408-3415).
IEEE DOI 1006
Using annotation texts. BibRef

Jung, K.H.[Kwang-Hee], Choi, S.H.[Sung-Hyun], Kim, H.S.[Hyung-Seok], Hur, N.H.[Nam-Ho], Kim, J.K.[Joong Kyu],
Caption insertion method for 3D broadcasting service,
3DTV10(1-4).
IEEE DOI 1006
BibRef

Pramod, S.K.[Sankar K.], Jawahar, C.V., Zisserman, A.[Andrew],
Subtitle-free Movie to Script Alignment,
BMVC09(xx-yy).
PDF File. 0909
BibRef

Zeng, Z.[Zhi], Liang, W.[Wei], Li, H.P.[He-Ping], Zhang, S.W.[Shu-Wu],
A Novel Video Classification Method Based on Hybrid Generative/Discriminative Models,
SSPR08(705-713).
Springer DOI 0812
Using audio. BibRef

Zhu, Y.Y.[Ying-Ying], Ming, Z.[Zhong], Huang, Q.A.[Qi-Ang],
SVM-Based Audio Classification for Content- Based Multimedia Retrieval,
MCAM07(474-482).
Springer DOI 0706
BibRef

Goldmann, L., Samour, A., Karaman, M., Sikora, T.,
Extracting High Level Semantics by Means of Speech, Audio, and Image Primitives in Surveillance Applications,
ICIP06(2397-2400).
IEEE DOI 0610
BibRef

Luo, J.[Jie], Caputo, B.[Barbara], Zweig, A.[Alon], Bach, J.H.[Jörg-Hendrik], Anemüller, J.[Jörn],
Object Category Detection Using Audio-Visual Cues,
CVS08(xx-yy).
Springer DOI 0805
BibRef

Caputo, B., Wallraven, C., Nilsback, M.E.,
Object categorization via local kernels,
ICPR04(II: 132-135).
IEEE DOI 0409
BibRef

Schauer, C., Gross, H.M.,
A Computational Model of Early Auditory-Visual Integration,
DAGM03(362-369).
Springer DOI 0310
BibRef

Fu, T.Y.[Tie-Yan], Liu, X.X.[Xiao Xing], Liang, L.H.[Lu Hong], Pi, X.B.[Xiao-Bo], Nefian, A.V.,
A audio-visual speaker identification using coupled hidden Markov models,
ICIP03(III: 29-32).
IEEE DOI 0312
BibRef

Yemez, Y.[Yücel], Kanak, A., Erzin, E., Tekalp, A.M.,
Multimodal speaker identification with audio-video processing,
ICIP03(III: 5-8).
IEEE DOI 0312
BibRef

Sugano, M., Isaksson, R., Nakajima, Y., Yanagihara, H.,
Shot genre classification using compressed audio-visual features,
ICIP03(II: 17-20).
IEEE DOI 0312
BibRef

Moncrieff, S., Venkatesh, S., and Dorai, C.,
Horror film genre typing and scene labeling via audio analysis,
ICME03(I: 193-196). BibRef 0300

Moncrieff, S., Dorai, C., Venkatesh, S.,
Affect computing in film through sound energy dynamics,
ACMMM01(525-527). BibRef 0100

Wachsmuth, S., Sagerer, G.,
Integrated analysis of speech and images as a probabilistic decoding process,
ICPR02(II: 588-592).
IEEE DOI 0211
BibRef

Kulesh, V., Petrushin, V.A., Sethi, I.K.,
Video clip recognition using joint audio-visual processing model,
ICPR02(I: 500-503).
IEEE DOI 0211
BibRef

Miyamori, H.,
Improving accuracy in behaviour identification for content-based retrieval by using audio and video information,
ICPR02(II: 826-830).
IEEE DOI 0211
BibRef

de Santo, M., Percannella, G., Sansone, C., Vento, M.,
Classifying audio of movies by a multi-expert system,
CIAP01(386-391).
IEEE DOI 0210
BibRef

Albiol, A., Torres, L., Delp, E.J.,
Video preprocessing for audiovisual indexing,
Southwest02(57-61).
IEEE Top Reference. 0208
BibRef

Bakker, E.M.[Erwin M.], Lew, M.S.[Michael S.],
Semantic Video Retrieval Using Audio Analysis,
CIVR02(271-277).
Springer DOI 0208
BibRef

Kim, K.[Kyungsu], Choi, J.[Junho], Kim, N.[Namjung], Kim, P.K.[Pan-Koo],
Extracting Semantic Information from Basketball Video Based on Audio-Visual Features,
CIVR02(278-288).
Springer DOI 0208
BibRef

Fisher, J.W.[John W.], Darrell, T.J.[Trevor J.],
Probabalistic Models and Informative Subspaces for Audiovisual Correspondence,
ECCV02(III: 592 ff.).
Springer DOI 0205
BibRef

Chu, S.M.[Stephen M.], Huang, T.S.[Thomas S.],
Audio-Visual Speech Fusion Using Coupled Hidden Markov Models,
MSCSAS07(1-2).
IEEE DOI 0706
BibRef

Naphade, M.R.[Milind R.], Garg, A.[Ashutosh], Huang, T.S.[Thomas S.],
Audio-Visual Event Detection using Duration Dependent Input Output Markov Models,
CBAIVL01(30).
IEEE DOI 0110
BibRef

Alatan, A.A.,
Automatic Multi-modal Dialogue Scene Indexing,
ICIP01(III: 374-377).
IEEE DOI 0108
BibRef

Sundaram, H.[Hari], Chang, S.F.[Shih-Fu],
Video Scene Segmentation Using Video and Audio Features,
ICME00(TP10). 0007
BibRef

Smith, J.R.[John R.], Li, C.S.[Chung-Sheng],
Adaptive Synthesis in Progressive Retrieval of Audio-Visual Data,
ICME00(MP5). 0007
BibRef

Toklu, C., Liou, S.P.,
Image and Audio Sequence Visualization and Interaction Mechanisms for Structured Video Browsing and Editing,
ICIP00(Vol II: 263-266).
IEEE DOI 0008
BibRef

Jiang, H.[Hao], Lin, T.[Tong], Zhang, H.J.[Hong-Jiang],
Video Segmentation with the Assistance of Audio Content Analysis,
ICME00(WP5). 0007
BibRef

Pandit, M., Kittler, J.V., Li, Y., Chilton, E.,
A Comparative Study of Different Segmentation Approaches for Audio Track Indexing,
ICPR00(Vol II: 467-470).
IEEE DOI 0009
BibRef

Huang, J.C.[Jin-Cheng], Liu, Z.[Zhu], Yao, W.[Wang],
Integration of audio and visual information for content-based video segmentation,
ICIP98(III: 526-529).
IEEE DOI 9810
BibRef

Saraceno, C., Leonardi, R.,
Identification of story units in audio-visual sequences by joint audio and video processing,
ICIP98(I: 363-367).
IEEE DOI 9810
BibRef

Chapter on Implementations and Applications, Databases, QBIC, Video Analysis, Hardware and Software, Inspection continues in
Survey, Comparison, Evaluation, of Segmentation and Cut Detection, Summarization .


Last update:Mar 16, 2024 at 20:36:19