26.1.12 Speech Recognition, Speech Analysis, Signal Processing

Chapter Contents (Back)
These are mostly included since they are in the full ToC for journals that are taken completely. There is no attempt to have anywhere near complete speech recognition coverage. Speech.
See also Speech Recognition, Neural Networks, CNN.
See also Emotion Recognition, from Other Than Faces.

Dragon Voice,
2005 Speech Recognition
WWW Link. Vendor, Speech Recognition. Developed from the original Dragon speech system.

Hanson, A.R., Riseman, E.M., Fisher, E.,
Context in word recognition,
PR(8), No. 1, January 1976, pp. 35-45.
Elsevier DOI 0309
BibRef

de Mori, R., Laface, P., Makhonine, V.A., Mezzalama, M.,
A syntactic procedure for the recognition of glottal pulses in continuous speech,
PR(9), No. 4, 1977, pp. 181-189.
Elsevier DOI 0309
BibRef

Maroy, J.P., Berthod, M.,
Natural language understanding by a robot: A pattern recognition problem,
PR(10), No. 2, 1978, pp. 63-71.
Elsevier DOI 0309
BibRef

Pal, S.K., Datta, A.K., Majumder, D.D.[D. Dutta],
A self-supervised vowel recognition system,
PR(12), No. 1, 1980, pp. 27-34.
Elsevier DOI 0309
BibRef

Pathak, A.[Amita], Pal, S.K.[Sankar K.],
On the convergence of 'A self-supervised vowel recognition system',
PR(20), No. 2, 1987, pp. 237-244.
Elsevier DOI 0309
BibRef

de Mori, R.[Renato], Giordano, G.[Giovanna],
Algorithms for syllabic hypothesization in continuous speech,
PR(14), No. 1-6, 1981, pp. 245-260.
Elsevier DOI 0309
BibRef

Tanaka, E.[Eiichi], Toyama, T.[Takanori], Kawai, S.[Sachiko],
High speed error correction of phoneme sequences,
PR(19), No. 5, 1986, pp. 407-412.
Elsevier DOI 0309
BibRef

Lee, L.S., Tseng, C.Y., Chen, K.J., Huang, J., Hwang, C.H., Ting, P.Y., Lin, L.J., Chen, C.C.,
A Mandarin dictation machine based upon a hierarchical recognition approach and Chinese natural language analysis,
PAMI(12), No. 7, July 1990, pp. 695-704.
IEEE DOI 0401
BibRef

Kenny, P., Lennig, M., Mermelstein, P.,
Speaker adaptation in a large-vocabulary Gaussian HMM recognizer,
PAMI(12), No. 9, September 1990, pp. 917-920.
IEEE DOI 0401
BibRef

Casacuberta, F.,
Some relations among stochastic finite state networks used in automatic speech recognition,
PAMI(12), No. 7, July 1990, pp. 691-695.
IEEE DOI 0401
BibRef

Yannakoudakis, E.J., Tsomokos, I., Hutton, P.J.,
n-Grams and their implication to natural language understanding,
PR(23), No. 5, 1990, pp. 509-528.
Elsevier DOI 0401
BibRef

Ney, H.[Hermann],
A comparative study of two search strategies for connected word recognition: dynamic programming and heuristic search,
PAMI(14), No. 5, May 1992, pp. 586-595.
IEEE DOI 0401
BibRef

Ney, H.[Hermann],
Stochastic Modelling: From Pattern Classification to Speech Recognition and Translation,
ICPR00(Vol III: 21-28).
IEEE DOI 0009
BibRef

Liu, L.C.[Lih-Cherng], Chiou, D.[Denis], Wang, H.C.[Hsiao-Chuan],
A speech recognition method based on feature distributions,
PR(24), No. 8, 1991, pp. 717-722.
Elsevier DOI 0401
BibRef

Pinkowski, B.[Ben],
Multiscale fourier descriptors for classifying semivowels in spectrograms,
PR(26), No. 10, October 1993, pp. 1593-1602.
Elsevier DOI 0401
BibRef

Pinkowski, B.[Ben],
Principal Component Analysis of Speech Spectrogram Images,
PR(30), No. 5, May 1997, pp. 777-787.
Elsevier DOI 9705
BibRef

Mast, M., Kummert, F., Ehrlich, U., Fink, G.A., Kuhn, T., Niemann, H., Sagerer, G.F.,
A speech understanding and dialog system with a homogeneous linguistic knowledge base,
PAMI(16), No. 2, February 1994, pp. 179-194.
IEEE DOI 0401
BibRef

Huo, Q.A.[Qi-Ang], Chan, C.[Chorkin],
Contextual vector quantization for speech recognition with discrete hidden Markov model,
PR(28), No. 4, April 1995, pp. 513-517.
Elsevier DOI 0401
BibRef

Pham, T.D.[Tuan D.], Wagner, M.[Michael],
A geostatistical model for linear prediction analysis of speech,
PR(31), No. 12, December 1998, pp. 1981-1991.
Elsevier DOI 0401
BibRef

Han, J.Q.[Ji-Qing], Gao, W.[Wen],
Robust telephone speech recognition based on channel compensation,
PR(32), No. 6, June 1999, pp. 1061-1067.
Elsevier DOI 0401
BibRef

Deng, S.[Shiwen], Han, J.Q.[Ji-Qing],
Sparse Decomposition for Signal Periodic Model Over Complex Exponential Dictionary,
SPLetters(23), No. 12, December 2016, pp. 1858-1861.
IEEE DOI 1612
signal representation BibRef
And:
Voice Activity Detection Based on Complex Exponential Atomic Decomposition and Likelihood Ratio Test,
ICPR10(89-92).
IEEE DOI 1008
BibRef

Lewis, M.A.[Michael A.], Ramachandran, R.P.[Ravi P.],
Cochannel speaker count labelling based on the use of cepstral and pitch prediction derived features,
PR(34), No. 2, February 2001, pp. 499-507.
Elsevier DOI 0011
BibRef

Kant, S.[Shri], Verma, N.[Neelam],
An Effective Source Recognition Algorithm: Extraction of Significant Binary Words,
PRL(21), No. 11, October 2000, pp. 981-988. 0010
BibRef

Kwong, S., He, Q.H., Man, K.F., Tang, K.S.,
A maximum model distance approach for HMM-based speech recognition,
PR(31), No. 3, March 1998, pp. 219-229.
Elsevier DOI 0401
BibRef

He, Q.H., Kwong, S., Man, K.F., Tang, K.S.,
An improved maximum model distance approach for HMM-based speech recognition systems,
PR(33), No. 10, October 2000, pp. 1749-1758.
Elsevier DOI 0006
BibRef

Wu, C.H., Chen, Y.J., Yan, G.L.,
Integration of phonetic and prosodic information for robust utterance verification,
VISP(147), No. 1, February 2000, pp. 55. 0005
BibRef

Kim, W.[Wooil], Kang, S.[Sunmee], Ko, H.S.[Han-Seok],
Spectral subtraction based on phonetic dependency and masking effects,
VISP(147), No. 5, October 2000, pp. 423-427. 0101
BibRef

Hussain, A., Campbell, D.R.,
Intelligibility improvements using binaural diverse sub-band processing applied to speech corrupted with automobile noise,
VISP(148), No. 2, April 2001, pp. 127-132. 0106
BibRef

Bohez, E.L.J.[Erik L.J.], Senevirathne, T.R.,
Speech recognition using fractals,
PR(34), No. 11, November 2001, pp. 2227-2243.
Elsevier DOI 0108
BibRef

Chen, S.H., Wang, J.F.,
Application of wavelet transforms for C/V segmentation on Mandarin speech signals,
VISP(148), No. 2, April 2001, pp. 133-139. 0106
BibRef

Mouria-Beji, F.[Fériel],
A hierarchical Bayesian model for continuous speech recognition,
PRL(23), No. 7, May 2002, pp. 773-781.
Elsevier DOI 0203
BibRef

Chen, F.K., Yang, J.F., Yan, Y.L.,
Candidate scheme for fast ACELP search,
VISP(149), No. 1, February 2002, pp. 10-16.
IEEE Top Reference. 0205
Algebraic code excited linear prediction. Speech coding. BibRef

Liu, J.W.[Jing-Wei], Cheng, Q.S.[Qian-Sheng], Zheng, Z.G.[Zhong-Guo], Qian, M.P.[Min-Ping],
A DTW-based probability model for speaker feature analysis and data mining,
PRL(23), No. 11, September 2002, pp. 1271-1276.
Elsevier DOI 0206
BibRef

Huang, C.S.[Chao-Shih], Wang, H.C.[Hsiao-Chuan],
Bandwidth-adjusted LPC analysis for robust speech recognition,
PRL(24), No. 9-10, June 2003, pp. 1583-1587.
Elsevier DOI 0304
BibRef

Juang, Y.T.[Yau-Tarng], Huang, K.C.[Kuo-Chang], Ding, I.J.[Ing-Jr],
Speaker adaptation based on MAP estimation using fuzzy controller,
PRL(24), No. 15, November 2003, pp. 2807-2813.
Elsevier DOI 0308
BibRef

Ding, I.J.[Ing-Jr],
Incremental MLLR speaker adaptation by fuzzy logic control,
PR(40), No. 11, November 2007, pp. 3110-3119.
Elsevier DOI 0707
Speech recognition; Speaker adaptation; Hidden Markov model; Maximum likelihood linear regression; T-S fuzzy logic controller BibRef

Li, T.F.[Tze Fen],
Speech Recognition of Mandarin Monosyllables,
PR(36), No. 11, November 2003, pp. 2713-2721.
Elsevier DOI 0309
BibRef

Farooq, O., Datta, S.,
Wavelet based robust sub-band features for phoneme recognition,
VISP(151), No. 3, June 2004, pp. 187-193.
IEEE Abstract. 0409
BibRef

Ricotti, L.P.,
Multitapering and a wavelet variant of MFCC in speech recognition,
VISP(152), No. 1, February 2005, pp. 29-35.
IEEE Abstract. 0501
BibRef

Chen, K.[Ke],
On the use of different speech representations for speaker modeling,
SMC-C(35), No. 3, August 2005, pp. 301-314.
IEEE DOI 0508
BibRef

Zhong, W., Li, S., Tai, H.M.,
Signal subspace approach for narrowband noise reduction in speech,
VISP(152), No. 6, December 2005, pp. 800-805.
DOI Link 0512
BibRef

Chen, B.[Berlin],
Exploring the use of latent topical information for statistical Chinese spoken document retrieval,
PRL(27), No. 1, 1 January 2006, pp. 9-18.
Elsevier DOI 0512
BibRef

Chen, B.[Berlin], Chen, Y.T.[Yi-Ting],
Extractive spoken document summarization for information retrieval,
PRL(29), No. 4, 1 March 2008, pp. 426-437.
Elsevier DOI 0711
Extractive summarization; Information retrieval; Topical mixture model; Spoken documents; Speech recognition BibRef

Wan, C.[Chunru], Liu, M.C.[Ming-Chun],
Content-based audio retrieval with relevance feedback,
PRL(27), No. 2, 15 January 2006, pp. 85-92.
Elsevier DOI 0512
BibRef

Radhakrishnan, R.[Regunathan], Divakaran, A.[Ajay], Xiong, Z.Y.[Zi-You], Otsuka, I.[Isao],
A Content-Adaptive Analysis and Representation Framework for Audio Event Discovery from 'Unscripted' Multimedia,
JASP(2006), 2006, pp. 1-24.
DOI Link 0603
BibRef

Chu, W.T.[Wei-Ta], Cheng, W.H.[Wen-Huang], Wu, J.L.[Ja-Ling],
Semantic Context Detection Using Audio Event Fusion,
JASP(2006), 2006, pp. 1-12.
WWW Link. 0603
BibRef

Liu, J.W.[Jing-Wei], Wang, Z.Y.[Zuo-Ying], Xiao, X.[Xi],
A hybrid SVM/DDBHMM decision fusion modeling for robust continuous digital speech recognition,
PRL(28), No. 8, 1 June 2007, pp. 912-920.
Elsevier DOI 0704
Speech recognition; Gaussian mixture model; Duration distribution based hidden Markov model (DDBHMM); Support vector machine BibRef

Leavitt, N.,
Two technologies vie for recognition in speech market,
Computer(36), No. 6, June 2003, pp. 13-16.
IEEE DOI 0306
BibRef

Paulson, L.D.,
Speech Recognition Moves from Software to Hardware,
Computer(39), No. 11, November 2006, pp. 15-18.
IEEE DOI 0611
BibRef

Araujo, L.[Lourdes], Serrano, J.I.[J. Ignacio],
Highly accurate error-driven method for noun phrase detection,
PRL(29), No. 4, 1 March 2008, pp. 547-557.
Elsevier DOI 0711
Noun phrase detection; Evolutionary programming; Grammar induction; Information retrieval BibRef

Zhang, Y.X.[Yong-Xin], Scordilis, M.S.[Michael S.],
Effective online unsupervised adaptation of Gaussian mixture models and its application to speech classification,
PRL(29), No. 6, 15 April 2008, pp. 735-744.
Elsevier DOI 0803
Gaussian mixture model; Speech classification; Online adaptation; Unsupervised adaptation BibRef

O'Shaughnessy, D.[Douglas],
Invited paper: Automatic speech recognition: History, methods and challenges,
PR(41), No. 10, October 2008, pp. 2965-2979.
Elsevier DOI 0808
Automatic speech recognition; Hidden Markov models; Adaptation; Compensation; Pattern recognition; Spectral representation BibRef

Zeng, J.[Jia], Xie, L.[Lei], Liu, Z.Q.[Zhi-Qiang],
Type-2 fuzzy Gaussian mixture models,
PR(41), No. 12, December 2008, pp. 3636-3643.
Elsevier DOI 0810
BibRef
Earlier: A1, A3, Only:
Type-2 fuzzy hidden markov models to phoneme recognition,
ICPR04(I: 192-195).
IEEE DOI 0409
Type-2 fuzzy sets; Gaussian mixture models; Hidden Markov models BibRef

Chen, B.[Berlin], Liu, S.H.[Shih-Hung], Chu, F.H.[Fang-Hui],
Training data selection for improving discriminative training of acoustic models,
PRL(30), No. 13, 1 October 2009, pp. 1228-1235.
Elsevier DOI 0909
Continuous speech recognition; Discriminative training; Acoustic models; Data selection; Phone accuracy; Entropy BibRef

Kang, S.W.[Sang-Woo], Kim, H.[Harksoo], Seo, J.Y.[Jung-Yun],
A reliable multidomain model for speech act classification,
PRL(31), No. 1, 1 January 2010, pp. 71-74.
Elsevier DOI 1001
Speech act classification; Dialogue domain detection; Multidomain dialogue BibRef

Kang, S.W.[Sang-Woo], Seo, J.Y.[Jung-Yun],
Two-phase reanalysis model for understanding user intention,
PRL(42), No. 1, 2014, pp. 35-39.
Elsevier DOI 1404
Natural language processing BibRef

Milone, D.H.[Diego H.], di Persia, L.E.[Leandro E.], Torres, M.E.[Maria E.],
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees,
PR(43), No. 4, April 2010, pp. 1577-1589.
Elsevier DOI 1002
Sequence learning; EM algorithm; Wavelets; Speech recognition BibRef

Lu, Y.[Yong], Wu, H.Y.[Hai-Yang], Zhou, L.[Lin], Wu, Z.Y.[Zhen-Yang],
Multi-environment model adaptation based on vector Taylor series for robust speech recognition,
PR(43), No. 9, September 2010, pp. 3093-3099.
Elsevier DOI 1006
Model adaptation; Vector Taylor series; Multi-environment model; Speech recognition BibRef

Hong, H., Zhao, Z., Wang, X., Tao, Z.,
Detection of Dynamic Structures of Speech Fundamental Frequency in Tonal Languages,
SPLetters(17), No. 10, October 2010, pp. 843-846.
IEEE DOI 1008
BibRef

Heracleous, P.[Panikos], Badin, P.[Pierre], Bailly, G.[Gerard], Hagita, N.[Norihiro],
A pilot study on augmented speech communication based on Electro-Magnetic Articulography,
PRL(32), No. 8, 1 June 2011, pp. 1119-1125.
Elsevier DOI 1101
Augmented speech; Electro-Magnetic Articulography (EMA); Automatic speech recognition; Hidden Markov model (HMMs); Fusion; Noise robustness BibRef

Chen, B.[Berlin], Chen, W.H.[Wei-Hau], Lin, S.H.[Shih-Hsiang], Chu, W.Y.[Wen-Yi],
Robust speech recognition using spatial-temporal feature distribution characteristics,
PRL(32), No. 7, 1 May 2011, pp. 919-926.
Elsevier DOI 1101
Speech recognition, Noise robustness, Histogram equalization, Spatial-temporal distribution characteristics, Aurora-2 BibRef

Zamani, B.[Behzad], Akbari, A.[Ahmad], Nasersharif, B.[Babak], Jalalvand, A.[Azarakhsh],
Optimized discriminative transformations for speech features based on minimum classification error,
PRL(32), No. 7, 1 May 2011, pp. 948-955.
Elsevier DOI 1101
Minimum classification error; Principal Component Analysis; Linear Discriminant Analysis; Feature transformation; Hidden Markov Model BibRef

Lo, H.Y., Wang, J.C., Wang, H.M., Lin, S.D.,
Cost-Sensitive Multi-Label Learning for Audio Tag Annotation and Retrieval,
MultMed(13), No. 3, 2011, pp. 518-529.
IEEE DOI 1106
BibRef

Lu, L., Ghoshal, A., Renals, S.,
Regularized Subspace Gaussian Mixture Models for Speech Recognition,
SPLetters(18), No. 7, July 2011, pp. 419-422.
IEEE DOI 1101
BibRef

Lu, L., Renals, S.,
Probabilistic Linear Discriminant Analysis for Acoustic Modeling,
SPLetters(21), No. 6, June 2014, pp. 702-706.
IEEE DOI 1404
Analytical models BibRef

Remes, U., Palomaki, K.J., Raiko, T., Honkela, A., Kurimo, M.,
Missing-Feature Reconstruction With a Bounded Nonlinear State-Space Model,
SPLetters(18), No. 10, October 2011, pp. 563-566.
IEEE DOI 1109
Speech recognition. BibRef

He, Y., Han, J.,
Gaussian Specific Compensation for Channel Distortion in Speech Recognition,
SPLetters(18), No. 10, October 2011, pp. 599-602.
IEEE DOI 1109
BibRef

Roupakia, Z., Gales, M.,
Kernel Eigenvoices (Revisited) for Large-Vocabulary Speech Recognition,
SPLetters(18), No. 12, December 2011, pp. 709-712.
IEEE DOI 1112
BibRef

Kim, S.[Seonho], Yoon, J.[Juntae], Seo, J.Y.[Jung-Yun], Park, S.[Seog],
Improving Korean verb-verb morphological disambiguation using lexical knowledge from unambiguous unlabeled data and selective web counts,
PRL(33), No. 1, 1 January 2012, pp. 62-70.
Elsevier DOI 1112
POS tagging; Verb-verb morphological disambiguation; Unlabeled corpora; Automatic annotation; Web counts; Hard example-based selective sampling BibRef

Geller, T.[Tom],
Talking to Machines,
CACM(55), No. 4, April 2012, pp. 14-16.
DOI Link 1204
Voice recognition programs like Siri are now capable of understanding spoken commands, recognizing a conversation's context, and answering questions in a personable manner. BibRef

Norrenbrock, C.R., Hinterleitner, F., Heute, U., Moller, S.,
Instrumental Assessment of Prosodic Quality for Text-to-Speech Signals,
SPLetters(19), No. 5, May 2012, pp. 255-258.
IEEE DOI 1204
BibRef

Seon, C.N.[Choong-Nyoung], Kim, H.[Harksoo], Seo, J.Y.[Jung-Yun],
A statistical prediction model of speakers' intentions using multi-level features in a goal-oriented dialog system,
PRL(33), No. 10, 15 July 2012, pp. 1397-1404.
Elsevier DOI 1205
Speech act prediction; Concept sequence prediction; Multi-level feature BibRef

Kang, S.W.[Sang-Woo], Ko, Y.J.[Young-Joong], Seo, J.Y.[Jung-Yun],
Hierarchical speech-act classification for discourse analysis,
PRL(34), No. 10, 15 July 2013, pp. 1119-1124.
Elsevier DOI 1306
Natural language processing; Discourse analysis; Speech act classification; Hierarchical structure; Dialogue system BibRef

Dehzangi, O.[Omid], Ma, B.[Bin], Chng, E.S.[Eng Siong], Li, H.Z.[Hai-Zhou],
Discriminative feature extraction for speech recognition using continuous output codes,
PRL(33), No. 13, 1 October 2012, pp. 1703-1709.
Elsevier DOI 1208
BibRef
Earlier:
Fuzzy rule selection using Iterative Rule Learning for speech data classification,
ICPR08(1-4).
IEEE DOI 0812
Speech recognition; Feature transformation; Generalized discriminant analysis; Output coding BibRef

Schroder, M.[Marc], Bevacqua, E.[Elisabetta], Cowie, R.[Roddy], Eyben, F.[Florian], Gunes, H.[Hatice], Heylen, D.[Dirk], ter Maat, M.[Mark], McKeown, G.[Gary], Pammi, S.[Sathish], Pantic, M.[Maja], Pelachaud, C.[Catherine], Schuller, B.[Bjorn], de Sevin, E.[Etienne], Valstar, M.F.[Michel F.], Wollmer, M.[Martin],
Building Autonomous Sensitive Artificial Listeners,
AffCom(3), No. 2, 2012, pp. 165-183.
IEEE DOI 1208
BibRef

Furui, S., Deng, L., Gales, M., Ney, H., Tokuda, K.,
Fundamental Technologies in Modern Speech Recognition,
SPMag(29), No. 3, 2012, pp. 16-17.
IEEE DOI 1210
From the Guest Editors. Survey of speech recognition, intro to special issue BibRef

Saon, G., Chien, J.T.,
Large-Vocabulary Continuous Speech Recognition Systems: A Look at Some Recent Advances,
SPMag(29), No. 3, 2012, pp. 18-33.
IEEE DOI 1210
Survey, Speech Recognition. BibRef

Wang, H.P.[Hai-Peng], Leung, C.C.[Cheung-Chi], Lee, T.[Tan], Ma, B.[Bin], Li, H.Z.[Hai-Zhou],
Shifted-Delta MLP Features for Spoken Language Recognition,
SPLetters(20), No. 1, January 2013, pp. 15-18.
IEEE DOI 1212
BibRef

Edwards, J.,
Researchers Push Speech Recognition Toward the Mainstream,
SPMag(30), No. 1, 2012, pp. 8-11.
IEEE DOI 1212
[Special Reports] BibRef

Das, B.[Biswajit], Mandal, S.[Sandipan], Mitra, P.[Pabitra], Basu, A.[Anupam],
Aging speech recognition with speaker adaptation techniques: Study on medium vocabulary continuous Bengali speech,
PRL(34), No. 3, 1 February 2013, pp. 335-343.
Elsevier DOI 1301
Aging speech recognition; Vocal tract length normalization (VTLN); Maximum likelihood linear transform (MLLT); Maximum likelihood linear regression (MLLR); Maximum a posteriori (MAP); Maximum mutual information estimation (MMIE) BibRef

Keefer, R., Liu, Y., Bourbakis, N.,
The Development and Evaluation of an Eyes-Free Interaction Model for Mobile Reading Devices,
HMS(43), No. 1, January 2013, pp. 76-91.
IEEE DOI 1301
Voice user interface. BibRef

O'Shaughnessy, D., Deng, L., Li, H.,
Speech Information Processing: Theory and Applications,
PIEEE(100), No. 5, May 2013, pp. 1034-1037.
IEEE DOI 1305
[Scanning the Issue], Introduction to special issue. BibRef

O'Shaughnessy, D.,
Acoustic Analysis for Automatic Speech Recognition,
PIEEE(100), No. 5, May 2013, pp. 1038-1053.
IEEE DOI 1305
BibRef

Fosler-Lussier, E., He, Y., Jyothi, P., Prabhavalkar, R.,
Conditional Random Fields in Speech, Audio, and Language Processing,
PIEEE(100), No. 5, May 2013, pp. 1054-1075.
IEEE DOI 1305
BibRef

Hermansky, H.,
Multistream Recognition of Speech: Dealing With Unknown Unknowns,
PIEEE(100), No. 5, May 2013, pp. 1076-1088.
IEEE DOI 1305
BibRef

Lee, C.H., Siniscalchi, S.M.,
An Information-Extraction Approach to Speech Processing: Analysis, Detection, Verification, and Recognition,
PIEEE(100), No. 5, May 2013, pp. 1089-1115.
IEEE DOI 1305
BibRef

He, X., Deng, L.,
Speech-Centric Information Processing: An Optimization-Oriented Approach,
PIEEE(100), No. 5, May 2013, pp. 1116-1135.
IEEE DOI 1305
BibRef

Young, S., Gasic, M., Thomson, B., Williams, J.D.,
POMDP-Based Statistical Spoken Dialog Systems: A Review,
PIEEE(100), No. 5, May 2013, pp. 1160-1179.
IEEE DOI 1305
Survey, Speech. BibRef

Li, W.F.[Wei-Feng], Zhou, Y.C.[Yi-Cong], Poh, N., Zhou, F.[Fei], Liao, Q.M.[Qing-Min],
Feature Denoising Using Joint Sparse Representation for In-Car Speech Recognition,
SPLetters(20), No. 7, 2013, pp. 681-684.
IEEE DOI cepstral analysis 1307
BibRef

Hermansky, H., Cohen, J.R., Stern, R.M.,
Perceptual Properties of Current Speech Recognition Technology,
PIEEE(101), No. 9, 2013, pp. 1968-1985.
IEEE DOI 1309
Auditory system BibRef

Kolossa, D., Zeiler, S., Saeidi, R., Astudillo, R.F.[R. Fernandez],
Noise-Adaptive LDA: A New Approach for Speech Recognition Under Observation Uncertainty,
SPLetters(20), No. 11, 2013, pp. 1018-1021.
IEEE DOI 1310
speech recognition BibRef

Saeidi, R., Astudillo, R.F., Kolossa, D.,
Uncertain LDA: Including Observation Uncertainties in Discriminative Transforms,
PAMI(38), No. 7, July 2016, pp. 1479-1488.
IEEE DOI 1606
Estimation BibRef

Cho, J.W., Park, H.M.,
An Efficient HMM-Based Feature Enhancement Method With Filter Estimation for Reverberant Speech Recognition,
SPLetters(20), No. 12, 2013, pp. 1199-1202.
IEEE DOI 1311
Bayes methods BibRef

Lee, L.M.[Lee-Min], Jean, F.R.,
Adaptation of Hidden Markov Models for Recognizing Speech of Reduced Frame Rate,
Cyber(43), No. 6, 2013, pp. 2114-2121.
IEEE DOI 1312
hidden Markov models BibRef

Kim, K.T.[Kyung-Tae], Lin, K.H.[Kai-Hsiang], Walther, D.B.[Dirk B.], Hasegawa-Johnson, M.A.[Mark A.], Huang, T.S.[Tomas S.],
Automatic detection of auditory salience with optimized linear filters derived from human annotation,
PRL(38), No. 1, 2014, pp. 78-85.
Elsevier DOI 1402
Auditory salience BibRef

Huang, X.D.[Xue-Dong], Baker, J.[James], Reddy, R.[Raj],
A Historical Perspective of Speech Recognition,
CACM(57), No. 1, January 2014, pp. 94-103.
DOI Link 1402
Survey, Speech Recognition. What do we know now that we did not know 40 years ago? BibRef

Shi, Y.Z.[Yong-Zhe], Zhang, W.Q.[Wei-Qiang], Cai, M.[Meng], Liu, J.[Jia],
Efficient One-Pass Decoding with NNLM for Speech Recognition,
SPLetters(21), No. 4, April 2014, pp. 377-381.
IEEE DOI 1403
decoding BibRef

Zhang, W.B.[Wei-Bin], Fung, P.,
Efficient Sparse Banded Acoustic Models for Speech Recognition,
SPLetters(21), No. 3, March 2014, pp. 280-283.
IEEE DOI 1403
covariance matrices BibRef

Triefenbach, F., Demuynck, K., Martens, J.P.,
Large Vocabulary Continuous Speech Recognition With Reservoir-Based Acoustic Models,
SPLetters(21), No. 3, March 2014, pp. 311-315.
IEEE DOI 1403
error statistics BibRef

Diez, M.[Mireia], Varona, A.[Amparo], Penagarikano, M.[Mikel], Rodriguez-Fuentes, L.J.[Luis Javier], Bordel, G.[German],
On the Complementarity of Phone Posterior Probabilities for Improved Speaker Recognition,
SPLetters(21), No. 6, June 2014, pp. 649-652.
IEEE DOI 1404
BibRef
Earlier: A1, A3, A2, A4, A5:
On the Use of Dot Scoring for Speaker Diarization,
IbPRIA11(612-619).
Springer DOI 1106
audio databases BibRef

Räsänen, O.[Okko], Laine, U.K.[Unto K.],
A method for noise-robust context-aware pattern discovery and recognition from categorical sequences,
PR(45), No. 1, 2012, pp. 606-616.
Elsevier DOI 1410
Speech recognition BibRef

Liu, N.H.[Ning-Han],
Effective Results Ranking for Mobile Query by Singing/Humming Using a Hybrid Recommendation Mechanism,
MultMed(16), No. 5, August 2014, pp. 1407-1420.
IEEE DOI 1410
audio signal processing BibRef

Schneiderman, R.,
Accuracy, Apps Advance Speech Recognition,
SPMag(32), No. 1, January 2015, pp. 12-125.
IEEE DOI 1502
Special Reports. Commercialization BibRef

Ban, S.M., Kim, H.S.,
Weight-Space Viterbi Decoding Based Spectral Subtraction for Reverberant Speech Recognition,
SPLetters(22), No. 9, September 2015, pp. 1424-1428.
IEEE DOI 1503
Decoding BibRef

Sakano, T.[Toshihiro], Kobayashi, Y.[Yosuke], Kondo, K.[Kazuhiro],
A Speech Intelligibility Estimation Method Using a Non-reference Feature Set,
IEICE(E98-D), No. 1, January 2015, pp. 21-28.
WWW Link. 1503
BibRef

Khaldi, K.[Kais], Boudraa, A.O.[Abdel-Ouahab], Torresani, B.[Bruno], Chonavel, T.[Thierry],
HHT-based audio coding,
SIViP(9), No. 1, January 2015, pp. 107-115.
Springer DOI 1503
BibRef

Savchenko, A.V.[Andrey V.], Savchenko, L.V.[Liudmila V.],
Towards the creation of reliable voice control system based on a fuzzy approach,
PRL(65), No. 1, 2015, pp. 145-151.
Elsevier DOI 1511
Signal processing BibRef

Suh, Y.J.[Young-Joo], Kim, H.[Hoirin],
Probabilistic Class Histogram Equalization Based on Posterior Mean Estimation for Robust Speech Recognition,
SPLetters(22), No. 12, December 2015, pp. 2421-2424.
IEEE DOI 1512
maximum likelihood estimation BibRef

Wang, X.Y.[Xiao-Yun], Yamamoto, S.[Seiichi],
Speech Recognition of English by Japanese Using Lexicon Represented by Multiple Reduced Phoneme Sets,
IEICE(E98-D), No. 12, December 2015, pp. 2271-2279.
WWW Link. 1601
BibRef

Tohidypour, H.R.[Hamid Reza], Banitalebi-Dehkordi, A.[Amin],
Speech frame recognition based on less shift sensitive wavelet filter banks,
SIViP(10), No. 4, April 2016, pp. 633-637.
WWW Link. 1604
BibRef

Chung, Y.J.[Yong-Joo],
Vector Taylor series based model adaptation using noisy speech trained hidden Markov models,
PRL(75), No. 1, 2016, pp. 36-40.
Elsevier DOI 1604
Noisy speech recognition BibRef

Ansari, J.A., Sathyamurthy, A., Balasubramanyam, R.,
An Open Voice Command Interface Kit,
HMS(46), No. 3, June 2016, pp. 467-473.
IEEE DOI 1605
Hardware BibRef

Cho, B.J., Kwon, H., Cho, J.W., Kim, C., Stern, R.M., Park, H.M.,
A Subband-Based Stationary-Component Suppression Method Using Harmonics and Power Ratio for Reverberant Speech Recognition,
SPLetters(23), No. 6, June 2016, pp. 780-784.
IEEE DOI 1606
maximum likelihood estimation BibRef

Ren, H., Yan, Y.,
Structural Optimization and Online Evolutionary Learning for Spoken Dialog Management,
SPLetters(23), No. 7, July 2016, pp. 1013-1017.
IEEE DOI 1608
Monte Carlo methods BibRef

Khoubrouy, S.A., Hansen, J.H.L.,
Microphone Array Processing Strategies for Distant-Based Automatic Speech Recognition,
SPLetters(23), No. 10, October 2016, pp. 1344-1348.
IEEE DOI 1610
microphone arrays BibRef

Lamberti, F., Manuri, F., Paravati, G., Piumatti, G., Sanna, A.,
Using Semantics to Automatically Generate Speech Interfaces for Wearable Virtual and Augmented Reality Applications,
HMS(47), No. 1, February 2017, pp. 152-164.
IEEE DOI 1702
augmented reality BibRef

Ganapathy, S.,
Multivariate Autoregressive Spectrogram Modeling for Noisy Speech Recognition,
SPLetters(24), No. 9, September 2017, pp. 1373-1377.
IEEE DOI 1708
Discrete cosine transforms, Estimation, Feature extraction, Noise measurement, Spectrogram, Speech, Speech recognition, Feature extraction, Riesz envelopes, multivariate autoregressive (MAR) models, speech, recognition BibRef

Shahnawazuddin, S., Adiga, N., Kathania, H.K.,
Effect of Prosody Modification on Children's ASR,
SPLetters(24), No. 11, November 2017, pp. 1749-1753.
IEEE DOI 1710
Hidden Markov models, Mel frequency cepstral coefficient, Speech, Speech recognition, Training, Acoustic mismatch, pitch-adaptive features, prosody modification, speech recognition, zero-frequency, filter BibRef

Monroe, D.[Don],
Digital Hearing,
CACM(60), No. 10, October 2017, pp. 18-20.
DOI Link 1710
BibRef

Kim, J., Hahn, M.,
Voice Activity Detection Using an Adaptive Context Attention Model,
SPLetters(25), No. 8, August 2018, pp. 1181-1185.
IEEE DOI 1808
speech recognition, adaptive context attention model, voice activity detection, speech-related applications, voice activity detection (VAD) BibRef

Edwards, J.,
Something to Talk About: Signal Processing in Speech and Audiology Research: Promising Investigations Explore New Opportunities in Human Communication,
SPMag(35), No. 6, November 2018, pp. 8-12.
IEEE DOI 1812
[Special Reports]. Mice, Research and development, Microphones, Acoustics, Time-frequency analysis, Auditory system BibRef

Baltrušaitis, T.[Tadas], Ahuja, C., Morency, L.P.[Louis-Philippe],
Multimodal Machine Learning: A Survey and Taxonomy,
PAMI(41), No. 2, February 2019, pp. 423-443.
IEEE DOI 1901
Speech recognition, Visualization, Media, Speech, Multimedia communication, Streaming media, Hidden Markov models, survey BibRef

Shin, Y., Yoo, K.M., Lee, S.,
Utterance Generation With Variational Auto-Encoder for Slot Filling in Spoken Language Understanding,
SPLetters(26), No. 3, March 2019, pp. 505-509.
IEEE DOI 1903
learning (artificial intelligence), natural language processing, speech processing, travel industry, slot filling BibRef

Yang, B.H.[Bo-Hong], Yao, Z.P.[Ze-Ping], Lu, H.[Hong], Zhou, Y.Q.[Ya-Qian], Xu, J.K.[Jin-Kai],
In-classroom learning analytics based on student behavior, topic and teaching characteristic mining,
PRL(129), 2020, pp. 224-231.
Elsevier DOI 2001
Student behavior analysis, Topic modeling, Audio analysis, Sequential mining BibRef

Chandrakala, S., Jayalakshmi, S.L.,
Generative Model Driven Representation Learning in a Hybrid Framework for Environmental Audio Scene and Sound Event Recognition,
MultMed(22), No. 1, January 2020, pp. 3-14.
IEEE DOI 2001
Sound event recognition, environmental audio scene recognition, audio surveillance, adapted Gaussian mixture model BibRef

Yadav, I.C., Pradhan, G.,
Significance of Pitch-Based Spectral Normalization for Children's Speech Recognition,
SPLetters(26), No. 12, December 2019, pp. 1822-1826.
IEEE DOI 2001
acoustic correlation, feature extraction, fuzzy set theory, speech recognition, pitch-based spectral normalization, DLSTM BibRef

Shahnawazuddin, S., Adiga, N.[Nagaraj], Kathania, H.K.[Hemant Kumar], Sai, B.T.[B. Tarun],
Creating speaker independent ASR system through prosody modification based data augmentation,
PRL(131), 2020, pp. 213-218.
Elsevier DOI 2004
BibRef

Park, T.J., Han, K.J., Kumar, M., Narayanan, S.,
Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap,
SPLetters(27), 2020, pp. 381-385.
IEEE DOI 2004
Auto-Tuning, spectral clustering, Eigengap heuristic, speaker diarization BibRef

Deb, S., Dandapat, S., Krajewski, J.,
Analysis and Classification of Cold Speech Using Variational Mode Decomposition,
AffCom(11), No. 2, April 2020, pp. 296-307.
IEEE DOI 2006
Speech, Databases, Pathology, Speech recognition, Feature extraction, Nose, Mel frequency cepstral coefficient, Cold speech, SVM classifier BibRef

Sánchez-Junquera, J.[Javier], Villaseńor-Pineda, L.[Luis], Montes-y-Gómez, M.[Manuel], Rosso, P.[Paolo], Stamatatos, E.[Efstathios],
Masking domain-specific information for cross-domain deception detection,
PRL(135), 2020, pp. 122-130.
Elsevier DOI 2006
Deception detection, Domain adaptation, Masking information BibRef

Rill-García, R.[Rodrigo], Villaseńor-Pineda, L.[Luis], Reyes-Meza, V.[Verónica], Escalante, H.J.[Hugo Jair],
From Text to Speech: A Multimodal Cross-Domain Approach for Deception Detection,
MIPPSNA18(164-177).
Springer DOI 1901
BibRef

Lim, H., Kim, Y., Kim, H.,
Cross-Informed Domain Adversarial Training for Noise-Robust Wake-Up Word Detection,
SPLetters(27), 2020, pp. 1769-1773.
IEEE DOI 2010
Training, Noise robustness, Encoding, Optimization, Training data, Domain adversarial training, noise robustness, wake-up word detection BibRef

Zhao, L.[Ling], Zhang, A.[Ailian], Liu, Y.[Ying], Fei, H.[Hao],
Encoding multi-granularity structural information for joint Chinese word segmentation and POS tagging,
PRL(138), 2020, pp. 163-169.
Elsevier DOI 2010
Chinese word segmentation, POS tagging, Joint model, Lattice model, Graph model BibRef

Hsiao, R., Can, D., Ng, T., Travadi, R., Ghoshal, A.,
Online Automatic Speech Recognition With Listen, Attend and Spell Model,
SPLetters(27), 2020, pp. 1889-1893.
IEEE DOI 2011
Hidden Markov models, Decoding, Training, Earth Observing System, Computational modeling, Acoustics, Automatic speech recognition, online recognition BibRef

Bang, J.[Jeesoo], Han, S.[Sangdo], Lee, J.H.[Jong-Hyeok],
Listening-oriented response generation by exploiting user responses,
PRL(140), 2020, pp. 230-237.
Elsevier DOI 2012
Natural language processing, Dialogue system, Response generation, Listening-oriented dialogue, Affective computing BibRef

Zhou, J.T.Y.[Joey Tian-Yi], Zhang, H.[Hao], Jin, D.[Di], Peng, X.[Xi],
Dual Adversarial Transfer for Sequence Labeling,
PAMI(43), No. 2, February 2021, pp. 434-446.
IEEE DOI 2101
Labeling, Task analysis, Training, Feature extraction, Tagging, Natural language processing, adversarial training BibRef

Qiu, J.Y.[Jia-Yan], Wang, X.C.[Xin-Chao], Fua, P.[Pascal], Tao, D.C.[Da-Cheng],
Matching Seqlets: An Unsupervised Approach for Locality Preserving Sequence Matching,
PAMI(43), No. 2, February 2021, pp. 745-752.
IEEE DOI 2101
Hidden Markov models, Task analysis, Annotations, Pattern matching, Speech recognition, Optimization, Coherence, Sequence matching, joint optimization BibRef

Chen, N., Watanabe, S., Villalba, J., Zelasko, P., Dehak, N.,
Non-Autoregressive Transformer for Speech Recognition,
SPLetters(28), 2021, pp. 121-125.
IEEE DOI 2101
Training, Computational modeling, Speech recognition, Mathematical model, Predictive models, Iterative decoding, History, non-autoregressive BibRef

Haeb-Umbach, R., Heymann, J., Drude, L., Watanabe, S., Delcroix, M., Nakatani, T.,
Far-Field Automatic Speech Recognition,
PIEEE(109), No. 2, February 2021, pp. 124-148.
IEEE DOI 2101
Speech recognition, Microphones, Speech enhancement, Reverberation, Robustness, Array signal processing, Acoustic systems, speech enhancement BibRef

Fritsch, J., Magimai-Doss, M.,
Utterance Verification-Based Dysarthric Speech Intelligibility Assessment Using Phonetic Posterior Features,
SPLetters(28), 2021, pp. 224-228.
IEEE DOI 2102
Databases, Phonetics, Correlation, Testing, Speech coding, Estimation, Aerospace electronics, Dysarthric speech, utterance verification BibRef

Lu, L.[Liang], Kanda, N.[Naoyuki], Li, J.[Jinyu], Gong, Y.F.[Yi-Fan],
Streaming End-to-End Multi-Talker Speech Recognition,
SPLetters(28), 2021, pp. 803-807.
IEEE DOI 2105
Speech recognition, Training, Heating systems, Computational modeling, Transducers, Delays, Shape, heuristic error assignment training BibRef

Yi, C.[Cheng], Zhou, S.Y.[Shi-Yu], Xu, B.[Bo],
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-Resource Speech Recognition,
SPLetters(28), 2021, pp. 788-792.
IEEE DOI 2105
Acoustics, Bit error rate, Linguistics, Task analysis, Training, Decoding, Data models, BERT, end-to-end modeling, low-resource ASR, wav2vec BibRef

Xu, P.[Peng], Huang, Y.[Yongye], Yuan, T.[Tongtong], Xiang, T.[Tao], Hospedales, T.M.[Timothy M.], Song, Y.Z.[Yi-Zhe], Wang, L.[Liang],
On Learning Semantic Representations for Large-Scale Abstract Sketches,
CirSysVideo(31), No. 9, September 2021, pp. 3366-3379.
IEEE DOI 2109
Semantics, Visualization, Task analysis, Games, Feature extraction, Quantization (signal), Speech recognition, edge-map dataset BibRef

Kim, J.[Juntae], Lee, Y.[Yoonhan],
Improving End-to-End Contextual Speech Recognition via a Word-Matching Algorithm With Backward Search,
SPLetters(28), 2021, pp. 2087-2091.
IEEE DOI 2112
Sugar, Phonetics, Decoding, Context modeling, Training, Signal processing algorithms, Tagging, Speech recognition, biasing, context BibRef

Zhu, S.[Shirong], Zhang, Y.[Ying], He, K.[Kai], Zhao, L.[Lasheng],
Acoustic Word Embedding Based on Multi-Head Attention Quadruplet Network,
SPLetters(29), 2022, pp. 184-188.
IEEE DOI 2202
Acoustics, Training, Vocabulary, Linear programming, Task analysis, Speech recognition, Phonetics, Acoustic word embedding, attention mechanism BibRef

Tiwari, R.[Rajdev], Sharma, V.[Vidha], Sahoo, R.C.[Ramesh Chandra],
Isolated spoken word recognition using packed-MFCC on padded-voice signal for unscripted languages,
IJCVR(12), No. 2, 2022, pp. 120-140.
DOI Link 2203
BibRef

Tian, Z.K.[Zheng-Kun], Yi, J.[Jiangyan], Tao, J.H.[Jian-Hua], Zhang, S.[Shuai], Wen, Z.Q.[Zheng-Qi],
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition,
SPLetters(29), 2022, pp. 762-766.
IEEE DOI 2204
Decoding, Transformers, Acoustics, Predictive models, Training, Speech recognition, Linguistics, Autoregressive BibRef

Xiao, F.Y.[Fei-Yang], Guan, J.[Jian], Lan, H.Y.[Hai-Yan], Zhu, Q.[Qiaoxi], Wang, W.W.[Wen-Wu],
Local Information Assisted Attention-Free Decoder for Audio Captioning,
SPLetters(29), 2022, pp. 1604-1608.
IEEE DOI 2208
Decoding, Feature extraction, Wind forecasting, Interference, Convolution, Transformers, Task analysis, attention-free transformer BibRef

de Souza, D.B.[Douglas Baptista], Bakri, K.J.[Khaled Jamal], de Souza Ferreira, F.[Fernanda], Inacio, J.[Juliana],
Multitaper-Mel Spectrograms for Keyword Spotting,
SPLetters(29), 2022, pp. 2028-2032.
IEEE DOI 2210
Spectrogram, Hidden Markov models, Feature extraction, Speech recognition, Internet, Computational modeling, Training, mel spectrograms BibRef

Perochon, S.[Sam],
A Presentation and Short Discussion of rVAD-fast, a Fast Voice Activity Detector,
IPOL(12), 2022, pp. 404-419.
DOI Link 2210
BibRef

Huang, H.J.[Hao-Jing], Huang, P.J.[Pei-Jie], Zhu, Z.B.[Zhan-Biao], Li, J.[Jia], Lin, P.[Piyuan],
CLID: A Chunk-Level Intent Detection Framework for Multiple Intent Spoken Language Understanding,
SPLetters(29), 2022, pp. 2123-2127.
IEEE DOI 2211
Filling, Task analysis, Semantics, Decoding, Training, Predictive models, Testing, Chunk-level, intent detection, spoken language understanding BibRef

Du, X.[Xia], Pun, C.M.[Chi-Man],
Robust Audio Patch Attacks Using Physical Sample Simulation and Adversarial Patch Noise Generation,
MultMed(24), 2022, pp. 4381-4393.
IEEE DOI 2212
Perturbation methods, Speech recognition, Robustness, Signal to noise ratio, Training, Detectors, ensemble method BibRef

Kim, H.[Hoki], Park, J.[Jinseong], Lee, J.W.[Jae-Wook],
Generating Transferable Adversarial Examples for Speech Classification,
PR(137), 2023, pp. 109286.
Elsevier DOI 2302
Speech classification, Adversarial attack, Transferability BibRef

Wei, G.Y.[Guang-Yong], Duan, Z.K.[Zhi-Kui], Li, S.[Shiren], Yu, X.M.[Xin-Mei], Yang, G.G.[Guang-Guang],
LFEformer: Local Feature Enhancement Using Sliding Window With Deformability for Automatic Speech Recognition,
SPLetters(30), 2023, pp. 180-184.
IEEE DOI 2303
Feature extraction, Transformers, Decoding, Mathematical models, Data mining, Acoustics, Data preprocessing, Speech Recognition, Local Feature BibRef

Xiao, F.Y.[Fei-Yang], Guan, J.[Jian], Zhu, Q.[Qiaoxi], Wang, W.W.[Wen-Wu],
Graph Attention for Automated Audio Captioning,
SPLetters(30), 2023, pp. 413-417.
IEEE DOI 2305
Feature extraction, Decoding, Transformers, Semantics, Acoustics, Noise measurement, Matrix converters, Audio modelling, temporal information BibRef

Chang, C.M.[Chun-Min], Lee, C.C.[Chi-Chun],
Learning Enhanced Acoustic Latent Representation for Small Scale Affective Corpus with Adversarial Cross Corpora Integration,
AffCom(14), No. 2, April 2023, pp. 1308-1321.
IEEE DOI 2306
Databases, Emotion recognition, Acoustics, Training, Speech recognition, Transfer learning, Task analysis, cross corpus learning BibRef

Qu, H.L.[Hong-Lin], Su, X.D.[Xiang-Dong], Wang, Y.[Yonghe], Hao, X.[Xiang], Gao, G.L.[Guang-Lai],
Noise-Separated Adaptive Feature Distillation for Robust Speech Recognition,
SPLetters(30), 2023, pp. 763-767.
IEEE DOI 2307
Speech recognition, Noise measurement, Adaptation models, Task analysis, Training, Propagation losses, Knowledge transfer, speech recognition BibRef

Nga, C.H.[Cao Hong], Vu, D.Q.[Duc-Quang], Luong, H.H.[Huong Hoang], Huang, C.L.[Chien-Lin], Wang, J.C.[Jia-Ching],
Cyclic Transfer Learning for Mandarin-English Code-Switching Speech Recognition,
SPLetters(30), 2023, pp. 1387-1391.
IEEE DOI 2310
BibRef

Dong, F.[Fang], Qian, Y.Y.[Yi-Yang], Wang, T.L.[Tian-Lei], Liu, P.[Peng], Cao, J.W.[Jiu-Wen],
A Transformer-Based End-to-End Automatic Speech Recognition Algorithm,
SPLetters(30), 2023, pp. 1592-1596.
IEEE DOI 2311
BibRef

Fan, P.[Peng], Shan, C.[Changhao], Sun, S.[Sining], Yang, Q.[Qing], Zhang, J.W.[Jian-Wei],
Key Frame Mechanism for Efficient Conformer Based End-to-End Speech Recognition,
SPLetters(30), 2023, pp. 1612-1616.
IEEE DOI 2311
BibRef

Mahmoudi, H.[Homeyra], Camboim, S.[Silvana], Brovelli, M.A.[Maria Antonia],
Development of a Voice Virtual Assistant for the Geospatial Data Visualization Application on the Web,
IJGI(12), No. 11, 2023, pp. xx-yy.
DOI Link 2312
BibRef

Vitolo, P.[Paola], Liguori, R.[Rosalba], di Benedetto, L.[Luigi], Rubino, A.[Alfredo], Licciardo, G.D.[Gian Domenico],
Automatic Audio Feature Extraction for Keyword Spotting,
SPLetters(31), 2024, pp. 161-165.
IEEE DOI 2401
BibRef

Li, J.[Junhua], Duan, Z.K.[Zhi-Kui], Li, S.[Shiren], Yu, X.[Xinmei], Yang, G.[Guangguang],
ESAformer: Enhanced Self-Attention for Automatic Speech Recognition,
SPLetters(31), 2024, pp. 471-475.
IEEE DOI 2402
Feature extraction, Transformers, Convolution, Logic gates, Testing, Tensors, Training, Speech recognition, transformer, multi-order interaction BibRef

Nie, W.Z.[Wei-Zhi], Bao, Y.[Yuru], Zhao, Y.[Yue], Liu, A.[Anan],
Long Dialogue Emotion Detection Based on Commonsense Knowledge Graph Guidance,
MultMed(26), 2024, pp. 514-528.
IEEE DOI 2402
Emotion recognition, Commonsense reasoning, Oral communication, Correlation, Transformers, Speech recognition, topic module BibRef

Sun, T.L.[Tian-Li], Chen, H.N.[Hao-Nan], Hu, G.S.[Guo-Sheng], He, L.H.[Liang-Hua], Zhao, C.R.[Cai-Rong],
Explainability of Speech Recognition Transformers via Gradient-Based Attention Visualization,
MultMed(26), 2024, pp. 1395-1406.
IEEE DOI 2402
Transformers, Analytical models, Visualization, Predictive models, Data models, Computational modeling, Training, Explainability, attention visualization BibRef

Jacobs, C.[Christiaan], Kamper, H.[Herman],
Leveraging Multilingual Transfer for Unsupervised Semantic Acoustic Word Embeddings,
SPLetters(31), 2024, pp. 311-315.
IEEE DOI 2402
Semantics, Phonetics, Training, Data models, Task analysis, Acoustics, Decoding, Acoustic word embeddings, query-by-example search, semantic retrieval BibRef

Wang, F.Y.[Fang-Yuan], Xu, B.[Bo], Xu, B.[Bo],
SSCFormer: Push the Limit of Chunk-Wise Conformer for Streaming ASR Using Sequentially Sampled Chunks and Chunked Causal Convolution,
SPLetters(31), 2024, pp. 421-425.
IEEE DOI 2402
Convolution, Complexity theory, Computational modeling, Decoding, Training, Kernel, Transformers, Conformer, streaming ASR, linear complexity BibRef

Fan, R.[Ruchao], Shankar, N.B.[Natarajan Balaji], Alwan, A.[Abeer],
UniEnc-CASSNAT: An Encoder-Only Non-Autoregressive ASR for Speech SSL Models,
SPLetters(31), 2024, pp. 711-715.
IEEE DOI 2403
Decoding, Feature extraction, Acoustics, Iterative decoding, Transformers, Training, Task analysis, Non-autoregressive ASR, speech foundation model BibRef


Ng, H.W.[Han Wei], Guan, C.T.[Cun-Tai],
Efficient Representation Learning for Inner Speech Domain Generalization,
CAIP23(I:131-141).
Springer DOI 2312
BibRef

Oneata, D.[Dan], Cucu, H.[Horia],
Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations,
MULA22(4578-4587)
IEEE DOI 2210
Training, Couplings, Visualization, Image recognition, Keyword search, Speech recognition, Machine learning BibRef

Tapia, L.S.[Luis Sanchez], Gomez, A.[Antonio], Esparza, M.[Mario], Jatla, V.[Venkatesh], Pattichis, M.[Marios], Celedón-Pattichis, S.[Sylvia], López Leiva, C.[Carlos],
Bilingual Speech Recognition by Estimating Speaker Geometry from Video Data,
CAIP21(I:79-89).
Springer DOI 2112
BibRef

Qiao, F.C.[Feng-Chun], Peng, X.[Xi],
Uncertainty-guided Model Generalization to Unseen Domains,
CVPR21(6786-6796)
IEEE DOI 2111
Training, Image segmentation, Uncertainty, Perturbation methods, Text categorization, Semantics, Speech recognition BibRef

Ngantcha, P.[Patricia], Amith, M.[Muhammad], Tao, C.[Cui], Roberts, K.[Kirk],
Patient-Provider Communication Training Models for Interactive Speech Devices,
DHM21(I:250-268).
Springer DOI 2108
BibRef

Wu, Y.C.[Yi-Chieh], Liao, W.H.[Wen-Hung],
Toward Text-independent Cross-lingual Speaker Recognition Using English-Mandarin-Taiwanese Dataset,
ICPR21(8515-8522)
IEEE DOI 2105
Sociology, Speech recognition, Data collection, Acoustics, Data models, Speaker recognition, Speaker recognition, Cross-lingual dataset BibRef

Chen, Y.[Yangbin], Ma, Y.[Yun], Ko, T.[Tom], Wang, J.P.[Jian-Ping], Li, Q.[Qing],
MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization,
ICPR21(407-414)
IEEE DOI 2105
Training, Adaptation models, Training data, Speech recognition, Classification algorithms, Task analysis BibRef

Zhou, L.X.[Li-Xia], Zhang, J.[Jun],
From Bottom to Top: A Coordinated Feature Representation Method for Speech Recognition,
MMDLCA20(396-403).
Springer DOI 2103
BibRef

Zhao, J., Parry, C.J., dos Anjos, R., Anslow, C., Rhee, T.,
Voice Interaction for Augmented Reality Navigation Interfaces with Natural Language Understanding,
IVCNZ20(1-6)
IEEE DOI 2012
Productivity, Image recognition, Navigation, Natural languages, Human-robot interaction, Speech recognition, Augmented reality, intelligent interface BibRef

Ezzine, A., Satori, H., Hamidi, M., Satori, K.,
Moroccan Dialect Speech Recognition System Based on CMU SphinxTools,
ISCV20(1-5)
IEEE DOI 2011
feature extraction, Gaussian processes, hidden Markov models, natural language processing, speaker recognition, Artificial intelligence BibRef

ABAKARIM, F., ABENAOU, A.,
Amazigh isolated word speech recognition system using the Adaptive Orthogonal Transform Method.,
ISCV20(1-6)
IEEE DOI 2011
discrete wavelet transforms, feature extraction, principal component analysis, speech recognition, voice signals, DWT BibRef

Pérez, A.F., Sanguineti, V., Morerio, P., Murino, V.,
Audio-Visual Model Distillation Using Acoustic Images,
WACV20(2843-2852)
IEEE DOI 2006
Acoustics, Visualization, Data models, Training, Microphones, Machine learning, Synchronization BibRef

Tapu, R., Mocanu, B., Zaharia, T.,
Dynamic Subtitles: A Multimodal Video Accessibility Enhancement Dedicated to Deaf and Hearing Impaired Users,
ACVR19(2558-2566)
IEEE DOI 2004
audio signal processing, feature extraction, handicapped aids, hearing, speaker recognition, video signal processing, deaf users, active speaker detection BibRef

Roberto, A.[Antonio], Saggese, A.[Alessia], Vento, M.[Mario],
A Challenging Voice Dataset for Robotic Applications in Noisy Environments,
CAIP19(II:354-364).
Springer DOI 1909
BibRef

Naszádi, K.[Kata], Oualil, Y.[Youssef], Klakow, D.[Dietrich],
Image-Sensitive Language Modeling for Automatic Speech Recognition,
VL18(IV:173-179).
Springer DOI 1905
BibRef

Gauvain, J.[Jodie], Lamel, L.[Lori], Le, V.B.[Viet Bac], Despres, J.[Julien], Gauvain, J.L.[Jean-Luc], Messaoudi, A.[Abdel], Vieru, B.[Bianca], Ben Kheder, W.[Waad],
Challenges in Audio Processing of Terrorist-Related Data,
MMMod19(II:80-92).
Springer DOI 1901
BibRef

Jorrín, J.[Jesús], Buera, L.[Luis],
DANTE Speaker Recognition Module. An Efficient and Robust Automatic Speaker Searching Solution for Terrorism-Related Scenarios,
MMMod19(I:704-715).
Springer DOI 1901
BibRef

Galanopoulos, D.[Damianos], Mezaris, V.[Vasileios],
Temporal Lecture Video Fragmentation Using Word Embeddings,
MMMod19(II:254-265).
Springer DOI 1901
BibRef

Shahin, M., Ji, J.X., Ahmed, B.,
One-Class SVMs Based Pronunciation Verification Approach,
ICPR18(2881-2886)
IEEE DOI 1812
Feature extraction, Hidden Markov models, Training, Support vector machines, Error analysis, Lattices, Acoustics, speech attributes BibRef

Mukherjee, H., Obaidullah, S.M., Phadikar, S., Roy, K.,
A Dravidian Language Identification System,
ICPR18(2654-2657)
IEEE DOI 1812
Feature extraction, Speech recognition, Videos, Databases, NIST, Language Identification, Dravidian Language, LSP-G, FURIA BibRef

Galiotou, E.[Eleni], Karanikolas, N.[Nikitas], Ralli, A.[Angela],
Preservation and Management of Greek Dialectal Data,
EuroMed18(I:752-761).
Springer DOI 1811
Text and oral, dialects. BibRef

Li, R., Yu, J.,
Multimodal 3D visible articulation system for syllable based Mandarin Chinese training,
VCIP17(1-4)
IEEE DOI 1804
computer animation, computer based training, data visualisation, linguistics, mean square error methods, speech processing, multimodal human-computer interface BibRef

Le, N., Odobez, J.M.,
Improving Speaker Turn Embedding by Crossmodal Transfer Learning from Face Embedding,
CVAVM17(428-437)
IEEE DOI 1802
Acoustics, Face, Speech, Speech recognition, TV, Training BibRef

Arandjelovic, R.[Relja], Zisserman, A.[Andrew],
Look, Listen and Learn,
ICCV17(609-617)
IEEE DOI 1802
Audio-visual. learning (artificial intelligence), object recognition, video signal processing, audio networks, audio representations, Visualization BibRef

Muniandy, T.[Thagirarani], Alvar, T.A.[Thamilvaani Arvaree], Boon, C.J.[Chong Jiang],
Mandarin Language Learning System for Nasal Voice User,
IVIC17(376-388).
Springer DOI 1711
BibRef

Madhavi, M.C.[Maulik C.], Patil, H.A.[Hemant A.], Bhendawade, N.[Nikhil],
Spoken Keyword Retrieval Using Source and System Features,
PReMI17(333-341).
Springer DOI 1711
BibRef

Addarrazi, I., Satori, H., Satori, K.,
Amazigh audiovisual speech recognition system design,
ISCV17(1-5)
IEEE DOI 1710
Face, Feature extraction, Hidden Markov models, Lips, Mouth, Speech recognition, Visualization, Audio-visual recognition, Automatic Speech Recognition, HMM, lip, reading BibRef

Wu, C., Ng, R.W.M., Torralba, O.S., Hain, T.,
Analysing acoustic model changes for active learning in automatic speech recognition,
WSSIP17(1-5)
IEEE DOI 1707
Acoustics, Adaptation models, Analytical models, Computational modeling, Data models, Hidden Markov models, Measurement, Active learning, confidence measures, data selection, speaker, adaptation BibRef

Kacprzak, S.,
Spoken language clustering in the i-vectors space,
WSSIP17(1-5)
IEEE DOI 1707
Clustering algorithms, Data visualization, Impurities, NIST, Speech, Training, Training data, i-vectors, language clustering, language, recognition BibRef

Pironkov, G., Dupont, S., Dutoit, T.,
Speaker-aware Multi-Task Learning for automatic speech recognition,
ICPR16(2900-2905)
IEEE DOI 1705
Acoustics, Automatic speech recognition, Feature extraction, Machine learning, Speech, Training BibRef

Zhao, Y., Zhao, R.[Rui], Wang, X.Y.[Xiao-Yang], Ji, Q.,
Multilingual articulatory features augmentation learning,
ICPR16(2895-2899)
IEEE DOI 1705
Dictionaries, Encoding, Feature extraction, Mel frequency cepstral coefficient, Semantics, Speech, Speech recognition, latent attribute learning, multilingual articulatory features, phone recognition, sparse coding, speech, attributes BibRef

Ogawa, T., Mallidi, S.H., Dupoux, E., Cohen, J., Feldman, N.H., Hermansky, H.,
A new efficient measure for accuracy prediction and its application to multistream-based unsupervised adaptation,
ICPR16(2222-2227)
IEEE DOI 1705
Estimation, Monitoring, Noise measurement, Reliability, Speech, Time measurement, Training BibRef

Mzah, Y., Ahfir, M., Jaidane, M.,
Late pre-dereverberation for speech intelligibility enhancement in public address systems,
ISIVC16(291-296)
IEEE DOI 1704
Position measurement BibRef

Montalvo, A.[Ana], Calvo, J.R.[José Ramón],
Discriminative Capacity and Phonetic Information of Bottleneck Features in Speech,
CIARP16(134-141).
Springer DOI 1703
BibRef

Asadullah, Shaukat, A., Ali, H., Akram, U.,
Automatic Urdu Speech Recognition using Hidden Markov Model,
ICIVC16(135-139)
IEEE DOI 1610
cepstral analysis BibRef

Ondáš, S., Juhár, J.,
Towards human-machine dialog in Slovak,
WSSIP16(1-4)
IEEE DOI 1608
hidden Markov models BibRef

Conka, D., Viszlay, P., Juhár, J.,
Fuzzy clustering in HMM-based triphone classes of 2DLDA in Slovak LVCSR,
WSSIP16(1-4)
IEEE DOI 1608
fuzzy set theory BibRef

Kacur, J., Kozicka, R., Vargic, R.,
Semi-tight covariance matrices implementation in MASPER HMM training procedure,
WSSIP16(1-4)
IEEE DOI 1608
covariance matrices BibRef

Kacur, J., Trnovsky, T., Vargic, R.,
Discriminative training of HMM using MASPER procedure,
WSSIP15(93-96)
IEEE DOI 1603
hidden Markov models BibRef

Calvo, M.[Marcos], Hurtado, L.F.[Lluís F.], García, F.[Fernando], Sanchis, E.[Emilio],
Combining Several ASR Outputs in a Graph-Based SLU System,
CIARP15(551-558).
Springer DOI 1511
speech BibRef

Rohrbach, A.[Anna], Rohrbach, M.[Marcus], Schiele, B.[Bernt],
The Long-Short Story of Movie Description,
GCPR15(209-221).
Springer DOI 1511
Award, GCPR, HM. BibRef

Rohrbach, A.[Anna], Rohrbach, M.[Marcus], Tandon, N.[Niket], Schiele, B.[Bernt],
A dataset for Movie Description,
CVPR15(3202-3212)
IEEE DOI 1510
BibRef

Zhao, H.Q.[Han-Qing], Qin, Z.C.[Zeng-Chang], Wang, Y.[Yiyu], Wang, Y.X.[Yu-Xiao],
A Bag-of-phonemes Model for Homeplace Classification of Mandarin Speakers,
IbPRIA15(683-690).
Springer DOI 1506
BibRef

Yakubu, M.A.[M. Abukari], Maddage, N.C.[Namunu C.], Atrey, P.K.[Pradeep K.],
Audio Secret Management Scheme Using Shamir's Secret Sharing,
MMMod15(I: 396-407).
Springer DOI 1501
BibRef

Bello, C.[Claudia], Ribas, D.[Dayana], Calvo, J.R.[José R.], Ferrer, C.A.[Carlos A.],
From Speech Quality Measures to Speaker Recognition Performance,
CIARP14(199-206).
Springer DOI 1411
BibRef

Oropeza-Rodríguez, J.L.[José Luis], Suárez-Guerra, S.[Sergio], Jiménez-Hernández, M.[Mario],
The Place Theory as an Alternative Solution in Automatic Speech Recognition Tasks,
CIARP14(167-174).
Springer DOI 1411
BibRef

Diez, M., Varona, A., Penagarikano, M., Rodriguez-Fuentes, L.J., Bordel, G.,
On the Projection of PLLRs for Unbounded Feature Distributions in Spoken Language Recognition,
SPLetters(21), No. 9, September 2014, pp. 1073-1077.
IEEE DOI 1406
Decoding BibRef

Diez, M.[Mireia], Varona, A.[Amparo], Penagarikano, M.[Mike], Rodriguez-Fuentes, L.J.[Luis Javier], Bordel, G.[German],
Optimizing PLLR Features for Spoken Language Recognition,
ICPR14(779-784)
IEEE DOI 1412
Acoustics BibRef

Missaoui, I.[Ibrahim], Lachiri, Z.[Zied],
Gabor Filterbank Features for Robust Speech Recognition,
ICISP14(665-671).
Springer DOI 1406
BibRef

Carletti, V.[Vincenzo], Foggia, P.[Pasquale], Percannella, G.[Gennaro], Saggese, A.[Alessia], Strisciuglio, N.[Nicola], Vento, M.[Mario],
Audio surveillance using a bag of aural words classifier,
AVSS13(81-86)
IEEE DOI 1311
BibRef

Hurtado, L.F.[Lluís F.], Calvo, M.[Marcos], Gómez, J.A.[Jon Ander], García, F.[Fernando], Sanchis, E.[Emilio],
A Phonetic-Based Approach to Query-by-Example Spoken Term Detection,
CIARP13(I:504-511).
Springer DOI 1311
BibRef

Chaloupka, J.[Josef], Nouza, J.[Jan], Kucharova, M.[Michaela],
Using Various Types of Multimedia Resources to Train System for Automatic Transcription of Czech Historical Oral Archives,
MM4CH13(228-237).
Springer DOI 1309
BibRef

Nouza, J.[Jan], Cerva, P.[Petr], Silovsky, J.[Jan],
Dealing with Bilingualism in Automatic Transcription of Historical Archive of Czech Radio,
MM4CH13(238-246).
Springer DOI 1309
BibRef

Chan, K.Y.[Kit Yan], Nordholm, S.E.[Sven E.], Yiu, C.K.F.[Cedric K.F.],
Multichannel filters for speech recognition using a particle swarm optimization,
ICARCV12(937-942).
IEEE DOI 1304
BibRef

Zhao, Y.[Yue], Xu, X.N.[Xiao-Na], Yang, G.S.[Guo-Sheng],
Unsupervised Tibetan speech features Learning based on Dynamic Bayesian Networks,
ICPR12(2319-2322).
WWW Link. 1302
BibRef

Nour-Eddine, L.[Lachachi], Abdelkader, A.[Adla],
Reduced Universal Background Model for Speech Recognition and Identification System,
MCPR12(303-312).
Springer DOI 1208
BibRef

Pérez Maldonado, Y.[Yara], Caballero Morales, S.O.[Santiago Omar], Cruz Ortega, R.O.[Roberto Omar],
GA Approaches to HMM Optimization for Automatic Speech Recognition,
MCPR12(313-322).
Springer DOI 1208
BibRef

Amrous, A.I.[Anissa Imen], Debyeche, M.[Mohamed],
Robust Arabic Multi-stream Speech Recognition System in Noisy Environment,
ICISP12(571-578).
Springer DOI 1208
BibRef

Touazi, A.[Azzedine], Debyeche, M.[Mohamed],
New Encoding Algorithm for Distributed Speech Recognition Based on DTFS Transform,
ICISP12(547-554).
Springer DOI 1208
BibRef

Im, J.H., Lee, S.Y.,
Unified Training of Feature Extractor and HMM Classifier for Speech Recognition,
SPLetters(19), No. 2, February 2012, pp. 111-114.
IEEE DOI 1201
BibRef

Ghigi, F.[Fabrizio], Tamarit, V.[Vicent], Martínez-Hinarejos, C.D.[Carlos D.], Benedí, J.M.[José-Miguel],
Active Learning for Dialogue Act Labelling,
IbPRIA11(652-659).
Springer DOI 1106
BibRef

Swietojanski, P.[Pawel], Wielgat, R.[Robert], Zielinski, T.[Tomasz],
Automatic Selection of Pareto-Optimal Topologies of Hidden Markov Models Using Multicriteria Evolutionary Algorithms,
EvoIASP11(224-233).
Springer DOI 1104
Applied to speech recognition. BibRef

Ravinder, K.[Kumar],
Comparison of HMM and DTW for Isolated Word Recognition System of Punjabi Language,
CIARP10(244-252).
Springer DOI 1011
BibRef

Meng, L.[Lu], Xiang, J.[Jing], Zhao, D.[Dazhe], Zhao, H.[Hong],
A New Application of MEG and DTI on Word Recognition,
ICPR10(2472-2475).
IEEE DOI 1008
BibRef

Duan, Q.S.[Quan-Sheng], Kang, S.Y.[Shi-Yin], Wu, Z.Y.[Zhi-Yong], Cai, L.H.[Lian-Hong], Shuang, Z.W.[Zhi-Wei], Qin, Y.[Yong],
Comparison of Syllable/Phone HMM Based Mandarin TTS,
ICPR10(4496-4499).
IEEE DOI 1008
BibRef

O'Gorman, L.[Lawrence],
Latency in Speech Feature Analysis for Telepresence Event Coding,
ICPR10(4464-4467).
IEEE DOI 1008
BibRef

Zhang, S.L.[Shi-Lei], Shi, Q.[Qin], Qin, Y.[Yong],
Modeling Syllable-Based Pronunciation Variation for Accented Mandarin Speech Recognition,
ICPR10(1606-1609).
IEEE DOI 1008
BibRef

Zhang, S.L.[Shi-Lei], Zhang, S.W.[Shu-Wu], Xu, B.[Bo],
A Two-level Method for Unsupervised Speaker-based Audio Segmentation,
ICPR06(IV: 298-301).
IEEE DOI 0609
BibRef

Krajewski, J.[Jarek], Batliner, A.[Anton], Kessel, S.[Silke],
Comparing Multiple Classifiers for Speech-Based Detection of Self-Confidence: A Pilot Study,
ICPR10(3716-3719).
IEEE DOI 1008
BibRef

Nolazco-Flores, J.A.[Juan A.], Aceves L., R.A.[Roberto A.], Garcia-Perera, L.P.[L. Paola],
Speech Magnitude-Spectrum Information-Entropy (MSIE) for Automatic Speech Recognition in Noisy Environments,
ICPR10(4364-4367).
IEEE DOI 1008
BibRef

Kelly, F.[Finnian], Harte, N.[Naomi],
Auditory Features Revisited for Robust Speech Recognition,
ICPR10(4456-4459).
IEEE DOI 1008
BibRef

Xie, Z.Q.[Zhao-Qiang], Miao, Z.J.[Zhen-Jiang],
Tone Recognition of Isolated Mandarin Syllables,
ICISP10(412-418).
Springer DOI 1006
BibRef

Alotaibi, Y.A.[Yousef Ajami], Alghamdi, M.[Mansour], Alotaiby, F.[Fahad],
Speech Recognition System of Arabic Alphabet Based on a Telephony Arabic Corpus,
ICISP10(122-129).
Springer DOI 1006
BibRef

Lu, G.[Gao], Yu, H.Z.[Hong-Zhi], Li, Y.H.[Yong-Hong], Zhang, R.S.[Rui-Shan],
Study on SAMPA_ST for Lhasa Tibetan and realization of automatic labelling system,
IASP10(133-137).
IEEE DOI 1004
BibRef

Chen, X.Y.[Xiao-Ying], Jin, H.M.[Hui-Min], Yu, H.Z.[Hong-Zhi],
Acoustic research on long and short vowels in Tibetan Lhasa dialect,
IASP10(561-564).
IEEE DOI 1004
BibRef

Sahu, V.P.[Ved Prakash], Mishra, H.K.[Harendra Kumar], Sekhar, C.C.[C. Chandra],
Variational Bayes Adapted GMM Based Models for Audio Clip Classification,
PReMI09(513-518).
Springer DOI 0912
BibRef

Kacur, J., Rozinaj, G.,
Adding Voicing Features into Speech Recognition Based on HMM in Slovak,
WSSIP09(1-4).
IEEE DOI 0906
BibRef

Verteletskaya, E., Sakhnov, K., Simak, B.,
Pitch Detection Algorithms and Voiced/Unvoiced Classification for Noisy Speech,
WSSIP09(1-5).
IEEE DOI 0906
BibRef

Vlaj, D., Kos, M., Grasic, M., Kacic, Z.,
Influence of Hangover and Hangbefore Criteria on Automatic Speech Recognition,
WSSIP09(1-4).
IEEE DOI 0906
BibRef

Hanžl, V.[Václav], Pollák, P.[Petr],
Accuracy Analysis of Generalized Pronunciation Variant Selection in ASR Systems,
COST08(399-408).
Springer DOI 0810
BibRef

Camarena-Ibarrola, A.[Antonio], Chávez, E.[Edgar], Tellez, E.S.[Eric Sadit],
Robust Radio Broadcast Monitoring Using a Multi-Band Spectral Entropy Signature,
CIARP09(587-594).
Springer DOI 0911
BibRef

Mantilla-Caeiros, A.[Alfredo], Miyatake, M.N.[Mariko Nakano], Perez-Meana, H.[Hector],
Isolate Speech Recognition Based on Time-Frequency Analysis Methods,
CIARP09(297-304).
Springer DOI 0911
BibRef

Veronková, J.[Jitka], Palková, Z.[Zdena],
Perception of Czech in Noise: Stability of Vowels,
COST08(149-161).
Springer DOI 0810
BibRef

Skarnitzl, R.[Radek],
Challenges in Segmenting the Czech Lateral Liquid,
COST08(162-172).
Springer DOI 0810
BibRef

Machac, P.[Pavel],
Implications of Acoustic Variation for the Segmentation of the Czech Trill r,
COST08(173-181).
Springer DOI 0810
BibRef

Jorschick, A.B.[Annett B.],
Voicing in Labial Plosives in Czech,
COST08(182-189).
Springer DOI 0810
BibRef

Volín, J.[Jan],
Normalization of the Vocalic Space,
COST08(190-200).
Springer DOI 0810
BibRef

Rajnoha, J.[Josef], Pollák, P.[Petr],
Czech Spontaneous Speech Collection and Annotation: The Database of Technical Lectures,
COST08(377-385).
Springer DOI 0810
BibRef

Janda, J.[Jan],
Quantitative Analysis of the Relative Local Speech Rate,
COST08(368-376).
Springer DOI 0810
BibRef

Zhang, B.[Bo], Zhuang, X.[Xin], Huang, P.[Pan], Feng, C.[Chen], Zhao, J.[Jie],
Application of Uni-Directional Microphone Array for Identifying English Pronunciation Errors,
CISP09(1-5).
IEEE DOI 0910
BibRef

Kuremoto, T., Komoto, T., Kobayashi, K., Obayashi, M.,
A Voice Instruction Learning System Using PL-T-SOM,
CISP09(1-6).
IEEE DOI 0910
BibRef

Espi, M., Takeuchi, Y.,
Substitution of Vocal Folds for Voice Generation by Means of Intra-Oral Pulse Generator,
CISP09(1-5).
IEEE DOI 0910
BibRef

Orhan, Z., Gormez, Z.,
Evaluation of the Concatenative Turkish Text-to-Speech System,
CISP09(1-5).
IEEE DOI 0910
BibRef

Cai, Y.[Yu], Yuan, J.P.[Jian-Ping], Hou, C.H.[Chao-Huan], Yang, J.[Jun], Wu, B.[Bian],
Harmonic Enhancement with Noise Reduction of Speech Signal by Comb Filtering,
CISP09(1-4).
IEEE DOI 0910
BibRef

Li, W.F.[Wei-Feng], Billard, A., Bourlard, H.,
Keyword Detection for Spontaneous Speech,
CISP09(1-5).
IEEE DOI 0910
BibRef

Zhang, X.Y.[Xin-Yi], Yao, J.X.[Jian-Xiao], He, Q.A.[Qi-Ang],
Research of STRAIGHT Spectrogram and Difference Subspace Algorithm for Speech Recognition,
CISP09(1-4).
IEEE DOI 0910
BibRef

Lu, X.[Xugang], Matsuda, S., Unoki, M., Nakamura, S.,
Temporal Modulation Normalization for Robust Speech Feature Extraction and Recognition,
CISP09(1-4).
IEEE DOI 0910
BibRef

Jun, Y.Z.[Yue Zhen], Lei, W.[Wang], Hao, W.[Wang],
A New Parameter of Speech Character Based on the Bloomfield's Model,
CISP09(1-4).
IEEE DOI 0910
BibRef

Qasemi Zadeh, B.[Behrang], Shen, J.L.[Jia-Li], O'Neill, I.[Ian], Miller, P.[Paul], Hanna, P.[Philip], Stewart, D.[Darryl], Wang, H.B.[Hong-Bin],
A Speech Based Approach to Surveillance Video Retrieval,
AVSBS09(336-339).
IEEE DOI 0909
BibRef

Cristani, M., Pesarin, A., Drioli, C., Tavano, A., Perina, A., Murino, V.,
Auditory dialog analysis and understanding by generative modelling of interactional dynamics,
CVPR4HB09(103-109).
IEEE DOI 0906
BibRef

Chen, J.B.[Jin-Biao], Zhang, S.Q.[Shi-Qing],
Manifold learning-based phoneme recognition,
IASP09(308-312).
IEEE DOI 0904
BibRef

Mahdhaoui, A.[Ammar], Chetouani, M.[Mohamed], Zong, C.[Cong],
Motherese detection based on segmental and supra-segmental features,
ICPR08(1-4).
IEEE DOI 0812
parent-infant interactions. BibRef

Zeng, Z.[Zhi], Li, X.[Xin], Ma, X.H.[Xiao-Hong], Ji, Q.A.[Qi-Ang],
Adaptive context recognition based on audio signal,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Luo, L.[Li], Lu, P.F.[Peng-Fei], Wang, Z.F.[Zeng-Fu],
A real-time accompaniment system based on sung voice recognition,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Pesarin, A., Cristani, M., Murino, V., Drioli, C., Perina, A., Tavano, A.,
A statistical signature for automatic dialogue classification,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Choi, H.[Heeyoul], Gutierrez-Osuna, R.[Ricardo], Choi, S.J.[Seung-Jin], Choe, Y.[Yoonsuck],
Kernel oriented discriminant analysis for speaker-independent phoneme spaces,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Terry, L.[Louis], Katsaggelos, A.K.[Aggelos K.],
A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Krajewski, J.[Jarek], Batliner, A.[Anton], Wieland, R.[Rainer],
Multiple classifier applied on predicting microsleep from speech,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Banerjee, P.[Pratyush], Garg, G.[Gaurav], Mitra, P.[Pabitra], Basu, A.[Anupam],
Application of triphone clustering in acoustic modeling for continuous speech recognition in Bengali,
ICPR08(1-4).
IEEE DOI 0812
BibRef

Bouzid, A.[Aďcha], Ellouze, N.[Noureddine],
Voicing Detection in Noisy Speech Signal,
ICISP08(544-551).
Springer DOI 0807
BibRef

Türkmen, H.I.[H. Irem], Karsligil, M.E.[M. Elif],
Reconstruction of Dysphonic Speech by MELP,
CIARP08(767-774).
Springer DOI 0809
BibRef

Maskeliunas, R.[Rytis], Rudzionis, A.[Algimantas], Rudzionis, V.[Vytautas],
Analysis of the Possibilities to Adapt the Foreign Language Speech Recognition Engines for the Lithuanian Spoken Commands Recognition,
COST08(409-422).
Springer DOI 0810
BibRef

Hain, T.[Thomas], Burget, L.[Lukas], Dines, J.[John], Garau, G.[Giulia], Karafiat, M.[Martin], van Leeuwen, D.[David], Lincoln, M.[Mike], Wan, V.[Vincent],
The 2007 AMI(DA) System for Meeting Transcription,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Lamel, L., Bilinski, E., Gauvain, J.L., Adda, G., Barras, C., Zhu, X.,
The LIMSI RT07 Lecture Transcription System,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Fiscus, J.G.[Jonathan G.], Ajot, J.[Jerome], Garofolo, J.S.[John S.],
The Rich Transcription 2007 Meeting Recognition Evaluation,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Stolcke, A.[Andreas], Anguera, X.[Xavier], Boakye, K.[Kofi], Çetin, Ö.[Özgür], Janin, A.[Adam], Magimai-Doss, M.[Mathew], Wooters, C.[Chuck], Zheng, J.[Jing],
The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Huang, J.[Jing], Marcheret, E.[Etienne], Visweswariah, K.[Karthik], Libal, V.[Vit], Potamianos, G.[Gerasimos],
The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Wölfel, M.[Matthias], Stüker, S.[Sebastian], Kraft, F.[Florian],
The ISL RT-07 Speech-to-Text System,
MTPH07(xx-yy).
Springer DOI 0705
BibRef

Schuller, B.[Björn], Wöllmer, M.[Martin], Moosmayr, T.[Tobias], Ruske, G.[Günther], Rigoll, G.[Gerhard],
Switching Linear Dynamic Models for Noise Robust In-Car Speech Recognition,
DAGM08(xx-yy).
Springer DOI 0806
BibRef

Kamble, M.R.[Madhu R.], Patil, H.A.[Hemant A.],
Effectiveness of Mel Scale-Based ESA-IFCC Features for Classification of Natural vs. Spoofed Speech,
PReMI17(308-316).
Springer DOI 1711
BibRef

Tak, R.N.[Rishabh N.], Agrawal, D.M.[Dharmesh M.], Patil, H.A.[Hemant A.],
Novel Phase Encoded Mel Filterbank Energies for Environmental Sound Classification,
PReMI17(317-325).
Springer DOI 1711
BibRef

Patil, H.A.[Hemant A.], Basu, T.K.,
Cepstral Domain Teager Energy for Identifying Perceptually Similar Languages,
PReMI07(455-462).
Springer DOI 0712
BibRef

Manwani, N.[Naresh], Mitra, S.K.[Suman K.], Joshi, M.V.,
Spoken Language Identification for Indian Languages Using Split and Merge EM Algorithm,
PReMI07(463-468).
Springer DOI 0712
BibRef

Rao, K.S.[K. Sreenivasa], Laskar, R.H., Koolagudi, S.G.[Shashidhar G.],
Voice Transformation by Mapping the Features at Syllable Level,
PReMI07(479-486).
Springer DOI 0712
BibRef

Oropeza Rodríguez, J.L.[José Luis], Suárez Guerra, S.[Sergio], Sánchez Fernández, L.P.[Luis Pastor],
Using Adaptive Filter to Increase Automatic Speech Recognition Rate in a Digit Corpus,
CIARP07(78-87).
Springer DOI 0711
BibRef

Várallyay, G.[György],
SSM: A Novel Method to Recognize the Fundamental Frequency in Voice Signals,
CIARP07(88-95).
Springer DOI 0711
BibRef

Simőes, C.[Carla], Teixeira, C.[Carlos], Dias, M.[Miguel], Braga, D.[Daniela], Calado, A.[António],
European Portuguese Accent in Acoustic Models for Non-native English Speakers,
CIARP07(734-742).
Springer DOI 0711
BibRef

Smeaton, A.F.[Alan F.], McHugh, M.[Mike],
Towards event detection in an audio-based sensor network,
VSSN05(87-94).
WWW Link. 0511
BibRef

Esposito, A.[Anna], Stejskal, V.[Vojtech], Smékal, Z.[Zdenek], Bourbakis, N.[Nikolaos],
The Significance of Empty Speech Pauses: Cognitive and Algorithmic Issues,
BVAI07(542-554).
Springer DOI 0710
BibRef

Hernández, I.[Igmar], García, P.[Paola], Nolazco, J.[Juan], Buera, L.[Luis], Lleida, E.[Eduardo],
Robust Automatic Speech Recognition Using PD-MEEMLIN,
IbPRIA07(II: 1-8).
Springer DOI 0706
BibRef

Chung, Y.J.[Yong-Joo], Bae, K.S.[Keun-Sung],
Data-Driven Jacobian Adaptation in a Multi-model Structure for Noisy Speech Recognition,
IbPRIA07(II: 452-459).
Springer DOI 0706
BibRef

Cano, S.[Sergio], Suaste, I.[Israel], Escobedo, D.[Daniel], Reyes-García, C.A.[Carlos A.], Ekkel, T.[Taco],
A Combined Classifier of Cry Units with New Acoustic Attributes,
CIARP06(416-425).
Springer DOI 0611
BibRef

Huerta-Hernández, L.D.[Luis D.], Reyes-García, C.A.[Carlos A.],
On the Processing of Fuzzy Patterns for Text Independent Phonetic Speech Segmentation,
CIARP06(437-445).
Springer DOI 0611
BibRef

Alghassi, H., Tafazoli, S., Lawrence, P.,
The Audio Surveillance Eye,
AVSBS06(106-106).
IEEE DOI 0611
BibRef

Yuan, L.C.[Li-Chi], Chen, Z.G.[Zhi-Gang],
A Novel Statistical Model for Speech Recognition and POS Tagging,
AVSBS06(61-61).
IEEE DOI 0611
BibRef

Yin, B.[Bo], Ambikairajah, E.[Eliathamby], Chen, F.[Fang],
Combining Cepstral and Prosodic Features in Language Identification,
ICPR06(IV: 254-257).
IEEE DOI 0609
BibRef

Zouari, L.[Leila], Chollet, G.[Gerard],
Efficient Gaussian Mixture for Speech Recognition,
ICPR06(IV: 294-297).
IEEE DOI 0609
BibRef

Vinciarelli, A.[Alessandro],
Sociometry Based Multiparty Audio Recordings Summarization,
ICPR06(II: 1154-1157).
IEEE DOI 0609
BibRef

Wang, J.C.[Jia-Ching], Wang, J.F.[Jhing-Fa], Lin, C.B.[Cai-Bei], Jian, K.T.[Kun-Ting], Kuok, W.H.[Wai-He],
Content-Based Audio Classification Using Support Vector Machines and Independent Component Analysis,
ICPR06(IV: 157-160).
IEEE DOI 0609
BibRef

Huang, R.Q.[Rong-Qing], Ma, C.X.[Chang-Xue],
Toward A Speaker-Independent Real-Time Affect Detection System,
ICPR06(I: 1204-1207).
IEEE DOI 0609
BibRef

Wang, L.[Liang], Ambikairajah, E.[Eliathamby], Choi, E.H.C.[Eric H.C.],
Multi-lingual Phoneme Recognition and Language Identification Using Phonotactic Information,
ICPR06(IV: 245-248).
IEEE DOI 0609
BibRef

Kruger, S.E.[Sven E.], Schaffoner, M.[Martin], Katz, M.[Marcel], Andelic, E.[Edin], Wendemuth, A.[Andreas],
Mixture of Support Vector Machines for HMM based Speech Recognition,
ICPR06(IV: 326-329).
IEEE DOI 0609
BibRef

Andelic, E.[Edin], Schaffoner, M.[Martin], Katz, M.[Marcel], Kruger, S.E.[Sven E.],
A Hybrid HMM-Based Speech Recognizer Using Kernel-Based Discriminants as Acoustic Models,
ICPR06(II: 1158-1161).
IEEE DOI 0609
BibRef

Halavati, R.[Ramin], Shouraki, S.B.[Saeed Bagheri], Tajik, H.[Hossein], Cholakian, A.[Arpineh], Razaghpour, M.[Mina],
A Novel Approach to Very Fast and Noise Robust, Isolated Word Speech Recognition,
ICPR06(III: 190-193).
IEEE DOI 0609
BibRef

Lin, H.[Hui], Ou, Z.J.[Zhi-Jian],
Switching Auxiliary Chains for Speech Recognition based on Dynamic Bayesian Networks,
ICPR06(IV: 258-261).
IEEE DOI 0609
BibRef

Maier, A.[Andreas], Hacker, C.[Christian], Noth, E.[Elmar], Nkenke, E.[Emeka], Haderlein, T.[Tino], Rosanowski, F.[Frank], Schuster, M.[Maria],
Intelligibility of Children with Cleft Lip and Palate: Evaluation by Speech Recognition Techniques,
ICPR06(IV: 274-277).
IEEE DOI 0609
BibRef

Zioko, B.[Bartosz], Manandhar, S.[Suresh], Wilson, R.C.[Richard C.],
Phoneme segmentation of speech,
ICPR06(IV: 282-285).
IEEE DOI 0609
BibRef

Choi, E.H.C.[Eric H. C.],
A Noise Robust Front-end for Speech Recognition Using Hough Transform and Cumulative Distribution Mapping,
ICPR06(IV: 286-289).
IEEE DOI 0609
BibRef

Liu, M.[Ming], Huang, T.S.[Thomas S.],
A Bayesian Predictive Method for Automatic Speech Segmentation,
ICPR06(IV: 290-293).
IEEE DOI 0609
BibRef

Haas, J.[Jürgen], Gallwitz, F.[Florian], Horndasch, A.[Axel], Huber, R.[Richard], Warnke, V.[Volker],
Telephone-Based Speech Dialog Systems,
DAGM05(125).
Springer DOI 0509
BibRef

Maier, A.[Andreas], Hacker, C.[Christian], Steidl, S.[Stefan], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Robust Parallel Speech Recognition in Multiple Energy Bands,
DAGM05(133).
Springer DOI 0509
BibRef

Hacker, C.[Christian], Cincarek, T.[Tobias], Gruhn, R.[Rainer], Steidl, S.[Stefan], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Pronunciation Feature Extraction,
DAGM05(141).
Springer DOI 0509
BibRef

Ivanecky, J.[Jozef], Fischer, J.[Julia], Mast, M.[Marion], Kunzmann, S.[Siegfried], Ross, T.[Thomas], Fischer, V.[Volker],
Multi-lingual and Multi-modal Speech Processing and Applications,
DAGM05(149).
Springer DOI 0509
BibRef

Dai, H.S.[Hai-Sheng], Zhu, X.Y.[Xiao-Yan], Luo, Y.P.[Yu-Pin], Yang, S.Y.[Shi-Yuan],
An Utterance Verification Algorithm in Keyword Spotting System,
IbPRIA05(II:555).
Springer DOI 0509
BibRef

Rodríguez, L.J.[Luis Javier], Torres, M.I.[M. Inés],
A Clustering Algorithm for the Fast Match of Acoustic Conditions in Continuous Speech Recognition,
IbPRIA05(II:562).
Springer DOI 0509
BibRef

Sánchez, J.A.[Joan Andreu], Benedí, J.M.[José Miguel], Linares, D.[Diego],
Performance of a SCFG-Based Language Model with Training Data Sets of Increasing Size,
IbPRIA05(II:586).
Springer DOI 0509
BibRef

Nolazco-Flores, J.A.[Juan A.], Salgado-Garza, L.R.[Luis R.], Peńa-Díaz, M.[Marco],
Speaker Dependent ASRs for Huastec and Western-Huastec Náhuatl Languages,
IbPRIA05(II:595).
Springer DOI 0509
BibRef

Ribadas, F.J.[Francisco Jose], Vilares, M.[Manuel], Vilares, J.[Jesus],
Semantic Similarity Between Sentences Through Approximate Tree Matching,
IbPRIA05(II:638).
Springer DOI 0509
BibRef

Chen, K.[Ke],
Speaker Modeling with Various Speech Representations,
ICBA04(592-599).
Springer DOI 0505
BibRef

Iurgel, U., Rigoll, G.,
Spoken document classification with SVMs using linguistic unit weighting and probabilistic couplers,
ICPR04(II: 667-670).
IEEE DOI 0409
BibRef

Sit, C.H.[Chin-Hung], Mak, M.W.[Man-Wai], Kung, S.Y.[Sun-Yuan],
Maximum Likelihood and Maximum a Posteriori Adaptation for Distributed Speaker Recognition Systems,
ICBA04(640-647).
Springer DOI 0505
BibRef

Gutkin, A., King, S.,
Structural representation of speech for phonetic classification,
ICPR04(III: 438-441).
IEEE DOI 0409
BibRef

Demirekler, M., Karahan, F., Ciloglu, T.,
Fusing length and voicing information, and HMM decision using a Bayesian causal tree against insufficient training data,
ICPR00(Vol III: 102-105).
IEEE DOI 0403
BibRef

Kashino, K., Kurozumi, T., Murase, H.,
Feature fluctuation absorption for a quick audio retrieval from long recordings,
ICPR00(Vol III: 98-101).
IEEE DOI 0403
BibRef

Gravier, G., Sigelle, M., Chollet, G.,
A Markov random field model for automatic speech recognition,
ICPR00(Vol III: 254-257).
IEEE DOI 0403
BibRef

Ruiz, N., Rosa, M., Lopez, F., Martinez, D., Mata, R.,
New algorithm for searching minimum bit rate wavelet representations with application to multiresolution-based perceptual audio coding,
ICPR00(Vol III: 286-289).
IEEE DOI 0403
BibRef

Steidl, S.[Stefan], Stemmer, G.[Georg], Hacker, C.[Christian], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Improving Children's Speech Recognition by HMM Interpolation with an Adults' Speech Recognizer,
DAGM03(600-607).
Springer DOI 0310
BibRef

Stephenson, T.A., Magimai-Doss, M., Bourlard, H.,
Mixed bayesian networks with auxiliary variables for automatic speech recognition,
ICPR02(IV: 293-296).
IEEE DOI 0211
BibRef

Bourlard, H.,
Some recent advances in speech recognition with potential applications in other statistical pattern recognition areas,
ICPR02(III: 727-727).
IEEE DOI 0211
BibRef

Tanaka, K., Kojima, H., Fujimura, N., Itoh, Y.,
Constructing speech processing systems on universal phonetic codes accompanied with reference acoustic models,
ICPR02(III: 728-731).
IEEE DOI 0211
BibRef

Katz, M., Meier, H.G., Dolfing, H., Klakow, D.,
Robustness of linear discriminant analysis in automatic speech recognition,
ICPR02(III: 371-374).
IEEE DOI 0211
BibRef

Lefevre, S., Maillard, B., Vincent, N.,
A two level classifier process for audio segmentation,
ICPR02(III: 891-894).
IEEE DOI 0211
BibRef

de Stefano, C., Della Cioppa, A., Marcelli, A.,
An investigation on MPEG audio segmentation by evolutionary algorithms,
ICDAR01(952-956).
IEEE DOI 0109
BibRef

Nouza, J.,
Feature selection methods for hidden Markov model-based speech recognition,
ICPR96(II: 186-190).
IEEE DOI 0509
BibRef

Vande Wouwer, G., Scheunders, P., van Dyck, D.,
Wavelet-FILVQ classifier for speech analysis,
ICPR96(IV: 214-218).
IEEE DOI 0509
BibRef

Uma, S., Sridhar, V., Krishna, G.,
Time-normalization techniques for speaker-independent isolated word recognition,
ICPR92(III:537-540).
IEEE DOI 9208
BibRef

Rieck, S., Schukat-Talamazzini, E.G., Niemann, H.,
Speaker adaptation using semi-continuous hidden Markov models,
ICPR92(III:541-544).
IEEE DOI 9208
BibRef

Edmonds, E.A., Pan, L.Y., O'Brien, S.M.,
Automatic feature extraction from spectrograms for acoustic-phonetic analysis,
ICPR92(II:701-704).
IEEE DOI 9208
BibRef

Ishikawa, Y., Nakajima, K.,
A real time connected word recognition system,
ICPR90(II: 215-217).
IEEE DOI 9008
BibRef

Chapter on New Unsorted Entries, and Other Miscellaneous Papers continues in
Speech Recognition, Neural Networks, CNN .


Last update:Mar 16, 2024 at 20:36:19