Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator J Pinto, S Garimella, M Magimai-Doss, H Hermansky, H Bourlard Audio, Speech, and Language Processing, IEEE Transactions on 19 (2), 225-241, 2011 | 104 | 2011 |
Sparse coding for speech recognition GSVS Sivaram, SK Nemala, M Elhilali, TD Tran, H Hermansky 2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010 | 94 | 2010 |
Sparse multilayer perceptron for phoneme recognition GSVS Sivaram, H Hermansky IEEE Transactions on Audio, Speech, and Language Processing 20 (1), 23-29, 2011 | 91 | 2011 |
A design methodology for selection and placement of sensors in multimedia surveillance systems G Sivaram, KR Ramakrishnan, PK Atrey, VK Singh, MS Kankanhalli Proceedings of the 4th ACM international workshop on Video surveillance and …, 2006 | 65 | 2006 |
Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop G Zweig, P Nguyen, D Van Compernolle, K Demuynck, L Atlas, P Clark, ... 2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011 | 62 | 2011 |
Robust i-vector based adaptation of DNN acoustic model for speech recognition S Garimella, A Mandal, N Ström, B Hoffmeister, S Matsoukas, ... | 59 | 2015 |
fMLLR based feature-space speaker adaptation of DNN acoustic models SHK Parthasarathi, B Hoffmeister, S Matsoukas, A Mandal, N Ström, ... | 42 | 2015 |
Improving ASR confidence scores for Alexa using acoustic and hypothesis embeddings P Swarup, R Maas, S Garimella, SH Mallidi, B Hoffmeister | 36 | 2019 |
Multilayer perceptron with sparse hidden outputs for phoneme recognition GSVS Sivaram, H Hermansky 2011 IEEE international conference on acoustics, speech and signal …, 2011 | 22 | 2011 |
Data-driven and feedback based spectro-temporal features for speech recognition GSVS Sivaram, SK Nemala, N Mesgarani, H Hermansky IEEE Signal Processing Letters 17 (11), 957-960, 2010 | 21 | 2010 |
Streaming end-to-end bilingual asr systems with joint language identification S Punjabi, H Arsikere, Z Raeesy, C Chandak, N Bhave, A Bansal, ... arXiv preprint arXiv:2007.03900, 2020 | 19 | 2020 |
Design of multimedia surveillance systems GSVS Sivaram, MS Kankanhalli, KR Ramakrishnan ACM Transactions on Multimedia Computing, Communications, and Applications …, 2009 | 19 | 2009 |
Mixture of auto-associative neural networks for speaker verification GSVS Sivaram, S Thomas, H Hermansky Twelfth Annual Conference of the International Speech Communication Association, 2011 | 18 | 2011 |
Multi-dialect acoustic modeling using phone mapping and online i-vectors H Arsikere, A Sapru, S Garimella | 17 | 2019 |
Factor analysis of auto-associative neural networks with application in speaker verification S Garimella, H Hermansky IEEE transactions on neural networks and learning systems 24 (4), 522-528, 2013 | 16 | 2013 |
Generative modeling of speech using neural networks S Matsoukas, N Ström, A Rastrow, SVSSR Krishna US Patent 9,653,093, 2017 | 15 | 2017 |
Joint ASR and language identification using RNN-T: An efficient approach to dynamic language switching S Punjabi, H Arsikere, Z Raeesy, C Chandak, N Bhave, A Bansal, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 14 | 2021 |
Regularized auto-associative neural networks for speaker verification S Garimella, SH Mallidi, H Hermansky IEEE Signal Processing Letters 19 (12), 841-844, 2012 | 11 | 2012 |
The UMD-JHU 2011 speaker recognition system D Garcia-Romero, X Zhou, D Zotkin, B Srinivasan, Y Luo, S Ganapathy, ... 2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012 | 11 | 2012 |
Discriminant spectrotemporal features for phoneme recognition N Mesgarani, GSVS Sivaram, SK Nemala, M Elhilali, H Hermansky Tenth Annual Conference of the International Speech Communication Association, 2009 | 11 | 2009 |