Speaker adaptation of neural network acoustic models using i-vectors G Saon, H Soltau, D Nahamoo, M Picheny 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 55-59, 2013 | 585 | 2013 |
Boosted MMI for model and feature-space discriminative training D Povey, D Kanevsky, B Kingsbury, B Ramabhadran, G Saon, ... 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 448 | 2008 |
Deep convolutional neural networks for large-scale speech tasks TN Sainath, B Kingsbury, G Saon, H Soltau, A Mohamed, G Dahl, ... Neural networks 64, 39-48, 2015 | 389 | 2015 |
fMPE: Discriminatively trained features for speech recognition D Povey, B Kingsbury, L Mangu, G Saon, H Soltau, G Zweig Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005 | 361 | 2005 |
English conversational telephone speech recognition by humans and machines G Saon, G Kurata, T Sercu, K Audhkhasi, S Thomas, D Dimitriadis, X Cui, ... arXiv preprint arXiv:1703.02136, 2017 | 296 | 2017 |
Maximum likelihood discriminant feature spaces G Saon, M Padmanabhan, R Gopinath, S Chen 2000 IEEE International Conference on Acoustics, Speech, and Signal …, 2000 | 279 | 2000 |
The IBM 2015 English conversational telephone speech recognition system G Saon, HKJ Kuo, S Rennie, M Picheny arXiv preprint arXiv:1505.05899, 2015 | 203 | 2015 |
Improvements to deep convolutional neural networks for LVCSR TN Sainath, B Kingsbury, A Mohamed, GE Dahl, G Saon, H Soltau, ... 2013 IEEE workshop on automatic speech recognition and understanding, 315-320, 2013 | 202 | 2013 |
The IBM Attila speech recognition toolkit H Soltau, G Saon, B Kingsbury 2010 IEEE Spoken Language Technology Workshop, 97-102, 2010 | 160 | 2010 |
Advances in speech transcription at IBM under the DARPA EARS program SF Chen, B Kingsbury, L Mangu, D Povey, G Saon, H Soltau, G Zweig IEEE Transactions on Audio, Speech, and Language Processing 14 (5), 1596-1608, 2006 | 153 | 2006 |
The IBM 2004 conversational telephony system for rich transcription H Soltau, B Kingsbury, L Mangu, D Povey, G Saon, G Zweig Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005 | 142 | 2005 |
Large-vocabulary continuous speech recognition systems: A look at some recent advances G Saon, JT Chien IEEE Signal Processing Magazine 29 (6), 18-33, 2012 | 135 | 2012 |
Direct acoustics-to-word models for english conversational speech recognition K Audhkhasi, B Ramabhadran, G Saon, M Picheny, D Nahamoo arXiv preprint arXiv:1703.07754, 2017 | 122 | 2017 |
Anatomy of an extremely fast LVCSR decoder G Saon, D Povey, G Zweig Ninth European Conference on Speech Communication and Technology, 2005 | 94 | 2005 |
Building competitive direct acoustics-to-word models for english conversational speech recognition K Audhkhasi, B Kingsbury, B Ramabhadran, G Saon, M Picheny 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 91 | 2018 |
Joint training of convolutional and non-convolutional neural networks H Soltau, G Saon, TN Sainath 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 90 | 2014 |
Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions S Thomas, S Ganapathy, G Saon, H Soltau 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 89 | 2014 |
Data-driven approach to designing compound words for continuous speech recognition G Saon, M Padmanabhan IEEE transactions on Speech and audio processing 9 (4), 327-332, 2001 | 85 | 2001 |
Feature and model space speaker adaptation with full covariance Gaussians D Povey, G Saon Ninth International Conference on Spoken Language Processing, 2006 | 83 | 2006 |
Exploiting diversity for spoken term detection L Mangu, H Soltau, HK Kuo, B Kingsbury, G Saon 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 68 | 2013 |