Παρακολούθηση
Tomohiro Nakatani
Tomohiro Nakatani
NTT Communication Science Laboratories
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα ieee.org - Αρχική σελίδα
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
The reverb challenge: A common evaluation framework for dereverberation and recognition of reverberant speech
K Kinoshita, M Delcroix, T Yoshioka, T Nakatani, A Sehr, W Kellermann, ...
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE …, 2013
4132013
Speech dereverberation based on variance-normalized delayed linear prediction
T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang
IEEE Transactions on Audio, Speech, and Language Processing 18 (7), 1717-1731, 2010
4012010
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research
K Kinoshita, M Delcroix, S Gannot, EA P Habets, R Haeb-Umbach, ...
EURASIP Journal on Advances in Signal Processing 2016 (1), 1-19, 2016
3582016
Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition
T Yoshioka, A Sehr, M Delcroix, K Kinoshita, R Maas, T Nakatani, ...
IEEE Signal Processing Magazine 29 (6), 114-126, 2012
3112012
Suppression of late reverberation effect on speech signal using long-term multiple-step linear prediction
K Kinoshita, M Delcroix, T Nakatani, M Miyoshi
IEEE transactions on audio, speech, and language processing 17 (4), 534-545, 2009
2482009
Generalization of multi-channel linear prediction methods for blind MIMO impulse response shortening
T Yoshioka, T Nakatani
IEEE Transactions on Audio, Speech, and Language Processing 20 (10), 2707-2720, 2012
2432012
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices
T Yoshioka, N Ito, M Delcroix, A Ogawa, K Kinoshita, M Fujimoto, C Yu, ...
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
2422015
Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise
T Higuchi, N Ito, T Yoshioka, T Nakatani
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
2112016
Blind separation and dereverberation of speech mixtures by joint optimization
T Yoshioka, T Nakatani, M Miyoshi, HG Okuno
IEEE Transactions on Audio, Speech, and Language Processing 19 (1), 69-84, 2010
1752010
Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration
T Nakatani
Proc. Interspeech, 2019
1702019
Blind speech dereverberation with multi-channel linear prediction based on short time Fourier transform representation
T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang
2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008
1692008
Single channel target speaker extraction and recognition with speaker beam
M Delcroix, K Zmolikova, K Kinoshita, A Ogawa, T Nakatani
2018 IEEE international conference on acoustics, speech and signal …, 2018
1482018
LINEAR PREDICTION-BASED DEREVERBERATION WITH ADVANCED SPEECH ENHANCEMENT AND RECOGNITION TECHNOLOGIES FOR THE REVERB CHALLENGE
M Delcroix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto, N Ito, K Kinoshita, ...
1252014
A multichannel MMSE-based framework for speech source separation and noise reduction
M Souden, S Araki, K Kinoshita, T Nakatani, H Sawada
IEEE Transactions on Audio, Speech, and Language Processing 21 (9), 1913-1928, 2013
1152013
Blind dereverberation of single channel speech signal based on harmonic structure
T Nakatani, M Miyoshi
2003 IEEE International Conference on Acoustics, Speech, and Signal …, 2003
1122003
Exploring multi-channel features for denoising-autoencoder-based speech enhancement
S Araki, T Hayashi, M Delcroix, M Fujimoto, K Takeda, T Nakatani
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
1072015
Speech processing for digital home assistants: Combining signal processing with deep-learning techniques
R Haeb-Umbach, S Watanabe, T Nakatani, M Bacchiani, B Hoffmeister, ...
IEEE Signal processing magazine 36 (6), 111-124, 2019
1062019
Exploiting spectro-temporal locality in deep learning based acoustic event detection
M Espi, M Fujimoto, K Kinoshita, T Nakatani
EURASIP Journal on Audio, Speech, and Music Processing 2015 (1), 1-12, 2015
1062015
Online MVDR beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR
T Higuchi, N Ito, S Araki, T Yoshioka, M Delcroix, T Nakatani
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (4), 780-793, 2017
1042017
Robust and accurate fundamental frequency estimation based on dominant harmonic components
T Nakatani, T Irino
The Journal of the Acoustical Society of America 116 (6), 3690-3700, 2004
1042004
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–20