Παρακολούθηση
Takuya Yoshioka
Takuya Yoshioka
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα microsoft.com - Αρχική σελίδα
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Dual-path rnn: efficient long sequence modeling for time-domain single-channel speech separation
Y Luo, Z Chen, T Yoshioka
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
4772020
Speech dereverberation based on variance-normalized delayed linear prediction
T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang
IEEE Transactions on Audio, Speech, and Language Processing 18 (7), 1717-1731, 2010
4272010
The REVERB challenge: A common evaluation framework for dereverberation and recognition of reverberant speech
K Kinoshita, M Delcroix, T Yoshioka, T Nakatani, E Habets, ...
2013 IEEE Workshop on Applications of Signal Processing to Audio and …, 2013
4232013
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research
K Kinoshita, M Delcroix, S Gannot, EA P. Habets, R Haeb-Umbach, ...
EURASIP Journal on Advances in Signal Processing 2016, 1-19, 2016
3762016
Wavlm: Large-scale self-supervised pre-training for full stack speech processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022
3612022
Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition
T Yoshioka, A Sehr, M Delcroix, K Kinoshita, R Maas, T Nakatani, ...
IEEE Signal Processing Magazine 29 (6), 114-126, 2012
3212012
Generalization of multi-channel linear prediction methods for blind MIMO impulse response shortening
T Yoshioka, T Nakatani
IEEE Transactions on Audio, Speech, and Language Processing 20 (10), 2707-2720, 2012
2602012
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices
T Yoshioka, N Ito, M Delcroix, A Ogawa, K Kinoshita, M Fujimoto, C Yu, ...
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
2472015
Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise
T Higuchi, N Ito, T Yoshioka, T Nakatani
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
2182016
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
2092020
Blind separation and dereverberation of speech mixtures by joint optimization
T Yoshioka, T Nakatani, M Miyoshi, HG Okuno
IEEE Transactions on Audio, Speech, and Language Processing 19 (1), 69-84, 2010
1822010
Blind speech dereverberation with multi-channel linear prediction based on short time Fourier transform representation
T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang
2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008
1782008
Continuous speech separation: Dataset and analysis
Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1412020
Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the REVERB challenge
M Delcroix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto, N Ito, K Kinoshita, ...
Reverb workshop, 2014
1282014
End-to-end microphone permutation and number invariant multi-channel speech separation
Y Luo, Z Chen, N Mesgarani, T Yoshioka
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1142020
Online MVDR beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR
T Higuchi, N Ito, S Araki, T Yoshioka, M Delcroix, T Nakatani
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (4), 780-793, 2017
1132017
Multi-channel overlapped speech recognition with location guided speech extraction network
Z Chen, X Xiao, T Yoshioka, H Erdogan, J Li, Y Gong
2018 IEEE Spoken Language Technology Workshop (SLT), 558-565, 2018
1082018
Multi-microphone neural speech separation for far-field multi-talker speech recognition
T Yoshioka, H Erdogan, Z Chen, F Alleva
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1082018
Low-latency real-time meeting recognition and understanding using distant microphones and omni-directional camera
T Hori, S Araki, T Yoshioka, M Fujimoto, S Watanabe, T Oba, A Ogawa, ...
IEEE transactions on audio, speech, and language processing 20 (2), 499-513, 2011
1052011
Robust speech dereverberation based on non-negativity and sparse nature of speech spectrograms
H Kameoka, T Nakatani, T Yoshioka
2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009
1032009
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–20