Follow
Dan Su
Dan Su
Tencent AI Lab
Verified email at tencent.com
Title
Cited by
Cited by
Year
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
2212021
Replay and synthetic speech detection with res2net architecture
X Li, N Li, C Weng, X Liu, D Su, D Yu, H Meng
ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021
1772021
Fastdiff: A fast conditional diffusion model for high-quality speech synthesis
R Huang, MWY Lam, J Wang, D Su, D Yu, Y Ren, Z Zhao
arXiv preprint arXiv:2204.09934, 2022
1632022
Mm-llms: Recent advances in multimodal large language models
D Zhang, Y Yu, J Dong, C Li, D Su, C Chu, D Yu
arXiv preprint arXiv:2401.13601, 2024
1602024
DurIAN: Duration Informed Attention Network for Speech Synthesis.
C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ...
Interspeech, 2027-2031, 2020
1092020
Component fusion: Learning replaceable language model component for end-to-end speech recognition system
C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1062019
Durian: Duration informed attention network for multimodal synthesis
C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ...
arXiv preprint arXiv:1909.01700, 2019
1042019
Deep extractor network for target speaker recovery from single channel speech mixtures
J Wang, J Chen, D Su, L Chen, M Yu, Y Qian, D Yu
arXiv preprint arXiv:1807.08974, 2018
1042018
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.
R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu
Interspeech, 4290-4294, 2019
1032019
BDDM: Bilateral denoising diffusion models for fast and high-quality speech synthesis
MWY Lam, J Wang, D Su, D Yu
arXiv preprint arXiv:2203.13508, 2022
1022022
Investigating end-to-end speech recognition for mandarin-english code-switching
C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
912019
End-to-end multi-channel speech separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
arXiv preprint arXiv:1905.06286, 2019
872019
Deep Discriminative Embeddings for Duration Robust Speaker Verification.
N Li, D Tuo, D Su, Z Li, D Yu, A Tencent
Interspeech, 2262-2266, 2018
822018
Enhancing end-to-end multi-channel speech separation via spatial feature learning
R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
662020
Diffgan-tts: High-fidelity and efficient text-to-speech with denoising diffusion gans
S Liu, D Su, D Yu
arXiv preprint arXiv:2201.11972, 2022
632022
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition.
C Weng, J Cui, G Wang, J Wang, C Yu, D Su, D Yu
Interspeech, 761-765, 2018
632018
Simple attention module based speaker verification with iterative noisy label detection
X Qin, N Li, C Weng, D Su, M Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
592022
Speech-XLNet: Unsupervised acoustic model pretraining for self-attention networks
X Song, G Wang, Z Wu, Y Huang, D Su, D Yu, H Meng
arXiv preprint arXiv:1910.10387, 2019
562019
Diffsvc: A diffusion probabilistic model for singing voice conversion
S Liu, Y Cao, D Su, H Meng
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
542021
Sandglasset: A light multi-granularity self-attentive network for time-domain speech separation
MWY Lam, J Wang, D Su, D Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
542021
The system can't perform the operation now. Try again later.
Articles 1–20