Dan Su

Cited by

	All	Since 2019
Citations	2861	2810
h-index	31	31
i10-index	67	66

940

470

235

705

20182019202020212022202320249 125 248 509 671 927 319

Public access

View all

28 articles

1 article

available

not available

Based on funding mandates

Co-authors

Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Meng YUTencent AI LabVerified email at tencent.com
Jun WangPeking UniversityVerified email at tencent.com
Lianwu CHENKuaishou TechnologyVerified email at kuaishou.com
Shiyin KangXVerse Inc.Verified email at xverse.cn
Zhiyong WU (吴志勇)Associate Professor, Tsinghua UniversityVerified email at sz.tsinghua.edu.cn
Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
Xunying LiuChinese University of Hong KongVerified email at se.cuhk.edu.hk
Guangsen WangTencent AI LabVerified email at tencent.com
Songxiang LiumiHoYoVerified email at mihoyo.com
Shan YangTencent AI LabVerified email at nwpu-aslp.org
Yong XuPrincipal Researcher, Tencent America, Bellevue, USAVerified email at tencent.com
Shi-Xiong (Austin) ZhangSr. Director | AI Foundations@Capital One | ex-Microsoft, ex-Tencent, Cambridge PhDVerified email at capitalone.com
Rongzhi GuTencent AI LabVerified email at pku.edu.cn
Yuexian ZouPeking University Shenzhen Graduate SchoolVerified email at pku.edu.cn
Jia CuiTencentVerified email at tencent.com
Xihong WuPeking UniversityVerified email at cis.pku.edu.cn
Chao WengIndependent R&D
Max W. Y. LamIndependent Researcher

Dan Su

Tencent AI Lab

Verified email at tencent.com

speech recognition speech synthesis speaker recognition


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021	160	2021
Replay and synthetic speech detection with res2net architecture X Li, N Li, C Weng, X Liu, D Su, D Yu, H Meng ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021	134	2021
Fastdiff: A fast conditional diffusion model for high-quality speech synthesis R Huang, MWY Lam, J Wang, D Su, D Yu, Y Ren, Z Zhao arXiv preprint arXiv:2204.09934, 2022	118	2022
Deep extractor network for target speaker recovery from single channel speech mixtures J Wang, J Chen, D Su, L Chen, M Yu, Y Qian, D Yu arXiv preprint arXiv:1807.08974, 2018	103	2018
DurIAN: Duration Informed Attention Network for Speech Synthesis. C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ... Interspeech, 2027-2031, 2020	101	2020
Durian: Duration informed attention network for multimodal synthesis C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ... arXiv preprint arXiv:1909.01700, 2019	99	2019
Component fusion: Learning replaceable language model component for end-to-end speech recognition system C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	97	2019
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information. R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu Interspeech, 4290-4294, 2019	91	2019
End-to-end multi-channel speech separation R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu arXiv preprint arXiv:1905.06286, 2019	79	2019
Investigating end-to-end speech recognition for mandarin-english code-switching C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	79	2019
Deep Discriminative Embeddings for Duration Robust Speaker Verification. N Li, D Tuo, D Su, Z Li, D Yu, A Tencent Interspeech, 2262-2266, 2018	77	2018
BDDM: Bilateral denoising diffusion models for fast and high-quality speech synthesis MWY Lam, J Wang, D Su, D Yu arXiv preprint arXiv:2203.13508, 2022	75	2022
Enhancing end-to-end multi-channel speech separation via spatial feature learning R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	62	2020
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition. C Weng, J Cui, G Wang, J Wang, C Yu, D Su, D Yu Interspeech, 761-765, 2018	61	2018
Speech-XLNet: Unsupervised acoustic model pretraining for self-attention networks X Song, G Wang, Z Wu, Y Huang, D Su, D Yu, H Meng arXiv preprint arXiv:1910.10387, 2019	55	2019
Diffgan-tts: High-fidelity and efficient text-to-speech with denoising diffusion gans S Liu, D Su, D Yu arXiv preprint arXiv:2201.11972, 2022	45	2022
Sandglasset: A light multi-granularity self-attentive network for time-domain speech separation MWY Lam, J Wang, D Su, D Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	44	2021
Investigating robustness of adversarial samples detection for automatic speaker verification X Li, N Li, J Zhong, X Wu, X Liu, D Su, D Yu, H Meng arXiv preprint arXiv:2006.06186, 2020	43	2020
Simple attention module based speaker verification with iterative noisy label detection X Qin, N Li, C Weng, D Su, M Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	42	2022
Joint training of complex ratio mask based beamformer and acoustic model for noise robust asr Y Xu, C Weng, L Hui, J Liu, M Yu, D Su, D Yu ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	41	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors