Follow
Patrick Zhang
Patrick Zhang
Verified email at link.cuhk.edu.hk
Title
Cited by
Cited by
Year
Revisiting hidden Markov models for speech emotion recognition
S Mao, D Tao, G Zhang, PC Ching, T Lee
ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019
822019
Adaspeech 3: Adaptive text to speech for spontaneous style
Y Yan, X Tan, B Li, G Zhang, T Qin, S Zhao, Y Shen, WQ Zhang, TY Liu
arXiv preprint arXiv:2107.02530, 2021
342021
Mixed-phoneme bert: Improving bert with mixed phoneme and sup-phoneme representations for text to speech
G Zhang, K Song, X Tan, D Tan, Y Yan, Y Liu, G Wang, W Zhou, T Qin, ...
arXiv preprint arXiv:2203.17190, 2022
202022
iemotts: Toward robust cross-speaker emotion transfer and control for speech synthesis based on disentanglement between prosody and timbre
G Zhang, Y Qin, W Zhang, J Wu, M Li, Y Gai, F Jiang, T Lee
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
152023
Cuhk-ee voice cloning system for icassp 2021 m2voc challenge
D Tan, H Huang, G Zhang, T Lee
arXiv preprint arXiv:2103.04699, 2021
112021
Learning Syllable-Level Discrete Prosodic Representation for Expressive Speech Generation.
G Zhang, Y Qin, T Lee
Interspeech, 3426-3430, 2020
112020
Estimating mutual information in prosody representation for emotional prosody transfer in speech synthesis
G Zhang, S Qiu, Y Qin, T Lee
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
82021
Applying the information bottleneck principle to prosodic representation learning
G Zhang, Y Qin, D Tan, T Lee
arXiv preprint arXiv:2108.02821, 2021
42021
A study on the efficacy of model pre-training in developing neural text-to-speech system
G Zhang, Y Leng, D Tan, Y Qin, K Song, X Tan, S Zhao, T Lee
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
22022
Environment aware text-to-speech synthesis
D Tan, G Zhang, T Lee
arXiv preprint arXiv:2110.03887, 2021
22021
Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Y Tian, G Zhang, T Lee
arXiv preprint arXiv:2305.17436, 2023
12023
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
G Zhang, T Merritt, MS Ribeiro, B Tura-Vecino, K Yanagisawa, K Pokora, ...
arXiv preprint arXiv:2307.16679, 2023
2023
Chinese Nouns are Mass Nouns: An Information-Theoretic Computational Validation
W Zhou, G Zhang, Y Chen
Available at SSRN 4674220, 0
The system can't perform the operation now. Try again later.
Articles 1–13