StyleFusion TTS: Multimodal Style-Control and Enhanced Feature Fusion for Zero-Shot Text-to-Speech Synthesis

Zhiyong Chen, Xinnuo Li*, Zhiqi Ai, Shugong Xu*

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Fingerprint

Dive into the research topics of 'StyleFusion TTS: Multimodal Style-Control and Enhanced Feature Fusion for Zero-Shot Text-to-Speech Synthesis'. Together they form a unique fingerprint.

Computer Science