-
公开(公告)号:US20250157457A1
公开(公告)日:2025-05-15
申请号:US19023572
申请日:2025-01-16
Inventor: Bin HUANG , Tao SUN , Ce ZHANG , Yongguo KANG , Xiaoyin FU , Lei JIA
IPC: G10L13/027
Abstract: A method of training a deep learning model and a method of synthesizing a speech are provided, which relate to a field of artificial intelligence technology, in particular to fields of large model, large language model, generative model, deep learning, and speech processing technologies. The method of training a deep learning model includes: determining a reference speech feature of a sample speech, the reference speech feature being associated with a prosodic feature of the sample speech; retrieving a speech library using a sample text corresponding to the sample speech, so as to obtain a pronunciation expression feature of the sample text; inputting the pronunciation expression feature into the deep learning model to obtain an output speech feature; determining a loss of the deep learning model according to the reference speech feature and the output speech feature; and adjusting a parameter of the deep learning model according to the loss.