METHOD FOR TRAINING MODEL, SPEECH RECOGNITION METHOD, APPARATUS, MEDIUM, AND DEVICE

    公开(公告)号:US20240105162A1

    公开(公告)日:2024-03-28

    申请号:US18257403

    申请日:2021-11-18

    发明人: Kang WANG

    IPC分类号: G10L15/06 G10L15/00 G10L15/22

    摘要: The present disclosure relates to a method for training a model, a speech recognition method, an apparatus, a medium, and a device, the method including: acquiring training data, wherein the training data includes labeled data of at least two languages; ranking the languages in a descending order of a quantity of the labeled data of each language to obtain a training order corresponding to the languages; and sequentially acquiring, in accordance with ranking of the languages indicated by the training order, target data corresponding to each language to perform iterative training on a preset model, to obtain a target speech recognition model, wherein the target data is determined in accordance with the labeled data of language(s) from first ranking to current ranking in the training order.

    Electronic device and operating method thereof

    公开(公告)号:US11942077B2

    公开(公告)日:2024-03-26

    申请号:US17949741

    申请日:2022-09-21

    摘要: An electronic device for providing a text-to-speech (TTS) service and an operating method therefor are provided. The operating method of the electronic device includes obtaining target voice data based on an utterance input of a specific speaker, determining a number of learning steps of the target voice data, based on data features including a data amount of the target voice data, generating a target model by training a pre-trained model pre-trained to convert text into an audio signal, by using the target voice data as training data, based on the determined number of learning steps, generating output data obtained by converting input text into an audio signal, by using the generated target model, and outputting the generated output data.