SPEECH SYNTHESIS DEVICE AND SPEECH SYNTHESIS METHOD

    公开(公告)号:US20230148275A1

    公开(公告)日:2023-05-11

    申请号:US17959050

    申请日:2022-10-03

    CPC classification number: G10L13/047 G10L25/30

    Abstract: Provided is a speech synthetic device capable of outputting a synthetic voice having various speech styles. The speech synthesis device includes a speaker, and a processor to acquire voice feature information through a text and a user input; generate a synthetic voice, by receiving the text and the voice feature information inputs into a decoder supervised-trained to minimize a difference between feature information of a learning text and characteristic information of a learning voice, and output the generated synthetic voice through the speaker.

    SPEECH SYNTHESIS METHOD AND APPARATUS BASED ON EMOTION INFORMATION

    公开(公告)号:US20200035215A1

    公开(公告)日:2020-01-30

    申请号:US16593161

    申请日:2019-10-04

    Abstract: A speech synthesis method and apparatus based on emotion information are disclosed. A speech synthesis method based on emotion information extracts speech synthesis target text from received data and determines whether the received data includes situation explanation information. First metadata corresponding to first emotion information is generated on the basis of the situation explanation information. When the extracted data does not include situation explanation information, second metadata corresponding to second emotion information generated on the basis of semantic analysis and context analysis is generated. One of the first metadata and the second metadata is added to the speech synthesis target text to synthesize speech corresponding to the extracted data. A speech synthesis apparatus of this disclosure may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.

    METHOD FOR PROVIDING VOICE SYNTHESIS SERVICE AND SYSTEM THEREFOR

    公开(公告)号:US20250006177A1

    公开(公告)日:2025-01-02

    申请号:US18708348

    申请日:2022-10-20

    Abstract: A method for providing a voice synthesis service and a system therefor are disclosed. A method of providing a voice synthesis service according to at least one of various embodiments of the present disclosure may comprise the steps of: receiving sound source data for synthesizing a voice of a speaker for a plurality of predefined first texts through a voice synthesis service platform that provides a development toolkit; performing tone conversion training on the sound source data of the speaker using a pre-generated tone conversion base model; generating a voice synthesis model for the speaker through the voice conversion training; receiving a second text; generating a voice synthesis model through voice synthesis inference on the basis of the voice synthesis model for the speaker and the second text; and generating a synthesized voice using the voice synthesis model.

Patent Agency Ranking