General speech enhancement method and apparatus using multi-source auxiliary information

    公开(公告)号:US12094484B2

    公开(公告)日:2024-09-17

    申请号:US18360838

    申请日:2023-07-28

    Applicant: ZHEJIANG LAB

    CPC classification number: G10L21/0232 G10L17/02 G10L17/04 G10L25/30

    Abstract: The present disclosure discloses a general speech enhancement method and apparatus using multi-source auxiliary information. The method includes following steps: S1: building a training data set; S2: using the training data set to learn network parameters of a model, and building a speech enhancement model; S3: building a sound source information database in a pre-collection or on-site collection mode; S4: acquiring an input of the speech enhancement model; and S5: taking a noisy original signal as a main input of the speech enhancement model, taking auxiliary sound signals of a target source group and auxiliary sound signals of an interference source group as side inputs of the speech enhancement model for speech enhancement, and obtaining an enhanced speech signal.

Patent Agency Ranking