Patent search ap:("ZHEJIANG LAB") AND inv:"Zhenchuan Zhang" Page 1

1.

发明授权
General speech enhancement method and apparatus using multi-source auxiliary information 有权

公开(公告)号：US12094484B2

公开(公告)日：2024-09-17

申请号：US18360838

申请日：2023-07-28

Applicant: ZHEJIANG LAB

Inventor： Jingsong Li , Zhenchuan Zhang , Tianshu Zhou , Yu Tian

IPC: G10L21/0232 , G10L17/02 , G10L17/04 , G10L25/30

CPC classification number: G10L21/0232 , G10L17/02 , G10L17/04 , G10L25/30

Abstract: The present disclosure discloses a general speech enhancement method and apparatus using multi-source auxiliary information. The method includes following steps: S1: building a training data set; S2: using the training data set to learn network parameters of a model, and building a speech enhancement model; S3: building a sound source information database in a pre-collection or on-site collection mode; S4: acquiring an input of the speech enhancement model; and S5: taking a noisy original signal as a main input of the speech enhancement model, taking auxiliary sound signals of a target source group and auxiliary sound signals of an interference source group as side inputs of the speech enhancement model for speech enhancement, and obtaining an enhanced speech signal.

Patent Agency Ranking