MULTI-SPEAKER OVERLAPPING VOICE DETECTION METHOD AND SYSTEM THEREOF

    公开(公告)号:US20250046336A1

    公开(公告)日:2025-02-06

    申请号:US18788092

    申请日:2024-07-29

    Abstract: Disclosed are a multi-speaker overlapping voice detection method and a system. The method includes: obtaining a voice to be detected, and removing silence from the voice to be detected is removed; extracting a feature of the voice to be detected after silence removal to obtain a voice feature of the voice to be detected; and inputting the voice feature into an overlapping voice detection model to obtain an overlapping speaker number corresponding to the voice to be detected output by the overlapping voice detection model. The overlapping voice detection model is obtained by supervised training based on a voice feature of a sample voice and a corresponding label of the overlapping speaker number, extracts an embedding of the voice feature, and classifies the overlapping speaker number to obtain the overlapping speaker number of the voice to be detected based on the extracted speaker embedding.

Patent Agency Ranking