Systems and methods of speaker-independent embedding for identification and verification from audio

发明授权

US11948553B2 Systems and methods of speaker-independent embedding for identification and verification from audio 有权

请登陆查看更多内容

专利标题： Systems and methods of speaker-independent embedding for identification and verification from audio
申请号： US17192464

申请日： 2021-03-04
公开(公告)号： US11948553B2

公开(公告)日： 2024-04-02
发明人: Kedar Phatak , Elie Khoury
申请人： PINDROP SECURITY, INC.
申请人地址： US GA Atlanta
专利权人： Pindrop Security, Inc.
当前专利权人： Pindrop Security, Inc.
当前专利权人地址： US GA Atlanta
代理机构： Foley & Lardner LLP
主分类号： G10L15/06
IPC分类号： G10L15/06 ; G06N3/045 ; G06N20/00 ; G10L15/16 ; G10L25/27

Systems and methods of speaker-independent embedding for identification and verification from audio

摘要：

Embodiments described herein provide for audio processing operations that evaluate characteristics of audio signals that are independent of the speaker's voice. A neural network architecture trains and applies discriminatory neural networks tasked with modeling and classifying speaker-independent characteristics. The task-specific models generate or extract feature vectors from input audio data based on the trained embedding extraction models. The embeddings from the task-specific models are concatenated to form a deep-phoneprint vector for the input audio signal. The DP vector is a low dimensional representation of the each of the speaker-independent characteristics of the audio signal and applied in various downstream operations.

公开/授权文献

US20210280171A1 SYSTEMS AND METHODS OF SPEAKER-INDEPENDENT EMBEDDING FOR IDENTIFICATION AND VERIFICATION FROM AUDIO 公开/授权日：2021-09-09

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）