- 专利标题: Systems and methods of speaker-independent embedding for identification and verification from audio
-
申请号: US17192464申请日: 2021-03-04
-
公开(公告)号: US11948553B2公开(公告)日: 2024-04-02
- 发明人: Kedar Phatak , Elie Khoury
- 申请人: PINDROP SECURITY, INC.
- 申请人地址: US GA Atlanta
- 专利权人: Pindrop Security, Inc.
- 当前专利权人: Pindrop Security, Inc.
- 当前专利权人地址: US GA Atlanta
- 代理机构: Foley & Lardner LLP
- 主分类号: G10L15/06
- IPC分类号: G10L15/06 ; G06N3/045 ; G06N20/00 ; G10L15/16 ; G10L25/27
摘要:
Embodiments described herein provide for audio processing operations that evaluate characteristics of audio signals that are independent of the speaker's voice. A neural network architecture trains and applies discriminatory neural networks tasked with modeling and classifying speaker-independent characteristics. The task-specific models generate or extract feature vectors from input audio data based on the trained embedding extraction models. The embeddings from the task-specific models are concatenated to form a deep-phoneprint vector for the input audio signal. The DP vector is a low dimensional representation of the each of the speaker-independent characteristics of the audio signal and applied in various downstream operations.
公开/授权文献
信息查询