Invention Grant
- Patent Title: End-to-end speaker recognition using deep neural network
-
Application No.: US15818231Application Date: 2017-11-20
-
Publication No.: US10381009B2Publication Date: 2019-08-13
- Inventor: Elie Khoury , Matthew Garland
- Applicant: PINDROP SECURITY, INC.
- Applicant Address: US GA Atlanta
- Assignee: Pindrop Security, Inc.
- Current Assignee: Pindrop Security, Inc.
- Current Assignee Address: US GA Atlanta
- Agency: Dentons US LLP
- Agent Eric L. Sophir
- Main IPC: G06N3/04
- IPC: G06N3/04 ; G06N3/08 ; G10L15/16 ; G10L17/02 ; G10L17/04 ; G10L17/08 ; G10L17/18 ; G10L17/22

Abstract:
The present invention is directed to a deep neural network (DNN) having a triplet network architecture, which is suitable to perform speaker recognition. In particular, the DNN includes three feed-forward neural networks, which are trained according to a batch process utilizing a cohort set of negative training samples. After each batch of training samples is processed, the DNN may be trained according to a loss function, e.g., utilizing a cosine measure of similarity between respective samples, along with positive and negative margins, to provide a robust representation of voiceprints.
Public/Granted literature
- US20180075849A1 END-TO-END SPEAKER RECOGNITION USING DEEP NEURAL NETWORK Public/Granted day:2018-03-15
Information query