- 专利标题: System and method for speaker change detection
-
申请号: US15727498申请日: 2017-10-06
-
公开(公告)号: US10535000B2公开(公告)日: 2020-01-14
- 发明人: Zhenhao Ge , Ananth Nagaraja Iyer , Srinath Cheluvaraja , Aravind Ganapathiraju
- 申请人: INTERACTIVE INTELLIGENCE GROUP, INC.
- 主分类号: G06N3/08
- IPC分类号: G06N3/08 ; G10L17/04 ; G10L17/00 ; G10L17/18 ; G10L15/02
摘要:
A method for training a neural network of a neural network based speaker classifier for use in speaker change detection. The method comprises: a) preprocessing input speech data; b) extracting a plurality of feature frames from the preprocessed input speech data; c) normalizing the extracted feature frames of each speaker within the preprocessed input speech data with each speaker's mean and variance; d) concatenating the normalized feature frames to form overlapped longer frames having a frame length and a hop size; e) inputting the overlapped longer frames to the neural network based speaker classifier; and f) training the neural network through forward-backward propagation.
公开/授权文献
- US20180039888A1 SYSTEM AND METHOD FOR SPEAKER CHANGE DETECTION 公开/授权日:2018-02-08
信息查询