- 专利标题: Neural network voice activity detection employing running range normalization
-
申请号: US14866824申请日: 2015-09-25
-
公开(公告)号: US09953661B2公开(公告)日: 2018-04-24
- 发明人: Earl Vickers
- 申请人: Cirrus Logic Inc.
- 申请人地址: US TX Austin
- 专利权人: CIRRUS LOGIC INC.
- 当前专利权人: CIRRUS LOGIC INC.
- 当前专利权人地址: US TX Austin
- 代理机构: Dorius Law P.C.
- 代理商 Kirk Dorius
- 主分类号: G10L21/02
- IPC分类号: G10L21/02 ; G10L21/0264 ; G10L21/0224 ; G10L25/60 ; G10L25/84 ; G10L25/78 ; G10L25/30 ; G10L15/06
摘要:
A “running range normalization” method includes computing running estimates of the range of values of features useful for voice activity detection (VAD) and normalizing the features by mapping them to a desired range. Running range normalization includes computation of running estimates of the minimum and maximum values of VAD features and normalizing the feature values by mapping the original range to a desired range. Smoothing coefficients are optionally selected to directionally bias a rate of change of at least one of the running estimates of the minimum and maximum values. The normalized VAD feature parameters are used to train a machine learning algorithm to detect voice activity and to use the trained machine learning algorithm to isolate or enhance the speech component of the audio data.
公开/授权文献
信息查询