- 专利标题: Native language identification with time delay deep neural networks trained separately on native and non-native english corpora
-
申请号: US16221980申请日: 2018-12-17
-
公开(公告)号: US10783873B1公开(公告)日: 2020-09-22
- 发明人: Yao Qian , Keelan Evanini , Patrick Lange , Robert A. Pugh , Rutuja Ubale
- 申请人: Educational Testing Service
- 申请人地址: US NJ Princeton
- 专利权人: Educational Testing Service
- 当前专利权人: Educational Testing Service
- 当前专利权人地址: US NJ Princeton
- 代理机构: Jones Day
- 主分类号: G10L15/00
- IPC分类号: G10L15/00 ; G06N3/04 ; G10L15/16 ; G10L25/78 ; G06N3/08 ; G09B19/04
摘要:
Systems and methods for identifying a person's native language, are presented. A native language identification system, comprising a plurality of artificial neural networks, such as time delay deep neural networks, is provided. Respective artificial neural networks of the plurality of artificial neural networks are trained as universal background models, using separate native language and non-native language corpora. The artificial neural networks may be used to perform voice activity detection and to extract sufficient statistics from the respective language corpora. The artificial neural networks may use the sufficient statistics to estimate respective T-matrices, which may in turn be used to extract respective i-vectors. The artificial neural networks may use i-vectors to generate a multilayer perceptron model, which may be used to identify a person's native language, based on an utterance by the person in his or her non-native language.
信息查询