发明申请
- 专利标题: ACCOUSTIC MODEL LEARNING APPARATUS, ACCOUSTIC MODEL LEARNING METHOD, AND PROGRAM
-
申请号: US17428274申请日: 2020-01-23
-
公开(公告)号: US20220122626A1公开(公告)日: 2022-04-21
- 发明人: Kiyoaki MATSUI , Takafumi MORIYA , Takaaki FUKUTOMI , Yusuke SHINOHARA , Yoshikazu YAMAGUCHI , Manabu OKAMOTO
- 申请人: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
- 申请人地址: JP Tokyo
- 专利权人: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
- 当前专利权人: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
- 当前专利权人地址: JP Tokyo
- 优先权: JP2019-018478 20190205
- 国际申请: PCT/JP2020/002207 WO 20200123
- 主分类号: G10L25/30
- IPC分类号: G10L25/30 ; G10L25/78 ; G06N3/08
摘要:
Provided is a technology of learning an acoustic model with a certain degree of accuracy of sound recognition within a short calculation period. An acoustic model learning device includes: a loss calculation unit configured to calculate a loss of sound data which is an element of the corpus Cj for learning by using an acoustic model; a curriculum corpus generation unit configured to generate a curriculum corpus being a union of subsets of the corpuses Cj for learning, the corpuses Cj including, as elements, sound data for which the loss falls within a predetermined range indicating a small value; an acoustic model update unit configured to update the acoustic model by using the curriculum corpus; and a first end condition determination unit configured to output the acoustic model when a predetermined end condition is satisfied, or transfer execution control to the loss calculation unit when the predetermined end condition is not satisfied, and the acoustic model update unit is configured to update the acoustic model by giving a weight to a gradient for sound data which is an element of the curriculum corpus using such a weight for sound data as to have a smaller value as a number of times the sound data has been selected as an element of the curriculum corpus becomes larger.
公开/授权文献
信息查询