ACCOUSTIC MODEL LEARNING APPARATUS, ACCOUSTIC MODEL LEARNING METHOD, AND PROGRAM

发明申请

US20220122626A1 ACCOUSTIC MODEL LEARNING APPARATUS, ACCOUSTIC MODEL LEARNING METHOD, AND PROGRAM 有权

请登陆查看更多内容

专利标题： ACCOUSTIC MODEL LEARNING APPARATUS, ACCOUSTIC MODEL LEARNING METHOD, AND PROGRAM
申请号： US17428274

申请日： 2020-01-23
公开(公告)号： US20220122626A1

公开(公告)日： 2022-04-21
发明人: Kiyoaki MATSUI , Takafumi MORIYA , Takaaki FUKUTOMI , Yusuke SHINOHARA , Yoshikazu YAMAGUCHI , Manabu OKAMOTO
申请人： NIPPON TELEGRAPH AND TELEPHONE CORPORATION
申请人地址： JP Tokyo
专利权人： NIPPON TELEGRAPH AND TELEPHONE CORPORATION
当前专利权人： NIPPON TELEGRAPH AND TELEPHONE CORPORATION
当前专利权人地址： JP Tokyo
优先权： JP2019-018478 20190205
国际申请： PCT/JP2020/002207 WO 20200123
主分类号： G10L25/30
IPC分类号： G10L25/30 ; G10L25/78 ; G06N3/08

ACCOUSTIC MODEL LEARNING APPARATUS, ACCOUSTIC MODEL LEARNING METHOD, AND PROGRAM

摘要：

Provided is a technology of learning an acoustic model with a certain degree of accuracy of sound recognition within a short calculation period. An acoustic model learning device includes: a loss calculation unit configured to calculate a loss of sound data which is an element of the corpus Cj for learning by using an acoustic model; a curriculum corpus generation unit configured to generate a curriculum corpus being a union of subsets of the corpuses Cj for learning, the corpuses Cj including, as elements, sound data for which the loss falls within a predetermined range indicating a small value; an acoustic model update unit configured to update the acoustic model by using the curriculum corpus; and a first end condition determination unit configured to output the acoustic model when a predetermined end condition is satisfied, or transfer execution control to the loss calculation unit when the predetermined end condition is not satisfied, and the acoustic model update unit is configured to update the acoustic model by giving a weight to a gradient for sound data which is an element of the curriculum corpus using such a weight for sound data as to have a smaller value as a number of times the sound data has been selected as an element of the curriculum corpus becomes larger.

公开/授权文献

US12033658B2 Acoustic model learning apparatus, acoustic model learning method, and program 公开/授权日：2024-07-09

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/27	.以分析方法为特征的
G10L25/30	..利用神经网络