Determining input data for speech processing

发明授权

US11417317B2 Determining input data for speech processing 有权

请登陆查看更多内容

专利标题： Determining input data for speech processing
申请号： US16796506

申请日： 2020-02-20
公开(公告)号： US11417317B2

公开(公告)日： 2022-08-16
发明人: Christopher Larson , Tarek Aziz Lahlou , Diana Mingels , Zachary Kulis , Erik T. Mueller
申请人： Capital One Services, LLC
申请人地址： US VA McLean
专利权人： Capital One Services, LLC
当前专利权人： Capital One Services, LLC
当前专利权人地址： US VA McLean
代理机构： Banner & Witcoff, Ltd.
主分类号： G10L15/06
IPC分类号： G10L15/06 ; G10L15/26 ; G06N3/08 ; G06N20/00 ; G10L15/07

Determining input data for speech processing

摘要：

Aspects described herein may relate to the determination of data that is indicative of a greater range of speech properties than input text data. The determined data may be used as input to one or more speech processing tasks, such as model training, model validation, model testing, or classification. For example, after a model is trained based on the determined data, the model's performance may exhibit more resilience to a wider range of speech properties. The determined data may include one or more modified versions of the input text data. The one or more modified versions may be associated with the one or more speakers or accents and/or may be associated with one or more levels of semantic similarity in relation to the input text data. The one or more modified versions may be determined based on one or more machine learning algorithms.

公开/授权文献

US20200320982A1 Determining Input Data for Speech Processing 公开/授权日：2020-10-08

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）