SYSTEMS AND METHODS FOR TRAINING VOICE QUERY MODELS

Invention Application

US20240428779A1 SYSTEMS AND METHODS FOR TRAINING VOICE QUERY MODELS 有权

Please log in to see more content

Patent Title: SYSTEMS AND METHODS FOR TRAINING VOICE QUERY MODELS
Application No.: US18823383

Application Date: 2024-09-03
Publication No.: US20240428779A1

Publication Date: 2024-12-26
Inventor: WENYAN LI , FERHAN TURE , JOSE CASILLAS , GEORGE THOMAS DES JARDINS
Applicant: Comcast Cable Communications, LLC
Applicant Address: US PA Philadelphia
Assignee: Comcast Cable Communications, LLC
Current Assignee: Comcast Cable Communications, LLC
Current Assignee Address: US PA Philadelphia
Main IPC: G10L15/06
IPC: G10L15/06 ; G06F40/169 ; G06N20/00 ; G10L15/19 ; G10L15/22 ; G10L25/63

SYSTEMS AND METHODS FOR TRAINING VOICE QUERY MODELS

Abstract:

Methods for automatically evaluating ASR outputs and providing annotations, including corrections, on the transcriptions—in order to improve recognition—may be based on an analysis of sessions of user voice queries, utilizing time-ordered ASR transcriptions of user voice queries (i.e., user utterances). This utterance-based approach may involve extracting both session-level and query-level characteristics from a voice query sessions and identifying patterns of query reformulation in order to detect erroneous transcriptions and automatically determine an appropriate correction. Alternative, or in addition, ASR outputs may be evaluated based on user behavior. The outcomes may be classified as positive or negative. An ASR transcription may be labeled using the description of the outcome. The labeled transcription may be used as training data to train a model to output improved transcriptions of voice queries.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）