Invention Application
- Patent Title: SYSTEMS AND METHODS FOR TRAINING VOICE QUERY MODELS
-
Application No.: US18823383Application Date: 2024-09-03
-
Publication No.: US20240428779A1Publication Date: 2024-12-26
- Inventor: WENYAN LI , FERHAN TURE , JOSE CASILLAS , GEORGE THOMAS DES JARDINS
- Applicant: Comcast Cable Communications, LLC
- Applicant Address: US PA Philadelphia
- Assignee: Comcast Cable Communications, LLC
- Current Assignee: Comcast Cable Communications, LLC
- Current Assignee Address: US PA Philadelphia
- Main IPC: G10L15/06
- IPC: G10L15/06 ; G06F40/169 ; G06N20/00 ; G10L15/19 ; G10L15/22 ; G10L25/63

Abstract:
Methods for automatically evaluating ASR outputs and providing annotations, including corrections, on the transcriptions—in order to improve recognition—may be based on an analysis of sessions of user voice queries, utilizing time-ordered ASR transcriptions of user voice queries (i.e., user utterances). This utterance-based approach may involve extracting both session-level and query-level characteristics from a voice query sessions and identifying patterns of query reformulation in order to detect erroneous transcriptions and automatically determine an appropriate correction. Alternative, or in addition, ASR outputs may be evaluated based on user behavior. The outcomes may be classified as positive or negative. An ASR transcription may be labeled using the description of the outcome. The labeled transcription may be used as training data to train a model to output improved transcriptions of voice queries.
Information query