-
公开(公告)号:US12136413B1
公开(公告)日:2024-11-05
申请号:US17710762
申请日:2022-03-31
Applicant: Amazon Technologies, Inc.
Inventor: Saket Dingliwal , Sravan Babu Bodapati , Katrin Kirchhoff , Ankur Gandhe , Anubhav Mishra , John Baker , Ashish Vishwanath Shenoy , Ravi Teja Gadde
IPC: G06F40/40 , G10L15/06 , G10L15/183
Abstract: Domain-specific parameters may be used for tuning speech processing. A pre-trained transformer-based language model may train domain-specific parameters using domain-specific unlabeled text data. This domain-specific parameters can then be appended to candidate texts produced by a speech model on received speech data and input to the transformer-based language model to score the candidate texts. The scores of the candidate texts determined using the pre-trained transformer-based language model can then be used to select a candidate text for further speech processing.
-
公开(公告)号:US11688394B1
公开(公告)日:2023-06-27
申请号:US17007810
申请日:2020-08-31
Applicant: Amazon Technologies, Inc.
Inventor: Denis Filimonov , Ravi Teja Gadde , Ariya Rastrow
IPC: G10L15/187 , G10L15/16 , G06N3/049 , G10L15/02
CPC classification number: G10L15/187 , G06N3/049 , G10L15/16 , G10L2015/025
Abstract: This disclosure proposes systems and methods for leveraging entity-related language models in speech processing. A system can receive audio data corresponding to an utterance and perform automatic speech recognition (ASR) on a first portion of the audio data using a general language model. Based on the results, the system may identify a specific language model for processing a second portion of the audio data. The specific language model may include entities belonging to a common subject or class. The specific language model may, in some cases, provide better results than the general language model. While the general language model may describe a whole sentence, the specific language model may describe only a portion of a sentence. Thus, a top-level model may “activate” the specific language model when it may provide useful results. The resulting data may include results from both the general language model and the specific language model.
-