-
公开(公告)号:US11763819B1
公开(公告)日:2023-09-19
申请号:US17350451
申请日:2021-06-17
Applicant: Amazon Technologies, Inc.
Inventor: Benjamin Charles Eagan , Maciej Makowski , Zack Shahaf Matorin
IPC: G10L19/018 , G10L15/26 , H04L9/40 , G10L15/22
CPC classification number: G10L15/26 , G10L15/22 , G10L19/018 , H04L63/0428 , G10L2015/223
Abstract: A speech interface device is configured to defer encryption of audio data on-device until a time when the encryption operation is not competing with other computationally-intensive operations for responding to the audio data. For example, audio data based on sound captured in an environment of the speech interface device can be stored in volatile memory of the speech interface device, without encrypting it, until a set of processing operations (e.g., ASR processing, NLU processing, audio event processing, etc.) performed based on the audio data have stopped. Based on a determination that these processing operations for responding to the audio data have stopped, the logic may encrypt the audio data to generate encrypted data, and the encrypted data can be stored in non-volatile memory of the speech interface device for uploading to a remote system when a connection is available.
-
公开(公告)号:US11328713B1
公开(公告)日:2022-05-10
申请号:US16876958
申请日:2020-05-18
Applicant: Amazon Technologies, Inc.
IPC: G10L15/22 , G10L15/26 , G10L15/187 , G10L15/193 , G10L15/02 , G06F40/295
Abstract: Techniques for performing spoken language understanding (SLU) processing locally on a user device are described. When a user device is about to present content on a display, the user device may generate one or more SLU models (e.g., one or more ASR models and/or one or more NLU models) specific to the content to be presented. When the user device receives a spoken input while the content is being presented on the display, the user device performs SLU processing on the spoken input using the display-specific SLU model(s).
-
公开(公告)号:US11373645B1
公开(公告)日:2022-06-28
申请号:US16011238
申请日:2018-06-18
Applicant: Amazon Technologies, Inc.
Inventor: Zach Mathew , Benjamin Charles Eagan , Fabian Andreas Bumberger , Alejandro Jose Ramirez Sanabria
IPC: G10L15/22 , G10L15/18 , G10L15/30 , G10L15/19 , G06F40/295
Abstract: A speech interface device is configured to update personalized data in local memory for responding to user speech. The speech interface device may receive compiled personalized data from a remote system, or raw personalized data from local devices in the environment of the speech interface device, the raw personalized data being compiled locally on the speech interface device. The compiled personalized data is received by an artifact manager of the speech interface device for storage in the memory of the speech interface device. A local speech processing component of the speech interface device may use the compiled personalized data when processing user speech in order to understand a spoken form of a word or phrase that is associated with a user. This allows for responding to personal user speech, even when the speech interface device is unable to communicate with a remote speech processing system over a wide area network.
-
公开(公告)号:US11043222B1
公开(公告)日:2021-06-22
申请号:US16208474
申请日:2018-12-03
Applicant: Amazon Technologies, Inc.
Inventor: Benjamin Charles Eagan , Maciej Makowski , Zack Shahaf Matorin
IPC: G10L19/018 , G10L15/26 , H04L29/06 , G10L15/22
Abstract: A speech interface device is configured to defer encryption of audio data on-device until a time when the encryption operation is not competing with other computationally-intensive operations for responding to the audio data. For example, audio data based on sound captured in an environment of the speech interface device can be stored in volatile memory of the speech interface device, without encrypting it, until a set of processing operations (e.g., ASR processing, NLU processing, audio event processing, etc.) performed based on the audio data have stopped. Based on a determination that these processing operations for responding to the audio data have stopped, the logic may encrypt the audio data to generate encrypted data, and the encrypted data can be stored in non-volatile memory of the speech interface device for uploading to a remote system when a connection is available.
-
-
-