-
公开(公告)号:US12100396B2
公开(公告)日:2024-09-24
申请号:US17583672
申请日:2022-01-25
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
IPC: G10L15/22 , G06F3/16 , G06F40/35 , G10L15/30 , H04L51/10 , H04L51/224 , H04L67/306 , G06V40/10 , G10L13/00 , G10L15/08
CPC classification number: G10L15/22 , G06F3/167 , G06F40/35 , G10L15/30 , H04L51/10 , H04L51/224 , H04L67/306 , G06V40/10 , G10L13/00 , G10L15/08 , G10L2015/088 , G10L2015/223
Abstract: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US11264030B2
公开(公告)日:2022-03-01
申请号:US16732943
申请日:2020-01-02
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
IPC: G10L21/00 , G10L15/22 , H04L51/224 , H04L51/10 , G10L15/30 , H04L67/306 , G06F3/16 , G06F40/35 , G10L15/08 , G10L13/00 , G06K9/00
Abstract: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US10140973B1
公开(公告)日:2018-11-27
申请号:US15266116
申请日:2016-09-15
Applicant: Amazon Technologies, Inc.
Inventor: Manish Kumar Dalmia , Rafal Kuklinski
Abstract: Systems, methods, and devices for generating text-to-speech output using previously captured speech are described. Spoken audio is obtained and undergoes speech processing to create text. The resulting text is stored with the spoken audio, with both the text and the spoken audio being associated with the individual that spoke the audio. Various spoken audio and corresponding text are stored over time to create a library of speech units. When the individual sends a text message to a recipient, the text message is processed to determine portions of text, and the portions of text are compared to the library of text associated with the individual. When text in the library is identified, the system selects the spoken audio units associated with the identified stored text. The selected spoken audio units are then used to generate output audio data corresponding to the original text message, with the output audio data being sent to a device of the message recipient.
-
公开(公告)号:US20180061402A1
公开(公告)日:2018-03-01
申请号:US15254359
申请日:2016-09-01
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
CPC classification number: G10L15/22 , G06F17/2765 , G06F17/278 , G10L13/00 , G10L15/1822 , G10L17/22 , G10L17/24 , G10L2015/225 , G10L2015/227 , H04L67/306
Abstract: Systems, methods, and devices for escalating voice-based interactions via speech-controlled devices are described. Speech-controlled devices capture audio, including wakeword portions and payload portions, for sending to a server to relay messages between speech-controlled devices. In response to determining the occurrence of an escalation event, such as repeated messages between the same two devices, the system may automatically change a mode of a speech-controlled device, such as no longer requiring a wakeword, no longer requiring an indication of a desired recipient, or automatically connecting two speech-controlled devices in a voice-chat mode. In response to determining the occurrence of further escalation events, the system may initiate a real-time call between the speech-controlled devices.
-
公开(公告)号:US20220165268A1
公开(公告)日:2022-05-26
申请号:US17583672
申请日:2022-01-25
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
IPC: G10L15/22 , H04L51/224 , H04L51/10 , G10L15/30 , H04L67/306 , G06F3/16 , G06F40/35
Abstract: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US10580404B2
公开(公告)日:2020-03-03
申请号:US15254600
申请日:2016-09-01
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
IPC: G10L21/00 , G10L15/22 , H04L12/58 , G10L15/30 , H04L29/08 , G06F17/27 , G06F3/16 , G10L15/08 , G10L13/00 , G06K9/00
Abstract: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US20180061404A1
公开(公告)日:2018-03-01
申请号:US15254600
申请日:2016-09-01
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciabotariu , Sandra Lemon
Abstract: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US10803507B1
公开(公告)日:2020-10-13
申请号:US14949718
申请日:2015-11-23
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: Karan Verma , David Charles Couvrette , Manish Kumar Dalmia , Alberto Milan Gutierrez , Benjamin McDougall Hartung , Rohit Jain , Raymond Solyne Matthieu, III , Elsie Nallipogu , Teng Yi
IPC: G06Q30/00 , G06Q30/06 , G06F3/0484
Abstract: Described are techniques for automatically generating tables or other types of output for comparing attributes of similar items. For each attribute of a particular item, a system may determine whether that attribute is generic to most items or specific to an item or category, whether data for that attribute is available for a sufficient number of items, and whether the value of that attribute differs among a sufficient number of items to facilitate a useful comparison. Responsive to user input, item data and one or more tables or other types of output may be provided to a user device. User input interacting with a table(s) or other output may be used to modify the stored table or output or to navigate to other tables, outputs, or item data.
-
公开(公告)号:US10453449B2
公开(公告)日:2019-10-22
申请号:US15254458
申请日:2016-09-01
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
Abstract: Systems, methods, and devices for outputting visual indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to a voice message to a second speech-controlled device. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server performs speech processing on the audio data to determine a recipient and message content. The server then determines a second speech-controlled device associated with the recipient and sends the message content to the recipient's second speech-controlled device. Thereafter, the server receives an indication from the recipient's speech-controlled device that the second device is detecting speech, presumably in response to the original message. The server then causes a visual indication to be output by the first speech-controlled device, with the visual indication representing the recipient-speech controlled device is detecting speech.
-
公开(公告)号:US10074369B2
公开(公告)日:2018-09-11
申请号:US15254359
申请日:2016-09-01
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
CPC classification number: G10L15/22 , G06F17/2765 , G06F17/278 , G10L13/00 , G10L15/1822 , G10L17/22 , G10L17/24 , G10L2015/225 , G10L2015/227 , H04L67/306
Abstract: Systems, methods, and devices for escalating voice-based interactions via speech-controlled devices are described. Speech-controlled devices capture audio, including wakeword portions and payload portions, for sending to a server to relay messages between speech-controlled devices. In response to determining the occurrence of an escalation event, such as repeated messages between the same two devices, the system may automatically change a mode of a speech-controlled device, such as no longer requiring a wakeword, no longer requiring an indication of a desired recipient, or automatically connecting two speech-controlled devices in a voice-chat mode. In response to determining the occurrence of further escalation events, the system may initiate a real-time call between the speech-controlled devices.
-
-
-
-
-
-
-
-
-