-
公开(公告)号:US20210027798A1
公开(公告)日:2021-01-28
申请号:US17022197
申请日:2020-09-16
Applicant: Amazon Technologies, Inc.
Inventor: Shiva Kumar Sundaram , Chao Wang , Shiv Naga Prasad Vitaladevuni , Spyridon Matsoukas , Arindam Mandal
Abstract: A speech-capture device can capture audio data during wakeword monitoring and use the audio data to determine if a user is present nearby the device, even if no wakeword is spoken. Audio such as speech, human originating sounds (e.g., coughing, sneezing), or other human related noises (e.g., footsteps, doors closing) can be used to detect audio. Audio frames are individually scored as to whether a human presence is detected in the particular audio frames. The scores are then smoothed relative to nearby frames to create a decision for a particular frame. Presence information can then be sent according to a periodic schedule to a remote device to create a presence “heartbeat” that regularly identifies whether a user is detected proximate to a speech-capture device.
-
公开(公告)号:US10796716B1
公开(公告)日:2020-10-06
申请号:US16157319
申请日:2018-10-11
Applicant: Amazon Technologies, Inc.
Inventor: Shiva Kumar Sundaram , Chao Wang , Shiv Naga Prasad Vitaladevuni , Spyridon Matsoukas , Arindam Mandal
Abstract: A speech-capture device can capture audio data during wakeword monitoring and use the audio data to determine if a user is present nearby the device, even if no wakeword is spoken. Audio such as speech, human originating sounds (e.g., coughing, sneezing), or other human related noises (e.g., footsteps, doors closing) can be used to detect audio. Audio frames are individually scored as to whether a human presence is detected in the particular audio frames. The scores are then smoothed relative to nearby frames to create a decision for a particular frame. Presence information can then be sent according to a periodic schedule to a remote device to create a presence “heartbeat” that regularly identifies whether a user is detected proximate to a speech-capture device.
-
公开(公告)号:US10732258B1
公开(公告)日:2020-08-04
申请号:US15276278
申请日:2016-09-26
Applicant: Amazon Technologies, Inc.
Inventor: Shiva Kumar Sundaram , Rui Wang
Abstract: A system capable of detecting human presence based on output from a model-free detector and model-based detector(s). For example, the model-free detector may identify acoustic events and the model-based detectors can determine specific types of acoustic events and whether the acoustic events are associated with human activity. Using output from the model-based detectors, a device may confirm that an acoustic event identified by the model-free detector is associated with human activity or may determine that the acoustic event is associated with non-human activity and can be ignored. Thus, the device may detect human presence based on a wide variety of noises while reducing a number of false positives associated with the model-free detector.
-
公开(公告)号:US10026399B2
公开(公告)日:2018-07-17
申请号:US14852022
申请日:2015-09-11
Applicant: Amazon Technologies, Inc.
Inventor: Ramya Gopalan , Shiva Kumar Sundaram
IPC: G10L21/00 , G10L15/22 , G06F3/16 , G10L21/0216
Abstract: Architectures and techniques for selecting a voice-enabled device to handle audio input that is detected by multiple voice-enabled devices are described herein. In some instances, multiple voice-enabled devices may detect audio input from a user at substantially the same time, due to the voice-enabled devices being located within proximity to the user. The architectures and techniques may analyze a variety of audio signal metric values for the voice-enabled devices to designate a voice-enabled device to handle the audio input.
-
-
-