-
公开(公告)号:US12190883B2
公开(公告)日:2025-01-07
申请号:US18329635
申请日:2023-06-06
Applicant: Amazon Technologies, Inc.
Inventor: Zeya Chen
Abstract: Techniques for generating, from first speaker recognition data corresponding to at least a first word, second speaker recognition data corresponding to at least a second word are described. During a speaker recognition enrollment process, a device receives audio data corresponding to one or more prompted spoken inputs comprising the at least first word. Using the prompted spoken input(s), the first speaker recognition data (specific to that least first word) is generated. Sometime thereafter, a user may indicate that speaker recognition processing is to be performed using at least a second word. Rather than have the user go through the speaker recognition enrollment process a second time, the device (or a system) may apply a transformation model to the first speaker recognition data to generate second speaker recognition data specific to the at least second word.
-
公开(公告)号:US11763806B1
公开(公告)日:2023-09-19
申请号:US16912119
申请日:2020-06-25
Applicant: Amazon Technologies, Inc.
Inventor: Zeya Chen
CPC classification number: G10L15/22 , G10L15/08 , G10L2015/088 , G10L2015/223
Abstract: Techniques for generating, from first speaker recognition data corresponding to at least a first word, second speaker recognition data corresponding to at least a second word are described. During a speaker recognition enrollment process, a device receives audio data corresponding to one or more prompted spoken inputs comprising the at least first word. Using the prompted spoken input(s), the first speaker recognition data (specific to that least first word) is generated. Sometime thereafter, a user may indicate that speaker recognition processing is to be performed using at least a second word. Rather than have the user go through the speaker recognition enrollment process a second time, the device (or a system) may apply a transformation model to the first speaker recognition data to generate second speaker recognition data specific to the at least second word.
-
公开(公告)号:US11580955B1
公开(公告)日:2023-02-14
申请号:US17218740
申请日:2021-03-31
Applicant: Amazon Technologies, Inc.
Inventor: Yixiong Meng , Roberto Barra Chicote , Grzegorz Beringer , Zeya Chen , Jie Liang , James Garnet Droppo , Chia-Hao Chang , Oguz Hasan Elibol
IPC: G10L13/08 , G10L13/027 , G10L15/06 , G10L13/033 , G10L19/008 , G10L13/047
Abstract: A speech-processing system receives input data representing text. A first encoder processes segments of the text to determine embedding data representing the text, and a second encoder processes corresponding audio data to determine prosodic data corresponding to the text. The embedding and prosodic data is processed to create output data including a representation of speech corresponding to the text and prosody.
-
公开(公告)号:US11067718B1
公开(公告)日:2021-07-20
申请号:US16008554
申请日:2018-06-14
Applicant: Amazon Technologies, Inc.
Inventor: Charles Edwin Ashton Brett , Aniruddha Basak , Zeya Chen , Sara Parker Hillenmeyer , Lizhen Peng , Yunfeng Jiang , William Evan Welbourne , Jay Patel , Sven Eberhardt
Abstract: Described are systems, methods, and apparatus that gathers environment condition data from different sensors at various locations within an area, aggregates the environment condition data to produce aggregated environment condition scores for the area and provides the aggregated environment condition scores to different locations within the area. While sensor data from a single sensor/device, such as a camera may provide low quality environment information, by collecting and aggregating information from multiple sensors and/or locations in the area, highly accurate aggregated environment condition scores for environment conditions may be realized. The aggregated environment condition scores may be provided to various locations within the area as representative of the environment condition at that point in time within the area, regardless of whether those locations have sensors. The aggregated environment condition scores may be used by other devices at those locations to automate one or more actions, such as adjusting lighting conditions, closing garage doors, adjusting window blind positions, etc.
-
公开(公告)号:US20240013784A1
公开(公告)日:2024-01-11
申请号:US18329635
申请日:2023-06-06
Applicant: Amazon Technologies, Inc.
Inventor: Zeya Chen
CPC classification number: G10L15/22 , G10L15/08 , G10L2015/223 , G10L2015/088
Abstract: Techniques for generating, from first speaker recognition data corresponding to at least a first word, second speaker recognition data corresponding to at least a second word are described. During a speaker recognition enrollment process, a device receives audio data corresponding to one or more prompted spoken inputs comprising the at least first word. Using the prompted spoken input(s), the first speaker recognition data (specific to that least first word) is generated. Sometime thereafter, a user may indicate that speaker recognition processing is to be performed using at least a second word. Rather than have the user go through the speaker recognition enrollment process a second time, the device (or a system) may apply a transformation model to the first speaker recognition data to generate second speaker recognition data specific to the at least second word.
-
公开(公告)号:US11373640B1
公开(公告)日:2022-06-28
申请号:US16052546
申请日:2018-08-01
Applicant: Amazon Technologies, Inc.
Inventor: Zeya Chen , Charles Edwin Ashton Brett , Jay Patel , Lizhen Peng , Aniruddha Basak , Hongyang Wang , Sara Hillenmeyer , Yunfeng Jiang , Sven Eberhardt , Akshay Kumar , William Evan Welbourne
IPC: G10L15/22 , G10L15/18 , H04L41/0893 , G06F3/16 , G06N7/00 , G06F40/30 , H04L67/306
Abstract: Systems and methods for intelligent device grouping are disclosed. An environment, such as a home, may have a number of voice-enabled devices and accessory devices that may be controlled by the voice-enabled devices. One or more models, such as linguistics model(s) and/or device affinity models may be utilized to determine which accessory devices are candidates for inclusion in a device group, and a recommendation for grouping the devices may be provided. Device-group naming recommendations may also be generated and may be sent to users.
-
-
-
-
-