Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Shuang Wu"

1.

发明授权
Object recognition with attribute-based cells 有权

公开(公告)号：US10402704B1

公开(公告)日：2019-09-03

申请号：US14788272

申请日：2015-06-30

Applicant: Amazon Technologies, Inc.

Inventor： Shuang Wu

IPC: G06K9/62 , G06K9/72

Abstract: Various examples are directed to methods and systems for object recognition in an image. A computer vision system may receive a patch comprising a plurality of pixels arranged in a grid. The computer vision system may determine a plurality of columns and a plurality of rows in the patch. The plurality of columns may be based at least in part on a column target sum and the plurality of rows may be based at least in part on a row target sum.

2.

发明授权
Dynamic wakeword detection 有权

公开(公告)号：US10510340B1

公开(公告)日：2019-12-17

申请号：US15832331

申请日：2017-12-05

Applicant: Amazon Technologies, Inc.

Inventor： Gengshen Fu , Shiv Naga Prasad Vitaladevuni , Paul McIntyre , Shuang Wu

IPC: G10L15/26 , G10L15/18 , G10L15/22 , G10L15/16 , G10L15/08

Abstract: Techniques for using a dynamic wakeword detection threshold are described. A server(s) may receive audio data corresponding to an utterance from a device in response to the device detecting a wakeword using a wakeword detection threshold. The server(s) may then determine the device should use a lower wakeword detection threshold for a duration of time. In addition to sending the device output data responsive to the utterance, the server(s) may send the device an instruction to use the lower wakeword detection threshold for the duration of time. Alternatively, the server(s) may train a machine learning model to determine when the device should use a lower wakeword detection threshold. The server(s) may send the trained machine learned model to the device for use at runtime.

3.

发明授权
Dynamic wakeword detection 有权

公开(公告)号：US11699433B2

公开(公告)日：2023-07-11

申请号：US16936952

申请日：2020-07-23

Applicant: Amazon Technologies, Inc.

Inventor： Gengshen Fu , Shiv Naga Prasad Vitaladevuni , Paul McIntyre , Shuang Wu

IPC: G10L15/18 , G10L15/30 , G10L15/22 , G10L15/08

CPC classification number: G10L15/18 , G10L15/22 , G10L15/30 , G10L2015/088

Abstract: Techniques for using a dynamic wakeword detection threshold are described. A device detects a wakeword in audio data using a first wakeword detection threshold value. Thereafter, the device receives audio including speech. If the device receives the audio within a predetermined duration of time after detecting the previous wakeword, the device attempts to detect a wakeword in second audio data, corresponding to the audio including the speech, using a second, lower wakeword detection threshold value.

4.

发明授权
Text detection using features associated with neighboring glyph pairs 有权
Title translation: 使用与相邻字形对相关联的功能的文本检测

公开(公告)号：US09367736B1

公开(公告)日：2016-06-14

申请号：US14842125

申请日：2015-09-01

Applicant: Amazon Technologies, Inc.

Inventor： Thibaud Senechal , Quan Wang , Daniel Makoto Willenson , Shuang Wu , Yue Liu , Shiv Naga Prasad Vitaladevuni , David Paul Ramos , Qingfeng Yu

IPC: G06K9/46 , G06K9/00 , G06K9/34

CPC classification number: G06K9/00463 , G06K9/00442 , G06K9/00456 , G06K9/344 , G06K9/348 , G06K9/4638 , G06K9/4652 , G06K2209/01

Abstract: A multi-orientation text detection method and associated system is disclosed that utilizes orientation-variant glyph features to determine a text line in an image regardless of an orientation of the text line. Glyph features are determined for each glyph in an image with respect to a neighboring glyph. The glyph features are provided to a learned classifier that outputs a glyph pair score for each neighboring glyph pair. Each glyph pair score indicates a likelihood that the corresponding pair of neighboring glyphs form part of a same text line. The glyph pair scores are used to identify candidate text lines, which are then ranked to select a final set of text lines in the image.

Abstract translation: 公开了一种多方向文本检测方法和相关系统，其利用取向变体字形特征来确定图像中的文本行，而不管文本行的取向如何。为相对于相邻字形的图像中的每个字形确定字形特征。字形特征被提供给学习的分类器，其为每个相邻字形对输出字形对分数。每个字形对得分表示对应的相邻字形对形成相同文本行的一部分的可能性。字形对分数用于识别候选文本行，然后将其排序以选择图像中的最后一组文本行。

5.

发明授权
Dynamic wakeword detection 有权

公开(公告)号：US10777189B1

公开(公告)日：2020-09-15

申请号：US15832259

申请日：2017-12-05

Applicant: Amazon Technologies, Inc.

Inventor： Gengshen Fu , Shiv Naga Prasad Vitaladevuni , Paul McIntyre , Shuang Wu

IPC: G10L15/00 , G10L15/18 , G10L15/30 , G10L15/22 , G10L15/08

Abstract: Techniques for using a dynamic wakeword detection threshold are described. A device detects a wakeword in audio data using a first wakeword detection threshold value. Thereafter, the device receives audio including speech. If the device receives the audio within a predetermined duration of time after detecting the previous wakeword, the device attempts to detect a wakeword in second audio data, corresponding to the audio including the speech, using a second, lower wakeword detection threshold value.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification