Wakeword and acoustic event detection

    公开(公告)号:US11132990B1

    公开(公告)日:2021-09-28

    申请号:US16453063

    申请日:2019-06-26

    Abstract: A system processes audio data to detect when it includes a representation of a wakeword or of an acoustic event. The system may receive or determine acoustic features for the audio data, such as log-filterbank energy (LFBE). The acoustic features may be used by a first, wakeword-detection model to detect the wakeword; the output of this model may be further processed using a softmax function, to smooth it, and to detect spikes. The same acoustic features may be also be used by a second, acoustic-event-detection model to detect the acoustic event; the output of this model may be further processed using a sigmoid function and a classifier. Another model may be used to extract additional features from the LFBE data; these additional features may be used by the other models.

    Wakeword and acoustic event detection

    公开(公告)号:US11043218B1

    公开(公告)日:2021-06-22

    申请号:US16452964

    申请日:2019-06-26

    Abstract: A system processes audio data to detect when it includes a representation of a wakeword or of an acoustic event. The system may receive or determine acoustic features for the audio data, such as log-filterbank energy (LFBE). The acoustic features may be used by a first, wakeword-detection model to detect the wakeword; the output of this model may be further processed using a softmax function, to smooth it, and to detect spikes. The same acoustic features may be also be used by a second, acoustic-event-detection model to detect the acoustic event; the output of this model may be further processed using a sigmoid function and a classifier. Another model may be used to extract additional features from the LFBE data; these additional features may be used by the other models.

    Text detection using features associated with neighboring glyph pairs
    5.
    发明授权
    Text detection using features associated with neighboring glyph pairs 有权
    使用与相邻字形对相关联的功能的文本检测

    公开(公告)号:US09367736B1

    公开(公告)日:2016-06-14

    申请号:US14842125

    申请日:2015-09-01

    Abstract: A multi-orientation text detection method and associated system is disclosed that utilizes orientation-variant glyph features to determine a text line in an image regardless of an orientation of the text line. Glyph features are determined for each glyph in an image with respect to a neighboring glyph. The glyph features are provided to a learned classifier that outputs a glyph pair score for each neighboring glyph pair. Each glyph pair score indicates a likelihood that the corresponding pair of neighboring glyphs form part of a same text line. The glyph pair scores are used to identify candidate text lines, which are then ranked to select a final set of text lines in the image.

    Abstract translation: 公开了一种多方向文本检测方法和相关系统,其利用取向变体字形特征来确定图像中的文本行,而不管文本行的取向如何。 为相对于相邻字形的图像中的每个字形确定字形特征。 字形特征被提供给学习的分类器,其为每个相邻字形对输出字形对分数。 每个字形对得分表示对应的相邻字形对形成相同文本行的一部分的可能性。 字形对分数用于识别候选文本行,然后将其排序以选择图像中的最后一组文本行。

    WAKEWORD AND ACOUSTIC EVENT DETECTION

    公开(公告)号:US20210358497A1

    公开(公告)日:2021-11-18

    申请号:US17321999

    申请日:2021-05-17

    Abstract: A system processes audio data to detect when it includes a representation of a wakeword or of an acoustic event. The system may receive or determine acoustic features for the audio data, such as log-filterbank energy (LFBE). The acoustic features may be used by a first, wakeword-detection model to detect the wakeword; the output of this model may be further processed using a softmax function, to smooth it, and to detect spikes. The same acoustic features may be also be used by a second, acoustic-event-detection model to detect the acoustic event; the output of this model may be further processed using a sigmoid function and a classifier. Another model may be used to extract additional features from the LFBE data; these additional features may be used by the other models.

Patent Agency Ranking