Content collection search with robust content matching

    公开(公告)号:US10007680B2

    公开(公告)日:2018-06-26

    申请号:US14605669

    申请日:2015-01-26

    申请人: A9.com, Inc.

    IPC分类号: G06F17/30 G06K9/46 G06K9/62

    摘要: Systems and approaches for searching a content collection corresponding to query content are provided. In particular, false positive match rates between the query content and the content collection may be reduced with a minimum content region test and/or a minimum features per scale test. For example, by correlating content descriptors of a content piece in the content collection with query descriptors of the query content, the content piece can be determined to match the query content when a particular region of the content piece and/or a particular region of a query descriptor have a proportionate size meeting or exceeding a specified minimum. Alternatively, or in addition, the false positive match rate between query content and a content piece can be reduced by comparing content descriptors and query descriptors of features at a plurality of scales. A content piece can be determined to match the query content according to descriptor proportion quotas for the plurality of scales.

    Method and system for detecting and recognizing text in images
    3.
    发明授权
    Method and system for detecting and recognizing text in images 有权
    检测和识别图像文本的方法和系统

    公开(公告)号:US08977072B1

    公开(公告)日:2015-03-10

    申请号:US13713776

    申请日:2012-12-13

    申请人: A9.com, Inc.

    摘要: Various embodiments of the present invention relate to a method, system and computer program product for detecting and recognizing text in the images captured by cameras and scanners. First, a series of image-processing techniques is applied to detect text regions in the image. Subsequently, the detected text regions pass through different processing stages that reduce blurring and the negative effects of variable lighting. This results in the creation of multiple images that are versions of the same text region. Some of these multiple versions are sent to a character-recognition system. The resulting texts from each of the versions of the image sent to the character-recognition system are then combined to a single result, wherein the single result is detected text.

    摘要翻译: 本发明的各种实施例涉及用于检测和识别由相机和扫描仪捕获的图像中的文本的方法,系统和计算机程序产品。 首先,应用一系列图像处理技术来检测图像中的文本区域。 随后,检测到的文本区域通过不同的处理阶段,减少模糊和可变照明的负面影响。 这导致创建多个相同文本区域的图像。 这些多个版本中的一些被发送到字符识别系统。 然后,将发送到字符识别系统的图像的每个版本的结果文本合并为单个结果,其中单个结果是检测到的文本。

    Activation layers for deep learning networks

    公开(公告)号:US10366313B2

    公开(公告)日:2019-07-30

    申请号:US15894867

    申请日:2018-02-12

    申请人: A9.com, Inc.

    IPC分类号: G06K9/66 G06N3/08 G06K9/62

    摘要: Tasks such as object classification from image data can take advantage of a deep learning process using convolutional neural networks. These networks can include a convolutional layer followed by an activation layer, or activation unit, among other potential layers. Improved accuracy can be obtained by using a generalized linear unit (GLU) as an activation unit in such a network, where a GLU is linear for both positive and negative inputs, and is defined by a positive slope, a negative slope, and a bias. These parameters can be learned for each channel or a block of channels, and stacking those types of activation units can further improve accuracy.

    ACTIVATION LAYERS FOR DEEP LEARNING NETWORKS

    公开(公告)号:US20180197049A1

    公开(公告)日:2018-07-12

    申请号:US15894867

    申请日:2018-02-12

    申请人: A9.com, Inc.

    IPC分类号: G06K9/66 G06K9/62 G06N3/08

    摘要: Tasks such as object classification from image data can take advantage of a deep learning process using convolutional neural networks. These networks can include a convolutional layer followed by an activation layer, or activation unit, among other potential layers. Improved accuracy can be obtained by using a generalized linear unit (GLU) as an activation unit in such a network, where a GLU is linear for both positive and negative inputs, and is defined by a positive slope, a negative slope, and a bias. These parameters can be learned for each channel or a block of channels, and stacking those types of activation units can further improve accuracy.

    Method and system for matching an image using normalized feature vectors

    公开(公告)号:US09721182B2

    公开(公告)日:2017-08-01

    申请号:US15387314

    申请日:2016-12-21

    申请人: A9.com, Inc.

    IPC分类号: G06K9/46 G06K9/62

    摘要: A method, system and computer program product for encoding an image is provided. The image that needs to be represented is represented in the form of a Gaussian pyramid which is a scale-space representation of the image and includes several pyramid images. The feature points in the pyramid images are identified and a specified number of feature points are selected. The orientations of the selected feature points are obtained by using a set of orientation calculating algorithms. A patch is extracted around the feature point in the pyramid images based on the orientations of the feature point and the sampling factor of the pyramid image. The boundary patches in the pyramid images are extracted by padding the pyramid images with extra pixels. The feature vectors of the extracted patches are defined. These feature vectors are normalized so that the components in the feature vectors are less than a threshold.

    Method and system for searching for information on a network in response to an image query sent by a user from a mobile communications device
    7.
    发明授权
    Method and system for searching for information on a network in response to an image query sent by a user from a mobile communications device 有权
    响应于用户从移动通信设备发送的图像查询,在网络上搜索信息的方法和系统

    公开(公告)号:US09104700B1

    公开(公告)日:2015-08-11

    申请号:US14164904

    申请日:2014-01-27

    申请人: A9.com, Inc.

    IPC分类号: G06K9/60 G06F17/30

    摘要: Present invention relates to a method and system for automatic searching for information on a network in response to an image query sent by a user. The image query includes an image that is captured by using a mobile communications device with a camera. The image is processed to detect the text present in it. The detected text is then recognized using an OCR. Subsequently, the text is searched for matches in the corresponding domain database, selected from the various domain databases present in the network. Thereafter, selected matches and additional related information is sent to the user.

    摘要翻译: 本发明涉及一种用于响应于用户发送的图像查询来自动搜索网络上的信息的方法和系统。 图像查询包括通过使用具有相机的移动通信设备捕获的图像。 处理图像以检测其中存在的文本。 然后使用OCR识别检测到的文本。 随后,在相应的域数据库中搜索文本,从网络中存在的各种域数据库中选择匹配。 此后,向用户发送选择的匹配和附加的相关信息。

    METHOD AND SYSTEM FOR MATCHING AN IMAGE USING NORMALIZED FEATURE VECTORS

    公开(公告)号:US20170103282A1

    公开(公告)日:2017-04-13

    申请号:US15387314

    申请日:2016-12-21

    申请人: A9.com, Inc.

    IPC分类号: G06K9/46 G06K9/62

    摘要: A method, system and computer program product for encoding an image is provided. The image that needs to be represented is represented in the form of a Gaussian pyramid which is a scale-space representation of the image and includes several pyramid images. The feature points in the pyramid images are identified and a specified number of feature points are selected. The orientations of the selected feature points are obtained by using a set of orientation calculating algorithms. A patch is extracted around the feature point in the pyramid images based on the orientations of the feature point and the sampling factor of the pyramid image. The boundary patches in the pyramid images are extracted by padding the pyramid images with extra pixels. The feature vectors of the extracted patches are defined. These feature vectors are normalized so that the components in the feature vectors are less than a threshold.

    Method and system for matching an image using image patches
    10.
    发明授权
    Method and system for matching an image using image patches 有权
    使用图像补丁匹配图像的方法和系统

    公开(公告)号:US08958629B2

    公开(公告)日:2015-02-17

    申请号:US14259002

    申请日:2014-04-22

    申请人: A9.com, Inc.

    摘要: A method, system and computer program product for encoding an image is provided. The image that needs to be represented is represented in the form of a Gaussian pyramid which is a scale-space representation of the image and includes several pyramid images. The feature points in the pyramid images are identified and a specified number of feature points are selected. The orientations of the selected feature points are obtained by using a set of orientation calculating algorithms. A patch is extracted around the feature point in the pyramid images based on the orientations of the feature point and the sampling factor of the pyramid image. The boundary patches in the pyramid images are extracted by padding the pyramid images with extra pixels. The feature vectors of the extracted patches are defined. These feature vectors are normalized so that the components in the feature vectors are less than a threshold.

    摘要翻译: 提供了一种用于编码图像的方法,系统和计算机程序产品。 需要表示的图像以高斯金字塔的形式表示,高斯金字塔是图像的尺度空间表示,并且包括几个金字塔图像。 识别金字塔图像中的特征点,并选择指定数量的特征点。 通过使用一组取向计算算法获得所选特征点的取向。 基于特征点的取向和金字塔图像的采样因子,在金字塔图像的特征点周围提取补丁。 通过用额外的像素填充金字塔图像来提取金字塔图像中的边界补丁。 定义提取的补丁的特征向量。 这些特征向量被归一化,使得特征向量中的分量小于阈值。