Patent search cpc:"G06K9/726" Page 1

1.

发明申请
SYSTEM AND METHOD FOR SENTENCE DIRECTED VIDEO OBJECT CODETECTION 审中-公开

公开(公告)号：US20190220668A1

公开(公告)日：2019-07-18

申请号：US16323179

申请日：2017-06-06

Applicant: Purdue Research Foundation

Inventor： Jeffrey Mark Siskind , Haonan Yu

IPC: G06K9/00 , G06T7/70 , G06T7/20 , G06K9/62 , G06K9/72

CPC classification number: G06K9/00718 , G06K9/00751 , G06K9/6215 , G06K9/6296 , G06K9/726 , G06K2009/00738 , G06T7/20 , G06T7/70 , G06T2207/10016 , G06T2207/10024

Abstract: A system and method for determining the locations and types of objects in a plurality of videos. The method comprises pairing each video with one or more sentences describing the activity or activities in which those objects participate in the associated video, wherein no use is made of a pretrained object detector. The object locations are specified as rectangles, the object types are specified as nouns, and sentences describe the relative positions and motions of the objects in the videos referred to by the nouns in the sentences. The relative positions and motions of the objects in the video are described by a conjunction of predicates constructed to represent the activity described by the sentences associated with the videos.

2.

发明申请
HIGH-SPEED OCR DECODE USING DEPLETED CENTERLINES 审中-公开

公开(公告)号：US20180336441A1

公开(公告)日：2018-11-22

申请号：US15599600

申请日：2017-05-19

Applicant: Hand Held Products, Inc.

Inventor： Edward Hatton , H. Sprague Ackley

IPC: G06K9/78 , G06K9/32 , G06K9/72

CPC classification number: G06K9/78 , G06K9/3216 , G06K9/3283 , G06K9/44 , G06K9/6211 , G06K9/6215 , G06K9/726 , G06K2209/01

Abstract: A method for template matching can include iteratively selecting a template set of points to project over a centerline of a candidate symbol; conducting a template matching analysis; assigning a score to each template set; and selecting a template set with a highest assigned score. For example, the score can depend on proximity of the template points to a center and/or boundaries of a principal tracing path of the symbol. Additionally, one or more template sets having a top rank can be selected for a secondary analysis of proximity of the template points to a boundary of a printing of the symbol. The method can further include using the template with the highest score to interpret the candidate symbol.

3.

发明申请
LANDMARKS FROM DIGITAL PHOTO COLLECTIONS 审中-公开

公开(公告)号：US20180211134A1

公开(公告)日：2018-07-26

申请号：US15663796

申请日：2017-07-30

Applicant: Google Inc.

Inventor： Hartwig ADAM , Li Zhang

IPC: G06K9/62 , G06K9/00 , G06F17/30 , G06K9/46

CPC classification number: G06K9/6217 , G06F17/30244 , G06F17/30247 , G06F17/30265 , G06K9/00624 , G06K9/00664 , G06K9/46 , G06K9/726

Abstract: Methods and systems for automatic detection of landmarks in digital images and annotation of those images are disclosed. A method for detecting and annotating landmarks in digital images includes the steps of automatically assigning a tag descriptive of a landmark to one or more images in a plurality of text-associated digital images to generate a set of landmark-tagged images, learning an appearance model for the landmark from the set of landmark-tagged images, and detecting the landmark in a new digital image using the appearance model. The method can also include a step of annotating the new image with the tag descriptive of the landmark.

4.

发明申请
MEDIA CAPTURE AND PROCESS SYSTEM 审中-公开

公开(公告)号：US20180182379A1

公开(公告)日：2018-06-28

申请号：US15388683

申请日：2016-12-22

Applicant: FUJITSU LIMITED

Inventor： James MONTANTES

IPC: G10L15/18 , G10L15/30 , H04R1/40 , H04R3/04 , H03G5/16 , G06K9/00 , G06F17/30 , G06N5/04 , G06N99/00

CPC classification number: G10L15/1815 , G06F17/2785 , G06F17/30684 , G06F17/30696 , G06F17/30746 , G06K9/00684 , G06K9/00718 , G06K9/726 , G06N5/022 , G06N5/04 , G06N99/005 , G10L15/22 , G10L15/30 , H03G5/165 , H04R1/326 , H04R1/406 , H04R3/005 , H04R3/04 , H04R2499/11

Abstract: A user media device may include a microphone array and a communication interface. The microphone array may include an omnidirectional microphone and a directional microphone. The microphone array may be selectively switchable. The communication interface may communicatively couple the user media device with a computer and may transmit audio captured by the microphone array to the computer for transfer to a remote service. The remote service may generate text of the processed audio via natural language processing. The remote service may further perform semantic reasoning of the processed audio via a semantic reasoning engine. The remote service may also generate content based at least in part on the semantic reasoning performed on the processed audio. The curated content may include a report having results of the semantic reasoning organized to demonstrate the results in a meaningful way with respect to the processed audio.

5.

发明授权
Automatic translation of digital graphic novels 有权

公开(公告)号：US09881003B2

公开(公告)日：2018-01-30

申请号：US14863394

申请日：2015-09-23

Applicant: Google Inc.

Inventor： Greg Don Hartrell , Debajit Ghosh , Matthew William Vaughan-Vail , John Michael Rivlin , Garth Conboy , Xinxing Gu , Alexander Toshkov Toshev

IPC: G06F17/00 , G09G5/00 , G06F17/28 , G06F17/24 , G06F17/21 , G06F17/22 , G06N3/02 , G06K9/00 , G06K9/72

CPC classification number: G06F17/2836 , G06F17/212 , G06F17/2229 , G06F17/243 , G06F17/289 , G06K9/00476 , G06K9/726 , G06N3/02 , G06N3/0454 , G06N3/084 , G06F17/00 , H04N7/00

Abstract: Digital graphic novel content is received and features of the graphic novel content are identified. At least one of the identified features includes text. Contextual information corresponding to the feature or features that include text is generated based on the identified features. The contextual information is used to aid translation of the text included in the feature or features that include text.

6.

发明授权
Techniques for providing user image capture feedback for improved machine language translation 有权

公开(公告)号：US09836456B2

公开(公告)日：2017-12-05

申请号：US14594238

申请日：2015-01-12

Applicant: Google Inc.

Inventor： Alexander Jay Cuthbert , Macduff Richard Hughes

IPC: G06F17/28 , G06F17/21 , G06K9/20 , G06K9/72 , G06K9/00 , G06K9/03

CPC classification number: G06F17/28 , G06F17/21 , G06F17/2854 , G06F17/289 , G06K9/00671 , G06K9/033 , G06K9/20 , G06K9/726 , G06K2209/01

Abstract: A computer-implemented technique includes techniques are presented for user image capture feedback for improved machine language translation. When machine language translation of OCR text obtained from an initial image has a low degree of likelihood of being an appropriate translation, these techniques provide for user image capture feedback to obtain additional images to obtain a modified OCR text, which can result in improved machine language translation results. Instead of user image capture feedback, the techniques may obtain the modified OCR text by selecting another possible OCR text from the initial OCR operation. In addition to additional image capturing, light source intensity and/or a quantity/number of light source flashes can be adjusted. After obtaining the modified OCR text, another machine language translation can be obtained and, if it has a high enough degree of likelihood, it can then be output to a user.

7.

发明申请
Method for Structural Analysis and Recongnigiton of Handwritten Mathematical Formula in Natural Scene Image 审中-公开

公开(公告)号：US20170337423A1

公开(公告)日：2017-11-23

申请号：US15531088

申请日：2015-08-26

Applicant: Beijing Lejent Technology Co., Ltd

Inventor： Lijiang Chen , Ning Liu , Hui Liu

IPC: G06K9/00 , G06K9/62 , G06N3/04 , G06F17/12

CPC classification number: G06K9/00429 , G06F17/12 , G06K9/38 , G06K9/4638 , G06K9/6212 , G06K9/6269 , G06K9/726 , G06K2209/01 , G06N3/02 , G06N3/04

Abstract: A method for structural analysis and recognition of a handwritten mathematical formula in a natural scene image, including: transforming a gray matrix of a natural scene image into a local contrast matrix, and performing a binary division to the obtained local contrast matrix using an Otsu method, thereby obtaining a binary matrix; performing a connected domain analysis to the binary matrix, eliminating non-character connected domains to obtain character connected domains; performing a detection of elements of a special structure of a formula to the character connected domains using a correlation coefficient method, and separately annotating all the detected elements of the special structure: dividing rows of the binary matrix by means of horizontal projection; recognizing each character connected domain by means of a convolutional neural network; defining an output sequence, and outputting the results of recognition in a corresponding sequence according to a typesetting format of latex.

8.

发明授权
Methods for mobile image capture of vehicle identification numbers in a non-document 有权

公开(公告)号：US09773186B2

公开(公告)日：2017-09-26

申请号：US14217361

申请日：2014-03-17

Applicant: MITEK SYSTEMS, INC.

Inventor： Grigori Nepomniachtchi , Nikolay Kotovich

IPC: G06K9/00 , G06K9/34 , G06K9/46 , G06K9/03 , G06K9/20 , G06K9/38 , G06K9/72

CPC classification number: G06K9/344 , G06K9/00442 , G06K9/03 , G06K9/2054 , G06K9/38 , G06K9/4652 , G06K9/726

Abstract: Various embodiments disclosed herein are directed to methods of capturing Vehicle Identification Numbers (VIN) from images captured by a mobile device. Capturing VIN data can be useful in several applications, for example, insurance data capture applications. There are at least two types of images supported by this technology: (1) images of documents and (2) images of non-documents.

9.

发明申请
METHOD AND APPARATUS FOR OBTAINING SEMANTIC LABEL OF DIGITAL IMAGE 审中-公开

公开(公告)号：US20170220907A1

公开(公告)日：2017-08-03

申请号：US15246413

申请日：2016-08-24

Applicant: Baidu Online Network Technology (Beijing) Co., Ltd.

Inventor： Xiao LIU , Tian XIA , Jiang WANG

IPC: G06K9/72 , G06K9/62 , G06K9/66

CPC classification number: G06K9/726 , G06K9/00624 , G06K9/6267 , G06K9/6269 , G06K9/66

Abstract: The present application discloses a method and apparatus for obtaining a semantic label of a digital image. An implementation of the method includes: obtaining the digital image; looking up a semantic label model corresponding to the digital image, the semantic label model being used for representing correlation between digital images and semantic labels, and a semantic label being used for literally describing a digital image; and introducing the digital image into the semantic label model to obtain full-image recognition information and local recognition information corresponding to the digital image, and combining the full-image recognition information and the local recognition information to form a semantic label, the full-image recognition information being a summarized description of the digital image, and the local recognition information being a detailed description of the digital image. According to the implementation, the digital image is obtained first, then a semantic label model corresponding to the digital image is looked up, and a semantic label is obtained by using the semantic label model, which may improve the accuracy of obtaining the semantic label corresponding to the digital image.

10.

发明申请
SYSTEM, METHOD AND COMPUTER PROGRAM PRODUCT FOR TRAINING A THREE DIMENSIONAL OBJECT INDENTIFICATION SYSTEM AND IDENTIFYING THREE DIMENSIONAL OBJECTS USING SEMANTIC SEGMENTS 有权

公开(公告)号：US20170193340A1

公开(公告)日：2017-07-06

申请号：US14983834

申请日：2015-12-30

Applicant: International Business Machines Corporation

Inventor： Joseph Shtok , Asaf Tzadok , Yochay Tzur

IPC: G06K9/72 , G06K9/52 , G06T7/60 , G06K9/62

CPC classification number: G06K9/726 , G06K9/00201 , G06K9/52 , G06K9/6256 , G06T7/60 , G06T7/73 , G06T7/75 , G06T2200/04 , G06T2207/10024 , G06T2207/20061 , G06T2207/20081

Abstract: A method of training an object identification system and identifying three dimensional objects using semantic segments includes receiving, into a non-volatile memory, an input file containing a geometric description of a three dimensional object having one or more semantic segments and one or more annotations for each of the one or more semantic segments, receiving, into the non-volatile memory one or more training images of the three dimensional object, identifying, through a processor, the one or more segments in the one or more training images, computing, through a training module, one or more descriptors to the one or more segments, and generating an output file representing a machine vision of the three dimensional object.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification