-
公开(公告)号:US20190220668A1
公开(公告)日:2019-07-18
申请号:US16323179
申请日:2017-06-06
Applicant: Purdue Research Foundation
Inventor: Jeffrey Mark Siskind , Haonan Yu
CPC classification number: G06K9/00718 , G06K9/00751 , G06K9/6215 , G06K9/6296 , G06K9/726 , G06K2009/00738 , G06T7/20 , G06T7/70 , G06T2207/10016 , G06T2207/10024
Abstract: A system and method for determining the locations and types of objects in a plurality of videos. The method comprises pairing each video with one or more sentences describing the activity or activities in which those objects participate in the associated video, wherein no use is made of a pretrained object detector. The object locations are specified as rectangles, the object types are specified as nouns, and sentences describe the relative positions and motions of the objects in the videos referred to by the nouns in the sentences. The relative positions and motions of the objects in the video are described by a conjunction of predicates constructed to represent the activity described by the sentences associated with the videos.
-
公开(公告)号:US20180336441A1
公开(公告)日:2018-11-22
申请号:US15599600
申请日:2017-05-19
Applicant: Hand Held Products, Inc.
Inventor: Edward Hatton , H. Sprague Ackley
CPC classification number: G06K9/78 , G06K9/3216 , G06K9/3283 , G06K9/44 , G06K9/6211 , G06K9/6215 , G06K9/726 , G06K2209/01
Abstract: A method for template matching can include iteratively selecting a template set of points to project over a centerline of a candidate symbol; conducting a template matching analysis; assigning a score to each template set; and selecting a template set with a highest assigned score. For example, the score can depend on proximity of the template points to a center and/or boundaries of a principal tracing path of the symbol. Additionally, one or more template sets having a top rank can be selected for a secondary analysis of proximity of the template points to a boundary of a printing of the symbol. The method can further include using the template with the highest score to interpret the candidate symbol.
-
公开(公告)号:US20180211134A1
公开(公告)日:2018-07-26
申请号:US15663796
申请日:2017-07-30
Applicant: Google Inc.
Inventor: Hartwig ADAM , Li Zhang
CPC classification number: G06K9/6217 , G06F17/30244 , G06F17/30247 , G06F17/30265 , G06K9/00624 , G06K9/00664 , G06K9/46 , G06K9/726
Abstract: Methods and systems for automatic detection of landmarks in digital images and annotation of those images are disclosed. A method for detecting and annotating landmarks in digital images includes the steps of automatically assigning a tag descriptive of a landmark to one or more images in a plurality of text-associated digital images to generate a set of landmark-tagged images, learning an appearance model for the landmark from the set of landmark-tagged images, and detecting the landmark in a new digital image using the appearance model. The method can also include a step of annotating the new image with the tag descriptive of the landmark.
-
公开(公告)号:US20180182379A1
公开(公告)日:2018-06-28
申请号:US15388683
申请日:2016-12-22
Applicant: FUJITSU LIMITED
Inventor: James MONTANTES
IPC: G10L15/18 , G10L15/30 , H04R1/40 , H04R3/04 , H03G5/16 , G06K9/00 , G06F17/30 , G06N5/04 , G06N99/00
CPC classification number: G10L15/1815 , G06F17/2785 , G06F17/30684 , G06F17/30696 , G06F17/30746 , G06K9/00684 , G06K9/00718 , G06K9/726 , G06N5/022 , G06N5/04 , G06N99/005 , G10L15/22 , G10L15/30 , H03G5/165 , H04R1/326 , H04R1/406 , H04R3/005 , H04R3/04 , H04R2499/11
Abstract: A user media device may include a microphone array and a communication interface. The microphone array may include an omnidirectional microphone and a directional microphone. The microphone array may be selectively switchable. The communication interface may communicatively couple the user media device with a computer and may transmit audio captured by the microphone array to the computer for transfer to a remote service. The remote service may generate text of the processed audio via natural language processing. The remote service may further perform semantic reasoning of the processed audio via a semantic reasoning engine. The remote service may also generate content based at least in part on the semantic reasoning performed on the processed audio. The curated content may include a report having results of the semantic reasoning organized to demonstrate the results in a meaningful way with respect to the processed audio.
-
公开(公告)号:US09881003B2
公开(公告)日:2018-01-30
申请号:US14863394
申请日:2015-09-23
Applicant: Google Inc.
Inventor: Greg Don Hartrell , Debajit Ghosh , Matthew William Vaughan-Vail , John Michael Rivlin , Garth Conboy , Xinxing Gu , Alexander Toshkov Toshev
IPC: G06F17/00 , G09G5/00 , G06F17/28 , G06F17/24 , G06F17/21 , G06F17/22 , G06N3/02 , G06K9/00 , G06K9/72
CPC classification number: G06F17/2836 , G06F17/212 , G06F17/2229 , G06F17/243 , G06F17/289 , G06K9/00476 , G06K9/726 , G06N3/02 , G06N3/0454 , G06N3/084 , G06F17/00 , H04N7/00
Abstract: Digital graphic novel content is received and features of the graphic novel content are identified. At least one of the identified features includes text. Contextual information corresponding to the feature or features that include text is generated based on the identified features. The contextual information is used to aid translation of the text included in the feature or features that include text.
-
6.
公开(公告)号:US09836456B2
公开(公告)日:2017-12-05
申请号:US14594238
申请日:2015-01-12
Applicant: Google Inc.
Inventor: Alexander Jay Cuthbert , Macduff Richard Hughes
CPC classification number: G06F17/28 , G06F17/21 , G06F17/2854 , G06F17/289 , G06K9/00671 , G06K9/033 , G06K9/20 , G06K9/726 , G06K2209/01
Abstract: A computer-implemented technique includes techniques are presented for user image capture feedback for improved machine language translation. When machine language translation of OCR text obtained from an initial image has a low degree of likelihood of being an appropriate translation, these techniques provide for user image capture feedback to obtain additional images to obtain a modified OCR text, which can result in improved machine language translation results. Instead of user image capture feedback, the techniques may obtain the modified OCR text by selecting another possible OCR text from the initial OCR operation. In addition to additional image capturing, light source intensity and/or a quantity/number of light source flashes can be adjusted. After obtaining the modified OCR text, another machine language translation can be obtained and, if it has a high enough degree of likelihood, it can then be output to a user.
-
7.
公开(公告)号:US20170337423A1
公开(公告)日:2017-11-23
申请号:US15531088
申请日:2015-08-26
Applicant: Beijing Lejent Technology Co., Ltd
Inventor: Lijiang Chen , Ning Liu , Hui Liu
CPC classification number: G06K9/00429 , G06F17/12 , G06K9/38 , G06K9/4638 , G06K9/6212 , G06K9/6269 , G06K9/726 , G06K2209/01 , G06N3/02 , G06N3/04
Abstract: A method for structural analysis and recognition of a handwritten mathematical formula in a natural scene image, including: transforming a gray matrix of a natural scene image into a local contrast matrix, and performing a binary division to the obtained local contrast matrix using an Otsu method, thereby obtaining a binary matrix; performing a connected domain analysis to the binary matrix, eliminating non-character connected domains to obtain character connected domains; performing a detection of elements of a special structure of a formula to the character connected domains using a correlation coefficient method, and separately annotating all the detected elements of the special structure: dividing rows of the binary matrix by means of horizontal projection; recognizing each character connected domain by means of a convolutional neural network; defining an output sequence, and outputting the results of recognition in a corresponding sequence according to a typesetting format of latex.
-
公开(公告)号:US09773186B2
公开(公告)日:2017-09-26
申请号:US14217361
申请日:2014-03-17
Applicant: MITEK SYSTEMS, INC.
Inventor: Grigori Nepomniachtchi , Nikolay Kotovich
CPC classification number: G06K9/344 , G06K9/00442 , G06K9/03 , G06K9/2054 , G06K9/38 , G06K9/4652 , G06K9/726
Abstract: Various embodiments disclosed herein are directed to methods of capturing Vehicle Identification Numbers (VIN) from images captured by a mobile device. Capturing VIN data can be useful in several applications, for example, insurance data capture applications. There are at least two types of images supported by this technology: (1) images of documents and (2) images of non-documents.
-
公开(公告)号:US20170220907A1
公开(公告)日:2017-08-03
申请号:US15246413
申请日:2016-08-24
Inventor: Xiao LIU , Tian XIA , Jiang WANG
CPC classification number: G06K9/726 , G06K9/00624 , G06K9/6267 , G06K9/6269 , G06K9/66
Abstract: The present application discloses a method and apparatus for obtaining a semantic label of a digital image. An implementation of the method includes: obtaining the digital image; looking up a semantic label model corresponding to the digital image, the semantic label model being used for representing correlation between digital images and semantic labels, and a semantic label being used for literally describing a digital image; and introducing the digital image into the semantic label model to obtain full-image recognition information and local recognition information corresponding to the digital image, and combining the full-image recognition information and the local recognition information to form a semantic label, the full-image recognition information being a summarized description of the digital image, and the local recognition information being a detailed description of the digital image. According to the implementation, the digital image is obtained first, then a semantic label model corresponding to the digital image is looked up, and a semantic label is obtained by using the semantic label model, which may improve the accuracy of obtaining the semantic label corresponding to the digital image.
-
公开(公告)号:US20170193340A1
公开(公告)日:2017-07-06
申请号:US14983834
申请日:2015-12-30
Applicant: International Business Machines Corporation
Inventor: Joseph Shtok , Asaf Tzadok , Yochay Tzur
CPC classification number: G06K9/726 , G06K9/00201 , G06K9/52 , G06K9/6256 , G06T7/60 , G06T7/73 , G06T7/75 , G06T2200/04 , G06T2207/10024 , G06T2207/20061 , G06T2207/20081
Abstract: A method of training an object identification system and identifying three dimensional objects using semantic segments includes receiving, into a non-volatile memory, an input file containing a geometric description of a three dimensional object having one or more semantic segments and one or more annotations for each of the one or more semantic segments, receiving, into the non-volatile memory one or more training images of the three dimensional object, identifying, through a processor, the one or more segments in the one or more training images, computing, through a training module, one or more descriptors to the one or more segments, and generating an output file representing a machine vision of the three dimensional object.
-
-
-
-
-
-
-
-
-