-
公开(公告)号:US12087067B2
公开(公告)日:2024-09-10
申请号:US17697954
申请日:2022-03-18
Applicant: RAKUTEN GROUP, INC.
Inventor: Yeongnam Chae , Preetham Prakasha
IPC: G06V30/148 , G06V30/146 , G06V30/18 , G06V30/19 , G06V30/242
CPC classification number: G06V30/153 , G06V30/1478 , G06V30/18105 , G06V30/242 , G06V30/18095 , G06V30/19107
Abstract: The information processing device obtains a character string image which includes a plurality of characters, and which includes the characters arranged in an arrangement direction, obtains a probability image representing a probability of an existence of a character in each of the pixel included in the character string image, obtains a plurality of character regions in which the characters are estimated to respectively exist in the character string image based on the probability image, obtains an additional character region which is located in the character string image, and which does not overlap the plurality of character regions based on a determination result on whether or not a pixel of a non-background color exists in a direction perpendicular to the arrangement direction at every position in the arrangement direction in the character string image, and recognizes the plurality of characters from the character regions and the additional character region.
-
公开(公告)号:US12045580B2
公开(公告)日:2024-07-23
申请号:US17423413
申请日:2020-07-28
Applicant: BOE TECHNOLOGY GROUP CO., LTD.
Inventor: Jibo Zhao , Xingqun Jiang
IPC: G06K9/36 , G06F40/284 , G06F40/40 , G06F40/47 , G06F40/58 , G06V10/10 , G06V10/22 , G06V30/14 , G06V30/142 , G06V30/148 , G06V30/242
CPC classification number: G06F40/58 , G06F40/284 , G06F40/40 , G06F40/47 , G06V10/17 , G06V10/22 , G06V10/235 , G06V30/142 , G06V30/1444 , G06V30/1456 , G06V30/148 , G06V30/158 , G06V30/242
Abstract: A translation pen includes: a pen body, an indication component, an image collector and a first processor. The pen body has a pen tip end. The indication component is arranged on the pen tip end. The image collector is arranged on the pen body, and the image collector is configured to: collect an image including a text to be translated according to a position indicated by the indication component, and send the image collected. The first processor is arranged in the pen body and electrically connected to the image collector, and the first processor is configured to: receive the image sent by the image collector, and recognize the text to be translated in the image.
-
公开(公告)号:US11842524B2
公开(公告)日:2023-12-12
申请号:US17245349
申请日:2021-04-30
Applicant: International Business Machines Corporation
Inventor: Rajesh M. Desai , Ayush Utkarsh , Nazrul Islam , Praveen Vyas
IPC: G06K9/03 , G06V10/40 , G06F40/126 , G06F40/109 , G06N20/00 , G06F40/232 , G06V30/10 , G06F18/214 , G06V30/19 , G06V30/12 , G06V10/82 , G06V30/26 , G06F18/213 , G06N3/0464 , G06N3/0442 , G06N3/0455 , G06V10/98 , G06F40/279 , G06V30/242
CPC classification number: G06V10/40 , G06F18/213 , G06F18/214 , G06F40/109 , G06F40/126 , G06F40/232 , G06F40/279 , G06N3/0442 , G06N3/0455 , G06N3/0464 , G06N20/00 , G06V10/82 , G06V10/98 , G06V30/10 , G06V30/12 , G06V30/19 , G06V30/242 , G06V30/26
Abstract: A mechanism is provided for implementing an optical character recognition (OCR) error correction mechanism for correcting OCR errors. Responsive to receiving a document in which OCR has been performed, the mechanism assesses the document to identify a set of OCR errors generated by an OCR engine that performed the OCR using a set of visual embeddings. Responsive to identifying the set of OCR errors, the mechanism analyzes each character of a plurality of sentences within the document to generate a high-dimensional embedding for the characters of the plurality of sentences within the document. The mechanism then linguistically corrects each OCR error in the set of OCR error. The mechanism utilizes ground truth information and the set of visual embeddings to verify that character stream is linguistically correct. Responsive to verifying that the character stream is linguistically correct, the mechanism outputs an OCR error corrected document to a user.
-
公开(公告)号:US11908122B2
公开(公告)日:2024-02-20
申请号:US16939939
申请日:2020-07-27
Applicant: Sensors Incorporated
Inventor: David J. Kotula
IPC: G06T7/00 , G06F18/22 , G06K7/10 , G06K7/14 , G06V10/75 , G06V20/64 , G06V20/52 , G06V30/242 , G06V20/68
CPC classification number: G06T7/0004 , G06F18/22 , G06K7/10445 , G06K7/10722 , G06K7/10861 , G06K7/1413 , G06T7/0002 , G06V10/75 , G06V20/52 , G06V20/64 , G06V30/242 , G06T2207/10024 , G06T2207/30128 , G06T2207/30164 , G06T2207/30168 , G06V20/68
Abstract: In an illustrative embodiment, a system for identifying products on a production line includes image capturing devices that acquire images of containers moving along a production line at an inspection location. The system also includes a rejection device and a controller that configures the image capturing devices for image acquisition based on properties of the containers, identifies a product associated with each of the containers based on a portion of a product identification code and a portion of additional features detected in the images, and determines whether the identified product matches predetermined properties or characteristics, resulting in a pass result, otherwise a non-pass result occurs. When a non-pass result occurs, the controller outputs a signal to actuate the rejection device that removes the container from the production line.
-
公开(公告)号:US11743426B2
公开(公告)日:2023-08-29
申请号:US16992968
申请日:2020-08-13
Applicant: Snap Inc.
Inventor: Lidiia Bogdanovych , William Brendel , Samuel Edward Hare , Fedir Poliakov , Guohui Wang , Xuehan Xiong , Jianchao Yang , Linjie Yang
IPC: G06T7/194 , G06V10/82 , H04N7/14 , G06T7/11 , G06N3/08 , G06N3/04 , G06V30/242 , G06F18/214 , G06F18/24 , G06V30/19 , H04N5/445 , H04N5/76
CPC classification number: H04N7/147 , G06F18/214 , G06F18/24765 , G06N3/04 , G06N3/08 , G06T7/11 , G06T7/194 , G06V10/82 , G06V30/19173 , G06V30/242 , G06T2207/10016 , G06T2207/10024 , G06T2207/20024 , G06T2207/20081 , G06T2207/20084 , G06T2207/20221 , G06T2207/30201 , H04N5/44504 , H04N5/76 , H04N7/141
Abstract: A machine learning system can generate an image mask (e.g., a pixel mask) comprising pixel assignments for pixels. The pixels can he assigned to classes, including, for example, face, clothes, body skin, or hair. The machine learning system can be implemented. using a convolutional neural network that is configured to execute efficiently on computing devices having limited resources, such as mobile phones. The pixel mask can be used to more accurately display video effects interacting with a user or subject depicted in the image.
-
6.
公开(公告)号:US11722584B2
公开(公告)日:2023-08-08
申请号:US17111325
申请日:2020-12-03
Applicant: Elliot Berookhim , Pejman Yedidsion
Inventor: Elliot Berookhim , Pejman Yedidsion
IPC: H04L29/06 , H04L69/00 , H04L67/06 , H04L69/06 , H04W4/80 , G06V30/242 , G06V40/16 , H04W4/02 , H04W8/18 , H04L9/40 , H04L51/52
CPC classification number: H04L69/02 , G06V30/242 , G06V40/172 , H04L67/06 , H04L69/06 , H04W4/023 , H04W4/80 , H04W8/18 , G06V2201/10 , H04L51/52 , H04L63/107
Abstract: Methods, systems, and devices for determining a subset of user devices from among a complete set of user devices based on a set of received information, i.e., attributes associated with a photograph or user device that transmitted the photograph and attributes, where the disposition of the information may be used to determine the subset and then perform facial recognition on the subset of user associated photographs in order to accurately identify each user or users present in the photograph.
-
公开(公告)号:US11651626B2
公开(公告)日:2023-05-16
申请号:US17324347
申请日:2021-05-19
Applicant: Robert Bosch GmbH
Inventor: Holger Behrens , Joerg Staudigel
IPC: G06V40/20 , G06V30/242 , G06F18/2413 , G06V10/75 , G06V10/764
CPC classification number: G06V40/20 , G06F18/2413 , G06V10/75 , G06V10/764 , G06V30/242
Abstract: A method for detecting comparison persons 7 to a search person 4, wherein a plurality of classification persons 3 is classified by extracting values W1,W2,W3 for classification features K1,K2,K3 from classification images 2 of the classification persons 3, the classification being ambiguous in such a way that the classification does not enable a unique identification of any of the classification persons 3, wherein during a search for a search person 4 using a search image 5 by a comparison of values of search features from the search image 5 with values W1,W2,W3 of classification features K1,K2,K3, at least two classification persons 3 are output as comparison persons 7.
-
公开(公告)号:US20240071116A1
公开(公告)日:2024-02-29
申请号:US18038763
申请日:2021-12-02
Applicant: Semiconductor Energy Laboratory Co., Ltd.
Inventor: Junpei MOMO , Shoko SAITO
IPC: G06V30/19 , G06V10/82 , G06V30/242
CPC classification number: G06V30/19093 , G06V10/82 , G06V30/19147 , G06V30/242
Abstract: A proofreading system that allows a user to easily judge whether or not there is an error in writing or the like. A proofreading method using a comparison image group obtained by dividing a sentence included in a comparison document group into a plurality of first terms and converting the first terms into images is provided. Specifically, first, a sentence included in a designated document is divided into a plurality of second terms, and the appearance frequency in the comparison document group of the plurality of second terms are obtained. Next, the second term with the appearance frequency lower than or equal to a threshold value of the plurality of second terms are imaged to obtain a verification image. After that, similarity degrees between the verification image and comparison images included in the comparison image group are obtained, and the first term represented by the comparison image with the highest similarity degree of the comparison images is presented. The presentation is performed by displaying that the second term represented by the verification image can be an error in writing of the first term represented by the comparison image having a high similarity degree with the verification image.
-
9.
公开(公告)号:US11917037B2
公开(公告)日:2024-02-27
申请号:US17848739
申请日:2022-06-24
Applicant: Elliot Berookhim , Pejman Yedidsion
Inventor: Elliot Berookhim , Pejman Yedidsion
IPC: H04L69/00 , H04L67/06 , H04L69/06 , H04W4/80 , G06V30/242 , G06V40/16 , H04W4/02 , H04W8/18 , H04L9/40 , H04L51/52
CPC classification number: H04L69/02 , G06V30/242 , G06V40/172 , H04L67/06 , H04L69/06 , H04W4/023 , H04W4/80 , H04W8/18 , G06V2201/10 , H04L51/52 , H04L63/107
Abstract: Methods, systems, and devices for determining a subset of users from among a set of users based on a set of received information associated with a photograph, where the disposition of the information is used to first determine the subset and then perform facial recognition on the subset of photographs for each user in order to accurately identify each user or users present in the photograph.
-
10.
公开(公告)号:US20240054802A1
公开(公告)日:2024-02-15
申请号:US18493676
申请日:2023-10-24
Applicant: INTUIT INC.
Inventor: Tharathorn RIMCHALA
IPC: G06V30/40 , G06N20/00 , G06F40/149 , G06F40/284 , G06N3/02
CPC classification number: G06V30/40 , G06N20/00 , G06F40/149 , G06F40/284 , G06N3/02 , G06V30/242
Abstract: A system and method for extracting data from a piece of content using spatial information about the piece of content. The system and method may use a conditional random fields process or a bidirectional long short term memory and conditional random fields process to extract structured data using the spatial information.
-
-
-
-
-
-
-
-
-