专利检索 cpc:"G06V30/19127" 第 1 页

1.

发明授权
System and method for identifying non-standard user interface object 有权

公开(公告)号：US12112513B2

公开(公告)日：2024-10-08

申请号：US17511711

申请日：2021-10-27

申请人： SAMSUNG SDS CO., LTD.

发明人： Hyo Young Kim , Koo Hyun Park , Keun Taek Park

IPC分类号： G06V10/20 , G06F16/22 , G06F18/22 , G06V10/75 , G06V30/10 , G06V30/19

CPC分类号： G06V10/255 , G06F16/2282 , G06F18/22 , G06V10/751 , G06V30/1912 , G06V30/19127 , G06V30/10

摘要： A non-standard user interface object identification system includes an object candidate extractior that extracts one or more objects from an image, a first similarity analyzer that determines object type candidates of the one or more objects in accordance with similarities between the one or more objects and a standard user interface (UI) element, a second similarity analyzer that selects object type-specific weight values in accordance with layout characteristics of the one or more objects and determines object types of the one or more objects using the object type candidates and the object type-specific weight values, and an object identifier that receives type and characteristic information of a search target object and identifies the search target object in accordance with characteristic information and the object types of the one or more objects.

2.

发明授权
Dynamic detection and recognition of media subjects 有权

公开(公告)号：US12020483B2

公开(公告)日：2024-06-25

申请号：US17896666

申请日：2022-08-26

申请人： Microsoft Technology Licensing, LLC

发明人： Yonit Hoffman , Irit Ofer , Avner Levi , Haim Sabo , Reut Amior

IPC分类号： G06V20/40 , G06F16/71 , G06F18/22 , G06F18/24 , G06V10/764 , G06V10/77 , G06V30/19

CPC分类号： G06V20/46 , G06F16/71 , G06F18/22 , G06F18/24 , G06V10/764 , G06V10/7715 , G06V20/41 , G06V30/19127

摘要： A system for indexing animated content receives detections extracted from a media file, where each one of the detections includes an image extracted from a corresponding frame of the media file that corresponds to a detected instance of an animated character. The system determines, for each of the received detections, an embedding defining a set of characteristics for the detected instance. The embedding associated with each detection is provided to a grouping engine that is configured to dynamically configure at least one grouping parameter based on a total number of the detections received. The grouping engine is also configured to sort the detections into groups using the grouping parameter and the embedding for each detection. A character ID is assigned to each one of the groups of detections, and the system indexes the groups of detections in a database in association with the character ID assigned to each group.

3.

发明公开
IMAGE-BASED INFORMATION EXTRACTION MODEL, METHOD, AND APPARATUS, DEVICE, AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20240021000A1

公开(公告)日：2024-01-18

申请号：US18113178

申请日：2023-02-23

申请人： BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

发明人： Xiameng QIN , Yulin LI , Xiaoqiang ZHANG , Ju HUANG , Qunyi XIE , Kun YAO

IPC分类号： G06V30/19 , G06V30/148

CPC分类号： G06V30/1918 , G06V30/15 , G06V30/19127 , G06V30/19147

摘要： There is provided an image-based information extraction model, method, and apparatus, a device, and a storage medium, which relates to the field of artificial intelligence (AI) technologies, specifically to fields of deep learning, image processing, computer vision technologies, and is applicable to optical character recognition (OCR) and other scenarios. A specific implementation solution involves: acquiring a to-be-extracted first image and a category of to-be-extracted information; and inputting the first image and the category into a pre-trained information extraction model to perform information extraction on the first image to obtain text information corresponding to the category.

4.

发明公开
END TO END TRAINABLE DOCUMENT EXTRACTION 审中-公开

公开(公告)号：US20230394862A1

公开(公告)日：2023-12-07

申请号：US18454032

申请日：2023-08-22

申请人： INTUIT INC.

发明人： Dominic Miguel ROSSI , Xiao Xiao

IPC分类号： G06V30/19 , G06T7/194 , G06V30/146 , G06V30/18 , G06V30/414 , G06V30/14

CPC分类号： G06V30/19173 , G06T7/194 , G06V30/19127 , G06V30/146 , G06V30/18 , G06V30/19147 , G06T2207/30176 , G06V30/414 , G06V30/1448 , G06T2207/20021 , G06T2207/20072 , G06T2207/20081 , G06T2207/20084 , G06V30/1916

摘要： A processor may receive an image and identify a plurality of characters in the image using a machine learning (ML) model. The processor may generate at least one word-level bounding box indicating one or more words including at least a subset of the plurality of characters and/or may generate at least one field-level bounding box indicating at least one field including at least a subset of the one or more words. The processor may overlay the at least one word-level bounding box and the at least one field-level bounding box on the image to form a masked image including a plurality of optically-recognized characters and one or more predicted fields for at least a subset of the plurality of optically-recognized characters.

5.

发明公开
METHOD AND APPARATUS FOR EDITING AN IMAGE AND METHOD AND APPARATUS FOR TRAINING AN IMAGE EDITING MODEL, DEVICE AND MEDIUM 审中-公开

公开(公告)号：US20230377225A1

公开(公告)日：2023-11-23

申请号：US18121444

申请日：2023-03-14

申请人： Beijing Baidu Netcom Science and Technology Co., Ltd.

发明人： Chengquan ZHANG , Yuechen YU , Liang WU

IPC分类号： G06T11/60 , G06V20/62 , G06V10/82 , G06V30/19 , G06V30/14

CPC分类号： G06T11/60 , G06V20/62 , G06V10/82 , G06V30/19127 , G06V30/1918 , G06V30/1444 , G06V30/19147 , G06V40/10

摘要： A method for training an image editing model includes steps described below. Covering processing is performed on a region of interest determined in an original image so that a background image sample is formed, and content corresponding to the region of interest is determined as a sample of content of interest; the background image sample and the sample of the content of interest are input into an image editing model; fusion processing is performed on a background image feature and a feature of the region of interest by using the image editing model so that a fusion feature is formed; an image reconstruction operation is performed according to the fusion feature by using the image editing model so that a reconstructed image is output; and optimization training is performed on the image editing model according to a loss relationship between the reconstructed image and the original image.

6.

发明授权
End to end trainable document extraction 有权

公开(公告)号：US12087068B2

公开(公告)日：2024-09-10

申请号：US18454032

申请日：2023-08-22

申请人： INTUIT INC.

发明人： Dominic Miguel Rossi , Xiao Xiao

IPC分类号： G06V30/412 , G06T7/194 , G06V30/14 , G06V30/146 , G06V30/18 , G06V30/19 , G06V30/414

CPC分类号： G06V30/19173 , G06T7/194 , G06V30/1448 , G06V30/146 , G06V30/18 , G06V30/19127 , G06V30/19147 , G06V30/1916 , G06V30/414 , G06T2207/20021 , G06T2207/20072 , G06T2207/20081 , G06T2207/20084 , G06T2207/30176

摘要： A processor may receive an image and identify a plurality of characters in the image using a machine learning (ML) model. The processor may generate at least one word-level bounding box indicating one or more words including at least a subset of the plurality of characters and/or may generate at least one field-level bounding box indicating at least one field including at least a subset of the one or more words. The processor may overlay the at least one word-level bounding box and the at least one field-level bounding box on the image to form a masked image including a plurality of optically-recognized characters and one or more predicted fields for at least a subset of the plurality of optically-recognized characters.

7.

发明公开
MULTI-TASK SELF-TRAINING FOR CHARACTER GENDER IDENTIFICATION 审中-公开

公开(公告)号：US20240281608A1

公开(公告)日：2024-08-22

申请号：US18172018

申请日：2023-02-21

申请人： Tencent America LLC

发明人： Dian YU , Linfeng SONG , Dong YU

IPC分类号： G06F40/284 , G06V30/19 , G06V30/416

CPC分类号： G06F40/284 , G06V30/19127 , G06V30/416

摘要： A method and apparatus that identifies one or more characters within a text; determines one or more informative sections within the text, the one or more informative sections providing information regarding a gender of the one or more characters within the text; selects a most informative section from the one or more informative sections; extracts unlabeled instances corresponding to the gender of the one or more characters from the most informative section; iteratively trains a multi-task model using unlabeled corpora, the multi-task model performing both speaker identification and gender identification; and labels the gender of the one or more characters based on the extracted unlabeled instances and the multi-task model.

8.

发明公开
MAPPER COMPONENT FOR A NEURO-LINGUISTIC BEHAVIOR RECOGNITION SYSTEM 审中-公开

公开(公告)号：US20240071037A1

公开(公告)日：2024-02-29

申请号：US18203185

申请日：2023-05-30

申请人： Intellective Ai, Inc.

发明人： Ming-Jung SEOW , Gang XU , Tao YANG , Wesley Kenneth COBB

IPC分类号： G06V10/32 , G06F18/2137 , G06F18/23 , G06F18/28 , G06N7/01 , G06V10/762 , G06V30/19 , G06V30/262 , H01B1/02

CPC分类号： G06V10/32 , G06F18/2137 , G06F18/23 , G06F18/28 , G06N7/01 , G06V10/762 , G06V30/19127 , G06V30/1914 , G06V30/268 , H01B1/02

摘要： Techniques are disclosed for generating a sequence of symbols based on input data for a neuro-linguistic model. The model may be used by a behavior recognition system to analyze the input data. A mapper component of a neuro-linguistic module in the behavior recognition system receives one or more normalized vectors generated from the input data. The mapper component generates one or more clusters based on a statistical distribution of the normalized vectors. The mapper component evaluates statistics and identifies statistically relevant clusters. The mapper component assigns a distinct symbol to each of the identified clusters.

9.

发明授权
Device anti-surveillance system 有权

公开(公告)号：US11756296B2

公开(公告)日：2023-09-12

申请号：US17225236

申请日：2021-04-08

申请人： Dell Products L.P.

发明人： Dhilip S. Kumar , Jaganathan Subramanian

IPC分类号： G06V20/40 , G06N3/08 , G06N3/04 , G06T7/70 , G06V10/22 , G06F18/21 , G06F18/24 , H04N23/60 , G06V10/44 , G06V30/18 , G06V30/19 , G08B5/22

CPC分类号： G06V20/41 , G06F18/21 , G06F18/24 , G06N3/04 , G06N3/08 , G06T7/70 , G06V10/22 , G06V10/454 , G06V30/18057 , G06V30/19127 , H04N23/60 , G06T2207/10016 , G08B5/22

摘要： A method comprises receiving one or more inputs captured by a camera of a device, and determining, using one or more machine learning models, whether the one or more inputs depict at least one object configured to capture a visual representation of a screen of the device. A recommendation is generated responsive to an affirmative determination, the recommendation comprising at least one action to prevent the capture of the visual representation of the screen of the device.

10.

发明授权
Automatic container loading and unloading apparatus and method 有权

公开(公告)号：US11748891B2

公开(公告)日：2023-09-05

申请号：US17799010

申请日：2021-02-26

申请人： Shanghai Master Matrix Information Technology Co., Ltd.

发明人： Junming Hong , Huan Chen

IPC分类号： G06T7/12 , G06T7/521 , B65G69/24 , B65G69/26 , B66C13/18 , G06T7/10 , G06V30/19 , G06T7/543 , G06T7/564 , B65G67/04

CPC分类号： G06T7/12 , B65G69/24 , B65G69/26 , B66C13/18 , G06T7/10 , G06T7/521 , G06T7/543 , G06T7/564 , G06V30/19127 , B65G67/04

摘要： The present invention provides an automatic container loading and unloading apparatus and method. The apparatus comprises: a data acquisition module, used for scanning a container truck panel to obtain laser point cloud data; a data preprocessing module, used for segmenting a laser point cloud on a surface of the container truck panel from the laser point cloud data; a key point extraction module, used for performing edge extraction on the laser point cloud on the surface of the container truck panel to obtain discrete points on edges of the keel of the container truck panel; and a straight line fitting module, used for performing random sample consensus straight line fitting on the discrete points on the edges of the keel of the container truck panel to obtain spatial straight lines of the edges of the keel of the truck panel. The automatic container loading and unloading apparatus and method provided by the present invention using spatial straight lines on the edges of the keel of the container truck panel for computing processing, thereby achieving stronger robustness and higher accuracy, so that a container is loaded onto the container truck panel with higher precision and lower calculation amount.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类