-
公开(公告)号:US12112513B2
公开(公告)日:2024-10-08
申请号:US17511711
申请日:2021-10-27
发明人: Hyo Young Kim , Koo Hyun Park , Keun Taek Park
CPC分类号: G06V10/255 , G06F16/2282 , G06F18/22 , G06V10/751 , G06V30/1912 , G06V30/19127 , G06V30/10
摘要: A non-standard user interface object identification system includes an object candidate extractior that extracts one or more objects from an image, a first similarity analyzer that determines object type candidates of the one or more objects in accordance with similarities between the one or more objects and a standard user interface (UI) element, a second similarity analyzer that selects object type-specific weight values in accordance with layout characteristics of the one or more objects and determines object types of the one or more objects using the object type candidates and the object type-specific weight values, and an object identifier that receives type and characteristic information of a search target object and identifies the search target object in accordance with characteristic information and the object types of the one or more objects.
-
公开(公告)号:US12020483B2
公开(公告)日:2024-06-25
申请号:US17896666
申请日:2022-08-26
发明人: Yonit Hoffman , Irit Ofer , Avner Levi , Haim Sabo , Reut Amior
CPC分类号: G06V20/46 , G06F16/71 , G06F18/22 , G06F18/24 , G06V10/764 , G06V10/7715 , G06V20/41 , G06V30/19127
摘要: A system for indexing animated content receives detections extracted from a media file, where each one of the detections includes an image extracted from a corresponding frame of the media file that corresponds to a detected instance of an animated character. The system determines, for each of the received detections, an embedding defining a set of characteristics for the detected instance. The embedding associated with each detection is provided to a grouping engine that is configured to dynamically configure at least one grouping parameter based on a total number of the detections received. The grouping engine is also configured to sort the detections into groups using the grouping parameter and the embedding for each detection. A character ID is assigned to each one of the groups of detections, and the system indexes the groups of detections in a database in association with the character ID assigned to each group.
-
3.
公开(公告)号:US20240021000A1
公开(公告)日:2024-01-18
申请号:US18113178
申请日:2023-02-23
发明人: Xiameng QIN , Yulin LI , Xiaoqiang ZHANG , Ju HUANG , Qunyi XIE , Kun YAO
IPC分类号: G06V30/19 , G06V30/148
CPC分类号: G06V30/1918 , G06V30/15 , G06V30/19127 , G06V30/19147
摘要: There is provided an image-based information extraction model, method, and apparatus, a device, and a storage medium, which relates to the field of artificial intelligence (AI) technologies, specifically to fields of deep learning, image processing, computer vision technologies, and is applicable to optical character recognition (OCR) and other scenarios. A specific implementation solution involves: acquiring a to-be-extracted first image and a category of to-be-extracted information; and inputting the first image and the category into a pre-trained information extraction model to perform information extraction on the first image to obtain text information corresponding to the category.
-
公开(公告)号:US20230394862A1
公开(公告)日:2023-12-07
申请号:US18454032
申请日:2023-08-22
申请人: INTUIT INC.
发明人: Dominic Miguel ROSSI , Xiao Xiao
IPC分类号: G06V30/19 , G06T7/194 , G06V30/146 , G06V30/18 , G06V30/414 , G06V30/14
CPC分类号: G06V30/19173 , G06T7/194 , G06V30/19127 , G06V30/146 , G06V30/18 , G06V30/19147 , G06T2207/30176 , G06V30/414 , G06V30/1448 , G06T2207/20021 , G06T2207/20072 , G06T2207/20081 , G06T2207/20084 , G06V30/1916
摘要: A processor may receive an image and identify a plurality of characters in the image using a machine learning (ML) model. The processor may generate at least one word-level bounding box indicating one or more words including at least a subset of the plurality of characters and/or may generate at least one field-level bounding box indicating at least one field including at least a subset of the one or more words. The processor may overlay the at least one word-level bounding box and the at least one field-level bounding box on the image to form a masked image including a plurality of optically-recognized characters and one or more predicted fields for at least a subset of the plurality of optically-recognized characters.
-
5.
公开(公告)号:US20230377225A1
公开(公告)日:2023-11-23
申请号:US18121444
申请日:2023-03-14
发明人: Chengquan ZHANG , Yuechen YU , Liang WU
CPC分类号: G06T11/60 , G06V20/62 , G06V10/82 , G06V30/19127 , G06V30/1918 , G06V30/1444 , G06V30/19147 , G06V40/10
摘要: A method for training an image editing model includes steps described below. Covering processing is performed on a region of interest determined in an original image so that a background image sample is formed, and content corresponding to the region of interest is determined as a sample of content of interest; the background image sample and the sample of the content of interest are input into an image editing model; fusion processing is performed on a background image feature and a feature of the region of interest by using the image editing model so that a fusion feature is formed; an image reconstruction operation is performed according to the fusion feature by using the image editing model so that a reconstructed image is output; and optimization training is performed on the image editing model according to a loss relationship between the reconstructed image and the original image.
-
公开(公告)号:US12087068B2
公开(公告)日:2024-09-10
申请号:US18454032
申请日:2023-08-22
申请人: INTUIT INC.
发明人: Dominic Miguel Rossi , Xiao Xiao
IPC分类号: G06V30/412 , G06T7/194 , G06V30/14 , G06V30/146 , G06V30/18 , G06V30/19 , G06V30/414
CPC分类号: G06V30/19173 , G06T7/194 , G06V30/1448 , G06V30/146 , G06V30/18 , G06V30/19127 , G06V30/19147 , G06V30/1916 , G06V30/414 , G06T2207/20021 , G06T2207/20072 , G06T2207/20081 , G06T2207/20084 , G06T2207/30176
摘要: A processor may receive an image and identify a plurality of characters in the image using a machine learning (ML) model. The processor may generate at least one word-level bounding box indicating one or more words including at least a subset of the plurality of characters and/or may generate at least one field-level bounding box indicating at least one field including at least a subset of the one or more words. The processor may overlay the at least one word-level bounding box and the at least one field-level bounding box on the image to form a masked image including a plurality of optically-recognized characters and one or more predicted fields for at least a subset of the plurality of optically-recognized characters.
-
公开(公告)号:US20240281608A1
公开(公告)日:2024-08-22
申请号:US18172018
申请日:2023-02-21
申请人: Tencent America LLC
发明人: Dian YU , Linfeng SONG , Dong YU
IPC分类号: G06F40/284 , G06V30/19 , G06V30/416
CPC分类号: G06F40/284 , G06V30/19127 , G06V30/416
摘要: A method and apparatus that identifies one or more characters within a text; determines one or more informative sections within the text, the one or more informative sections providing information regarding a gender of the one or more characters within the text; selects a most informative section from the one or more informative sections; extracts unlabeled instances corresponding to the gender of the one or more characters from the most informative section; iteratively trains a multi-task model using unlabeled corpora, the multi-task model performing both speaker identification and gender identification; and labels the gender of the one or more characters based on the extracted unlabeled instances and the multi-task model.
-
公开(公告)号:US20240071037A1
公开(公告)日:2024-02-29
申请号:US18203185
申请日:2023-05-30
发明人: Ming-Jung SEOW , Gang XU , Tao YANG , Wesley Kenneth COBB
IPC分类号: G06V10/32 , G06F18/2137 , G06F18/23 , G06F18/28 , G06N7/01 , G06V10/762 , G06V30/19 , G06V30/262 , H01B1/02
CPC分类号: G06V10/32 , G06F18/2137 , G06F18/23 , G06F18/28 , G06N7/01 , G06V10/762 , G06V30/19127 , G06V30/1914 , G06V30/268 , H01B1/02
摘要: Techniques are disclosed for generating a sequence of symbols based on input data for a neuro-linguistic model. The model may be used by a behavior recognition system to analyze the input data. A mapper component of a neuro-linguistic module in the behavior recognition system receives one or more normalized vectors generated from the input data. The mapper component generates one or more clusters based on a statistical distribution of the normalized vectors. The mapper component evaluates statistics and identifies statistically relevant clusters. The mapper component assigns a distinct symbol to each of the identified clusters.
-
公开(公告)号:US11756296B2
公开(公告)日:2023-09-12
申请号:US17225236
申请日:2021-04-08
申请人: Dell Products L.P.
IPC分类号: G06V20/40 , G06N3/08 , G06N3/04 , G06T7/70 , G06V10/22 , G06F18/21 , G06F18/24 , H04N23/60 , G06V10/44 , G06V30/18 , G06V30/19 , G08B5/22
CPC分类号: G06V20/41 , G06F18/21 , G06F18/24 , G06N3/04 , G06N3/08 , G06T7/70 , G06V10/22 , G06V10/454 , G06V30/18057 , G06V30/19127 , H04N23/60 , G06T2207/10016 , G08B5/22
摘要: A method comprises receiving one or more inputs captured by a camera of a device, and determining, using one or more machine learning models, whether the one or more inputs depict at least one object configured to capture a visual representation of a screen of the device. A recommendation is generated responsive to an affirmative determination, the recommendation comprising at least one action to prevent the capture of the visual representation of the screen of the device.
-
公开(公告)号:US11748891B2
公开(公告)日:2023-09-05
申请号:US17799010
申请日:2021-02-26
发明人: Junming Hong , Huan Chen
IPC分类号: G06T7/12 , G06T7/521 , B65G69/24 , B65G69/26 , B66C13/18 , G06T7/10 , G06V30/19 , G06T7/543 , G06T7/564 , B65G67/04
CPC分类号: G06T7/12 , B65G69/24 , B65G69/26 , B66C13/18 , G06T7/10 , G06T7/521 , G06T7/543 , G06T7/564 , G06V30/19127 , B65G67/04
摘要: The present invention provides an automatic container loading and unloading apparatus and method. The apparatus comprises: a data acquisition module, used for scanning a container truck panel to obtain laser point cloud data; a data preprocessing module, used for segmenting a laser point cloud on a surface of the container truck panel from the laser point cloud data; a key point extraction module, used for performing edge extraction on the laser point cloud on the surface of the container truck panel to obtain discrete points on edges of the keel of the container truck panel; and a straight line fitting module, used for performing random sample consensus straight line fitting on the discrete points on the edges of the keel of the container truck panel to obtain spatial straight lines of the edges of the keel of the truck panel. The automatic container loading and unloading apparatus and method provided by the present invention using spatial straight lines on the edges of the keel of the container truck panel for computing processing, thereby achieving stronger robustness and higher accuracy, so that a container is loaded onto the container truck panel with higher precision and lower calculation amount.
-
-
-
-
-
-
-
-
-