-
公开(公告)号:US20240312488A1
公开(公告)日:2024-09-19
申请号:US18618407
申请日:2024-03-27
Applicant: Rovi Guides, Inc.
Inventor: Jeffry Copps Robert Jose , Mithun Umesh , Sindhuja Chonat Sri
IPC: G11B27/34 , G06F18/214 , G06V20/40 , G10L25/78 , G11B27/031 , G11B27/06 , H04N21/2668 , H04N21/845 , H04N21/8549
CPC classification number: G11B27/34 , G06F18/214 , G06V20/46 , G06V20/49 , G10L25/78 , G11B27/031 , G11B27/06 , G06V2201/10 , H04N21/2668 , H04N21/8456 , H04N21/8549
Abstract: Systems and methods for generating individualized content trailers. Content such as a video is divided into segments each representing a set of common features. With reference to a set of stored user preferences, certain segments are selected as aligning with the user's interests. Each selected segment may then be assigned a label corresponding to the plot portion or element to which it belongs. A coherent trailer may then be assembled from the selected segments, ordered according to their plot elements. This allows a user to see not only segments containing subject matter that aligns with their interests, but also a set of such segments arranged to give the user an idea of the plot, and a sense of drama, increasing the likelihood of engagement with the content.
-
公开(公告)号:US20240304003A1
公开(公告)日:2024-09-12
申请号:US18666598
申请日:2024-05-16
Applicant: Tesla, Inc.
Inventor: Ashok Kumar Elluswamy , Matthew Bauch , Christopher Payne , Andrej Karpathy , Dhaval Shroff , Arvind Ramanandan , James Robert Howard Hakewill
IPC: G06V20/56 , G06F18/214 , G06N20/00 , G06T7/20 , G06V10/764 , G06V10/82
CPC classification number: G06V20/588 , G06F18/214 , G06N20/00 , G06T7/20 , G06V10/764 , G06V10/82 , G06T2207/20076 , G06T2207/20081 , G06T2207/20084 , G06T2207/30241 , G06T2207/30256 , G06V2201/10
Abstract: A processor coupled to memory is configured to receive image data based on an image captured by a camera of a vehicle. The image data is used as a basis of an input to a trained machine learning model trained to predict a three-dimensional trajectory of a machine learning feature. The three-dimensional trajectory of the machine learning feature is provided for automatically controlling the vehicle.
-
公开(公告)号:US20240303747A1
公开(公告)日:2024-09-12
申请号:US18597575
申请日:2024-03-06
Inventor: Rick Lovings , Jody A. Thoele , Erik Skyten , Joann C. Yant , Joshua Sutter , Miguel A. Garcia-Peguero , Shawn R. Harbaugh , Tishauna Wilson
CPC classification number: G06Q40/08 , G06V10/761 , G06V2201/10
Abstract: A computer system may include at least one memory and at least one processor in communication with the at least one memory. The processor may be programmed to: (1) receive photographic data including one or more images of a structure; (2) in response to receiving the photographic data, apply the photographic data to a structure assessment model configured to determine a structural status of the structure, wherein the structure assessment model is trained using historical photographic data including a plurality of historical images of structures; (3) receive an output from the structure assessment model, wherein the output at least one determined characteristic of the structure; and/or (4) based upon the output, transmit a message to a user computing device associated with the structure that causes display of the determined characteristic.
-
公开(公告)号:US20240296225A1
公开(公告)日:2024-09-05
申请号:US18177707
申请日:2023-03-02
Applicant: THE BOEING COMPANY
Inventor: Amir AFRASIABI
IPC: G06F21/56 , G06V10/774 , G06V10/86
CPC classification number: G06F21/566 , G06V10/774 , G06V10/86 , G06F2221/034 , G06V2201/10
Abstract: Techniques for adversarial attack avoidance for machine learning (ML) are disclosed. These techniques include receiving one or more images at a trained ML model and receiving attack data at the ML model. The techniques further include predicting an object depicted in the one or more images using the ML model, based on the one or more images, metadata relating to the one or more images, and the attack data. The ML model uses the metadata to prevent the attack data from changing a result of the predicting.
-
公开(公告)号:US12062026B2
公开(公告)日:2024-08-13
申请号:US18353426
申请日:2023-07-17
Applicant: Painted Dog, Inc.
Inventor: Jared Max Browarnik , Ken Aizawa
IPC: G06Q20/12 , G06F9/54 , G06F16/22 , G06F16/71 , G06F16/78 , G06V10/764 , G06V10/77 , G06V20/40 , H04N21/472 , H04N21/478
CPC classification number: G06Q20/123 , G06F9/547 , G06F16/2255 , G06F16/71 , G06F16/7867 , G06V10/764 , G06V10/7715 , G06V20/40 , H04N21/47217 , H04N21/47815 , G06V20/48 , G06V2201/10
Abstract: Shoppable video enables a viewer to identify and buy items appearing in a video. To retrieve information about the items in a frame of the video, the playback device generates a perceptual hash of that frame and uses that hash to query a first database storing perceptual hashes of different version of the video. The database query returns an identifier for the frame, which is then used to query a second database that store the item information. The results of this query are returned to the playback device, which shows them to the user, enabling the viewer to learn more about and possibly purchase the item. Using queries based on perceptual hashes of different versions of the video increases the likelihood of returning a match, despite formatting differences. And using separate hash and metadata databases makes it possible to update the metadata without changing the hashes.
-
公开(公告)号:US12061989B2
公开(公告)日:2024-08-13
申请号:US17187322
申请日:2021-02-26
Applicant: Capital One Services, LLC
Inventor: Sunil Subrahmanyam Vasisht , Geoffrey Dagley , Qiaochu Tang , Sean Reddy , Jason Richard Hoover , Stephen Michael Wylie , Micah Price
IPC: G06K9/62 , G06F16/583 , G06F18/214 , G06F18/24 , G06N3/02 , G06N3/045 , G06N3/048 , G06N3/084 , G06N20/00 , G06V10/10 , G06V10/44 , G06V10/764 , G06V10/774 , G06V20/20 , G06F18/23 , G06N5/01 , G06N20/20
CPC classification number: G06N3/084 , G06F16/583 , G06F18/214 , G06F18/24 , G06N3/02 , G06N3/045 , G06N3/048 , G06N20/00 , G06V10/17 , G06V10/454 , G06V10/764 , G06V10/774 , G06V20/20 , G06F18/23 , G06N5/01 , G06N20/20 , G06V2201/10
Abstract: An artificial intelligence system for identifying attributes in an image. The system may include a processor in communication with a client device; and a storage medium. The storage medium may store instructions that, when executed, configure the processor to perform operations including: extracting first features; categorizing the first images in a first group or a second group; modifying first metadata associated with each image in the first images to include a binary label; calculating a classification function; classifying a second plurality of images using the classification function; extracting second features from the second images classified in the first group; categorizing the second images in the first group by attribute; calculating an attribute identification function that identifies attributes of the second images; and identifying at least one attribute associated with a client image using the attribute identification function, the client image being received from the client device.
-
公开(公告)号:US20240267491A1
公开(公告)日:2024-08-08
申请号:US18434025
申请日:2024-02-06
Applicant: MILESTONE SYSTEMS A/S
Inventor: Oleksii ZATVORNYTSKYI , Peter Posselt VERGMANN
IPC: H04N7/18 , G06F16/735 , G06F16/738 , G06V10/94 , G06V20/40 , G06V20/52
CPC classification number: H04N7/181 , G06F16/735 , G06F16/738 , G06V10/945 , G06V20/40 , G06V20/52 , G06V2201/10
Abstract: The present disclosure generally relates to video surveillance systems and optionally computer-implemented video management methods for video surveillance systems. A video management system may be configured to display search results of a plurality of video streams as respective thumbnails on a geo-map at respective positions of the search results within a surveillance area.
-
公开(公告)号:US20240265724A1
公开(公告)日:2024-08-08
申请号:US18641155
申请日:2024-04-19
Applicant: Leigh M. Rothschild
Inventor: Leigh M. Rothschild
CPC classification number: G06V40/10 , G06V10/74 , G06V40/50 , H04W76/14 , G06V2201/10
Abstract: A system and a method for identifying and tagging individuals present in an image are disclosed. The method comprises detecting a second device present in proximity of a first device, for establishing a connection. The connection may be established while the first device enters in a camera mode. Immediately after the first device captured an image, the first device may receive identity information of individuals from the second device. The identity information of individuals may comprise at least one of images and personal details of the individuals. Based on the received identity information, the first device may identify the individuals present in the image. The identified individuals present in the image may be tagged using their corresponding identities. Such tagging information may be stored in metadata of the image for a later usage.
-
公开(公告)号:US12056945B2
公开(公告)日:2024-08-06
申请号:US17098902
申请日:2020-11-16
Applicant: KYOCERA DOCUMENT SOLUTIONS INC.
Inventor: Andrii Matiukhov
IPC: G06V30/19 , G06F18/20 , G06F18/21 , G06F18/214 , G06F18/40 , G06N20/00 , G06V10/40 , G06V30/416
CPC classification number: G06V30/19 , G06F18/214 , G06F18/217 , G06F18/285 , G06F18/40 , G06N20/00 , G06V10/40 , G06V30/416 , G06V2201/10
Abstract: A method performed by a computing system includes receiving, by a document data extraction system (DDES), image data associated with a document. The DDES extracts, via optical character recognition (OCR) logic of the DDES, metadata from the image data. The metadata specifies sequences of text content items and text content item features associated with each text content item of the sequences of text content items. A machine learning logic (MLL) module of the DDES determines, based on the sequences of text content items and the text content item features, one or more text content items associated with a key. The DDES communicates information that specifies the key and a corresponding value that is associated with the one or more text content items that are associated with the key to a terminal.
-
公开(公告)号:US12051209B2
公开(公告)日:2024-07-30
申请号:US17233986
申请日:2021-04-19
Applicant: Chooch Intelligence Technologies Co.
Inventor: Hakan Robert Gultekin , Emrah Gultekin
IPC: G06T7/00 , G06F16/78 , G06F16/783 , G06F18/214 , G06N20/00 , G06T7/20 , G06T7/215 , G06V10/82 , G06V20/40
CPC classification number: G06T7/20 , G06F16/7837 , G06F16/7867 , G06F18/214 , G06N20/00 , G06T7/215 , G06V10/82 , G06V20/41 , G06T2207/10016 , G06T2207/20081 , G06V2201/10
Abstract: Embodiments of the present invention train multiple Perception models to predict contextual metadata (tags) with respect to target content items. By extracting context from content items, and generating associations among the Perception models, individual Perceptions trigger one another based on the extracted context to generate a more robust set of contextual metadata. A Perception Identifier predicts core tags that make coarse distinctions among content items at relatively higher levels of abstraction, while also triggering other Perception models to predict additional perception tags at lower levels of abstraction. A Dense Classifier identifies sub-content items at various levels of abstraction, and facilitates the iterative generation of additional dense tags across integrated Perceptions. Class-specific thresholds are generated with respect to individual classes of each Perception to address the inherent sampling bias that results from the varying number and quality of training samples (across different classes of content items) available to train each Perception.
-
-
-
-
-
-
-
-
-