Patent search cpc:"G06V2201/10" Page 17

161.

发明授权
Context-aware event based annotation system for media asset 有权

公开(公告)号：US11776261B2

公开(公告)日：2023-10-03

申请号：US17246456

申请日：2021-04-30

Applicant: Spherex, Inc.

Inventor： Teresa Ann Phillips , Pranav Anand Joshi , Kira Michelle McStay

IPC: G06V20/40 , H04N21/472 , H04N21/466 , G06T7/00 , G06F18/40 , G06F18/214 , G06F18/241

CPC classification number: G06V20/41 , G06F18/214 , G06F18/241 , G06F18/40 , G06T7/0002 , H04N21/4662 , H04N21/472 , G06T2207/10016 , G06V20/44 , G06V2201/10

Abstract: Various embodiments described herein support or provide for annotation of a media asset, such as an audio asset or a video asset, based on one or more events identified within content of the media asset. In particular, some embodiments can determine one or more of the following details with respect to content of a given media asset, which can represent annotations that enable determination of contextual information for the given media asset: events; event classification labels for events; subclassifications labels for events; scenes comprising events; attributes of scenes; themes presented by the content; and title-level attributes of the given media asset.

162.

发明授权
Systems and methods for automated document image orientation correction 有权

公开(公告)号：US11776248B2

公开(公告)日：2023-10-03

申请号：US18047045

申请日：2022-10-17

Applicant: Optum, Inc.

Inventor： Rahul Bhaskar , Daryl Seiichi Furuyama , Daniel William James

IPC: G06V10/98 , G06N20/00 , G06V30/40 , G06V10/24 , G06F18/21 , G06F18/28 , G06V30/10

CPC classification number: G06V10/98 , G06F18/21 , G06F18/217 , G06F18/28 , G06N20/00 , G06V10/242 , G06V30/40 , G06V30/10 , G06V2201/10

Abstract: Systems and methods are configured for correcting the orientation of an image data object subject to optical character recognition (OCR) by receiving an original image data object, generating initial machine readable text for the original image data object via OCR, generating an initial quality score for the initial machine readable text via machine-learning models, determining whether the initial quality score satisfies quality criteria, upon determining that the initial quality score does not satisfy the quality criteria, generating a plurality of rotated image data objects each comprising the original image data object rotated to a different rotational position, generating a rotated machine readable text data object for each of the plurality of rotated image data objects and generating a rotated quality score for each of the plurality of rotated machine readable text data objects, and determining that one of the plurality of rotated quality scores satisfies the quality criteria.

163.

发明公开
CURATION OF CUSTOM WORKFLOWS USING MULTIPLE CAMERAS, WITH AI TO PROVIDE AWARENESS OF SITUATIONS 审中-公开

公开(公告)号：US20230305903A1

公开(公告)日：2023-09-28

申请号：US18203535

申请日：2023-05-30

Applicant: Scenera, Inc.

Inventor： David D. Lee , Andrew Augustine Wajs

IPC: G06F9/50 , G06F9/54 , G06N20/00 , G06V10/94 , G06F18/20 , G06F18/25 , H04N23/80 , H04N23/90 , G06V10/764 , G06V10/82 , G06V10/44 , G06V20/52

CPC classification number: G06F9/5072 , G06F9/542 , G06N20/00 , G06V10/94 , G06F18/285 , G06F18/251 , H04N23/80 , H04N23/90 , G06V10/764 , G06V10/82 , G06V10/454 , G06V20/52 , G06V2201/10 , G06V40/161

Abstract: A multi-layer technology stack includes a sensor layer including image sensors, a device layer, and a cloud layer, with interfaces between the layers. A method to curate different custom workflows for multiple applications include the following. Requirements for custom sets of data packages for the applications is received. The custom set of data packages include sensor data packages (e.g., SceneData) and contextual metadata packages that contextualize the sensor data packages (e.g., SceneMarks). Based on the received requirements and capabilities of components in the technology stack, the custom workflow for that application is deployed. This includes a selection, configuration and linking of components from the technology stack. The custom workflow is implemented in the components of the technology stack by transmitting workflow control packages directly and/or indirectly via the interfaces to the different layers.

164.

发明公开
OBJECT DETECTION DEVICE AND METHOD 审中-公开

公开(公告)号：US20230290104A1

公开(公告)日：2023-09-14

申请号：US18172765

申请日：2023-02-22

Applicant: Fujitsu Limited

Inventor： Moyuru YAMADA

IPC: G06V10/22 , G06V20/50 , G06V10/75

CPC classification number: G06V10/235 , G06V20/50 , G06V10/751 , G06V2201/07 , G06V2201/10

Abstract: An object detection device includes a processor that executes a procedure. The procedure includes: converting an input image into a first vector such that information related to an area of an object in the image is contained in the first vector; converting input text into a second vector such that information related to an order of appearance in the text of one or more word strings each indicating a detection target object included in the text is contained in the second vector; generating a third vector in which the first vector and the second vector have been reflected in a vector of initial values corresponding to detection target objects; and estimating whether or not a feature indicated by the third vector corresponds to a detection target object that appears at which number place in the text, and estimating a position of the detection target object in the image.

165.

发明公开
SYSTEMS AND METHODS TO ENHANCE SEGMENT DURING TRICK PLAY 审中-公开

公开(公告)号：US20230276110A1

公开(公告)日：2023-08-31

申请号：US18195558

申请日：2023-05-10

Applicant: Rovi Guides, Inc.

Inventor： Vishwas Sharadanagar Panchaksharaiah , Harshith Kumar Gejjegondanahally Sreekanth , Pawan Nagdeve , Anjum Makkar , Reda Harb

IPC: H04N21/81 , G06V20/40 , H04N21/435 , H04N21/442 , H04N21/45 , H04N21/472 , H04N21/4725

CPC classification number: H04N21/8133 , G06V20/49 , G06V20/41 , H04N21/435 , H04N21/44204 , H04N21/4532 , H04N21/47217 , H04N21/4725 , G06V2201/10

Abstract: Systems and methods are provided herein for including supplemental content with segments based on the complexity of the segment. This may be accomplished by a system receiving complexity information related to a media asset and user information related to a viewer to determine if one or more segments of the media asset is complex for the user. If the system receives a trick play command, from the user, during a segment categorized as complex, the system can use the complexity information and user information to generate supplemental content, facilitating better user understanding of the complex segment.

166.

发明授权
Video identification and analytical recognition system 有权

公开(公告)号：US11743431B2

公开(公告)日：2023-08-29

申请号：US16823710

申请日：2020-03-19

Applicant: James Carey

Inventor： James Carey

IPC: H04N7/18 , G06T7/246 , G06T7/20 , H04W8/00 , G06V20/52 , G06V40/10 , G06V40/16 , G08B13/196 , G06V40/20 , G06F18/25 , H04N23/45

CPC classification number: H04N7/181 , G06F18/251 , G06T7/20 , G06T7/246 , G06V20/52 , G06V20/53 , G06V40/10 , G06V40/161 , G06V40/23 , G08B13/19608 , G08B13/19613 , H04N7/185 , H04N23/45 , H04W8/005 , G06T2207/10016 , G06T2207/30232 , G06V40/172 , G06V40/25 , G06V2201/10

Abstract: An analytical recognition system is includes a video camera configured to capture video data of a subject and an antenna configured to capture mobile communication device data relating to a mobile communication device of the subject. The system further includes a data analytics module configured to: analyze the video data to determine at least one of a physical attribute or a movement attribute of the subject and generate; generate a first certainty match value based on the at least one of the physical attribute or the movement attribute of the subject; and perform a facial recognition analysis of the subject to obtain facial recognition data. The data analytics module is further configured to generate a second certainty match value based on the facial recognition data; generate a third certainty match value based on the mobile communication device data; and generate a combined certainty match value based on the first certainty match value, the second certainty match value, and the third certainty match value.

167.

发明公开
EVENT-DRIVEN STREAMING MEDIA INTERACTIVITY 审中-公开

公开(公告)号：US20230262296A1

公开(公告)日：2023-08-17

申请号：US18140236

申请日：2023-04-27

Applicant: Videokawa, Inc.

Inventor： Steven Selfors

IPC: H04N21/485 , H04N21/44 , H04N21/858 , H04N21/472 , H04N21/84 , H04N21/488 , G06F16/78 , G06V20/40 , G06V10/70

CPC classification number: H04N21/4856 , G06F16/7867 , G06V10/768 , G06V20/49 , H04N21/44 , H04N21/47217 , H04N21/4882 , H04N21/84 , H04N21/8586 , G06V2201/10

Abstract: Aspects described herein may provide systems, methods, and device for facilitating language learning using videos. Subtitles may be displayed in a first, target language or a second, native language during display of the video. On a pause event, both the target language subtitle and the native language subtitle may be displayed simultaneously to facilitate understanding. While paused, a user may select an option to be provided with additional contextual information indicating usage and context associated with one or more words of the target language subtitle. The user may navigate through previous and next subtitles with additional contextual information while the video is paused. Other aspects may allow users to create auto-continuous video loops of definable duration, and may allow users to generate video segments by searching an entire database of subtitle text, and may allow users create, save, share, and search video loops.

168.

发明公开
HDMI CUSTOMIZED AD INSERTION 审中-公开

公开(公告)号：US20230262289A1

公开(公告)日：2023-08-17

申请号：US17674339

申请日：2022-02-17

Applicant: Roku, Inc.

Inventor： Purushottam NARAYANA , Andre GODDARD ROSA

IPC: H04N21/458 , G06V20/40 , H04N21/81 , H04N21/4363 , H04N21/44 , H04N21/431

CPC classification number: H04N21/458 , G06V20/44 , G06V20/48 , H04N21/812 , H04N21/43635 , H04N21/44008 , H04N21/4312 , G06V2201/10

Abstract: Disclosed herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for ad insertion by a display device coupled to a media device via a high-definition media interface (HDMI) connection, where the media device provides media content and/or a control signal. When the media device pauses the media content, the display device can determine that a pause event has occurred and insert an ad shown on the display device. Further, some embodiments include determining the context and/or content of the media content that is paused, and determining an ad that is customized to the determined context and/or content to be displayed on the display device. In some embodiments, the display device can determine additional information from the control signal that may also be used to determine the ad to be displayed on the display device.

169.

发明公开
IMAGE PROCESSING APPARATUS, IMAGE PROCESSING METHOD, IMAGE CAPTURING APPARATUS, AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20230260299A1

公开(公告)日：2023-08-17

申请号：US18165992

申请日：2023-02-08

Applicant: Canon Kabushiki Kaisha

Inventor： Rui Nabeshima

IPC: G06V20/70 , H04N5/92 , G06F16/58

CPC classification number: G06V20/70 , G06F16/58 , H04N5/9201 , G06V2201/10

Abstract: An image processing apparatus comprises a generation unit configured to generate an image file of captured image data, the generation unit generating the image file with estimation results related to the image data added thereto as metadata, wherein the generation unit generates the metadata so that a first estimation result and a second estimation result are distinguishable from each other, the first estimation result being based on data that is included in the image file, the second estimation result being based on data that is not included in the image file.

170.

发明授权
Identifying and retrieving video metadata with perceptual frame hashing 有权

公开(公告)号：US11727375B2

公开(公告)日：2023-08-15

申请号：US17714533

申请日：2022-04-06

Applicant: Painted Dog, Inc.

Inventor： Jared Max Browarnik , Ken Aizawa

IPC: G06Q20/12 , G06F16/71 , G06F16/22 , G06F16/78 , G06F9/54 , H04N21/472 , H04N21/478 , G06V10/764 , G06V10/77 , G06V20/40

CPC classification number: G06Q20/123 , G06F9/547 , G06F16/2255 , G06F16/71 , G06F16/7867 , G06V10/764 , G06V10/7715 , G06V20/40 , H04N21/47217 , H04N21/47815 , G06V20/48 , G06V2201/10

Abstract: Shoppable video enables a viewer to identify and buy items appearing in a video. To retrieve information about the items in a frame of the video, the playback device generates a perceptual hash of that frame and uses that hash to query a first database storing perceptual hashes of different version of the video. The database query returns an identifier for the frame, which is then used to query a second database that store the item information. The results of this query are returned to the playback device, which shows them to the user, enabling the viewer to learn more about and possibly purchase the item. Using queries based on perceptual hashes of different versions of the video increases the likelihood of returning a match, despite formatting differences. And using separate hash and metadata databases makes it possible to update the metadata without changing the hashes.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification