-
公开(公告)号:US11776261B2
公开(公告)日:2023-10-03
申请号:US17246456
申请日:2021-04-30
Applicant: Spherex, Inc.
Inventor: Teresa Ann Phillips , Pranav Anand Joshi , Kira Michelle McStay
IPC: G06V20/40 , H04N21/472 , H04N21/466 , G06T7/00 , G06F18/40 , G06F18/214 , G06F18/241
CPC classification number: G06V20/41 , G06F18/214 , G06F18/241 , G06F18/40 , G06T7/0002 , H04N21/4662 , H04N21/472 , G06T2207/10016 , G06V20/44 , G06V2201/10
Abstract: Various embodiments described herein support or provide for annotation of a media asset, such as an audio asset or a video asset, based on one or more events identified within content of the media asset. In particular, some embodiments can determine one or more of the following details with respect to content of a given media asset, which can represent annotations that enable determination of contextual information for the given media asset: events; event classification labels for events; subclassifications labels for events; scenes comprising events; attributes of scenes; themes presented by the content; and title-level attributes of the given media asset.
-
公开(公告)号:US11776248B2
公开(公告)日:2023-10-03
申请号:US18047045
申请日:2022-10-17
Applicant: Optum, Inc.
Inventor: Rahul Bhaskar , Daryl Seiichi Furuyama , Daniel William James
CPC classification number: G06V10/98 , G06F18/21 , G06F18/217 , G06F18/28 , G06N20/00 , G06V10/242 , G06V30/40 , G06V30/10 , G06V2201/10
Abstract: Systems and methods are configured for correcting the orientation of an image data object subject to optical character recognition (OCR) by receiving an original image data object, generating initial machine readable text for the original image data object via OCR, generating an initial quality score for the initial machine readable text via machine-learning models, determining whether the initial quality score satisfies quality criteria, upon determining that the initial quality score does not satisfy the quality criteria, generating a plurality of rotated image data objects each comprising the original image data object rotated to a different rotational position, generating a rotated machine readable text data object for each of the plurality of rotated image data objects and generating a rotated quality score for each of the plurality of rotated machine readable text data objects, and determining that one of the plurality of rotated quality scores satisfies the quality criteria.
-
163.
公开(公告)号:US20230305903A1
公开(公告)日:2023-09-28
申请号:US18203535
申请日:2023-05-30
Applicant: Scenera, Inc.
Inventor: David D. Lee , Andrew Augustine Wajs
IPC: G06F9/50 , G06F9/54 , G06N20/00 , G06V10/94 , G06F18/20 , G06F18/25 , H04N23/80 , H04N23/90 , G06V10/764 , G06V10/82 , G06V10/44 , G06V20/52
CPC classification number: G06F9/5072 , G06F9/542 , G06N20/00 , G06V10/94 , G06F18/285 , G06F18/251 , H04N23/80 , H04N23/90 , G06V10/764 , G06V10/82 , G06V10/454 , G06V20/52 , G06V2201/10 , G06V40/161
Abstract: A multi-layer technology stack includes a sensor layer including image sensors, a device layer, and a cloud layer, with interfaces between the layers. A method to curate different custom workflows for multiple applications include the following. Requirements for custom sets of data packages for the applications is received. The custom set of data packages include sensor data packages (e.g., SceneData) and contextual metadata packages that contextualize the sensor data packages (e.g., SceneMarks). Based on the received requirements and capabilities of components in the technology stack, the custom workflow for that application is deployed. This includes a selection, configuration and linking of components from the technology stack. The custom workflow is implemented in the components of the technology stack by transmitting workflow control packages directly and/or indirectly via the interfaces to the different layers.
-
公开(公告)号:US20230290104A1
公开(公告)日:2023-09-14
申请号:US18172765
申请日:2023-02-22
Applicant: Fujitsu Limited
Inventor: Moyuru YAMADA
CPC classification number: G06V10/235 , G06V20/50 , G06V10/751 , G06V2201/07 , G06V2201/10
Abstract: An object detection device includes a processor that executes a procedure. The procedure includes: converting an input image into a first vector such that information related to an area of an object in the image is contained in the first vector; converting input text into a second vector such that information related to an order of appearance in the text of one or more word strings each indicating a detection target object included in the text is contained in the second vector; generating a third vector in which the first vector and the second vector have been reflected in a vector of initial values corresponding to detection target objects; and estimating whether or not a feature indicated by the third vector corresponds to a detection target object that appears at which number place in the text, and estimating a position of the detection target object in the image.
-
公开(公告)号:US20230276110A1
公开(公告)日:2023-08-31
申请号:US18195558
申请日:2023-05-10
Applicant: Rovi Guides, Inc.
Inventor: Vishwas Sharadanagar Panchaksharaiah , Harshith Kumar Gejjegondanahally Sreekanth , Pawan Nagdeve , Anjum Makkar , Reda Harb
IPC: H04N21/81 , G06V20/40 , H04N21/435 , H04N21/442 , H04N21/45 , H04N21/472 , H04N21/4725
CPC classification number: H04N21/8133 , G06V20/49 , G06V20/41 , H04N21/435 , H04N21/44204 , H04N21/4532 , H04N21/47217 , H04N21/4725 , G06V2201/10
Abstract: Systems and methods are provided herein for including supplemental content with segments based on the complexity of the segment. This may be accomplished by a system receiving complexity information related to a media asset and user information related to a viewer to determine if one or more segments of the media asset is complex for the user. If the system receives a trick play command, from the user, during a segment categorized as complex, the system can use the complexity information and user information to generate supplemental content, facilitating better user understanding of the complex segment.
-
公开(公告)号:US11743431B2
公开(公告)日:2023-08-29
申请号:US16823710
申请日:2020-03-19
Applicant: James Carey
Inventor: James Carey
IPC: H04N7/18 , G06T7/246 , G06T7/20 , H04W8/00 , G06V20/52 , G06V40/10 , G06V40/16 , G08B13/196 , G06V40/20 , G06F18/25 , H04N23/45
CPC classification number: H04N7/181 , G06F18/251 , G06T7/20 , G06T7/246 , G06V20/52 , G06V20/53 , G06V40/10 , G06V40/161 , G06V40/23 , G08B13/19608 , G08B13/19613 , H04N7/185 , H04N23/45 , H04W8/005 , G06T2207/10016 , G06T2207/30232 , G06V40/172 , G06V40/25 , G06V2201/10
Abstract: An analytical recognition system is includes a video camera configured to capture video data of a subject and an antenna configured to capture mobile communication device data relating to a mobile communication device of the subject. The system further includes a data analytics module configured to: analyze the video data to determine at least one of a physical attribute or a movement attribute of the subject and generate; generate a first certainty match value based on the at least one of the physical attribute or the movement attribute of the subject; and perform a facial recognition analysis of the subject to obtain facial recognition data. The data analytics module is further configured to generate a second certainty match value based on the facial recognition data; generate a third certainty match value based on the mobile communication device data; and generate a combined certainty match value based on the first certainty match value, the second certainty match value, and the third certainty match value.
-
公开(公告)号:US20230262296A1
公开(公告)日:2023-08-17
申请号:US18140236
申请日:2023-04-27
Applicant: Videokawa, Inc.
Inventor: Steven Selfors
IPC: H04N21/485 , H04N21/44 , H04N21/858 , H04N21/472 , H04N21/84 , H04N21/488 , G06F16/78 , G06V20/40 , G06V10/70
CPC classification number: H04N21/4856 , G06F16/7867 , G06V10/768 , G06V20/49 , H04N21/44 , H04N21/47217 , H04N21/4882 , H04N21/84 , H04N21/8586 , G06V2201/10
Abstract: Aspects described herein may provide systems, methods, and device for facilitating language learning using videos. Subtitles may be displayed in a first, target language or a second, native language during display of the video. On a pause event, both the target language subtitle and the native language subtitle may be displayed simultaneously to facilitate understanding. While paused, a user may select an option to be provided with additional contextual information indicating usage and context associated with one or more words of the target language subtitle. The user may navigate through previous and next subtitles with additional contextual information while the video is paused. Other aspects may allow users to create auto-continuous video loops of definable duration, and may allow users to generate video segments by searching an entire database of subtitle text, and may allow users create, save, share, and search video loops.
-
公开(公告)号:US20230262289A1
公开(公告)日:2023-08-17
申请号:US17674339
申请日:2022-02-17
Applicant: Roku, Inc.
Inventor: Purushottam NARAYANA , Andre GODDARD ROSA
IPC: H04N21/458 , G06V20/40 , H04N21/81 , H04N21/4363 , H04N21/44 , H04N21/431
CPC classification number: H04N21/458 , G06V20/44 , G06V20/48 , H04N21/812 , H04N21/43635 , H04N21/44008 , H04N21/4312 , G06V2201/10
Abstract: Disclosed herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for ad insertion by a display device coupled to a media device via a high-definition media interface (HDMI) connection, where the media device provides media content and/or a control signal. When the media device pauses the media content, the display device can determine that a pause event has occurred and insert an ad shown on the display device. Further, some embodiments include determining the context and/or content of the media content that is paused, and determining an ad that is customized to the determined context and/or content to be displayed on the display device. In some embodiments, the display device can determine additional information from the control signal that may also be used to determine the ad to be displayed on the display device.
-
169.
公开(公告)号:US20230260299A1
公开(公告)日:2023-08-17
申请号:US18165992
申请日:2023-02-08
Applicant: Canon Kabushiki Kaisha
Inventor: Rui Nabeshima
CPC classification number: G06V20/70 , G06F16/58 , H04N5/9201 , G06V2201/10
Abstract: An image processing apparatus comprises a generation unit configured to generate an image file of captured image data, the generation unit generating the image file with estimation results related to the image data added thereto as metadata, wherein the generation unit generates the metadata so that a first estimation result and a second estimation result are distinguishable from each other, the first estimation result being based on data that is included in the image file, the second estimation result being based on data that is not included in the image file.
-
公开(公告)号:US11727375B2
公开(公告)日:2023-08-15
申请号:US17714533
申请日:2022-04-06
Applicant: Painted Dog, Inc.
Inventor: Jared Max Browarnik , Ken Aizawa
IPC: G06Q20/12 , G06F16/71 , G06F16/22 , G06F16/78 , G06F9/54 , H04N21/472 , H04N21/478 , G06V10/764 , G06V10/77 , G06V20/40
CPC classification number: G06Q20/123 , G06F9/547 , G06F16/2255 , G06F16/71 , G06F16/7867 , G06V10/764 , G06V10/7715 , G06V20/40 , H04N21/47217 , H04N21/47815 , G06V20/48 , G06V2201/10
Abstract: Shoppable video enables a viewer to identify and buy items appearing in a video. To retrieve information about the items in a frame of the video, the playback device generates a perceptual hash of that frame and uses that hash to query a first database storing perceptual hashes of different version of the video. The database query returns an identifier for the frame, which is then used to query a second database that store the item information. The results of this query are returned to the playback device, which shows them to the user, enabling the viewer to learn more about and possibly purchase the item. Using queries based on perceptual hashes of different versions of the video increases the likelihood of returning a match, despite formatting differences. And using separate hash and metadata databases makes it possible to update the metadata without changing the hashes.
-
-
-
-
-
-
-
-
-