-
公开(公告)号:US11722702B2
公开(公告)日:2023-08-08
申请号:US17473734
申请日:2021-09-13
发明人: Mandayam Narasimhan
IPC分类号: H04N19/70 , H04N19/30 , H04N19/127 , H04N19/146 , H04N19/187 , H04N19/423 , H04N7/52 , H04N21/2343 , H04N21/44 , H04N21/845 , H04N21/434 , H04N19/44 , H04N19/152 , H04N19/61
CPC分类号: H04N19/70 , H04N7/52 , H04N19/127 , H04N19/146 , H04N19/152 , H04N19/187 , H04N19/30 , H04N19/423 , H04N19/44 , H04N21/234327 , H04N21/4343 , H04N21/44004 , H04N21/8451 , H04N19/61
摘要: A method of decoding an enhanced video stream composed of base layer video access units and enhancement layer video access units, each access unit comprising a plurality of syntax structures, includes passing the syntax structures of the base layer access units to a base layer buffer, passing syntax structures of the enhancement layer access units to an enhancement layer buffer, outputting the syntax structures passed to the base layer buffer in a predetermined sequence, outputting the syntax structures passed to the enhancement layer buffer in a predetermined sequence, and recombining the sequences of syntax structures output by the base layer buffer and the enhancement layer buffer respectively to form a complete enhanced access unit, composed of base layer syntax structures and enhancement layer syntax structures in a predetermined sequence.
-
公开(公告)号:US20230071845A1
公开(公告)日:2023-03-09
申请号:US17987462
申请日:2022-11-15
发明人: Isselmou Ould Dellahy , Shivajit Mohapatra , Anthony J. Braskich , Faisal Ishtiaq , Renxiang Li
IPC分类号: H04N21/234 , H04N21/482 , H04N21/488 , H04N21/81 , H04N21/235 , H04N21/478 , H04N21/4725 , H04N21/266 , H04N21/8545 , G06V20/62
摘要: Systems, methods, and devices for an interactive viewing experience by detecting on-screen data are disclosed. One or more frames of video data are analyzed to detect regions in the visual video content that contain text. A character recognition operation can be performed on the regions to generate textual data. Based on the textual data and the regions, a graphical user interface (GUI) definition to can be generated. The GUI definition can be used to generate a corresponding GUI superimposed onto the visual video content to present users with controls and functionality with which to interact with the text or enhance the video content. Context metadata can be determined from external sources or by analyzing the continuity of audio and visual aspects of the video data. The context metadata can then be used to improve the character recognition or inform the generation of the GUI.
-