-
公开(公告)号:US20230071845A1
公开(公告)日:2023-03-09
申请号:US17987462
申请日:2022-11-15
发明人: Isselmou Ould Dellahy , Shivajit Mohapatra , Anthony J. Braskich , Faisal Ishtiaq , Renxiang Li
IPC分类号: H04N21/234 , H04N21/482 , H04N21/488 , H04N21/81 , H04N21/235 , H04N21/478 , H04N21/4725 , H04N21/266 , H04N21/8545 , G06V20/62
摘要: Systems, methods, and devices for an interactive viewing experience by detecting on-screen data are disclosed. One or more frames of video data are analyzed to detect regions in the visual video content that contain text. A character recognition operation can be performed on the regions to generate textual data. Based on the textual data and the regions, a graphical user interface (GUI) definition to can be generated. The GUI definition can be used to generate a corresponding GUI superimposed onto the visual video content to present users with controls and functionality with which to interact with the text or enhance the video content. Context metadata can be determined from external sources or by analyzing the continuity of audio and visual aspects of the video data. The context metadata can then be used to improve the character recognition or inform the generation of the GUI.