-
公开(公告)号:US20230377363A1
公开(公告)日:2023-11-23
申请号:US17663785
申请日:2022-05-17
Applicant: ADOBE INC.
Inventor: Tong SUN , Nicholas Sergei REWKOWSKI , Nedim LIPKA , Jennifer Anne HEALEY , Curtis Michael WIGINGTON , Anshul MALIK
CPC classification number: G06V30/41 , G06V40/107 , G06T7/13 , G06T2207/30176
Abstract: Systems and methods for machine learning based multipage scanning are provided. In one embodiment, one or more processing devices perform operations that include receiving a video stream that includes image frames that capture a plurality of pages of a document. The operations further include detection, via a machine learning model that is trained to infer events from the video stream detects, a new page event. Detection of the new page event indicates that a page of the plurality of pages available for scanning has changed from a first page to a second page. Based on the detection of the new page event, the one or more processing devices capture an image frame of the page from the video stream. In some embodiments, the machine learning model detects events based on a weighted use of video data, inertial data, audio samples, image depth information, image statistics and/or other information.