Predicting video edits from text-based conversations using neural networks

    公开(公告)号:US12238451B2

    公开(公告)日:2025-02-25

    申请号:US18055301

    申请日:2022-11-14

    Applicant: Adobe Inc.

    Abstract: Embodiments are disclosed for predicting, using neural networks, editing operations for application to a video sequence based on processing conversational messages by a video editing system. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including a video sequence and text sentences, the text sentences describing a modification to the video sequence, mapping, by a first neural network content of the text sentences describing the modification to the video sequence to a candidate editing operation, processing, by a second neural network, the video sequence to predict parameter values for the candidate editing operation, and generating a modified video sequence by applying the candidate editing operation with the predicted parameter values to the video sequence.

    VIDEO FRAME TRANSPORT
    4.
    发明公开

    公开(公告)号:US20240314267A1

    公开(公告)日:2024-09-19

    申请号:US18122315

    申请日:2023-03-16

    Inventor: Wing-Chi Chow

    CPC classification number: H04N7/007 H04N7/0122 H04N7/025

    Abstract: In response to a video aspect ratio of a frame of video not matching an aspect ratio of a display panel of a display device, a source device of a processing system transmits only the frame to the display device and metadata indicating that the display device is to generate bars for letterboxing or pillarboxing. By generating the bars for letterboxing or pillarboxing at the display device instead of transmitting the bars from the source device to the display device or storing the bars at a frame buffer of the display device, the processing system conserves power and bandwidth.

    PREDICTING VIDEO EDITS FROM TEXT-BASED CONVERSATIONS USING NEURAL NETWORKS

    公开(公告)号:US20240163393A1

    公开(公告)日:2024-05-16

    申请号:US18055301

    申请日:2022-11-14

    Applicant: Adobe Inc.

    CPC classification number: H04N7/002 G06T11/60

    Abstract: Embodiments are disclosed for predicting, using neural networks, editing operations for application to a video sequence based on processing conversational messages by a video editing system. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including a video sequence and text sentences, the text sentences describing a modification to the video sequence, mapping, by a first neural network content of the text sentences describing the modification to the video sequence to a candidate editing operation, processing, by a second neural network, the video sequence to predict parameter values for the candidate editing operation, and generating a modified video sequence by applying the candidate editing operation with the predicted parameter values to the video sequence.

Patent Agency Ranking