FEATURE-BASED VIDEO ANNOTATION
    2.
    发明申请

    公开(公告)号:US20220207873A1

    公开(公告)日:2022-06-30

    申请号:US17548859

    申请日:2021-12-13

    Applicant: Google LLC

    Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.

    IMAGE MANIPULATION BY TEXT INSTRUCTION
    5.
    发明公开

    公开(公告)号:US20240212246A1

    公开(公告)日:2024-06-27

    申请号:US18400629

    申请日:2023-12-29

    Applicant: Google LLC

    Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

    SUMMARIZING VIDEO CONTENT
    6.
    发明申请

    公开(公告)号:US20210312186A1

    公开(公告)日:2021-10-07

    申请号:US17352067

    申请日:2021-06-18

    Applicant: Google LLC

    Abstract: Systems and methods of automatically extracting summaries of video content are described herein. A data processing system can access, from a video database, a first video content element including a first plurality of frame. The data processing system can select an intervallic subset of the first plurality of frames of the first video content element. The data processing system can calculate, for each of a plurality of further subsets comprising a predetermined number of frames from the intervallic subset, a score for the further subset. The data processing system can identify, from the plurality of further subsets, a further subset having a highest score. The data processing system can select a portion of the first video content element comprising the frames of the further subset having the highest score. The data processing system can generate a second video content element comprising the selected portion of the first video content element.

    Image manipulation by text instruction

    公开(公告)号:US11562518B2

    公开(公告)日:2023-01-24

    申请号:US17340671

    申请日:2021-06-07

    Applicant: Google LLC

    Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

    IMAGE MANIPULATION BY TEXT INSTRUCTION

    公开(公告)号:US20210383584A1

    公开(公告)日:2021-12-09

    申请号:US17340671

    申请日:2021-06-07

    Applicant: Google LLC

    Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

    FEATURE-BASED VIDEO ANNOTATION
    9.
    发明申请

    公开(公告)号:US20200082173A1

    公开(公告)日:2020-03-12

    申请号:US16687118

    申请日:2019-11-18

    Applicant: Google LLC

    Abstract: A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method identifies an entity from a plurality of entities identifying characteristics of video items. The computer-implemented method selects a set of features correlated with the entity based on a value of a feature of a plurality of features, determines a classifier for the entity using the set of features, and determines an aggregation calibration function for the entity based on the set of features. The computer-implemented method selects a video frame from a video item, where the video frame having associated features, and determines a probability of existence of the entity based on the associated features using the classifier and the aggregation calibration function.

Patent Agency Ranking