Efficient video processing via dynamic knowledge propagation

    公开(公告)号:US12067777B2

    公开(公告)日:2024-08-20

    申请号:US17654986

    申请日:2022-03-15

    IPC分类号: G06V20/40 G06V10/82 H04L67/04

    CPC分类号: G06V20/46 G06V10/82 H04L67/04

    摘要: Certain aspects of the present disclosure provide a method of processing video data. In one example, the method includes receiving input video data; sampling a first subset of clips from the input video data; providing the first subset of clips to a first component of a machine learning model to generate first output; sampling a second subset of clips from the input video data, wherein the second subset of clips comprises fewer clips than the first subset of clips; providing the second subset of clips to a second component of the machine learning model to generate a second output; aggregating the first output from the first component of the machine learning model with the second output from the second component of the machine learning model to generate aggregated output; and determining a characteristic of the input video data based on the aggregated output.

    EFFICIENT VIDEO PROCESSING VIA DYNAMIC KNOWLEDGE PROPAGATION

    公开(公告)号:US20220301310A1

    公开(公告)日:2022-09-22

    申请号:US17654986

    申请日:2022-03-15

    IPC分类号: G06V20/40 G06V10/82 H04L67/04

    摘要: Certain aspects of the present disclosure provide a method of processing video data. In one example, the method includes receiving input video data; sampling a first subset of clips from the input video data; providing the first subset of clips to a first component of a machine learning model to generate first output; sampling a second subset of clips from the input video data, wherein the second subset of clips comprises fewer clips than the first subset of clips; providing the second subset of clips to a second component of the machine learning model to generate a second output; aggregating the first output from the first component of the machine learning model with the second output from the second component of the machine learning model to generate aggregated output; and determining a characteristic of the input video data based on the aggregated output.