Machine Learning Model-Based Video Compression

    公开(公告)号:US20220329876A1

    公开(公告)日:2022-10-13

    申请号:US17704692

    申请日:2022-03-25

    IPC分类号: H04N19/89 G06N3/08

    摘要: A system processing hard e executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content. The ML model-based video compression encoder further encodes the latent space representation of the image space residual to produce an encoded latent residual, encodes, using the trained image compression ML model, the latent space representation of the motion compensated video content to produce an encoded latent video content, and generates, using the encoded latent residual and the encoded latent video content, a compressed video content corresponding to the uncompressed video content.

    Segment quality-guided adaptive stream creation

    公开(公告)号:US11595716B2

    公开(公告)日:2023-02-28

    申请号:US17506489

    申请日:2021-10-20

    摘要: Embodiments provide for improved stream generation. A target average bitrate (TAB) segment is generated by encoding a first segment, of a plurality of segments in a video, using a first maximum average bitrate (MAB) of a plurality of MABs specified in an encoding ladder. An intermediate average bitrate (IAB) segment is generated by encoding the first segment using a first intermediate bitrate, wherein the first intermediate bitrate is lower than the first MAB. Upon receiving a request for the first segment at the first MAB, the IAB segment is output based at least in part on determining that a first quality score of the IAB segment is within a predefined tolerance of a second quality score of the TAB segment.

    Microdosing For Low Bitrate Video Compression

    公开(公告)号:US20220337852A1

    公开(公告)日:2022-10-20

    申请号:US17704722

    申请日:2022-03-25

    IPC分类号: H04N19/42

    摘要: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.

    Image Compression Using Normalizing Flows

    公开(公告)号:US20210142524A1

    公开(公告)日:2021-05-13

    申请号:US16811219

    申请日:2020-03-06

    摘要: According to one implementation, an image compression system includes a computing platform having a hardware processor and a system memory storing a software code. The hardware processor executes the software code to receive an input image, transform the input image to a latent space representation of the input image, and quantize the latent space representation of the input image to produce multiple quantized latents. The hardware processor further executes the software code to encode the quantized latents using a probability density function of the latent space representation of the input image, to generate a bitstream, and convert the bitstream into an output image corresponding to the input image. The probability density function of the latent space representation of the input image is obtained based on a normalizing flow mapping of one of the input image or the latent space representation of the input image.