Fast multi-rate encoding for adaptive HTTP streaming

    公开(公告)号:US11546401B2

    公开(公告)日:2023-01-03

    申请号:US17090802

    申请日:2020-11-05

    Applicant: BITMOVIN, INC.

    Abstract: According to embodiments of the disclosure, information of higher and lower quality encoded video segments is used to limit Rate-Distortion Optimization (RDO) for each Coding Unit Tree (CTU). A method first encodes the highest bit-rate segment and consequently uses it to encode the lowest bit-rate video segment. Block structure and selected reference frame of both highest and lowest bit-rate video segments are used to predict and shorten RDO process for each CTU in middle bit-rates. The method delays just one frame using parallel processing. This approach provides time-complexity reduction compared to the reference software for middle bit-rates while degradation is negligible.

    CHUNK-BASED PREDICTION ADAPTATION LOGIC

    公开(公告)号:US20210289013A1

    公开(公告)日:2021-09-16

    申请号:US17336009

    申请日:2021-06-01

    Applicant: BITMOVIN, INC.

    Abstract: A multimedia player downloads chunks (parts of the segment file) during the download of a segment of a stream of segments of a low-latency stream. The first chunks of a segment may be downloaded while the segment is still being written to the CDN server. A chunk-based prediction adaptation logic uses throughput measurements on a chunk instead of a segment and specifically looks at bursts in a sliding window. This data is used to build a prediction of future throughput by applying linear adaptive filter algorithms that may rely on recursive least squares. This adaptation logic leads to very accurate bandwidth predictions and as a consequence, better user experience, compared to existing adaptation algorithms.

    Scalable Per-Title Encoding
    3.
    发明申请

    公开(公告)号:US20240406418A1

    公开(公告)日:2024-12-05

    申请号:US18734842

    申请日:2024-06-05

    Applicant: BITMOVIN, INC.

    Abstract: A scalable per-title encoding technique may include detecting scene cuts in an input video received by an encoding network or system, generating segments of the input video, performing per-title encoding of a segment of the input video, training a deep neural network (DNN) for each representation of the segment, thereby generating a trained DNN, compressing the trained DNN, thereby generating a compressed trained DNN, and generating an enhanced bitrate ladder including metadata comprising the compressed trained DNN. In some embodiments, the method also may include generating a base layer bitrate ladder for CPU devices, and providing the enhanced bitrate ladder for GPU-available devices.

    OPTIMIZED FAST MULTIPASS VIDEO TRANSCODING
    5.
    发明公开

    公开(公告)号:US20230269386A1

    公开(公告)日:2023-08-24

    申请号:US18016577

    申请日:2021-05-27

    Applicant: BITMOVIN, INC.

    CPC classification number: H04N19/40 H04N19/12 H04N21/234309

    Abstract: A computer-implemented method and system for transcoding input video content is provided. The method includes decoding the input video content from a first format to a first set of raw video data. Encoding the first set of raw video data into an intermediate format and storing the video data in the second intermediate format. Also encoding the first set of raw video data into a third desired output format to extract video parameters and determining optimized encoding parameters for encoding the video content into the final output video. The method includes decoding the stored video data encoded into the intermediate format into a second set of raw video data and encoding the second set of raw video data into the third desired output format using the optimized encoding parameters to generate the final output video.

    Scalable Per-Title Encoding
    7.
    发明申请

    公开(公告)号:US20230118010A1

    公开(公告)日:2023-04-20

    申请号:US17965124

    申请日:2022-10-13

    Applicant: BITMOVIN, INC.

    Abstract: A scalable per-title encoding technique may include detecting scene cuts in an input video received by an encoding network or system, generating segments of the input video, performing per-title encoding of a segment of the input video, training a deep neural network (DNN) for each representation of the segment, thereby generating a trained DNN, compressing the trained DNN, thereby generating a compressed trained DNN, and generating an enhanced bitrate ladder including metadata comprising the compressed trained DNN. In some embodiments, the method also may include generating a base layer bitrate ladder for CPU devices, and providing the enhanced bitrate ladder for GPU-available devices.

    Per-Title Encoding Using Spatial and Temporal Resolution Downscaling

    公开(公告)号:US20230007298A1

    公开(公告)日:2023-01-05

    申请号:US17853549

    申请日:2022-06-29

    Applicant: BITMOVIN, INC.

    Abstract: Techniques relating to per-title encoding using spatial and temporal resolution downscaling is disclosed. A method for per-title encoding includes receiving a video input comprised of video segments, spatially downscaling the video input, temporally downscaling the video input, encoding the video input to generate an encoded video, then temporally and spatially upscaling the encoded video. Spatially downscaling may include reducing a resolution of the video input, and temporally downscaling may include reducing a framerate of the video input. Objective metrics for the upscaled encoded video show improved quality over conventional methods.

    Fast Multi-Rate Encoding for Adaptive Streaming Using Machine Learming

    公开(公告)号:US20220094928A1

    公开(公告)日:2022-03-24

    申请号:US17540643

    申请日:2021-12-02

    Applicant: BITMOVIN, INC.

    Abstract: According to embodiments of the disclosure, fast multi-rate encoding may be performed using machine learning by encoding a lowest quality representation to determine encoding parameters, processing raw data of the video using a neural network to obtain an intermediate output comprising encoding features, augmenting the intermediate output with additional encoding features to form a final tensor, and processing the final tensor with another neural network to obtain a classification output comprising a split or not split decision for an image data block. The classification output may be used to encode a highest quality representation, and then other representations of the video.

    Video Encoding Based on Customized Bitrate Table

    公开(公告)号:US20210329255A1

    公开(公告)日:2021-10-21

    申请号:US17359346

    申请日:2021-06-25

    Applicant: BITMOVIN, INC.

    Inventor: Martin Smole

    Abstract: An original input content is subjected to multiple constant quality probe encodes for a defined set of resolutions. In one embodiment, probe encodes encode a few parts of the original source video, for example, 30 seconds from 5 different positions. Each probe encode delivers an average bitrate that is required to achieve the configured constant quality. The mean value of the average bitrate is taken per resolution. This results in a list of bitrates that map to a resolution that would achieve the best quality, a custom bitrate: table. Based on the custom bitrate table, an optimized bitrate ladder is computed. The process starts with a configurable minimum bitrate and steps up by a bitrate step size that is between a configurable min and max bitrate step size until the bitrate of the highest resolution is reached.

Patent Agency Ranking