-
公开(公告)号:US11546401B2
公开(公告)日:2023-01-03
申请号:US17090802
申请日:2020-11-05
Applicant: BITMOVIN, INC.
Inventor: Hadi Amirpour , Ekrem Çetinkaya , Christian Timmerer
IPC: H04L65/70 , H04N19/146 , H04N19/176 , H04L65/65
Abstract: According to embodiments of the disclosure, information of higher and lower quality encoded video segments is used to limit Rate-Distortion Optimization (RDO) for each Coding Unit Tree (CTU). A method first encodes the highest bit-rate segment and consequently uses it to encode the lowest bit-rate video segment. Block structure and selected reference frame of both highest and lowest bit-rate video segments are used to predict and shorten RDO process for each CTU in middle bit-rates. The method delays just one frame using parallel processing. This approach provides time-complexity reduction compared to the reference software for middle bit-rates while degradation is negligible.
-
公开(公告)号:US20210289013A1
公开(公告)日:2021-09-16
申请号:US17336009
申请日:2021-06-01
Applicant: BITMOVIN, INC.
Inventor: Abdelhak BENTALEB , Martin FILLAFER , Daniel WEINBERGER
Abstract: A multimedia player downloads chunks (parts of the segment file) during the download of a segment of a stream of segments of a low-latency stream. The first chunks of a segment may be downloaded while the segment is still being written to the CDN server. A chunk-based prediction adaptation logic uses throughput measurements on a chunk instead of a segment and specifically looks at bursts in a sliding window. This data is used to build a prediction of future throughput by applying linear adaptive filter algorithms that may rely on recursive least squares. This adaptation logic leads to very accurate bandwidth predictions and as a consequence, better user experience, compared to existing adaptation algorithms.
-
公开(公告)号:US20240406418A1
公开(公告)日:2024-12-05
申请号:US18734842
申请日:2024-06-05
Applicant: BITMOVIN, INC.
Inventor: Hadi Amirpour , Christian Timmerer
IPC: H04N19/179 , H04N19/29
Abstract: A scalable per-title encoding technique may include detecting scene cuts in an input video received by an encoding network or system, generating segments of the input video, performing per-title encoding of a segment of the input video, training a deep neural network (DNN) for each representation of the segment, thereby generating a trained DNN, compressing the trained DNN, thereby generating a compressed trained DNN, and generating an enhanced bitrate ladder including metadata comprising the compressed trained DNN. In some embodiments, the method also may include generating a base layer bitrate ladder for CPU devices, and providing the enhanced bitrate ladder for GPU-available devices.
-
公开(公告)号:US12003763B2
公开(公告)日:2024-06-04
申请号:US17853549
申请日:2022-06-29
Applicant: BITMOVIN, INC.
Inventor: Hadi Amirpour , Christian Timmerer
Abstract: Techniques relating to per-title encoding using spatial and temporal resolution downscaling is disclosed. A method for per-title encoding includes receiving a video input comprised of video segments, spatially downscaling the video input, temporally downscaling the video input, encoding the video input to generate an encoded video, then temporally and spatially upscaling the encoded video. Spatially downscaling may include reducing a resolution of the video input, and temporally downscaling may include reducing a framerate of the video input. Objective metrics for the upscaled encoded video show improved quality over conventional methods.
-
公开(公告)号:US20230269386A1
公开(公告)日:2023-08-24
申请号:US18016577
申请日:2021-05-27
Applicant: BITMOVIN, INC.
Inventor: Adithyan Ilangovan , Gerald Götzenbrucker , Riccardo Ressi
IPC: H04N19/40 , H04N19/12 , H04N21/2343
CPC classification number: H04N19/40 , H04N19/12 , H04N21/234309
Abstract: A computer-implemented method and system for transcoding input video content is provided. The method includes decoding the input video content from a first format to a first set of raw video data. Encoding the first set of raw video data into an intermediate format and storing the video data in the second intermediate format. Also encoding the first set of raw video data into a third desired output format to extract video parameters and determining optimized encoding parameters for encoding the video content into the final output video. The method includes decoding the stored video data encoded into the intermediate format into a second set of raw video data and encoding the second set of raw video data into the third desired output format using the optimized encoding parameters to generate the final output video.
-
公开(公告)号:US20230179800A1
公开(公告)日:2023-06-08
申请号:US17544986
申请日:2021-12-08
Applicant: BITMOVIN, INC.
Inventor: Vignesh V. Menon , Hadi Amirpour , Christian Timmerer
IPC: H04N19/61 , H04N19/119 , H04N19/30 , H04N19/423 , H04N19/625 , H04N19/115 , H04N19/172 , H04N19/196
CPC classification number: H04N19/61 , H04N19/30 , H04N19/115 , H04N19/119 , H04N19/172 , H04N19/197 , H04N19/423 , H04N19/625
Abstract: The technology described herein relates to online per-title encoding. A method for online per-title encoding includes receiving a video input, generating segments of the video input, extracting a spatial feature and a temporal feature, predicting bitrate-resolution pairs based on the spatial feature and the temporal feature, using a discrete cosine transform (DCT)-based energy function, and per-title encoding segments of the video input for the predicted bitrate-resolution pairs. A system for online per-title encoding may include memory for storing a set of bitrates, a set of resolutions, and a machine learning module configured to predict bitrate resolution pairs based on low-complexity spatial and temporal features.
-
公开(公告)号:US20230118010A1
公开(公告)日:2023-04-20
申请号:US17965124
申请日:2022-10-13
Applicant: BITMOVIN, INC.
Inventor: Hadi Amirpour , Christian Timmerer
IPC: H04N19/179 , H04N19/29
Abstract: A scalable per-title encoding technique may include detecting scene cuts in an input video received by an encoding network or system, generating segments of the input video, performing per-title encoding of a segment of the input video, training a deep neural network (DNN) for each representation of the segment, thereby generating a trained DNN, compressing the trained DNN, thereby generating a compressed trained DNN, and generating an enhanced bitrate ladder including metadata comprising the compressed trained DNN. In some embodiments, the method also may include generating a base layer bitrate ladder for CPU devices, and providing the enhanced bitrate ladder for GPU-available devices.
-
公开(公告)号:US20230007298A1
公开(公告)日:2023-01-05
申请号:US17853549
申请日:2022-06-29
Applicant: BITMOVIN, INC.
Inventor: Hadi Amirpour , Christian Timmerer
Abstract: Techniques relating to per-title encoding using spatial and temporal resolution downscaling is disclosed. A method for per-title encoding includes receiving a video input comprised of video segments, spatially downscaling the video input, temporally downscaling the video input, encoding the video input to generate an encoded video, then temporally and spatially upscaling the encoded video. Spatially downscaling may include reducing a resolution of the video input, and temporally downscaling may include reducing a framerate of the video input. Objective metrics for the upscaled encoded video show improved quality over conventional methods.
-
公开(公告)号:US20220094928A1
公开(公告)日:2022-03-24
申请号:US17540643
申请日:2021-12-02
Applicant: BITMOVIN, INC.
Inventor: Hadi Amirpour , Ekrem Çetinkaya , Christian Timmerer
IPC: H04N19/12 , H04N19/136 , H04N19/176 , H04N19/436 , G06N3/04
Abstract: According to embodiments of the disclosure, fast multi-rate encoding may be performed using machine learning by encoding a lowest quality representation to determine encoding parameters, processing raw data of the video using a neural network to obtain an intermediate output comprising encoding features, augmenting the intermediate output with additional encoding features to form a final tensor, and processing the final tensor with another neural network to obtain a classification output comprising a split or not split decision for an image data block. The classification output may be used to encode a highest quality representation, and then other representations of the video.
-
公开(公告)号:US20210329255A1
公开(公告)日:2021-10-21
申请号:US17359346
申请日:2021-06-25
Applicant: BITMOVIN, INC.
Inventor: Martin Smole
IPC: H04N19/146 , H04N19/31 , H04N19/132 , H04N21/234
Abstract: An original input content is subjected to multiple constant quality probe encodes for a defined set of resolutions. In one embodiment, probe encodes encode a few parts of the original source video, for example, 30 seconds from 5 different positions. Each probe encode delivers an average bitrate that is required to achieve the configured constant quality. The mean value of the average bitrate is taken per resolution. This results in a list of bitrates that map to a resolution that would achieve the best quality, a custom bitrate: table. Based on the custom bitrate table, an optimized bitrate ladder is computed. The process starts with a configurable minimum bitrate and steps up by a bitrate step size that is between a configurable min and max bitrate step size until the bitrate of the highest resolution is reached.