Abstract:
Coding syntaxes in compliance with same or different VDR specifications may be signaled by upstream coding devices such as VDR encoders to downstream coding devices such as VDR decoders in a common vehicle in the form of RPU data units. VDR coding operations and operational parameters may be specified as sequence level, frame level, or partition level syntax elements in a coding syntax. Syntax elements in a coding syntax may be coded directly in one or more current RPU data units under a current RPU ID, predicted from other partitions/segments/ranges previously sent with the same current RPU ID, or predicted from other frame level or sequence level syntax elements previously sent with a previous RPU ID. A downstream device may perform decoding operations on multi-layered input image data based on received coding syntaxes to construct VDR images.
Abstract:
Inter-color image prediction is based on multi-channel multiple regression (MMR) models. Image prediction is applied to the efficient coding of images and video signals of high dynamic range. MMR models may include first order parameters, second order parameters, and cross-pixel parameters. MMR models using extension parameters incorporating neighbor pixel relations are also presented. Using minimum means-square error criteria, closed form solutions for the prediction parameters are presented for a variety of MMR models.
Abstract:
Inter-color image prediction is based on multi-channel multiple regression (MMR) models. Image prediction is applied to the efficient coding of images and video signals of high dynamic range. MMR models may include first order parameters, second order parameters, and crosspixel parameters. MMR models using extension parameters incorporating neighbor pixel relations are also presented. Using minimum means-square error criteria, closed form solutions for the prediction parameters are presented for a variety of MMR models.
Abstract:
An encoder receives one or more input pictures of enhanced dynamic range (EDR) to be encoded in a coded bit stream comprising a base layer and one or more enhancement layers. To encode the chroma pixels, the encoder generates a luma mask and a corresponding chroma mask. Based on generated high-clipping and low-clipping thresholds, the encoder determines the appropriate parameters to encode the chroma values in the base and enhancement layers.
Abstract:
An encoder receives a target image in a standard dynamic range and a guide image in a high dynamic range, wherein both the target image and the guide image represent the same scene. A color transient improvement (CTI) filter is selected to predict a chroma component of a decoded version of the target image based on both the luma and chroma components of the target image and the guide image. The filtering coefficients for the CTI filter are computed by minimizing an error measurement (e.g., MSE) between pixel values of the decoded image and the guide image. The computed set of filtering coefficients is signaled to a receiver (e.g., as metadata). A decoder receives the coded image and the metadata, and applies the same CTI filter to the decoded image to generate an output image.
Abstract:
Novel methods and systems for decoding and displaying enhanced dynamic range (EDR) video signals are disclosed. To accommodate legacy digital media players with constrained computational resources, compositing and display management (DM) operations are moved from a digital media player to its attached EDR display. On a video receiver, base and enhancement video layers are decoded and multiplexed together with overlay graphics into an interleaved stream. The video and graphics signals are all converted to a common format which allows metadata to be embedded in the interleaved signal as part of the least significant bits in the chroma channels. On the display, the video and the graphics are de-interleaved. After compositing and display management operations guided by the received metadata, the received graphics data are blended with the output of the DM process and the final video output is displayed on the display's panel.
Abstract:
In an embodiment, a control map of false contour filtering is generated for a predicted image. The predicted image is predicted from a low dynamic range image mapped from the wide dynamic range image. Based at least in part on the control map of false contour filtering and the predicted image, one or more filter parameters for a sparse finite-impulse-response (FIR) filter are determined. The sparse FIR filter is applied to filter pixel values in a portion of the predicted image based at least in part on the control map of false contour filtering. The control map of false contour filtering is encoded into a part of a multi-layer video signal that includes the low dynamic range image.
Abstract:
A sequence of visual dynamic range (VDR) images may be encoded using a standard dynamic range (SDR) base layer and one or more enhancement layers. A prediction image is generated by using piecewise cross-color channel prediction (PCCC), wherein a color channel in the SDR input may be segmented into two or more color channel segments and each segment is assigned its own cross-color channel predictor to derive a predicted output VDR image. PCCC prediction models may include first order, second order, or higher order parameters. Using a minimum mean-square error criterion, a closed form solution is presented for the prediction parameters for a second-order PCCC model. Algorithms for segmenting the color channels into multiple color channel segments are also presented.
Abstract:
Inter-color image prediction is based on color grading modeling. Prediction is applied to the efficient coding of images and video signals of high dynamic range. Prediction models may include a color transformation matrix that models hue and saturation color changes and a non-linear function modeling color correction changes. Under the assumption that the color grading process uses a slope, offset, and power (SOP) operations, an example non linear prediction model is presented.
Abstract:
Tensor-Product B-splines (TPB) have been shown to improve video quality when used to represent reshaping functions to map reshaped standard dynamic range content into high dynamic range (HDR) content; however, TPB prediction is computationally intensive and may not be supported by legacy devices. Methods and systems for backwards-compatible signaling of TPB-related metadata and a fast TPB prediction method are presented to overcome both of these limitations. Computation overhead for a TPB-based 3D look-up table is reduced by using temporary two-dimensional arrays. A remapping of the most significant bits of a legacy bit-depth parameter allows for backwards compatibility.