Abstract:
In layered Visual Dynamic range (VDR) coding, inter-layer prediction requires several color-format transformations between the input VDR and Standard Dynamic Range (SDR) signals. Coding and decoding architectures are presented wherein inter-layer prediction is performed in the SDR-based color format, thus reducing computational complexity in both the encoder and the decoder, without compromising coding efficiency or coding quality.
Abstract:
Methods and systems for generating and applying scene-stable metadata for a video data stream are disclosed herein. A video data stream is divided or partitioned into scenes and a first set of metadata may be generated for a given scene of video data. The first set of metadata may be any known metadata as a desired function of video content (e.g., luminance). The first set of metadata may be generated on a frame-by-frame basis. In one example, scene-stable metadata may be generated that may be different from the first set of metadata for the scene. The scene-stable metadata may be generated by monitoring a desired feature with the scene and may be used to keep the desired feature within an acceptable range of values. This may help to avoid noticeable and possibly objectionably visual artifacts upon rendering the video data.
Abstract:
Inter-color image prediction is based on multi-channel multiple regression (MMR) models. Image prediction is applied to the efficient coding of images and video signals of high dynamic range. MMR models may include first order parameters, second order parameters, and cross-pixel parameters. MMR models using extension parameters incorporating neighbor pixel relations are also presented. Using minimum means-square error criteria, closed form solutions for the prediction parameters are presented for a variety of MMR models.
Abstract:
In a decoder, a processor extracts a control map of false contour filtering from a part of a multi-layer video signal that includes a low dynamic range image mapped from an original high-dynamic range (HDR) image. It determines one or more filter parameters for a sparse finite-impulse-response (FIR) filter, where the one or more filter parameters relate to at least in part on the control map of false contour filtering and a predicted image predicted from the low dynamic range image. It applies the sparse FIR filter to filter pixel values in a portion of the predicted image based at least in part on the control map of false contour filtering, and it reconstructs a version of the original HDR image based at least in part on the portion of the predicted image as filtered by the FIR filter.
Abstract:
In a decoder, a processor extracts a control map of false contour filtering from a part of a multi-layer video signal that includes a low dynamic range image mapped from an original high-dynamic range (HDR) image. It determines one or more filter parameters for a sparse finite-impulse-response (FIR) filter, where the one or more filter parameters relate to at least in part on the control map of false contour filtering and a predicted image predicted from the low dynamic range image. It applies the sparse FIR filter to filter pixel values in a portion of the predicted image based at least in part on the control map of false contour filtering, and it reconstructs a version of the original HDR image based at least in part on the portion of the predicted image as filtered by the FIR filter.
Abstract:
Methods and systems for generating and applying scene-stable metadata for a video data stream are disclosed herein. A video data stream is divided or partitioned into scenes and a first set of metadata may be generated for a given scene of video data. The first set of metadata may be any known metadata as a desired function of video content (e.g., luminance). The first set of metadata may be generated on a frame-by-frame basis. In one example, scene-stable metadata may be generated that may be different from the first set of metadata for the scene. The scene-stable metadata may be generated by monitoring a desired feature with the scene and may be used to keep the desired feature within an acceptable range of values. This may help to avoid noticeable and possibly objectionably visual artifacts upon rendering the video data.
Abstract:
Novel methods and systems for decoding and displaying enhanced dynamic range (EDR) video signals are disclosed. To accommodate legacy digital media players with constrained computational resources, compositing and display management (DM) operations are moved from a digital media player to its attached EDR display. On a video receiver, base and enhancement video layers are decoded and multiplexed together with overlay graphics into an interleaved stream. The video and graphics signals are all converted to a common format which allows metadata to be embedded in the interleaved signal as part of the least significant bits in the chroma channels. On the display, the video and the graphics are de-interleaved. After compositing and display management operations guided by the received metadata, the received graphics data are blended with the output of the DM process and the final video output is displayed on the display's panel.
Abstract:
A visual dynamic range (VDR) coding system creates a sequence of VDR prediction images using corresponding standard dynamic range (SDR) images and a prediction function. For each prediction image, an encoder identifies one or more areas within the prediction image suitable for post-prediction filtering. For each identified post-prediction area, a post-prediction filtering mode is selected among one or more post-prediction filtering modes. The selected post-prediction filtering mode is applied to output a filtered prediction image. Information related to the post-prediction filtering areas and the selected corresponding post-prediction filtering modes may be communicated to a receiver (e.g., as metadata) for guided post-prediction filtering. Example post-prediction filtering modes that use low-pass averaging filtering or adaptive linear interpolation are also described.
Abstract:
An encoder receives a sequence of images in extended or visual dynamic range (VDR). For each image, a dynamic range compression function and associated parameters are selected to convert the input image into a second image with a lower dynamic range. Using the input image and the second image, a residual image is computed. The input VDR image sequence is coded using a layered codec that uses the second image as a base layer and a residual image that is derived from the input and second images as one or more residual layers. Using the residual image, a false contour detection method (FCD) estimates the number of potential perceptually visible false contours in the decoded VDR image and iteratively adjusts the dynamic range compression parameters to prevent or reduce the number of false contours. Examples that use a uniform dynamic range compression function are also described.
Abstract:
An encoder receives one or more input pictures of enhanced dynamic range (EDR) to be encoded in a coded bit stream comprising a base layer and one or more enhancement layer. The encoder comprises a base layer quantizer (BLQ) and an enhancement layer quantizer (ELQ) and selects parameters of the BLQ and the ELQ by a joint BLQ-ELQ adaptation method which given a plurality of candidate sets of parameters for the BLQ, for each candidate set, computes a joint BLQ-ELQ distortion value based on a BLQ distortion function, an ELQ distortion function, and at least in part on the number of input pixels to be quantized by the ELQ. The encoder selects as the output BLQ parameter set the candidate set for which the computed joint BLQ-ELQ distortion value is the smallest. Example ELQ, BLQ, and joint BLQ-ELQ distortion functions are provided.