Abstract:
Systems and methods for overlaying a second image/video data onto a first image/video data are described herein. The first image/video data may be intended to be rendered on a display with certain characteristics—e.g., HDR, EDR, VDR or UHD capabilities. The second image/video data may comprise graphics, closed captioning, text, advertisement—or any data that may be desired to be overlaid and/or composited onto the first image/video data. The second image/video data may be appearance mapped according to the image statistics and/or characteristics of the first image/video data. In addition, such appearance mapping may be made according to the characteristics of the display that the composite data is to be rendered. Such appearance mapping is desired to render a composite data that is visually pleasing to a viewer, rendered upon a desired display.
Abstract:
Representation and coding of multi-view images using tapestry encoding are described for standard and enhanced dynamic ranges compatibility. A tapestry comprises information on a tapestry image, a left-shift displacement map and a right-shift displacement map. Perspective images of a scene can be generated from the tapestry and the displacement maps. Different methods for achieving compatibility are described.
Abstract:
Motion characteristics related to the images are determined. A motion characteristics metadata portion is generated based on the motion characteristics, and is to be used for determining an optimal FRC operational mode with a downstream device for the images. The images are encoded into a video stream. The motion characteristics metadata portion is encoded into the video stream as a part of image metadata. The video stream is transmitted to the downstream device. The downstream receives the video stream and operates the optimal FRC operational mode to generate, based on the images, additional images. The images and the additional images are rendered on a display device at an image refresh rate different from an input image refresh rate represented by images encoded in the video stream.
Abstract:
Systems and methods for overlaying a second image/video data onto a first image/video data are described herein. The first image/video data may be intended to be rendered on a display with certain characteristics—e.g., HDR, EDR, VDR or UHD capabilities. The second image/video data may comprise graphics, closed captioning, text, advertisement—or any data that may be desired to be overlaid and/or composited onto the first image/video data. The second image/video data may be appearance mapped according to the image statistics and/or characteristics of the first image/video data. In addition, such appearance mapping may be made according to the characteristics of the display that the composite data is to be rendered. Such appearance mapping is desired to render a composite data that is visually pleasing to a viewer, rendered upon a desired display.
Abstract:
One or more derived versions of image content may be obtained by interpolating two or more source versions of the same image content. A derived version may be targeted for a class of displays that differs from classes of displays targeted by the source versions. Source images in a source version may have been color graded in a creative process by a content creator/colorist. Interpolation of the source versions may be performed with interpolation parameters having two or more different values in two or more different clusters in at least one of the source images. A normalized version may be used to allow efficient distribution of multiple versions of the same content to a variety of downstream media processing devices, and to preserve or restore image details otherwise lost in one or more of the source versions.
Abstract:
3D Images may be encoded into reduced resolution image data in a base layer and enhancement layer (EL) image data in one or more enhancement layers. Different types of data compositions may be used in the EL image data. The different types of data compositions may include unfiltered full resolution image data for one or both of left eye and right eye perspectives, or unfiltered full resolution image data for a color channel, e.g., luminance channel, or unfiltered full resolution image data for selected portions of image frames, or fallback data compositions. Based on deciding factors including bitrate requirements and bandwidth constraints, different types of data compositions may be alternatively used by an upstream device to deliver the best possible 3D image data to a wide variety of downstream devices. The upstream device may inform a downstream device of specific types of data compositions with EL image data descriptors.
Abstract:
In a method to code and transmit scalable HDR video signals, HDR signals are processed and encoded in the IPT-PQ color space to generate a base layer at reduced spatial resolution and/or dynamic range, and an enhancement layer with a residual signal. A signal reshaping block before the base layer encoder allows for improved coding of HDR signals using a reduced bit depth. A decoder can use a BL decoder and backward reshaping to generate a decoded BL HDR signal at a reduced dynamic range and/or spatial resolution, or it can combine the decoded BL HDR signal and the EL stream to generate a decoded HDR signal at full dynamic range and full resolution.
Abstract:
Methods are disclosed for adaptive display management using look-up table interpolation. Given a target maximum brightness value for a display, a new look-up table (LUT) for color gamut mapping is determined based on interpolating values from two other pre-computed color gamut LUTs; one computed for a first maximum display brightness larger than the target maximum brightness value, and one computed for a second maximum display brightness lower than the target brightness value. An interpolation scale is computed based at least on the target maximum brightness value and the first maximum display brightness. Methods to reduce the computation load for the translation of RGB data from one color representation (say, ST 2084) to another color representation (say, BT 1866) using fast interpolation methods are also presented.
Abstract:
Systems and methods are disclosed for dynamically adjusting the backlight of a display during video playback. Given an input video stream and associated minimum, average, or maximum luminance values of the video frames in the video stream, values of a function of the frame min, mid, or max luminance values are filtered using a temporal filter to generate a filtered output value for each frame. The instantaneous dynamic range of a target display is determined based on the filtered output value and the minimum and maximum brightness values of the display. A backlight control level is computed based on the instantaneous dynamic range, and the input signal is tone mapped by a display management process to be displayed on the target display at the selected backlight level. The design of a temporal filter based on an exponential moving average filter and scene-change detection is presented.
Abstract:
A high resolution 3D image may be encoded into a first multiplexed image frame and a second multiplexed image frame in a base layer (BL) video signal and an enhancement layer (EL) video signal. The first multiplexed image frame may comprise horizontal high resolution image data for both eyes, while the second multiplexed image frame may comprise vertical high resolution image data for both eyes. Encoded symmetric-resolution image data for the 3D image may be distributed to a wide variety of devices for 3D image processing and rendering. A recipient device may reconstruct reduced resolution 3D image from one of the first multiplexed image frame or the second multiplexed image frame. A recipient device may also reconstruct high resolution 3D image by combining high resolution image data from both of the first multiplexed image frame and the second multiplexed image frame.