Abstract:
In various embodiments, a method and a decoder include identifying a directional intra prediction mode with an angle of prediction. The method also includes identifying a first and second reference neighboring samples in a block of the video along the angle of prediction; the angle of prediction intersects a pixel to be predicted. The method further includes determining which of the first and second reference samples is nearest the angle of prediction and applying a value of the nearest reference neighboring sample to the pixel as a predictor. Also, a method and a decoder include determining whether a block type of a block of the video is intra block copy. The method also includes responsive to the block type being the intra block copy, determining a transform block size of the block and, responsive to the transform block size being 4×4, applying a discrete sine transform to the block.
Abstract:
To improve precision of visual search processing, SIFT points within a query image are forward matched to features in each of a plurality of repository images and SIFT points within each repository image are backward matched to features within the query image. Forward-only, backward-only and forward-and-backward matches may be weighted differently in determining an image match. Two way matching may be triggered by query image bit rate in excess of a threshold or by a sum of weighted distances between matching points exceeding a threshold. Significant performance gains in eliminating false positive matches are achieved.
Abstract:
A method includes identifying an optimal backlight value for at least one quality level of a first video segment. The method also includes transmitting data for the first video segment. The transmitted data for the first video segment includes a message containing a first set of display adaptation information. The first set of display adaptation information includes the optimal backlight value for the at least one quality level of the first video segment. The method further includes identifying a backlight value for the at least one quality level of a second video segment. The method also includes determining a maximum backlight value change threshold between successive video segments. In addition, the method includes applying temporal smoothing between the optimal backlight value and the backlight value based on the maximum backlight value change threshold.
Abstract:
Global descriptors for images within an image repository accessible to a visual search server are compared based on order statistics processing including sorting (which is a non-linear transform) and heat kernel matching. Affinity scores are computed for Hamming distances between Fisher vector components corresponding to different clusters of global descriptors from a pair of images and normalized to [0, 1], with zero affinity scores assigned to non-active cluster pairs. Linear Discriminant Analysis is employed to determine a sorted vector of affinity scores to obtain a new global descriptor. The resulting global descriptors produce significantly more accurate matching.
Abstract:
To improve feature selection accuracy during a visual search, interest points within a query image are two-way matched to features in an affine transformed image or otherwise transformed version of the query image. A user device implements a method for selecting local descriptors in the visual search. The method includes: detecting a first set of interest points for the original image; computing an affine transform matrix; computing a new image as a transformation of the original image using the affine transform matrix; detecting a second set of interest points from the and new image; performing a two-way matching between the first set of interest points and the second set of interest points; sorting matching pairs according to a specified self-matching score (SMS); assigning an infinite value to SMS of unmatched interest points from the original image; selecting the interest points based on SMS. Significant performance gains reduce false positive matches.
Abstract:
A method is provided that includes receiving a bitstream. The method also includes parsing the bitstream for a flag indicating whether a palette was used from a first or second coding unit. The method also includes decoding the first coding unit using the palette from the first or second coding unit indicated by the flag. The palette is determined based on which palette of the first or second coding unit improves compression performance. Also, a method is provided that includes receiving a bitstream with a predicted pixel. A coding unit and a reference unit are identified. A number of pixels of the coding unit and the reference unit overlap. A set of available pixels and a set of unavailable pixels of the reference unit are identified. The predicted pixel of the set of unavailable pixels is estimated as a pixel of the set of available pixels.
Abstract:
A method includes identifying an optimal backlight value for at least one quality level of a first video segment. The method also includes transmitting data for the first video segment. The transmitted data for the first video segment includes a message containing a first set of display adaptation information. The first set of display adaptation information includes the optimal backlight value for the at least one quality level of the first video segment. The method further includes identifying a backlight value for the at least one quality level of a second video segment. The method also includes determining a maximum backlight value change threshold between successive video segments. In addition, the method includes applying temporal smoothing between the optimal backlight value and the backlight value based on the maximum backlight value change threshold.
Abstract:
A decoder for video processing includes a receiver configured to receive a bitstream associated with a video from a coder. The decoder also includes a processor configured to parse the bitstream to determine a percentage of at least one a number of six tap filterings or a number of alpha point deblocking instances, in a specified period. The processor is further configured to determine a voltage and frequency to be used for decoding the video proportional to the percentage of the at least one of the number of six tap filterings or the number of alpha point deblocking instances. The processor is configured to decode the video at the determined voltage and frequency. Other embodiments including a encoder and method also are disclosed.
Abstract:
A decoder for video processing includes a receiver configured to receive a bitstream associated with a video from a coder. The decoder also includes a processor configured to parse the bitstream to determine a percentage of at least one a number of six tap filterings or a number of alpha point deblocking instances, in a specified period. The processor is further configured to determine a voltage and frequency to be used for decoding the video proportional to the percentage of the at least one of the number of six tap filterings or the number of alpha point deblocking instances. The processor is configured to decode the video at the determined voltage and frequency. Other embodiments including a encoder and method also are disclosed.
Abstract:
A method for real-time multi-frame super resolution (SR) of video content is provided. The method includes receiving a bitstream including an encoded video, motion metadata for a plurality of blocks of a frame of video content, and parameters. The motion metadata is estimated from the original video before downsampling and encoding. The motion metadata is averaged over consecutive blocks. The method includes upscaling the motion metadata for the plurality of blocks. The method also includes upscaling the decoded video using the upscaled motion metadata. The method also includes deblurring and denoising the upscaled video.