摘要:
Wavelet based multiresolution video representations generated by multi-scale motion compensated temporal filtering (MCTF) and spatial wavelet transform are disclosed. Since temporal filtering and spatial filtering are separated in generating such representations, there are many different ways to intertwine single-level MCTF and single-level spatial filtering, resulting in many different video representation schemes with spatially scalable motion vectors for the support of different combination of spatial scalability and temporal scalability. The problem of design of such a video representation scheme to full the spatial/temporal scalability requirements is studied. Signaling of the scheme to the decoder is also investigated. Since MCTF is performed subband by subband, motion vectors are available for reconstructing video sequences of any possible reduced spatial resolution, restricted by the dyadic decomposition pattern and the maximal spatial decomposition level. It is thus clear that the family of decomposition schemes provides efficient and versatile multiresolution video representations for fully scalable video coding.
摘要:
Wavelet based multiresolution video representations generated by multi-scale motion compensated temporal filtering (MCTF) and spatial wavelet transform are disclosed. Since temporal filtering and spatial filtering are separated in generating such representations, there are many different ways to intertwine single-level MCTF and single-level spatial filtering, resulting in many different video representation schemes with spatially scalable motion vectors for the support of different combination of spatial scalability and temporal scalability. The problem of design of such a video representation scheme to full the spatial/temporal scalability requirements is studied. Signaling of the scheme to the decoder is also investigated. Since MCTF is performed subband by subband, motion vectors are available for reconstructing video sequences of any possible reduced spatial resolution, restricted by the dyadic decomposition pattern and the maximal spatial decomposition level. It is thus clear that the family of decomposition schemes provides efficient and versatile multiresolution video representations for fully scalable video coding.
摘要:
Wavelet based multiresolution video representations generated by multi-scale motion compensated temporal filtering (MCTF) and spatial wavelet transform are disclosed. Since temporal filtering and spatial filtering are separated in generating such representations, there are many different ways to intertwine single-level MCTF and single-level spatial filtering, resulting in many different video representation schemes with spatially scalable motion vectors for the support of different combination of spatial scalability and temporal scalability. The problem of design of such a video representation scheme to full the spatial/temporal scalability requirements is studied. Signaling of the scheme to the decoder is also investigated. Since MCTF is performed subband by subband, motion vectors are available for reconstructing video sequences of any possible reduced spatial resolution, restricted by the dyadic decomposition pattern and the maximal spatial decomposition level. It is thus clear that the family of decomposition schemes provides efficient and versatile multiresolution video representations for fully scalable video coding.
摘要:
A method for determining optimal video encoding parameters is disclosed. The method generally includes the steps of (A) storing a plurality of configurable parameters each comprising a respective trial value, (B) generating a bitstream by encoding a test sequence of pictures using (i) a plurality of non-configurable parameters fixed in a design of the encoder, (ii) the configurable parameters and (iii) a plurality of dynamic parameters adjustable in real time by the encoder, (C) generating a reconstructed sequence of pictures by decoding the bitstream, (D) generating a quality metric based on the reconstructed sequence of pictures compared with the test sequence of pictures and (E) adjusting the respective trial values to optimize the quality metric.
摘要:
A method for encoding an image is disclosed. The method generally includes the steps, of (A) generating a quantization matrix as a function of at least four parameters, (B) optimizing the parameters to maximize a quality metric for encoding the image and (C) encoding the image with the quantization matrix as optimized.
摘要:
An apparatus comprising a first circuit and a second circuit. The first circuit may be configured to (i) receive an image data stream comprising a plurality of frames each having a plurality of regions, (ii) select a particular region to be marked as being homogeneous or not homogeneous, and (iii) determine whether a group of neighboring regions to the selected region are qualified or not qualified. The second circuit may be configured to mark the selected region as being homogeneous when one or more of the adjacent regions are (i) qualified and (ii) previously marked as being homogeneous.
摘要:
A method of generating a value for a missing pixel “x” by determining a “least harmful” local edge direction between pixels, or sub-pixels, on substantially opposing sides of the missing pixel, and interpolating the difference to arrive at a value for pixel “x”. The method involves generating sub-pixel values for locations within neighboring pixels, the sub-pixels may comprise half-pixels, quarter-pixels, three-quarter pixels, and so forth, wherein any fractional pixel quantity may be created. Absolute difference values are calculated between neighboring pixels, or sub-pixel values, to determine a least harmful local edge direction along which a value is generated for pixel “x” by interpolation.
摘要:
A method for coding video is disclosed. The method generally includes the steps of (A) receiving a video signal having a series of pictures, each of the pictures having a plurality of blocks, (B) analyzing the blocks to forecast if coding the blocks in a zero-residual coding mode would generate a plurality of artifacts, (C) disabling the zero-residual coding mode for the blocks forecasted to generate at least one of the artifacts and (D) enabling the zero-residual coding mode for the blocks forecasted to generate none of the artifacts.
摘要:
In one embodiment, a plurality of attendee devices may participate in an online collaborative computing session to receive video and audio content for the online collaborative computing session. A particular attendee device may then either initiate or receive a communicated signal between a “whisperer” and “whisperee” that indicates a desire of the whisperer to establish a video whisper session with the whisperee. In response, the video whisper session may be established between the whisperer and whisperee devices, such as through a mutual subscription by the whisperer and whisperee to a video channel and audio channel of each other corresponding device. In this manner, users of the whisperer and whisperee devices may see and hear each other via the video whisper session, and attendee devices other than the whisperer and whisperee are prevented from playing audio from the video whisper session between the whisperer and whisperee.
摘要:
Methods for enhancing the performance of quantization operations by converting division operations to a combination of multiplication and shift operations, which are preferably performed on a processor supporting single-instruction multiple-data (SIMD) instructions. A table of mantissa and exponent values is created for a sufficient range of values for 1/a. During quantization, the mantissa and exponent values are found in the table 1/a for associated with a given quantization division operation given by b/a which is found according to the formula b/a=(b×A)>>n. Aspects are described for application to processors that do not support non-uniform shift operations, and for reducing the necessary bit-width of the operations to increase efficiency. The quantization method may be applied to protocols such as MPEG-2 and other similar formats.