Abstract:
According to the invention, a device (101, 101′) for postprocessing at least one channel signal of a plurality of channel signals of a multi-channel signal is described, the at least one channel signal being generated from a decoded downmix signal by a low-bit-rate audio coding/decoding system, the device comprising: a receiver (103; 103′) for receiving the at least one channel signal generated from the decoded downmix signal, a time envelope of the decoded downmix signal and a classification indication indicating a transient type of the at least one channel signal, wherein the classification indication is associated to the at least one channel signal, and a postprocessor (105; 105′) for postprocessing the at least one channel signal based on the time envelope of the decoded downmix signal weighted by a respective weighting factor and in dependence on the classification indication.
Abstract:
A point cloud decoding method related to the field of coding technologies and includes reconstructing a point cloud comprising one or more patches. The one or more patches comprise a current patch. The reconstructing process may include transforming coordinates (x2, y2) of a second point of the current patch in a second coordinate system to coordinates (x1, y1) of a first point of the current patch in a first coordinate system, where the coordinates (x1, y1) of the first point of the current patch in the first coordinate system may be obtained based on the coordinates (x2, y2) of the second point of the current patch in the second coordinate system and a transform matrix.
Abstract:
A method includes obtaining a location of a viewpoint video to which a user currently pays attention in a multi-view video, obtaining a speed at which the user viewpoint switches to the location of the viewpoint video to which attention is currently paid, determining a quantity of predictive viewpoint videos (NNV) that need to be downloaded before the user switches to another viewpoint, determining locations of the predictive viewpoint videos in the multi-view video according to a preset rule and according to the location of the viewpoint video to which the user currently pays attention, the first speed, and the NNV, downloading the predictive viewpoint videos corresponding to the locations of the predictive viewpoint videos from a server end, and transmitting the predictive viewpoint videos.
Abstract:
According to the invention, a device for post-processing at least one channel signal of a plurality of channel signals of a multi-channel signal is described, the at least one channel signal being generated from a decoded downmix signal by a low-bit-rate audio coding/decoding system, the device comprising: a receiver for receiving the at least one channel signal generated from the decoded downmix signal, a time envelope of the decoded downmix signal, an interchannel time difference between the channel signal and the downmix signal, and a classification indication indicating a transient type of the downmix signal; and a post-processor for post-processing the at least one channel signal based on the time envelope of the decoded downmix signal weighted by a respective weighting factor and in dependence on the classification indication and the interchannel time difference.
Abstract:
Embodiments of the present invention provide a method and an apparatus for generating a sideband residual signal. The method includes: comparing energy of a first signal input by a first sound channel with energy of a second signal input by a second sound channel; if the energy of the first signal is greater than the energy of the second signal, generating a sideband residual signal by allocating a monophonic quantization error to the first signal; and if the energy of the first signal is smaller than the energy of the second signal, generating a sideband residual signal by allocating a monophonic quantization error to the second signal. By using the method and apparatus provided in the embodiments of the present invention, it can be avoided that a monophonic quantization error has a greater impact on a signal whose energy is smaller.