THREE-DIMENSIONAL AUDIO SIGNAL CODING METHOD AND APPARATUS, AND ENCODER

    公开(公告)号:US20240087579A1

    公开(公告)日:2024-03-14

    申请号:US18511061

    申请日:2023-11-16

    CPC classification number: G10L19/008 H04S7/30 H04S2420/11

    Abstract: This application discloses a three-dimensional audio signal coding method and apparatus, and an encoder, and relates to the multimedia field. The method includes: After determining a first quantity of virtual speakers and a first quantity of vote values based on a current frame of a three-dimensional audio signal, a candidate virtual speaker set, and a voting round quantity, the encoder selects a second quantity of representative virtual speakers for the current frame from the first quantity of virtual speakers based on the first quantity of vote values, and further encodes the current frame based on the second quantity of representative virtual speakers for the current frame to obtain a bitstream. This achieves efficient data compression.

    AUDIO ENCODING AND DECODING METHOD AND APPARATUS

    公开(公告)号:US20230298601A1

    公开(公告)日:2023-09-21

    申请号:US18202930

    申请日:2023-05-28

    CPC classification number: G10L19/008

    Abstract: Audio encoding and decoding methods and apparatuses are disclosed, to reduce an amount of encoded and decoded data, so as to improve encoding and decoding efficiency. The method includes: selecting a first target virtual speaker from a preset virtual speaker set based on a first scene audio signal; generating a first virtual speaker signal based on the first scene audio signal and attribute information of the first target virtual speaker; obtaining a second scene audio signal using the attribute information of the first target virtual speaker and the first virtual speaker signal; generating a residual signal based on the first scene audio signal and the second scene audio signal; and encoding the first virtual speaker signal and the residual signal, to produce encoded signals, and writing the encoded signals into a bitstream.

    AUDIO ENCODING AND DECODING METHOD AND APPARATUS

    公开(公告)号:US20230298600A1

    公开(公告)日:2023-09-21

    申请号:US18202553

    申请日:2023-05-26

    CPC classification number: G10L19/008

    Abstract: An audio encoding and decoding method and apparatus, and a non-transitory readable storage medium are provided. The encoding method includes: selecting a first target virtual speaker from a preset virtual speaker set based on a current scene audio signal; generating a first virtual speaker signal based on the current scene audio signal and attribute information of the first target virtual speaker; and encoding the first virtual speaker signal to obtain a bitstream. According to the encoding method, an amount of encoded data is reduced, to improve encoding efficiency.

    THREE-DIMENSIONAL AUDIO SIGNAL PROCESSING METHOD AND APPARATUS

    公开(公告)号:US20240105187A1

    公开(公告)日:2024-03-28

    申请号:US18521944

    申请日:2023-11-28

    CPC classification number: G10L19/008 G10L19/02 H04S7/30 H04S2420/11

    Abstract: Embodiments of this application disclose a three-dimensional audio signal processing method and apparatus, to implement sound field classification of a three-dimensional audio signal, to accurately identify the three-dimensional audio signal. An embodiment of this application provides a three-dimensional audio signal processing method, including: performing linear decomposition on a current frame of a three-dimensional audio signal, to obtain a linear decomposition result; obtaining, based on the linear decomposition result, a sound field classification parameter corresponding to the current frame; and determining a sound field classification result of the current frame based on the sound field classification parameter.

    THREE-DIMENSIONAL AUDIO SIGNAL CODING METHOD AND APPARATUS, AND ENCODER

    公开(公告)号:US20240087578A1

    公开(公告)日:2024-03-14

    申请号:US18511025

    申请日:2023-11-16

    CPC classification number: G10L19/008 G10L19/167 H04S7/00 H04S2420/11

    Abstract: A three-dimensional audio signal coding method, apparatus, and encoder are described. The method includes, after obtaining a first correlation between a current frame of a three-dimensional audio signal and a representative virtual speaker set for a previous frame, the encoder determines whether the first correlation satisfies a reuse condition, where the first correlation is used to determine whether to reuse the representative virtual speaker set for the previous frame when the current frame is encoded. The method further encodes the current frame based on the representative virtual speaker set for the previous frame when the first correlation satisfies the reuse condition, to obtain a bitstream. A virtual speaker in the representative virtual speaker set for the previous frame is a virtual speaker used for encoding the previous frame of the three-dimensional audio signal.

    METHOD AND APPARATUS FOR DETERMINING VIRTUAL SPEAKER SET

    公开(公告)号:US20230412981A1

    公开(公告)日:2023-12-21

    申请号:US18241698

    申请日:2023-09-01

    CPC classification number: H04R5/02 H04S2420/11 H04R2205/024

    Abstract: This application provides a method and an apparatus for determining a virtual speaker set. The method for determining a virtual speaker set includes: determining a target virtual speaker from F preset virtual speakers based on a to-be-processed audio signal, where each of the F virtual speakers corresponds to S virtual speakers, F is a positive integer, and S is a positive integer greater than 1; and obtaining, from a preset virtual speaker distribution table, respective position information of S virtual speakers corresponding to the target virtual speaker, where the virtual speaker distribution table includes position information of K virtual speakers, the position information includes an elevation angle index and an azimuth angle index, K is a positive integer greater than 1, F≤K, and F×S≥K. This application can improve audio signal playback effect.

Patent Agency Ranking