ENCODING METHOD AND APPARATUS, DECODING METHOD AND APPARATUS, DEVICE, STORAGE MEDIUM, AND COMPUTER PROGRAM

    公开(公告)号:US20240105189A1

    公开(公告)日:2024-03-28

    申请号:US18533612

    申请日:2023-12-08

    CPC classification number: G10L19/0204 G10L19/167 G10L25/18 G10L25/21

    Abstract: This application disclose an encoding method and apparatus, a decoding method and apparatus, a device, a storage medium, and a computer program, and belong to the field of encoding and decoding technologies. In embodiments of this application, a first whitened spectrum for media data is whitened to obtain a second whitened spectrum, and then encoding is performed based on the second whitened spectrum. A spectral amplitude of the second whitened spectrum in a target frequency band is greater than or equal to a spectral amplitude of the first whitened spectrum in the target frequency band. It can be learned that, in this solution, the spectral amplitude of the first whitened spectrum in the target frequency band is increased, so that a difference between statistical average energy of spectral lines for different frequencies in the obtained second whitened spectrum is small.

    VEHICLE FAULT DIAGNOSTIC METHOD AND ON-BOARD DIAGNOSTIC APPARATUS

    公开(公告)号:US20240053738A1

    公开(公告)日:2024-02-15

    申请号:US18486707

    申请日:2023-10-13

    Inventor: Zhe WANG

    Abstract: A vehicle diagnostics technology is provided. Further, a vehicle fault diagnostic method, an on-board diagnostic apparatus, and a related system that are used for advanced intelligent driving are provided. In an example method, a first diagnostic module monitors and records first fault information, and a second diagnostic module monitors and records second fault information. The second diagnostic module sends the second fault information to the first diagnostic module. The example method further includes the first diagnostic module communicating with a master computer through a first communication interface and the second diagnostic module communicating with the master computer through a second communication interface.

    BIT ALLOCATION METHOD AND APPARATUS FOR AUDIO OBJECT

    公开(公告)号:US20230368801A1

    公开(公告)日:2023-11-16

    申请号:US18224237

    申请日:2023-07-20

    CPC classification number: G10L19/002 G10L19/02 G10L25/21

    Abstract: A bit allocation method and apparatus for an audio object are disclosed, which relate to the field of audio encoding and decoding technologies. The method includes: separately pre-rendering a plurality of audio objects to be pre-rendered in an audio frame, to obtain a plurality of pre-rendered audio objects; obtaining respective perceptual importance parameter values of the plurality of pre-rendered audio objects; obtaining a bit allocation parameter value of a current audio object to be pre-rendered based on the respective perceptual importance parameter values of the plurality of pre-rendered audio objects; and determining, based on the bit allocation parameter value of the current audio object to be pre-rendered and a total quantity of to-be-allocated bits corresponding to the plurality of audio objects to be pre-rendered, a target quantity of bits allocated to the current audio object to be pre-rendered.

    AUDIO ENCODING AND DECODING METHOD AND APPARATUS

    公开(公告)号:US20230298601A1

    公开(公告)日:2023-09-21

    申请号:US18202930

    申请日:2023-05-28

    CPC classification number: G10L19/008

    Abstract: Audio encoding and decoding methods and apparatuses are disclosed, to reduce an amount of encoded and decoded data, so as to improve encoding and decoding efficiency. The method includes: selecting a first target virtual speaker from a preset virtual speaker set based on a first scene audio signal; generating a first virtual speaker signal based on the first scene audio signal and attribute information of the first target virtual speaker; obtaining a second scene audio signal using the attribute information of the first target virtual speaker and the first virtual speaker signal; generating a residual signal based on the first scene audio signal and the second scene audio signal; and encoding the first virtual speaker signal and the residual signal, to produce encoded signals, and writing the encoded signals into a bitstream.

    MULTI-CHANNEL AUDIO SIGNAL ENCODING AND DECODING METHOD AND APPARATUS

    公开(公告)号:US20230154471A1

    公开(公告)日:2023-05-18

    申请号:US18153128

    申请日:2023-01-11

    CPC classification number: G10L19/008 G10L25/06

    Abstract: A multi-channel audio signal encoding method includes: obtaining a to-be-encoded first audio frame; obtaining a correlation value set, where the correlation value set includes respective correlation values of a plurality of channel pairs; selecting M correlation values from the correlation value set, where all the M correlation values are greater than correlation values other than the M correlation values in the correlation value set, and all the M correlation values are greater than or equal to a pairing threshold; obtaining M channel pair sets; determining a target channel pair set from the M channel pair sets, where a sum of correlation values of all channel pairs in the target channel pair set is the largest in those of the M channel pair sets; and encoding the first audio frame based on the target channel pair set.

    AUDIO ENCODING METHOD AND DEVICE AND AUDIO DECODING METHOD AND DEVICE

    公开(公告)号:US20220335962A1

    公开(公告)日:2022-10-20

    申请号:US17857725

    申请日:2022-07-05

    Abstract: An audio encoding method and device and an audio decoding method and device are provided. The audio encoding method includes: obtaining a current frame of an audio signal, where the current frame includes a high frequency band signal and a low frequency band signal; obtaining a compatible layer encoding parameter of the current frame based on the high frequency band signal and the low frequency band signal; obtaining an enhancement layer encoding parameter of the current frame based on the high frequency band signal; and performing bitstream multiplexing on the compatible layer encoding parameter and the enhancement layer encoding parameter to obtain an encoded bitstream.

    DATA EXCHANGE METHOD, TERMINAL DEVICE, AND NETWORK DEVICE

    公开(公告)号:US20200029343A1

    公开(公告)日:2020-01-23

    申请号:US16587042

    申请日:2019-09-29

    Abstract: Embodiments provide a data exchange method, a terminal device, and a network device. In accordance with the disclosure, a terminal device can obtain resource information for communicating with another terminal device. The resource information can indicate a time-frequency resource and an antenna port corresponding to an antenna polarization direction. The terminal device can then send scheduling information and data information to the another terminal device using the time-frequency resource and the antenna polarization direction. Time-frequency resources and/or antenna polarization directions used by any two terminal devices to send scheduling information and data information to other terminal devices can be different. In this way, terminal devices using different transmit antenna ports can use a same time-frequency resource, and an optional dimension of resource information is increased, thereby increasing an overall system communication capacity.

    NOISE SIGNAL PROCESSING METHOD, NOISE SIGNAL GENERATION METHOD, ENCODER, DECODER, AND ENCODING AND DECODING SYSTEM

    公开(公告)号:US20170323648A1

    公开(公告)日:2017-11-09

    申请号:US15662043

    申请日:2017-07-27

    Inventor: Zhe WANG

    Abstract: Present disclosure provide a linear prediction-based noise signal processing method includes: acquiring a noise signal, and obtaining a linear prediction coefficient according to the noise signal; filtering the noise signal according to the linear prediction coefficient, to obtain a linear prediction residual signal; obtaining a spectral envelope of the linear prediction residual signal according to the linear prediction residual signal; and encoding the spectral envelope of the linear prediction residual signal. According to the noise processing method, the noise generation method, the encoder, the decoder, and the encoding and decoding system that are in the embodiments of the present disclosure, more spectral details of an original background noise signal can be recovered, so that comfort noise can be closer to original background noise in terms of subjective auditory perception of a user, and subjective perception quality of the user is improved.

    METHOD FOR DETECTING AUDIO SIGNAL AND APPARATUS
    29.
    发明申请
    METHOD FOR DETECTING AUDIO SIGNAL AND APPARATUS 审中-公开
    检测音频信号和设备的方法

    公开(公告)号:US20160379670A1

    公开(公告)日:2016-12-29

    申请号:US15262263

    申请日:2016-09-12

    Inventor: Zhe WANG

    CPC classification number: G10L25/78 G10L25/18 G10L2025/783

    Abstract: Embodiments disclosed herein provide a method for detecting an audio signal and an apparatus, where the method includes determining an input audio signal as a to-be-determined audio signal; determining an enhanced segmental signal-to-noise ratio (SSNR) of the audio signal, where the enhanced SSNR is greater than a reference SSNR; and comparing the enhanced SSNR with a voice activity detection (VAD) decision threshold to determine whether the audio signal is an active signal. According to the method and the apparatus provided in the embodiments, an active voice and an inactive voice can be accurately distinguished.

    Abstract translation: 本文公开的实施例提供了一种用于检测音频信号和装置的方法,其中该方法包括将输入音频信号确定为待定音频信号; 确定所述音频信号的增强的分段信噪比(SSNR),其中所述增强的SSNR大于参考SSNR; 以及将增强的SSNR与语音活动检测(VAD)判定阈值进行比较,以确定音频信号是否是活动信号。 根据实施例中提供的方法和装置,可以准确地区分主动语音和非活动语音。

    METHOD AND APPARATUS FOR ENCODING THREE-DIMENSIONAL AUDIO SIGNAL, ENCODER, AND SYSTEM

    公开(公告)号:US20240119950A1

    公开(公告)日:2024-04-11

    申请号:US18538708

    申请日:2023-12-13

    CPC classification number: G10L19/008 G10L25/21 H04S7/30

    Abstract: A method for encoding a three-dimensional audio signal is provided. The method includes: An encoder obtains a current frame of a three-dimensional audio signal; obtains coding efficiency of an initial virtual speaker for the current frame based on the current frame of the three-dimensional audio signal; and when the coding efficiency of the initial virtual speaker for the current frame meets a preset condition, determines an updated virtual speaker for the current frame from a set of candidate virtual speakers; encodes the current frame based on the updated virtual speaker for the current frame, to obtain a first bitstream; or when the coding efficiency of the initial virtual speaker for the current frame does not meet the preset condition, encodes the current frame based on the initial virtual speaker for the current frame, to obtain a second bitstream.

Patent Agency Ranking