Audio Encoding Method and Apparatus
    81.
    发明申请

    公开(公告)号:US20190311727A1

    公开(公告)日:2019-10-10

    申请号:US16439954

    申请日:2019-06-13

    Inventor: Zhe Wang

    Abstract: An audio encoding method includes dividing an energy spectrum of a current audio frame into P FFT energy spectrum coefficients; determining a minimum bandwidth of distribution, on spectrum, of first-preset-proportion energy of the current audio frame according to the energy of the P FFT energy spectrum coefficients of the current audio frame, wherein the minimum bandwidth of distribution, on spectrum, of first preset proportion energy of the current audio frame indicates sparseness of distribution, on the spectrum, of energy of the current audio frame; and determining to use a linear-prediction-based encoding method to encode the current audio frame in response to the minimum bandwidth of distribution is greater than a first preset value.

    Audio encoding method and apparatus

    公开(公告)号:US10347267B2

    公开(公告)日:2019-07-09

    申请号:US15682097

    申请日:2017-08-21

    Inventor: Zhe Wang

    Abstract: An audio encoding method and an apparatus are provided. The method includes: determining sparseness of distribution, on spectrums, of energy of N input audio frames (101), where the N audio frames include a current audio frame, and N is a positive integer; and determining, according to the sparseness of distribution, on the spectrums, of the energy of the N audio frames, whether to use a first encoding method or a second encoding method to encode the current audio frame (102), where the first encoding method is an encoding method that is based on time-frequency transform and transform coefficient quantization and that is not based on linear prediction, and the second encoding method is a linear-predication-based encoding method. The method can reduce encoding complexity and ensure that encoding is of relatively high accuracy.

    Method for detecting audio signal and apparatus

    公开(公告)号:US10304478B2

    公开(公告)日:2019-05-28

    申请号:US15262263

    申请日:2016-09-12

    Inventor: Zhe Wang

    Abstract: Embodiments disclosed herein provide a method for detecting an audio signal and an apparatus, where the method includes determining an input audio signal as a to-be-determined audio signal; determining an enhanced segmental signal-to-noise ratio (SSNR) of the audio signal, where the enhanced SSNR is greater than a reference SSNR; and comparing the enhanced SSNR with a voice activity detection (VAD) decision threshold to determine whether the audio signal is an active signal. According to the method and the apparatus provided in the embodiments, an active voice and an inactive voice can be accurately distinguished.

    Method and apparatus for detecting a voice activity in an input audio signal

    公开(公告)号:US10134417B2

    公开(公告)日:2018-11-20

    申请号:US15700165

    申请日:2017-09-10

    Inventor: Zhe Wang

    Abstract: The disclosure provides a method and an apparatus for detecting a voice activity in an input audio signal composed of frames. A noise characteristic of the input signal is determined based on a received frame of the input audio signal. A voice activity detection (VAD) parameter is derived based on the noise characteristic of the input audio signal using an adaptive function. The derived VAD parameter is compared with a threshold value to provide a voice activity detection decision. The input audio signal is processed according to the voice activity detection decision.

    AUDIO ENCODING METHOD AND APPARATUS
    86.
    发明申请

    公开(公告)号:US20170345436A1

    公开(公告)日:2017-11-30

    申请号:US15682097

    申请日:2017-08-21

    Inventor: Zhe Wang

    Abstract: An audio encoding method and an apparatus are provided. The method includes: determining sparseness of distribution, on spectrums, of energy of N input audio frames (101), where the N audio frames include a current audio frame, and N is a positive integer; and determining, according to the sparseness of distribution, on the spectrums, of the energy of the N audio frames, whether to use a first encoding method or a second encoding method to encode the current audio frame (102), where the first encoding method is an encoding method that is based on time-frequency transform and transform coefficient quantization and that is not based on linear prediction, and the second encoding method is a linear-predication-based encoding method. The method can reduce encoding complexity and ensure that encoding is of relatively high accuracy.

    METHOD AND APPARATUS FOR DETECTING A VOICE ACTIVITY IN AN INPUT AUDIO SIGNAL
    87.
    发明申请
    METHOD AND APPARATUS FOR DETECTING A VOICE ACTIVITY IN AN INPUT AUDIO SIGNAL 审中-公开
    用于检测输入音频信号中的语音活动的方法和装置

    公开(公告)号:US20160260443A1

    公开(公告)日:2016-09-08

    申请号:US15157424

    申请日:2016-05-18

    Inventor: Zhe Wang

    Abstract: The disclosure provides a method and an apparatus for detecting a voice activity in an input audio signal composed of frames. A noise attribute of the input signal is determined based on a received frame of the input audio signal. A voice activity detection (VAD) parameter is derived based on the noise attribute of the input audio signal using an adaptive function. The derived VAD parameter is compared with a threshold value to provide a voice activity detection decision. The input audio signal is processed according to the voice activity detection decision.

    Abstract translation: 本公开提供了一种用于检测由帧组成的输入音频信号中的语音活动的方法和装置。 基于输入音频信号的接收帧来确定输入信号的噪声属性。 基于使用自适应功能的输入音频信号的噪声属性导出语音活动检测(VAD)参数。 将导出的VAD参数与阈值进行比较以提供语音活动检测决策。 输入音频信号根据语音活动检测决定进行处理。

    Method and apparatus for performing voice activity detection
    88.
    发明授权
    Method and apparatus for performing voice activity detection 有权
    执行语音活动检测的方法和装置

    公开(公告)号:US09390729B2

    公开(公告)日:2016-07-12

    申请号:US14341114

    申请日:2014-07-25

    Inventor: Zhe Wang

    CPC classification number: G10L25/93 G10L25/78 G10L2025/786

    Abstract: A voice activity detection (VAD) apparatus configured to provide a voice activity detection decision for an input audio signal. The VAD apparatus includes a state detector and a voice activity calculator. The state detector is configured to determine, based on the input audio signal, a current working state of the VAD apparatus among at least two different working states. Each of the at least two different working states is associated with a corresponding working state parameter decision set which includes at least one voice activity detection parameter. The voice activity calculator is configured to calculate a voice activity detection parameter value for the at least one voice activity detection parameter of the working state parameter decision set associated with the current working state, and to provide the voice activity detection decision by comparing the calculated voice activity detection parameter value with a threshold.

    Abstract translation: 语音活动检测(VAD)装置,被配置为提供用于输入音频信号的语音活动检测决定。 VAD装置包括状态检测器和语音活动计算器。 状态检测器被配置为基于输入音频信号确定VAD装置在至少两个不同工作状态中的当前工作状态。 所述至少两个不同工作状态中的每一个与包括至少一个语音活动检测参数的对应工作状态参数决定集相关联。 语音活动计算器被配置为计算与当前工作状态相关联的工作状态参数判定集的至少一个语音活动检测参数的语音活动检测参数值,并且通过比较计算出的语音来提供语音活动检测决定 具有阈值的活动检测参数值。

    Method, Apparatus, and System for Processing Audio Data
    89.
    发明申请
    Method, Apparatus, and System for Processing Audio Data 有权
    用于处理音频数据的方法,装置和系统

    公开(公告)号:US20140316774A1

    公开(公告)日:2014-10-23

    申请号:US14318899

    申请日:2014-06-30

    Inventor: Zhe Wang

    Abstract: A method, an apparatus, and a system for processing audio data are provided that pertain to the field of communications technologies. The method includes: obtaining a noise frame of an audio signal, and decomposing the current noise frame into a noise low-band signal and a noise high-band signal; and encoding and transmitting the noise low-band signal by using a first discontinuous transmission mechanism, and encoding and transmitting the noise high-band signal by using a second discontinuous transmission mechanism. According to the present invention, different processing manners are used for the high-band signal and the low-band signal, calculation loads and encoded bits may be saved under a premise of not lowering subjective quality of a codec, and bits that are saved may help to achieve an objective of reducing a transmission bandwidth or improving overall encoding quality.

    Abstract translation: 提供了涉及通信技术领域的方法,装置和用于处理音频数据的系统。 该方法包括:获得音频信号的噪声帧,并将当前噪声帧分解为噪声低频带信号和噪声高频带信号; 并通过使用第一不连续传输机制对噪声低频带信号进行编码和发送,并通过使用第二不连续传输机制对噪声高频带信号进行编码和发送。 根据本发明,对于高频带信号和低频带信号使用不同的处理方式,可以在不降低编解码器的主观质量的前提下保存计算负载和编码比特,并且保存的比特可以 有助于实现减少传输带宽或提高整体编码质量的目标。

Patent Agency Ranking