Compression for speech intelligibility enhancement
    1.
    发明授权
    Compression for speech intelligibility enhancement 有权
    压缩语音清晰度增强

    公开(公告)号:US09336785B2

    公开(公告)日:2016-05-10

    申请号:US12464498

    申请日:2009-05-12

    摘要: A speech intelligibility enhancement (SIE) system and method is described that improves the intelligibility of a speech signal to be played back by an audio device when the audio device is located in an environment with loud acoustic background noise. In an embodiment, the audio device comprises a near-end telephony terminal and the speech signal comprises a speech signal received over a communication network from a far-end telephony terminal for playback at the near-end telephony terminal.

    摘要翻译: 描述了一种语音清晰度增强(SIE)系统和方法,当音频设备位于具有大声背景噪声的环境中时,提高了由音频设备重放的语音信号的可懂度。 在一个实施例中,音频设备包括近端电话终端,并且语音信号包括通过通信网络从远端电话终端接收的用于在近端电话终端回放的语音信号。

    Method and apparatus for wind noise detection and suppression using multiple microphones
    2.
    发明授权
    Method and apparatus for wind noise detection and suppression using multiple microphones 有权
    使用多个麦克风进行风噪声检测和抑制的方法和装置

    公开(公告)号:US09330675B2

    公开(公告)日:2016-05-03

    申请号:US13250355

    申请日:2011-09-30

    摘要: Unlike sound based pressure waves that go everywhere, air turbulence caused by wind is usually a fairly local event. Therefore, in a system that utilizes two or more spatially separated microphones to pick up sound signals (e.g., speech), wind noise picked up by one of the microphones often will not be picked up (or at least not to the same extent) by the other microphone(s). Embodiments of methods and apparatuses that utilize this tact and others to effectively detect and suppress wind noise using multiple microphones that are spatially separated are described.

    摘要翻译: 不同于声音的压力波,无处不在,由风引起的空气湍流通常是一个相当地方的事件。 因此,在使用两个或多个空间分离的麦克风拾取声音信号(例如语音)的系统中,由麦克风之一拾取的风噪声通常不会被拾取(或至少不相同的程度),由 另一个麦克风。 描述利用这种技巧和其他技术来有效地检测和抑制使用空间分离的多个麦克风的风噪声的方法和装置的实施例。

    Systems and methods for enhancing audio quality of FM receivers
    3.
    发明授权
    Systems and methods for enhancing audio quality of FM receivers 有权
    提高FM接收机音频质量的系统和方法

    公开(公告)号:US09130643B2

    公开(公告)日:2015-09-08

    申请号:US13362244

    申请日:2012-01-31

    摘要: Systems and methods are described for enhancing the audio quality of an FM receiver. In embodiments described herein, quadrature L−R demodulation is applied to a composite baseband signal output by an FM demodulator to obtain an L−R noise signal. A channel quality measure is calculated based on the L−R noise signal and is used to control whether a pop suppression technique is applied to an L+R signal obtained from the composite baseband signal to detect and remove noise pulses therefrom. The channel quality measure and the L−R noise signal are also leveraged to perform single-channel noise suppression in the frequency domain on an L−R signal obtained from the composite baseband signal and on the L+R signal. The channel quality measure is also used to control the application of a fast fading compensation process that replaces noisy segments of the L−R and L+R signal with replacement waveforms generated via waveform extrapolation.

    摘要翻译: 描述了用于增强FM接收机的音频质量的系统和方法。 在本文描述的实施例中,正交L-R解调被应用于由FM解调器输出的复合基带信号以获得L-R噪声信号。 基于L-R噪声信号计算信道质量度量,并且用于控制是否将弹出抑制技术应用于从复合基带信号获得的L + R信号,以从其检测和去除噪声脉冲。 信道质量测量和L-R噪声信号也被用于在从复合基带信号和L + R信号获得的L-R信号上对频域执行单信道噪声抑制。 信道质量测量还用于控制快速衰落补偿过程的应用,其中替换了通过波形外推产生的替换波形的L-R和L + R信号的噪声段。

    Dynamic time scale modification for reduced bit rate audio coding
    4.
    发明授权
    Dynamic time scale modification for reduced bit rate audio coding 有权
    用于降低比特率音频编码的动态时间尺度修改

    公开(公告)号:US08670990B2

    公开(公告)日:2014-03-11

    申请号:US12847120

    申请日:2010-07-30

    IPC分类号: G10L21/04 G10L11/06

    CPC分类号: G10L19/22 G10L19/08

    摘要: Systems and methods are described that utilize dynamic time scale modification (TSM) to achieve reduced bit rate audio coding. In accordance with embodiments, different levels of TSM compression are selectively applied to segments of an input speech signal prior to encoding thereof by an encoder. Encoded TSM-compressed segments are received at a decoder which decodes such segments and then applies an appropriate level of TSM decompression to each based on information received from the encoder. By selectively applying different levels of TSM compression to segments of an input speech signal prior to encoding, a coding bit rate associated with the encoder/decoder is reduced. Furthermore, by selecting a level of TSM compression for each segment of the input speech signal that takes into account certain local characteristics of that signal, such bit rate reduction is provided without introducing unacceptable levels of distortion into an output speech signal produced by the decoder.

    摘要翻译: 描述了利用动态时间尺度修正(TSM)来实现降低的比特率音频编码的系统和方法。 根据实施例,在由编码器对TSM压缩进行编码之前,将不同级别的TSM压缩选择性地应用于输入语音信号的段。 编码的TSM压缩段在解码器处被接收,解码器对这些段进行解码,然后基于从编码器接收的信息向每个TSM解压缩应用适当级别的TSM解压缩。 通过在编码之前选择性地对输入语音信号的段应用不同级别的TSM压缩,减少与编码器/解码器相关联的编码比特率。 此外,通过选择考虑到该信号的某些局部特性的输入语音信号的每个段的TSM压缩级别,提供这样的比特率降低,而不会将不可接受的失真电平引入到由解码器产生的输出语音信号中。

    Method and Apparatus For Wind Noise Detection and Suppression Using Multiple Microphones
    5.
    发明申请
    Method and Apparatus For Wind Noise Detection and Suppression Using Multiple Microphones 有权
    使用多个麦克风进行风噪声检测和抑制的方法和装置

    公开(公告)号:US20120121100A1

    公开(公告)日:2012-05-17

    申请号:US13250355

    申请日:2011-09-30

    IPC分类号: G10K11/16

    摘要: Unlike sound based pressure waves that go everywhere, air turbulence caused by wind is usually a fairly local event. Therefore, in a system that utilizes two or more spatially separated microphones to pick up sound signals (e.g., speech), wind noise picked up by one of the microphones often will not be picked up (or at least not to the same extent) by the other microphone(s). Embodiments of methods and apparatuses that utilize this tact and others to effectively detect and suppress wind noise using multiple microphones that are spatially separated are described.

    摘要翻译: 不同于声音的压力波,无处不在,由风引起的空气湍流通常是一个相当地方的事件。 因此,在使用两个或多个空间分离的麦克风拾取声音信号(例如语音)的系统中,由麦克风之一拾取的风噪声通常不会被拾取(或至少不相同的程度),由 另一个麦克风。 描述利用这种技巧和其他技术来有效地检测和抑制使用空间分离的多个麦克风的风噪声的方法和装置的实施例。

    Adaptive postfiltering methods and systems for decoding speech
    6.
    发明授权
    Adaptive postfiltering methods and systems for decoding speech 有权
    自适应后置滤波方法和解码语音系统

    公开(公告)号:US08032363B2

    公开(公告)日:2011-10-04

    申请号:US10215048

    申请日:2002-08-09

    IPC分类号: G10L19/14 G10L21/02

    CPC分类号: G10L19/26

    摘要: A method of processing a decoded speech (DS) signal including successive DS frames, each DS frame including DS samples. The method comprises: adaptively filtering the DS signal to produce a filtered signal; gain-scaling the filtered signal with an adaptive gain updated once a DS frame, thereby producing a gain-scaled signal; and performing a smoothing operation to smooth possible waveform discontinuities in the gain-scaled signal.

    摘要翻译: 一种处理包括连续DS帧的解码语音(DS)信号的方法,每个DS帧包括DS采样。 该方法包括:对DS信号进行自适应滤波以产生滤波信号; 利用DS帧更新一次自适应增益来对滤波后的信号进行增益放大,从而产生增益标度信号; 并且执行平滑操作以平滑增益调节信号中的可能的波形不连续性。

    Packet loss concealment for block-independent speech codecs
    7.
    发明授权
    Packet loss concealment for block-independent speech codecs 有权
    分组丢失隐藏块独立语音编解码器

    公开(公告)号:US07930176B2

    公开(公告)日:2011-04-19

    申请号:US11234291

    申请日:2005-09-26

    申请人: Juin-Hwey Chen

    发明人: Juin-Hwey Chen

    IPC分类号: G10L21/00 G10L19/00 G10L21/02

    CPC分类号: G10L19/005

    摘要: A technique for performing frame erasure concealment (FEC) in a speech decoder. One or more non-erased frames of a speech signal are decoded in a block-independent manner. When an erased frame is detected, a short-term predictive filter and a long-term predictive filter are derived based on previously-decoded portions of the speech signal. A periodic waveform component is generated using the short-term predictive filter and the long-term predictive filter. A random waveform component is generated using the short-term predictive filter. A replacement frame is generated for the erased frame. The replacement frame may be generated based on the periodic waveform component, the random waveform component, or a mixture of both.

    摘要翻译: 一种用于在语音解码器中执行帧擦除隐藏(FEC)的技术。 语音信号的一个或多个未擦除的帧以块独立的方式被解码。 当检测到擦除的帧时,基于语音信号的先前解码的部分导出短期预测滤波器和长期预测滤波器。 使用短期预测滤波器和长期预测滤波器生成周期波形分量。 使用短期预测滤波器生成随机波形分量。 为已擦除的帧生成替换帧。 可以基于周期性波形分量,随机波形分量或两者的混合来生成替换帧。

    DYNAMIC TIME SCALE MODIFICATION FOR REDUCED BIT RATE AUDIO CODING
    8.
    发明申请
    DYNAMIC TIME SCALE MODIFICATION FOR REDUCED BIT RATE AUDIO CODING 有权
    用于减少比特率音频编码的动态时间尺度修改

    公开(公告)号:US20110029317A1

    公开(公告)日:2011-02-03

    申请号:US12847120

    申请日:2010-07-30

    IPC分类号: G10L19/00

    CPC分类号: G10L19/22 G10L19/08

    摘要: Systems and methods are described that utilize dynamic time scale modification (TSM) to achieve reduced bit rate audio coding. In accordance with embodiments, different levels of TSM compression are selectively applied to segments of an input speech signal prior to encoding thereof by an encoder. Encoded TSM-compressed segments are received at a decoder which decodes such segments and then applies an appropriate level of TSM decompression to each based on information received from the encoder. By selectively applying different levels of TSM compression to segments of an input speech signal prior to encoding, a coding bit rate associated with the encoder/decoder is reduced. Furthermore, by selecting a level of TSM compression for each segment of the input speech signal that takes into account certain local characteristics of that signal, such bit rate reduction is provided without introducing unacceptable levels of distortion into an output speech signal produced by the decoder.

    摘要翻译: 描述了利用动态时间尺度修正(TSM)来实现降低的比特率音频编码的系统和方法。 根据实施例,在由编码器对TSM压缩进行编码之前,将不同级别的TSM压缩选择性地应用于输入语音信号的段。 编码的TSM压缩段在解码器处被接收,解码器对这些段进行解码,然后基于从编码器接收的信息向每个TSM解压缩应用适当级别的TSM解压缩。 通过在编码之前选择性地对输入语音信号的段应用不同级别的TSM压缩,减少与编码器/解码器相关联的编码比特率。 此外,通过选择考虑到该信号的某些局部特性的输入语音信号的每个段的TSM压缩级别,提供这样的比特率降低,而不会将不可接受的失真电平引入到由解码器产生的输出语音信号中。

    Pitch extraction methods and systems for speech coding using sub-multiple time lag extraction
    9.
    发明授权
    Pitch extraction methods and systems for speech coding using sub-multiple time lag extraction 失效
    使用次多时滞提取语音编码的音调提取方法和系统

    公开(公告)号:US07752037B2

    公开(公告)日:2010-07-06

    申请号:US10284339

    申请日:2002-10-31

    申请人: Juin-Hwey Chen

    发明人: Juin-Hwey Chen

    IPC分类号: G10L11/04

    CPC分类号: G10L25/90

    摘要: A method of determining a pitch period of an audio signal using a correlation-based signal derived from the audio signal. The correlation-based signal includes known peaks each corresponding to a respective one of known time lags. The known peaks includes a global maximum peak. The method comprises: (a) determining if a candidate peak among the local peaks exceeds a peak threshold; (b) determining if a candidate time lag corresponding to the candidate peak is within a predetermined range of at least one integer sub-multiple of the time lag corresponding to the global maximum peak; and (c) setting the pitch period equal to the candidate time lag when the determinations of both steps (a) and (b) are true.

    摘要翻译: 一种使用从音频信号导出的基于相关的信号确定音频信号的音调周期的方法。 基于相关的信号包括已知的峰值,其各自对应于已知时间滞后的相应的一个。 已知峰包括全局最大峰。 该方法包括:(a)确定局部峰值中的候选峰值是否超过峰值阈值; (b)确定与所述候选峰值相对应的候选时间延迟是否在对应于全局最大峰值的时间延迟的至少一个整数子倍数的预定范围内; 和(c)当步骤(a)和(b)的确定为真时,将音调周期设置为等于候选时间延迟。

    Method and system for frame erasure concealment for predictive speech coding based on extrapolation of speech waveform
    10.
    发明授权
    Method and system for frame erasure concealment for predictive speech coding based on extrapolation of speech waveform 有权
    基于语音波形外推预测语音编码的帧擦除隐藏方法和系统

    公开(公告)号:US07711563B2

    公开(公告)日:2010-05-04

    申请号:US10183608

    申请日:2002-06-28

    申请人: Juin-Hwey Chen

    发明人: Juin-Hwey Chen

    IPC分类号: G10L13/00

    CPC分类号: G10L19/005

    摘要: A method and system are provided for synthesizing a corrupted frame output from a decoder including one or more predictive filters. The corrupted frame is representative of one segment of a decoded signal output from the decoder. The method comprises extrapolating a replacement frame based upon another segment of the decoded signal and substituting the replacement frame for the corrupted frame. Finally, the internal states of the filters are updated based upon the substituting.

    摘要翻译: 提供了一种用于合成从包括一个或多个预测滤波器的解码器输出的损坏帧的方法和系统。 损坏的帧代表从解码器输出的解码信号的一个段。 该方法包括基于解码信号的另一段外推替换帧,并将替换帧替换为损坏的帧。 最后,过滤器的内部状态基于替换来更新。