Stereo encoding device, stereo decoding device, and stereo encoding method
    1.
    发明授权
    Stereo encoding device, stereo decoding device, and stereo encoding method 有权
    立体声编码装置,立体声解码装置和立体声编码方法

    公开(公告)号:US08457319B2

    公开(公告)日:2013-06-04

    申请号:US12064995

    申请日:2006-08-30

    IPC分类号: H04H20/88

    CPC分类号: G10L19/24 G10L19/008

    摘要: There is disclosed a stereo encoding device capable of accurately encoding a stereo signal at a low bit rate and suppressing delay in audio communication. The device performs monaural encoding in its first layer (110). In a second layer (120), a filtering unit (103) generates an LPC (Linear Predictive Coding) coefficient and generates a left channel drive sound source signal. A time region evaluation unit (104) and a frequency region evaluation unit (105) perform signal evaluation and prediction in both of their regions. A residual encoding unit (106) encodes a residual signal. A bit distribution control unit (107) adaptively distributes bits to the time region evaluation unit (104), the frequency region evaluation unit (105), and the residual encoding unit (106) according to a condition of the audio signal.

    摘要翻译: 公开了一种能够以低比特率精确地编码立体声信号并抑制音频通信中的延迟的立体声编码装置。 设备在其第一层(110)中执行单声道编码。 在第二层(120)中,滤波单元(103)产生LPC(线性预测编码)系数并产生左声道驱动声源信号。 时间区域评估单元(104)和频率区域评估单元(105)在它们的两个区域中执行信号评估和预测。 残差编码单元(106)对剩余信号进行编码。 根据音频信号的条件,位分配控制单元(107)自适应地将时间区域评估单元(104),频率区域评估单元(105)和残差编码单元(106)分配。

    Pulse allocating method in voice coding
    2.
    发明授权
    Pulse allocating method in voice coding 有权
    语音编码中的脉冲分配方法

    公开(公告)号:US08024187B2

    公开(公告)日:2011-09-20

    申请号:US11815916

    申请日:2006-02-09

    IPC分类号: G10L15/00

    CPC分类号: G10L19/10 G10L19/008

    摘要: A pulse allocating method capable of coding stereophonic voice signals efficiently. In the fixed code note retrievals of this pulse allocating method, for individual subframes, the stereophonic voice signals are compared to judge similarity between channels, and are judged on their characteristics. On the basis of the similarity between the channels and the characteristics of the stereophonic signals, the pulse numbers to be allocated to the individual channels are determined. Pulse retrievals are executed to determine the pulse positions for the individual channels, so that the pulses determined are coded.

    摘要翻译: 一种能够有效地对立体声语音信号进行编码的脉冲分配方法。 在该脉冲分配方法的固定代码记录检索中,对于各个子帧,将立体声语音信号进行比较以判断通道之间的相似性,并根据其特性进行判断。 基于通道和立体声信号的特性之间的相似性,确定要分配给各个通道的脉冲数。 执行脉冲检索以确定各个通道的脉冲位置,使得确定的脉冲被编码。

    STEREO ENCODING DEVICE, STEREO DECODING DEVICE, AND STEREO ENCODING METHOD
    3.
    发明申请
    STEREO ENCODING DEVICE, STEREO DECODING DEVICE, AND STEREO ENCODING METHOD 有权
    立体声编码装置,立体声解码装置和立体声编码方法

    公开(公告)号:US20090262945A1

    公开(公告)日:2009-10-22

    申请号:US12064995

    申请日:2006-08-30

    IPC分类号: H04H20/88

    CPC分类号: G10L19/24 G10L19/008

    摘要: There is disclosed a stereo encoding device capable of accurately encoding a stereo signal at a low bit rate and suppressing delay in audio communication. The device performs monaural encoding in its first layer (110). In a second layer (120), a filtering unit (103) generates an LPC (Linear Predictive Coding) coefficient and generates a left channel drive sound source signal. A time region evaluation unit (104) and a frequency region evaluation unit (105) perform signal evaluation and prediction in both of their regions. A residual encoding unit (106) encodes a residual signal. A bit distribution control unit (107) adaptively distributes bits to the time region evaluation unit (104), the frequency region evaluation unit (105), and the residual encoding unit (106) according to a condition of the audio signal.

    摘要翻译: 公开了一种能够以低比特率精确地编码立体声信号并抑制音频通信中的延迟的立体声编码装置。 设备在其第一层(110)中执行单声道编码。 在第二层(120)中,滤波单元(103)产生LPC(线性预测编码)系数并产生左声道驱动声源信号。 时间区域评估单元(104)和频率区域评估单元(105)在它们的两个区域中执行信号评估和预测。 残差编码单元(106)对剩余信号进行编码。 根据音频信号的条件,位分配控制单元(107)自适应地将时间区域评估单元(104),频率区域评估单元(105)和残差编码单元(106)分配。

    Segmenting a Humming Signal Into Musical Notes
    4.
    发明申请
    Segmenting a Humming Signal Into Musical Notes 有权
    将嗡嗡信号分解成音乐笔记

    公开(公告)号:US20090171485A1

    公开(公告)日:2009-07-02

    申请号:US11921593

    申请日:2005-06-07

    IPC分类号: G06F17/00 G06N5/02

    摘要: A method (100) and apparatus (200) are disclosed for transcribing a humming signal into a sequence of musical notes. The method begins by grouping (305) the signal into frames of data samples. Each frame is then processed to derive (320) a frequency distribution for each frames. The frequency distributions are processed to derive (410) a Harmonic Product Energy (HPE) distribution over the frames. The MPE distribution is then segmented (115, 120) to obtain boundaries of musical notes. The frequency distributions of the frames are also processed to derive (412) a fundamental frequency distribution. A pitch for each note is determined (125) from the fundamental frequency distribution.

    摘要翻译: 公开了用于将哼唱信号转录成音符序列的方法(100)和装置(200)。 该方法开始于将信号分组(305)到数据样本的帧中。 然后处理每个帧以导出(320)每个帧的频率分布。 处理频率分布以导出(410)帧上的谐波产物能量(HPE)分布。 然后将MPE分布分段(115,120)以获得音符的边界。 还对帧的频率分布进行处理以导出(412)基频分布。 从基频分布确定每个音符的音调(125)。

    Audio encoding apparatus and spectrum modifying method
    5.
    发明授权
    Audio encoding apparatus and spectrum modifying method 有权
    音频编码装置和频谱修改方法

    公开(公告)号:US08296134B2

    公开(公告)日:2012-10-23

    申请号:US11914296

    申请日:2006-05-11

    IPC分类号: G10L19/14

    摘要: A spectrum modifying method and the like wherein the efficiencies of the signal estimation and prediction can be improved and the spectrum can be more efficiently encoded. According to this method, the pitch period is calculated from an original signal, which serves as a reference signal, and then a basic pitch frequency (f0) is calculated. Thereafter, the spectrum of a target signal, which is a target of spectrum modification, is divided into a plurality of partitions. It is specified here that the width of each partition be the basic pitch frequency. Then, the spectra of bands are interleaved such that a plurality of peaks having similar amplitudes are unified into a group. The basic pitch frequency is used as an interleave pitch.

    摘要翻译: 频谱修正方法等可以改善信号估计和预测的效率,并且可以更有效地编码频谱。 根据这种方法,从作为参考信号的原始信号计算音调周期,然后计算基本音调频率(f0)。 此后,将作为频谱修改对象的目标信号的频谱划分成多个分区。 这里指定每个分区的宽度为基本音调频率。 然后,频带的频谱被交织,使得具有相似幅度的多个峰值被统一成一组。 基本音调频率用作交织音调。

    Stereo signal generating apparatus and stereo signal generating method
    6.
    发明授权
    Stereo signal generating apparatus and stereo signal generating method 有权
    立体声信号发生装置和立体声信号产生方法

    公开(公告)号:US08019087B2

    公开(公告)日:2011-09-13

    申请号:US11573760

    申请日:2005-08-29

    IPC分类号: H04R5/00 G10L19/00 G10L13/00

    CPC分类号: G10L19/008

    摘要: A stereo signal generating apparatus capable of obtaining stereo signals that exhibit a low bit rate and an excellent reproducibility. In this stereo signal generating apparatus (90), an FT part (901) converts a monaural signal (M′t) of time domain to a monaural signal (M′) of frequency domain. A power spectrum calculating part (902) determines a power spectrum (PM′). A scaling ratio calculating part (904a) determines a scaling ratio (SL) for a left channel, while a scaling ratio calculating part (904b) determines a scaling ratio (SR) for a right channel. A multiplying part (905a) multiplies the monaural signal (M′) of frequency domain by the scaling ratio (SL) to produce a left channel signal (L″) of a stereo signal, while a multiplying part (905b) multiplies the monaural signal (M′) of frequency domain by the scaling ratio (SR) to produce a right channel signal (R″) of the stereo signal.

    摘要翻译: 一种立体声信号发生装置,其能够获得表现出低比特率和优异的再现性的立体声信号。 在该立体声信号生成装置(90)中,FT部(901)将时域的单声道信号(M't)变换为频域的单声道信号(M')。 功率谱计算部(902)决定功率谱(PM')。 缩放比例计算部分(904a)确定左声道的缩放率(SL),而缩放比例计算部分(904b)确定右声道的缩放比率(SR)。 乘法部分(905a)将频域的单声道信号(M')乘以缩放比率(SL)以产生立体声信号的左声道信号(L“),而乘法部分(905b)将单声道信号 (M')通过缩放比(SR)产生立体声信号的右声道信号(R“)。

    Multichannel signal coding equipment and multichannel signal decoding equipment
    7.
    发明授权
    Multichannel signal coding equipment and multichannel signal decoding equipment 有权
    多通道信号编码设备和多通道信号解码设备

    公开(公告)号:US07630396B2

    公开(公告)日:2009-12-08

    申请号:US11573100

    申请日:2005-08-24

    IPC分类号: H04J15/00

    摘要: Multichannel signal coding equipment is provided for presenting a high quality sound at a low bit rate. In the multichannel signal coding equipment (2), a down mix part (10) generates monaural reference channel signals for N number of channel signals. A coding part (11) codes the generated reference channel signal. A signal analyzing part (12) extracts parameters indicating characteristics of each of the N number of channel signals. An MUX part (13) multiplexes the coded reference channel signal with the extracted parameters.

    摘要翻译: 多通道信号编码设备用于以低比特率呈现高质量的声音。 在多声道信号编码设备(2)中,下混合部分(10)产生用于N个信道信号的单声道参考信道信号。 编码部分(11)对生成的参考信道信号进行编码。 信号分析部分(12)提取指示N个信道信号中的每一个的特性的参数。 MUX部分(13)将编码的参考信道信号与所提取的参数进行复用。

    Stereo Signal Generating Apparatus and Stereo Signal Generating Method
    8.
    发明申请
    Stereo Signal Generating Apparatus and Stereo Signal Generating Method 有权
    立体声信号发生装置和立体声信号发生方法

    公开(公告)号:US20080154583A1

    公开(公告)日:2008-06-26

    申请号:US11573760

    申请日:2005-08-29

    IPC分类号: H04S1/00

    CPC分类号: G10L19/008

    摘要: A stereo signal generating apparatus capable of obtaining stereo signals that exhibit a low bit rate and an excellent reproducibility. In this stereo signal generating apparatus (90), an FT part (901) converts a monaural signal (M′t) of time domain to a monaural signal (M′) of frequency domain. A power spectrum calculating part (902) determines a power spectrum (PM′). A scaling ratio calculating part (904a) determines a scaling ratio (SL) for a left channel, while a scaling ratio calculating part (904b) determines a scaling ratio (SR) for a right channel. A multiplying part (905a) multiplies the monaural signal (M′) of frequency domain by the scaling ratio (SL) to produce a left channel signal (L″) of a stereo signal, while a multiplying part (905b) multiplies themonaural signal (M′)of frequency domain by the scaling ratio (SR) to produce a right channel signal (R″) of the stereo signal.

    摘要翻译: 一种立体声信号发生装置,其能够获得表现出低比特率和优异的再现性的立体声信号。 在该立体声信号生成装置(90)中,FT部(901)将时域的单声道信号(M'SUB T< SUB)转换为频域的单声道信号(M')。 功率谱计算部(902)确定功率谱(P SUB M')。 缩放比例计算部分(904a)确定左通道的缩放比(S SUB> L),而缩放比例计算部分(904b)确定缩放比(S SUB> SUB>)。 乘法部分(905a)将频域的单声道信号(M')乘以缩放比(S L L L)以产生立体声信号的左声道信号(L“),而 乘法部分(905b)将频域的声场信号(M')乘以缩放比(S SUB R),以产生立体声信号的右声道信号(R“)。

    Segmenting a humming signal into musical notes
    9.
    发明授权
    Segmenting a humming signal into musical notes 有权
    将嗡嗡声的信号分割成音符

    公开(公告)号:US08193436B2

    公开(公告)日:2012-06-05

    申请号:US11921593

    申请日:2005-06-07

    IPC分类号: G10H7/00

    摘要: A method (100) and apparatus (200) are disclosed for transcribing a humming signal into a sequence of musical notes. The method begins by grouping (305) the signal into frames of data samples. Each frame is then processed to derive (320) a frequency distribution for each frames. The frequency distributions are processed to derive (410) a Harmonic Product Energy (HPE) distribution over the frames. The HPE distribution is then segmented (115, 120) to obtain boundaries of musical notes. The frequency distributions of the frames are also processed to derive (412) a fundamental frequency distribution. A pitch for each note is determined (125) from the fundamental frequency distribution.

    摘要翻译: 公开了用于将哼唱信号转录成音符序列的方法(100)和装置(200)。 该方法开始于将信号分组(305)到数据样本的帧中。 然后处理每个帧以导出(320)每个帧的频率分布。 处理频率分布以导出(410)帧上的谐波产物能量(HPE)分布。 然后HPE分布被分割(115,120)以获得音符的边界。 帧的频率分布也被处理以导出(412)基频分布。 从基频分布确定每个音符的音调(125)。

    Error conceal device and error conceal method
    10.
    发明授权
    Error conceal device and error conceal method 有权
    错误隐藏设备和错误隐藏方法

    公开(公告)号:US07809556B2

    公开(公告)日:2010-10-05

    申请号:US10591601

    申请日:2005-03-01

    IPC分类号: G10L19/10

    摘要: The conventional error conceal processing generates a greatly fluctuating irregular sound which is unpleasant to ears and causes a remarkable echo effect and click noise. A notification signal detection unit (301) judges processing for an input frame. In case of an error frame, a sound detection unit (303) makes judgment whether a preceding non-error data frame is a sound signal. If it is a sound frame, a sound copying unit (304) generates a replacing frame. If it is a non-sound frame, a transient signal detection unit (305) judges whether it is an attack signal by the transient signal detection and selects an appropriate area from the preceding non-error frame.

    摘要翻译: 常规的错误隐藏处理产生大量波动的不规则声音,其耳朵令人不愉快,并产生显着的回波效应和点击噪声。 通知信号检测单元(301)判断输入帧的处理。 在错误帧的情况下,声音检测单元(303)判断先前的无错误数据帧是否是声音信号。 如果是声音帧,则声音复制单元(304)生成替换帧。 如果它是非声音帧,则瞬态信号检测单元(305)通过瞬态信号检测来判断是否是攻击信号,并从前一个非错误帧中选择适当的区域。