OPERATOR RECOGNITION DEVICE, OPERATOR RECOGNITION METHOD AND OPERATOR RECOGNITION PROGRAM
    1.
    发明申请
    OPERATOR RECOGNITION DEVICE, OPERATOR RECOGNITION METHOD AND OPERATOR RECOGNITION PROGRAM 失效
    操作员识别装置,操作者识别方法和操作者识别程序

    公开(公告)号:US20090254757A1

    公开(公告)日:2009-10-08

    申请号:US11910415

    申请日:2006-03-24

    IPC分类号: G06F21/00 G06K9/00 G10L17/00

    CPC分类号: G10L17/16 G10L17/10

    摘要: An operator recognition device is provided that eliminates the registration of data such as HMM data having a characteristic amount for which error in recognition occurs easily when recognizing an operator, and thus reduces the possibility of errors in recognition, and has stable recognition performance. When registering HMM data that is used when performing recognition processing, a speaker recognition device 100 eliminates the registration of HMM data of a password having a characteristic amount of the spoken voice component that is similar to a characteristic amount that is indicated by HMM data that is already registered, and does not allow the registration of HMM data for which it is estimated that error in recognition will occur easily during the recognition process.

    摘要翻译: 提供了一种操作者识别装置,其消除了在识别操作者时容易识别出具有识别错误的特征量的HMM数据的登记,从而降低识别错误的可能性,并且具有稳定的识别性能。 当注册执行识别处理时使用的HMM数据时,说话人识别装置100消除了具有与由HMM数据表示的特征量相似的口语语音成分的特征量的密码的HMM数据的注册, 已经注册,并且不允许HMM数据的注册,估计在识别过程中容易发生识别错误。

    Operator recognition device, operator recognition method and operator recognition program
    2.
    发明授权
    Operator recognition device, operator recognition method and operator recognition program 失效
    操作员识别装置,操作员识别方法和操作员识别程序

    公开(公告)号:US07979718B2

    公开(公告)日:2011-07-12

    申请号:US11910415

    申请日:2006-03-24

    IPC分类号: H04K1/00 H04L9/00

    CPC分类号: G10L17/16 G10L17/10

    摘要: An operator recognition device is provided that eliminates the registration of data such as HMM data having a characteristic amount for which error in recognition occurs easily when recognizing an operator, and thus reduces the possibility of errors in recognition, and has stable recognition performance. When registering HMM data that is used when performing recognition processing, a speaker recognition device 100 eliminates the registration of HMM data of a password having a characteristic amount of the spoken voice component that is similar to a characteristic amount that is indicated by HMM data that is already registered, and does not allow the registration of HMM data for which it is estimated that error in recognition will occur easily during the recognition process.

    摘要翻译: 提供了一种操作者识别装置,其消除了在识别操作者时容易识别出具有识别错误的特征量的HMM数据的登记,从而降低识别错误的可能性,并且具有稳定的识别性能。 当注册执行识别处理时使用的HMM数据时,说话人识别装置100消除了具有与由HMM数据表示的特征量相似的口语语音成分的特征量的密码的HMM数据的注册, 已经注册,并且不允许HMM数据的注册,估计在识别过程中容易发生识别错误。

    Voice recognition system
    3.
    发明授权
    Voice recognition system 失效
    语音识别系统

    公开(公告)号:US06937981B2

    公开(公告)日:2005-08-30

    申请号:US09954151

    申请日:2001-09-18

    摘要: A multiplicative distortion Hm(cep) is subtracted from a voice HMM 5, a multiplicative distortion Ha(cep) of the uttered voice is subtracted from a noise HMM 6 formed by HMM, and the subtraction results Sm(cep) and {Nm(cep)−Ha (cep)} are combined with each other to thereby form a combined HMM 18 in the cepstrum domain. A cepstrum R^a(cep) obtained by subtracting the multiplicative distortion Ha (cep) from the cepstrum Ra (cep) of the uttered voice is compared with the distribution R^m(cep) of the combined HMM 18 in the cepstrum domain, and the combined HMM with the maximum likelihood is output as the voice recognition result.

    摘要翻译: 从语音HMM 5中减去乘法失真Hm(cep),从由HMM形成的噪声HMM6中减去所发出的语音的乘法失真Ha(cep),并且减法结果Sm(cep)和{Nm (cep)通过从发射的倒谱中的倒谱谱(cep)中减去乘法失真Ha(cep)而得到的倒谱R ^ a(cep) 将语音与倒谱域中组合HMM18的分布R ^ m(cep)进行比较,并输出具有最大似然性的组合HMM作为语音识别结果。

    Voice recognition system
    4.
    发明申请
    Voice recognition system 审中-公开
    语音识别系统

    公开(公告)号:US20050091053A1

    公开(公告)日:2005-04-28

    申请号:US10995509

    申请日:2004-11-24

    CPC分类号: G10L25/78

    摘要: A trained vector creating part 15 creates a characteristic of an unvoiced sound in advance as a trained vector V. Meanwhile, a threshold value THD for distinguishing a voice from a background sound is created based on a predictive residual power ε of a sound which is created during a non-voice period. As a voice is actually uttered, an inner product computation part 18 calculates an inner product of a feature vector A of an input signal Sa and a trained vector V, and a first threshold value judging part 19 judges that it is a voice section when the inner product has a value which is equal to or larger than a predetermined value θ while a second threshold value judging part 21 judges that it is a voice section when the predictive residual power ε of the input signal Sa is larger than a threshold value THD. As at least one of the first threshold value judging part 19 and the second threshold value judging part 21 judges that it is a voice section, a voice section determining part 300 finally judges that it is a voice section and cuts out an input signal Saf which are in units of frames and corresponds to this voice section as a voice Svc which is to be recognized.

    摘要翻译: 经训练的矢量创建部分15预先创建无声声音的特性作为训练矢量V.同时,基于产生的声音的预测剩余功率ε创建用于区分语音与背景声音的阈值THD 在非语音期间。 由于实际上发出声音,内积计算部18计算输入信号Sa的特征矢量A和训练矢量V的内积,第一阈值判定部19判断为声音部时, 内积具有等于或大于预定值θ的值,而当输入信号Sa的预测残余功率ε大于阈值THD时,第二阈值判断部21判断为语音区。 由于第一阈值判定部19和第二阈值判定部21中的至少一个判断为声音部,所以语音部确定部300最终判断为声音部,切断输入信号Saf, 是以帧为单位,并且对应于该声音部分作为要识别的声音Svc。

    Voice recognition system
    5.
    发明授权
    Voice recognition system 失效
    语音识别系统

    公开(公告)号:US07016837B2

    公开(公告)日:2006-03-21

    申请号:US09953905

    申请日:2001-09-18

    IPC分类号: G10L15/20

    CPC分类号: G10L15/20 G10L15/142

    摘要: An initial combination HMM 16 is generated from a voice HMM 10 having multiplicative distortions and an initial noise HMM of additive noise, and at the same time, a Jacobian matrix J is calculated by a Jacobian matrix calculating section 19. Noise variation Namh (cep), in which an estimated value Ha^(cep) of the multiplicative distortions that are obtained from voice that is actually uttered, additive noise Na(cep) that is obtained in a non-utterance period, and additive noise Nm(cep) of the initial noise HMM 17 are combined, is multiplied by a Jacobian matrix, wherein the result of the multiplication and initial combination HMM 16 are combined, and an adaptive HMM 26 is generated. Thereby, an adaptive HMM 26 that is matched to the observation value series RNah(cep) generated from actual utterance voice can be generated in advance. When performing voice recognition by collating the observation value series RNah(cep) with adaptive HMM 26, influences due to the multiplicative distortions and additive distortions are counterbalanced, wherein an effect that is equivalent to a case where voice recognition is carried out with clean voice can be obtained, and a robust voice recognition system can be achieved.

    摘要翻译: 从具有乘法失真的语音HMM 10和加性噪声的初始噪声HMM生成初始组合HMM 16,同时由雅可比矩阵计算部分19计算雅可比矩阵J.噪声变化Namh(cep) ,其中从实际发出的语音获得的乘法失真的估计值Ha ^(cep),在非话语周期中获得的加性噪声​​Na(cep)和加法噪声Nm(cep) 初始噪声HMM 17被组合,乘以雅可比矩阵,其中乘法和初始组合HMM 16的结果被组合,并且生成自适应HMM 26。 由此,可以预先生成与实际的话语语音产生的观察值序列RNah(cep)相匹配的自适应HMM26。 通过将观测值序列RNah(cep)与自适应HMM26进行比较来进行语音识别时,由于乘法失真和附加失真引起的影响被平衡,其中与用干净的语音执行语音识别的情况相当的效果可以 并且可以实现鲁棒的语音识别系统。

    Noise Suppressing Device, Noise Suppressing Method, Noise Suppressing Program, and Computer Readable Recording Medium
    6.
    发明申请
    Noise Suppressing Device, Noise Suppressing Method, Noise Suppressing Program, and Computer Readable Recording Medium 失效
    噪声抑制装置,噪声抑制方法,噪声抑制程序和计算机可读记录介质

    公开(公告)号:US20080010063A1

    公开(公告)日:2008-01-10

    申请号:US11794130

    申请日:2005-12-01

    申请人: Mitsuya Komamura

    发明人: Mitsuya Komamura

    IPC分类号: G10L21/02

    CPC分类号: G10L21/0208 G10L21/0216

    摘要: A noise suppression apparatus calculates a sound spectrum and a noise spectrum from an input sound, further calculates gain based on the sound spectrum and noise spectrum, and suppresses noise in the input sound. The noise suppression apparatus includes a first frame-dividing unit that divides the input sound into frames having a predetermined frame length, a second frame-dividing unit that divides the input sound into frames having a longer frame length than the frame length of the first frame-dividing unit, a second converting unit that converts, into a spectrum, the input sound divided into frames by the second frame-dividing unit, a smoothing unit that smoothes the converted spectrum in a frequency direction, and a gain calculating unit that calculates gain based on the smoothed spectrum and the noise spectrum.

    摘要翻译: 噪声抑制装置从输入声音计算声谱和噪声谱,进一步根据声谱和噪声谱计算增益,并抑制输入声音中的噪声。 噪声抑制装置包括将输入声音分割为具有预定帧长度的帧的第一帧分割单元,将输入声音分成具有比第一帧的帧长度更长的帧长度的帧的第二帧分割单元 分割单元,第二转换单元,其将通过第二分割单元划分成帧的输入声音转换为频谱,平滑化单元,其对频率方向的转换频谱进行平滑;以及增益计算单元,其计算增益 基于平滑的频谱和噪声谱。

    Digital signal conversion apparatus, digital signal conversion method, and computer-readable recording medium in which digital signal conversion program is recorded
    7.
    发明授权
    Digital signal conversion apparatus, digital signal conversion method, and computer-readable recording medium in which digital signal conversion program is recorded 失效
    数字信号转换装置,数字信号转换方法和记录数字信号转换程序的计算机可读记录介质

    公开(公告)号:US06873275B2

    公开(公告)日:2005-03-29

    申请号:US10465893

    申请日:2003-06-20

    CPC分类号: H03M5/08 H03M1/822

    摘要: The digital signal conversion apparatus comprises: an over-sampling circuit that samples the input digital signal at high frequency; a polarity-inversion circuit that inverts the polarity of the sampled digital signal; interpolation circuit (A) and interpolation circuit (B) that perform interpolation of each respective digital signal; noise-shaping circuit (A) and noise-shaping circuit (B) that perform noise shaping on the interpolated signals; PWM conversion circuit (A) and PWM conversion circuit (B) that perform PWM conversion on the noise-shaped signals; and a switching circuit for driving a load based on the PWM signals from PWM conversion circuit (A) and PWM conversion circuit (B).

    摘要翻译: 数字信号转换装置包括:以高频率对输入数字信号进行采样的过采样电路; 极性反转电路,反转采样数字信号的极性; 内插电路(A)和内插电路(B),对各个数字信号进行插值; 对内插信号进行噪声整形的噪声整形电路(A)和噪声整形电路(B); PWM转换电路(A)和PWM转换电路(B),对噪声信号进行PWM转换; 以及用于基于来自PWM转换电路(A)和PWM转换电路(B)的PWM信号来驱动负载的开关电路。

    Apparatus and methods for noise suppression in sound signals
    8.
    发明授权
    Apparatus and methods for noise suppression in sound signals 失效
    声音信号噪声抑制的装置和方法

    公开(公告)号:US07957964B2

    公开(公告)日:2011-06-07

    申请号:US11794130

    申请日:2005-12-01

    申请人: Mitsuya Komamura

    发明人: Mitsuya Komamura

    IPC分类号: G10L21/02

    CPC分类号: G10L21/0208 G10L21/0216

    摘要: A noise suppression apparatus calculates a sound spectrum and a noise spectrum from an input sound, further calculates gain based on the sound spectrum and noise spectrum, and suppresses noise in the input sound. The noise suppression apparatus includes a first frame-dividing unit that divides the input sound into frames having a predetermined frame length, a second frame-dividing unit that divides the input sound into frames having a longer frame length than the frame length of the first frame-dividing unit, a second converting unit that converts, into a spectrum, the input sound divided into frames by the second frame-dividing unit, a smoothing unit that smoothes the converted spectrum in a frequency direction, and a gain calculating unit that calculates gain based on the smoothed spectrum and the noise spectrum.

    摘要翻译: 噪声抑制装置从输入声音计算声谱和噪声谱,进一步根据声谱和噪声谱计算增益,并抑制输入声音中的噪声。 噪声抑制装置包括将输入声音分割为具有预定帧长度的帧的第一帧分割单元,将输入声音分成具有比第一帧的帧长度更长的帧长度的帧的第二帧分割单元 分割单元,第二转换单元,其将通过第二分割单元划分成帧的输入声音转换为频谱,平滑化单元,其对频率方向的转换频谱进行平滑;以及增益计算单元,其计算增益 基于平滑的频谱和噪声谱。

    Pulse width modulator and pulse width modulation method
    9.
    发明授权
    Pulse width modulator and pulse width modulation method 失效
    脉宽调制器和脉宽调制方法

    公开(公告)号:US07224728B2

    公开(公告)日:2007-05-29

    申请号:US10464485

    申请日:2003-06-19

    申请人: Mitsuya Komamura

    发明人: Mitsuya Komamura

    IPC分类号: H03K7/08

    CPC分类号: H03M5/08

    摘要: A pulse width modulator for producing a PWM signal having reduced nonlinear distortion through interpolation processing with fewer computation steps is provided. The computational processing, W={1+α(X2−X1)}{0.25 (X1+X2)}+0.5, is performed in accordance with successive sample values (X1, X2) in a PCM data train to determine a weighting factor (W). The computational processing, Xq=0.5·X0(W2−W)+X1(1−W2)+0.5·X2(W2+W), is performed using the sample values (X1, X2), a sample value (X0) previous to the sample value (X1), and the weighting factor (W) to thereby determine an interpolated sample value (Xq) having an amplitude close to that of an original analog signal (X(t)) generating the PCM data train. A point in time (tq) at which a reference signal (R(t)) takes on the interpolated sample value (Xq) is then determined to produce a PWM signal (Spwm) which is logically inverted at the point in time (tq).

    摘要翻译: 提供了一种用于通过具有较少计算步骤的内插处理来产生具有减小的非线性失真的PWM信号的脉宽调制器。 计算处理W = {1 +α(X 2 -X 1 -X 1)} {0.25(X 1 + X 2) )} + 0.5,根据PCM数据列中的连续采样值(X 1,X 2> 2)进行,以确定加权因子(W )。 计算处理Xq = 0.5.X 0(W-2 -W)+ X 1(1 -W 2)/ SUP>)+ 0.5XX 2(W 2 SUP + W),使用样本值(X 1,X 2, 2),样本值(X 1> 1)之前的样本值(X <0> 0 )和加权因子(W),从而确定内插 具有接近生成PCM数据列的原始模拟信号(X(t))的幅度的采样值(Xq)。 然后确定参考信号(R(t))获取内插采样值(Xq)的时间点(tq)以产生在时间点(tq)逻辑反相的PWM信号(Spwm) 。

    Band extending apparatus and method
    10.
    发明授权
    Band extending apparatus and method 失效
    带扩展装置和方法

    公开(公告)号:US08144762B2

    公开(公告)日:2012-03-27

    申请号:US12373898

    申请日:2006-07-31

    申请人: Mitsuya Komamura

    发明人: Mitsuya Komamura

    IPC分类号: H04B1/66

    CPC分类号: G10L21/038

    摘要: A band extending apparatus (1) is provided with: first generating device (111, 112) for generating a baseband signal (XB(n)) by up-sampling an input signal (X(n)) and then transmitting it through a low-pass filter; a second generating device (21) for generating a high-frequency signal (XH(n)), by extracting a signal component on a higher-frequency side of a signal which is obtained by squaring a band limited signal (Xb(n)) which is a signal component with a predetermined band of the baseband signal; and a third generating device (141) for generating an output signal (XE(n)) by adding the high-frequency signal to the baseband signal.

    摘要翻译: 带扩展装置(1)具有:通过对输入信号(X(n))进行上采样,然后通过低频发送来产生基带信号(XB(n))的第一生成装置(111,112) 通过滤波器 通过提取通过对频带限制信号(Xb(n))进行平方而获得的信号的高频侧的信号分量,生成高频信号(XH(n))的第二生成装置(21) 其是具有基带信号的预定频带的信号分量; 以及用于通过将高频信号加到基带信号来产生输出信号(XE(n))的第三产生装置(141)。