Mixed audio separation apparatus
    1.
    发明授权
    Mixed audio separation apparatus 有权
    混合音频分离装置

    公开(公告)号:US07974420B2

    公开(公告)日:2011-07-05

    申请号:US11665265

    申请日:2006-04-11

    CPC classification number: G10L21/0272 G10L19/0204

    Abstract: A mixed audio separation system (100) which separates a specific audio from among a mixed audio (S100) includes a local frequency information generation unit (105) which obtains pieces of local frequency information (S103) corresponding to local reference waveforms (S102), based on the local reference waveforms (S102) and an analysis waveform which is the waveform of the mixed audio (S100). Each of the local reference waveforms (S102) (i) constitutes a part of a reference waveform for analyzing a predetermined frequency, (ii) has a predetermined temporal/spatial resolution and (iii) includes at least one of an amplification spectrum and a phase spectrum in the predetermined frequency. The system includes: a specific audio's frequency feature value extraction unit (106) which performs pattern matching between a first set which is the pieces of local frequency information and a second set of pieces of frequency information (S103) of a predetermined specific audio, and extracts the first set of the pieces of local frequency information (S103), based on a result of the pattern matching; and an audio signal generation unit which generates a signal of the specific audio, based on the first set of the pieces of local frequency information (S103) extracted by the specific audio's frequency feature value extraction unit.

    Abstract translation: 从混合音频(S100)中分离特定音频的混合音频分离系统(100)包括本地频率信息生成单元(105),其获取与局部参考波形相对应的本地频率信息(S103)(S102), 基于本地参考波形(S102)和作为混合音频的波形的分析波形(S100)。 每个局部参考波形(S102)(i)构成用于分析预定频率的参考波形的一部分,(ii)具有预定的时间/空间分辨率,以及(iii)包括放大光谱和相位 频谱在预定频率。 该系统包括:特定音频频率特征值提取单元(106),其执行作为本地频率信息的第一组与预定特定音频的第二组频率信息(S103)之间的模式匹配,以及 基于模式匹配的结果提取第一组本地频率信息(S103); 以及音频信号生成单元,其基于由特定音频的频率特征值提取单元提取的第一组本地频率信息(S103),生成特定音频的信号。

    AUDIO SOURCE DIRECTION DETECTING DEVICE
    2.
    发明申请
    AUDIO SOURCE DIRECTION DETECTING DEVICE 有权
    音频源方向检测装置

    公开(公告)号:US20100303254A1

    公开(公告)日:2010-12-02

    申请号:US12446499

    申请日:2008-09-10

    CPC classification number: G01S3/8083

    Abstract: A sound source direction detector comprises FFT analysis sections (103(1) to 103(3)) for generating a frequency spectrum in at least one frequency band of acoustic signals for each of the acoustic signals collected by two or more microphones arranged apart from one another, detection sound identifying sections (104(1) to 104(3)) for identifying a time portion of the frequency spectrum of a detection sound which obtains a sound source direction from the frequency spectrum in the frequency band, and a direction detecting section (105) for obtaining the difference between the times at which the detection sound reaches the microphones, obtaining the sound source direction from the time difference, the distance between the microphones, and the sound velocity, and outputting it depending on the degree of coincidence between the microphones of the frequency spectrum in the time portion identified by the detection sound identifying sections (104(1) to 104(3)) in a time interval which is the time unit to detect the sound source direction.

    Abstract translation: 声源方向检测器包括FFT分析部分(103(1)至103(3)),用于在由两个或多个麦克风分离的每个声信号收集的声信号的至少一个频带中产生频谱 另一个用于识别从频带中的频谱获得声源方向的检测声音的频谱的时间部分的检测声音识别部分(104(1)至104(3)),以及方向检测部分 (105),用于获得检测声到达麦克风的时间之间的差异,从时差获得声源方向,麦克风之间的距离和声速,并根据其中的一致程度输出 由检测声音识别部分(104(1)至104(3))识别的时间部分中的频谱的麦克风在时间间隔内为ti 我单位来检测声源方向。

    Audio restoration apparatus and audio restoration method
    3.
    发明申请
    Audio restoration apparatus and audio restoration method 有权
    音频恢复装置和音频恢复方法

    公开(公告)号:US20060193671A1

    公开(公告)日:2006-08-31

    申请号:US11401263

    申请日:2006-04-11

    CPC classification number: G10L19/005 G10L21/0208

    Abstract: An audio restoration apparatus which restores an audio to be restored having a missing audio part and being included in a mixed audio. The audio restoration apparatus includes: a mixed audio separation unit which extracts the audio to be restored included in the mixed audio; an audio structure analysis unit which generates at least one of a phoneme sequence, a character sequence and a musical note sequence of the missing audio part in the extracted audio to be restored, based on an audio structure knowledge database in which semantics of audio are registered; an unchanged audio characteristic domain analysis unit which segments the extracted audio to be restored into time domains in each of which an audio characteristic remains unchanged; an audio characteristic extraction unit which identifies a time domain where the missing audio part is located, from among the segmented time domains, and extract audio characteristics of the identified time domain in the audio to be restored; and an audio restoration unit which restores the missing audio part in the audio to be restored, using the extracted audio characteristics and the generated one or more of phoneme sequence, character sequence and musical note sequence.

    Abstract translation: 一种音频恢复装置,其恢复具有丢失的音频部分并被包括在混合音频中的要恢复的音频。 音频恢复装置包括:混合音频分离单元,其提取包含在混合音频中的要恢复的音频; 音频结构分析单元,其基于音频结构知识数据库中生成音频结构知识数据库中的音素序列,字符序列和所提取的要恢复的音频中的音符序列中的至少一个。 ; 一个不变的音频特征域分析单元,其将提取的音频分段成恢复到每个音频特性保持不变的时域; 音频特征提取单元,从分段时域中识别缺失音频部分所在的时域,并且提取要恢复的音频中所识别的时域的音频特征; 以及音频恢复单元,其使用所提取的音频特性和所生成的一个或多个音素序列,字符序列和音符序列来恢复要恢复的音频中的丢失音频部分。

    Speech recognition apparatus and speech recognition method
    4.
    发明申请
    Speech recognition apparatus and speech recognition method 有权
    语音识别装置和语音识别方法

    公开(公告)号:US20060100876A1

    公开(公告)日:2006-05-11

    申请号:US11296268

    申请日:2005-12-08

    CPC classification number: G10L15/32 G10L15/183

    Abstract: To provide a speech recognition apparatus which appropriately performs speech recognition by generating, in real time, language models adapted to a new topic even in the case where topics are changed. The speech recognition apparatus includes: a word specification unit for obtaining and specifying a word; a language model information storage unit for storing language models for recognizing speech and the respectively corresponding pieces of tag information; a combination coefficient calculation unit for calculating the weights of the respective language models, as combination coefficients, according to the word obtained by the word specification unit, based on the relevance degree between the word obtained by the word specification unit and the tag information of each language model; a language probability calculation unit for calculating the probabilities of word appearance by combining the respective language models according to the calculated combination coefficients; and a speech recognition unit for recognizing speech using the calculated probabilities of word appearance.

    Abstract translation: 为了提供一种语音识别装置,通过即使在主题改变的情况下实时地生成适应于新主题的语言模型来适当地执行语音识别。 语音识别装置包括:字指定单元,用于获取并指定单词; 语言模型信息存储单元,用于存储用于识别语音的语言模型和分别对应的标签信息; 组合系数计算单元,用于根据由单词指定单元获得的单词,根据由单词指定单元获得的单词与每个单词指定单元的标签信息之间的相关度计算各语言模型的权重作为组合系数 语言模型; 语言概率计算单元,用于通过根据所计算的组合系数组合各个语言模型来计算出词概率; 以及语音识别单元,用于使用计算出的单词外观的概率来识别语音。

    SOUND SOURCE LOCALIZATION DEVICE
    5.
    发明申请
    SOUND SOURCE LOCALIZATION DEVICE 有权
    声源本地化设备

    公开(公告)号:US20090285409A1

    公开(公告)日:2009-11-19

    申请号:US12094724

    申请日:2007-11-06

    Abstract: Provided is a sound source localization device which can detect a source location of an extraction sound, including at least two microphones; an analysis unit (103) which (i) analyze frequencies of the mixed sound including the noise and received by each microphone, and (ii) generates frequency signals; and an extraction unit (105) which, for each source location candidate, (a) adjusts time axes of the frequency signals corresponding to the microphones, so that there is no time difference between when the mixed sound reaches one microphone from the source location candidate and when the mixed sound reaches another microphone from the source location candidate, and (b) determines frequency signals having a difference distance equal to or smaller than a threshold value, from among the frequency signals corresponding to the microphones with the time axis having been adjusted, the difference distance representing a degree of a difference in the frequency signals between the microphones, and (c) extracts the source location of the extraction sound from among the source location candidates, in accordance with a degree of matching of the determined frequency signals between the microphones.

    Abstract translation: 提供了一种声源定位装置,其可以检测包括至少两个麦克风的提取声音的源位置; 分析单元(103),其(i)分析包括噪声并由每个麦克风接收的混合声音的频率,以及(ii)产生频率信号; 以及提取单元(105),其针对每个源位置候选,(a)调整与所述麦克风相对应的频率信号的时间轴,使得当所述混合声音从所述源位置候选者到达一个麦克风时不存在时间差 并且当所述混合声音从所述源位置候选者到达另一个麦克风时,以及(b)从对应于所述时间轴已被调整的麦克风的频率信号中确定具有等于或小于阈值的差距的频率信号 ,所述差距表示所述麦克风之间的频率信号的差异程度,以及(c)根据所确定的频率信号的匹配程度,从所述源位置候选中提取所述提取声音的源位置, 麦克风。

    Sound identification apparatus
    6.
    发明授权
    Sound identification apparatus 有权
    声音识别装置

    公开(公告)号:US07473838B2

    公开(公告)日:2009-01-06

    申请号:US11783376

    申请日:2007-04-09

    CPC classification number: G10L25/48

    Abstract: A sound identification apparatus which reduces the chance of a drop in the identification rate, including: a frame sound feature extraction unit which extracts a sound feature per frame of an inputted audio signal; a frame likelihood calculation unit which calculates a frame likelihood of the sound feature in each frame, for each of a plurality of sound models; a confidence measure judgment unit which judges a confidence measure based on the frame likelihood; a cumulative likelihood output unit time determination unit which determines a cumulative likelihood output unit time based on the confidence measure; a cumulative likelihood calculation unit which calculates a cumulative likelihood in which the frame likelihoods of the frames included in the cumulative likelihood output unit time are cumulated, for each sound model; a sound type candidate judgment unit which determines, for each cumulative likelihood output unit time, a sound type corresponding to the sound model that has a maximum cumulative likelihood; a sound type frequency calculation unit which calculates the frequency of the sound type candidate; and a sound type interval determination unit which determines the sound type of the inputted audio signal and the interval of the sound type, based on the frequency of the sound type.

    Abstract translation: 一种声音识别装置,其减少识别率下降的可能性,包括:帧声音特征提取单元,其提取每帧输入的音频信号的声音特征; 帧似然计算单元,对于多个声音模型中的每一个,计算每个帧中的声音特征的帧似然度; 置信度判断单元,其基于所述帧可能性判断置信度量; 累积似然度输出单元时间确定单元,其基于所述置信度测量来确定累积似然度输出单位时间; 对于每个声音模型,计算累积似然输出单元时间中包括的帧的帧似然性的累积似然度的累积似然度计算单元; 声音候选判定单元,对于每个累积似然度输出单位时间,确定与具有最大累积似然性的声音模型对应的声音类型; 声音型频率计算单元,其计算声音类型候选的频率; 以及声音类型间隔确定单元,其基于声音类型的频率来确定输入的音频信号的声音类型和声音类型的间隔。

    Target sound analysis apparatus, target sound analysis method and target sound analysis program
    7.
    发明申请
    Target sound analysis apparatus, target sound analysis method and target sound analysis program 有权
    目标声音分析仪器,目标声音分析方法和目标声音分析程序

    公开(公告)号:US20080304672A1

    公开(公告)日:2008-12-11

    申请号:US11902731

    申请日:2007-09-25

    CPC classification number: G10L25/48 G08G1/017 G10L21/028 G10L25/90

    Abstract: A target sound analysis apparatus capable of distinguishing between a sound having the same fundamental period as a target sound but which differs therefrom and the target sound and analyzing whether or not the target sound is contained in an evaluation sound is an target sound analysis apparatus that analyzes whether or not a target sound is included in an evaluation sound, and includes: a target sound preparation unit that prepares a target sound that is an analysis waveform to be used for analyzing a fundamental period; an evaluation sound preparation unit that prepares an evaluation sound that is an analyzed waveform in which its fundamental period will be analyzed; and an analysis unit that temporally shifts the target sound with respect to the evaluation sound to sequentially calculate differential values of the evaluation sound and the target sound at corresponding points in time, calculate an iterative interval between the points in time where the differential value is equal to or lower than a predetermined threshold value, and judge whether or not the target sound exists in the evaluation sound based on a period of the iterative interval and the fundamental period of the target sound.

    Abstract translation: 能够区别具有与目标声音相同但与之不同的基准周期的声音与目标声音并且分析目标声音是否包含在评价声音中的目标声音分析装置是分析 无论目标声音是否被包括在评价声音中,并且包括:目标声音准备单元,其准备作为用于分析基本周期的分析波形的目标声音; 评估声音准备单元,其准备作为其分析基本周期的分析波形的评价声音; 以及分析单元,其相对于所述评价声音暂时移动所述目标声音,以在相应的时间点顺序地计算所述评价声音和所述目标声音的差分值,计算所述差分值相等的时间点之间的迭代间隔 达到或低于预定阈值,并且基于迭代间隔的周期和目标声音的基本周期来判断评估声中是否存在目标声音。

    Vehicle-in-blind-spot detecting apparatus and method thereof
    8.
    发明授权
    Vehicle-in-blind-spot detecting apparatus and method thereof 有权
    车内盲点检测装置及其方法

    公开(公告)号:US08525654B2

    公开(公告)日:2013-09-03

    申请号:US12774060

    申请日:2010-05-05

    CPC classification number: G01S3/802 G01S3/801 G08G1/16 G08G1/166 G08G1/167

    Abstract: A vehicle-in-blind-spot detecting apparatus detects a vehicle positioned in a blind spot by mounting the apparatus on an operator's vehicle. The vehicle-in-blind-spot includes a presenting unit which presents information; at least one microphone which detects a sound; a vehicle sound extracting unit which extracts a vehicle sound from the sound detected by the microphone; and a sound source direction detecting unit which detects a sound source direction of the vehicle sound extracted by the vehicle sound extracting unit. A vehicle-in-blind-spot determining unit causes the presenting unit to present the information indicating that a vehicle is found in a blind spot in the case where the sound source direction of the vehicle sound detected by the sound source direction detecting unit is a first direction representing above the vehicle-in-blind-spot detecting apparatus with respect to a ground.

    Abstract translation: 车内盲点检测装置通过将设备安装在驾驶员车辆上来检测位于盲点的车辆。 车内盲点包括提供信息的呈现单元; 至少一个麦克风,其检测声音; 车辆声音提取单元,从由麦克风检测到的声音中提取车辆声音; 以及声源方向检测单元,其检测由车辆声音提取单元提取的车辆声音的声源方向。 在车内盲点确定单元使得呈现单元在由声源方向检测单元检测到的车辆声音的声源方向为a的情况下,呈现指示车辆被发现在盲点中的信息 第一方向表示相对于地面的车载盲点检测装置。

    Target sound analysis apparatus, target sound analysis method and target sound analysis program
    9.
    发明授权
    Target sound analysis apparatus, target sound analysis method and target sound analysis program 有权
    目标声音分析仪器,目标声音分析方法和目标声音分析程序

    公开(公告)号:US08223978B2

    公开(公告)日:2012-07-17

    申请号:US11902731

    申请日:2007-09-25

    CPC classification number: G10L25/48 G08G1/017 G10L21/028 G10L25/90

    Abstract: A target sound analysis apparatus capable of distinguishing between a sound having the same fundamental period as a target sound but which differs therefrom and the target sound and analyzing whether or not the target sound is contained in an evaluation sound is an target sound analysis apparatus that analyzes whether or not a target sound is included in an evaluation sound, and includes: a target sound preparation unit that prepares a target sound that is an analysis waveform to be used for analyzing a fundamental period; an evaluation sound preparation unit that prepares an evaluation sound that is an analyzed waveform in which its fundamental period will be analyzed; and an analysis unit that temporally shifts the target sound with respect to the evaluation sound to sequentially calculate differential values of the evaluation sound and the target sound at corresponding points in time, calculate an iterative interval between the points in time where the differential value is equal to or lower than a predetermined threshold value, and judge whether or not the target sound exists in the evaluation sound based on a period of the iterative interval and the fundamental period of the target sound.

    Abstract translation: 能够区别具有与目标声音相同但与之不同的基准周期的声音与目标声音并且分析目标声音是否包含在评价声音中的目标声音分析装置是分析 无论目标声音是否被包括在评价声音中,并且包括:目标声音准备单元,其准备作为用于分析基本周期的分析波形的目标声音; 评估声音准备单元,其准备作为其分析基本周期的分析波形的评价声音; 以及分析单元,其相对于所述评价声音暂时移动所述目标声音,以在相应的时间点顺序地计算所述评价声音和所述目标声音的差分值,计算所述差分值相等的时间点之间的迭代间隔 达到或低于预定阈值,并且基于迭代间隔的周期和目标声音的基本周期来判断评估声中是否存在目标声音。

    Sound source position detector
    10.
    发明授权
    Sound source position detector 有权
    声源位置检测器

    公开(公告)号:US08184827B2

    公开(公告)日:2012-05-22

    申请号:US12094724

    申请日:2007-11-06

    Abstract: Provided is a sound source localization device which can detect a source location of an extraction sound, including at least two microphones; an analysis unit (103) which (i) analyze frequencies of the mixed sound including the noise and received by each microphone, and (ii) generates frequency signals; and an extraction unit (105) which, for each source location candidate, (a) adjusts time axes of the frequency signals corresponding to the microphones, so that there is no time difference between when the mixed sound reaches one microphone from the source location candidate and when the mixed sound reaches another microphone from the source location candidate, and (b) determines frequency signals having a difference distance equal to or smaller than a threshold value, from among the frequency signals corresponding to the microphones with the time axis having been adjusted, the difference distance representing a degree of a difference in the frequency signals between the microphones, and (c) extracts the source location of the extraction sound from among the source location candidates, in accordance with a degree of matching of the determined frequency signals between the microphones.

    Abstract translation: 提供了一种声源定位装置,其可以检测包括至少两个麦克风的提取声音的源位置; 分析单元(103),其(i)分析包括噪声并由每个麦克风接收的混合声音的频率,以及(ii)产生频率信号; 以及提取单元(105),其针对每个源位置候选,(a)调整与所述麦克风相对应的频率信号的时间轴,使得当所述混合声音从所述源位置候选者到达一个麦克风时不存在时间差 并且当所述混合声音从所述源位置候选者到达另一个麦克风时,以及(b)从对应于所述时间轴已被调整的麦克风的频率信号中确定具有等于或小于阈值的差距的频率信号 ,所述差距表示所述麦克风之间的频率信号的差异程度,以及(c)根据所确定的频率信号的匹配程度,从所述源位置候选中提取所述提取声音的源位置, 麦克风。

Patent Agency Ranking