-
公开(公告)号:US08428945B2
公开(公告)日:2013-04-23
申请号:US13105621
申请日:2011-05-11
申请人: Pierre Zakarauskas
发明人: Pierre Zakarauskas
IPC分类号: G10L15/20
摘要: A system classifies the source of an input signal. The system determines whether a sound source belongs to classes that may include human speech, musical instruments, machine noise, or other classes of sound sources. The system is robust, performing classification despite variation in sound level and noise masking. Additionally, the system consumes relatively few computational resources and adapts over time to provide consistently accurate classification.
摘要翻译: 系统对输入信号的源进行分类。 该系统确定声源是否属于可能包括人类语音,乐器,机器噪声或其他类别的声源的类。 该系统是鲁棒的,尽管声级和噪声屏蔽有变化,但执行分类。 此外,该系统消耗相对较少的计算资源并随时间适应以提供一致的精确分类。
-
公开(公告)号:US07957967B2
公开(公告)日:2011-06-07
申请号:US11540153
申请日:2006-09-29
申请人: Pierre Zakarauskas
发明人: Pierre Zakarauskas
IPC分类号: G10L15/20
摘要: A system classifies the source of an input signal. The system determines whether a sound source belongs to classes that may include human speech, musical instruments, machine noise, or other classes of sound sources. The system is robust, performing classification despite variation in sound level and noise masking. Additionally, the system consumes relatively few computational resources and adapts over time to provide consistently accurate classification.
摘要翻译: 系统对输入信号的源进行分类。 该系统确定声源是否属于可能包括人类语音,乐器,机器噪声或其他类别的声源的类。 该系统是鲁棒的,尽管声级和噪声屏蔽有变化,但执行分类。 此外,该系统消耗相对较少的计算资源并随时间适应以提供一致的精确分类。
-
公开(公告)号:US07885420B2
公开(公告)日:2011-02-08
申请号:US10410736
申请日:2003-04-10
申请人: Phil Hetherington , Xueman Li , Pierre Zakarauskas
发明人: Phil Hetherington , Xueman Li , Pierre Zakarauskas
CPC分类号: G10L21/0232 , G10L21/0208 , G10L21/0264 , G10L2021/02163 , H04R2410/07
摘要: The invention includes a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data. Sound from one or several microphones is digitized into binary data. A time-frequency transform is applied to the data to produce a series of spectra. The spectra are analyzed to detect the presence of wind noise and narrow band signals. Wind noise is selectively suppressed while preserving the narrow band signals. The narrow band signal is interpolated through the times and frequencies when it is masked by the wind noise. A time series is then synthesized from the signal spectral estimate that can be listened to. This invention overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed. Its application results in good-quality speech from data severely degraded by wind noise.
摘要翻译: 本发明包括一种在声学数据中保留窄带信号的同时选择性地抑制风噪声的方法,装置和计算机程序。 来自一个或多个麦克风的声音被数字化为二进制数据。 时间 - 频率变换被应用于数据以产生一系列光谱。 分析光谱以检测风噪声和窄带信号的存在。 选择性地抑制风噪声,同时保持窄带信号。 当窄带信号被风噪声掩蔽时,频带信号经过时间和频率被内插。 然后从可以听到的信号谱估计合成时间序列。 本发明克服了现有技术的限制,其需要多于一个麦克风和独立的风速测量。 其应用导致了风噪声严重恶化的数据的高质量语音。
-
公开(公告)号:US09020818B2
公开(公告)日:2015-04-28
申请号:US13589977
申请日:2012-08-20
IPC分类号: G10L15/00 , G10L15/14 , G10L15/26 , G10L21/00 , G10L21/02 , H04R25/00 , G10L25/15 , G10L25/75
CPC分类号: G10L19/012 , G10L19/0017 , G10L21/02 , G10L25/15 , G10L25/75 , G10L2019/0007 , H04R25/00
摘要: Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or the like. In particular, in some implementations, systems, methods and devices are operable to generate a machine readable formant based codebook. In some implementations, the method includes determining whether or not a candidate codebook tuple includes a sufficient amount of new information to warrant either adding the candidate codebook tuple to the codebook or using at least a portion of the candidate codebook tuple to update an existing codebook tuple. Additionally and/or alternatively, in some implementations systems, methods and devices are operable to reconstruct a target voice signal by detecting formants in an audible signal, using the detected formants to select codebook tuples, and using the formant information in the selected codebook tuples to reconstruct the target voice signal.
摘要翻译: 本文描述的系统,方法和装置的实现使得能够增强包括在由助听器装置等接收的噪声可听信号中的目标语音信号的可懂度。 特别地,在一些实现中,系统,方法和设备可操作以生成基于机器可读共享器的码本。 在一些实现中,该方法包括确定候选码本元组是否包括足够量的新信息以保证将候选码本元组添加到码本,或者使用候选码本元组的至少一部分来更新现有码本元组 。 附加地和/或替代地,在一些实现中,系统,方法和设备可操作以通过检测可听信号中的共振峰来重建目标语音信号,使用检测到的共振峰来选择码本元组,并且使用所选码本元组中的共振峰信息 重建目标语音信号。
-
公开(公告)号:US20130231932A1
公开(公告)日:2013-09-05
申请号:US13590022
申请日:2012-08-20
IPC分类号: G10L15/00
摘要: Implementations include systems, methods and/or devices operable to detect voice activity in an audible signal by detecting glottal pulses. The dominant frequency of a series of glottal pulses is perceived as the intonation pattern or melody of natural speech, which is also referred to as the pitch. However, as noted above, spoken communication typically occurs in the presence of noise and/or other interference. In turn, the undulation of voiced speech is masked in some portions of the frequency spectrum associated with human speech by the noise and/or other interference. In some implementations, detection of voice activity is facilitated by dividing the frequency spectrum associated with human speech into multiple sub-bands in order to identify glottal pulses that dominate the noise and/or other inference in particular sub-bands. Additionally and/or alternatively, in some implementations the analysis is furthered to provide a pitch estimate of the detected voice activity.
摘要翻译: 实现包括可操作以通过检测声门脉冲来检测可听信号中的语音活动的系统,方法和/或设备。 一系列声门脉冲的主频被视为自然语音的语调模式或旋律,也称为音调。 然而,如上所述,语音通信通常在存在噪声和/或其他干扰的情况下发生。 反过来,通过噪声和/或其他干扰,有声语音的波动在与人类语音相关联的频谱的某些部分被屏蔽。 在一些实现中,通过将与人类语音相关联的频谱划分成多个子带来便于语音活动的检测,以便识别主导噪声和/或特别是子带的其他推断的声门脉冲。 另外和/或替代地,在一些实现中,进一步分析以提供检测到的语音活动的音高估计。
-
公开(公告)号:US20110213612A1
公开(公告)日:2011-09-01
申请号:US13105621
申请日:2011-05-11
申请人: Pierre Zakarauskas
发明人: Pierre Zakarauskas
IPC分类号: G10L15/20
摘要: A system classifies the source of an input signal. The system determines whether a sound source belongs to classes that may include human speech, musical instruments, machine noise, or other classes of sound sources. The system is robust, performing classification despite variation in sound level and noise masking. Additionally, the system consumes relatively few computational resources and adapts over time to provide consistently accurate classification.
摘要翻译: 系统对输入信号的源进行分类。 该系统确定声源是否属于可能包括人类语音,乐器,机器噪声或其他类别的声源的类。 该系统是鲁棒的,尽管声级和噪声屏蔽有变化,但执行分类。 此外,该系统消耗相对较少的计算资源并随时间适应以提供一致的精确分类。
-
公开(公告)号:US20110123044A1
公开(公告)日:2011-05-26
申请号:US13013358
申请日:2011-01-25
申请人: Phil Hetherington , Xueman Li , Pierre Zakarauskas
发明人: Phil Hetherington , Xueman Li , Pierre Zakarauskas
IPC分类号: H04B15/00
CPC分类号: G10L21/0232 , G10L21/0208 , G10L21/0264 , G10L2021/02163 , H04R2410/07
摘要: The invention includes a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data. Sound from one or several microphones is digitized into binary data. A time-frequency transform is applied to the data to produce a series of spectra. The spectra are analyzed to detect the presence of wind noise and narrow band signals. Wind noise is selectively suppressed while preserving the narrow band signals. The narrow band signal is interpolated through the times and frequencies when it is masked by the wind noise. A time series is then synthesized from the signal spectral estimate that can be listened to. This invention overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed. Its application results in good-quality speech from data severely degraded by wind noise.
摘要翻译: 本发明包括一种在声学数据中保留窄带信号的同时选择性地抑制风噪声的方法,装置和计算机程序。 来自一个或多个麦克风的声音被数字化为二进制数据。 时间 - 频率变换被应用于数据以产生一系列光谱。 分析光谱以检测风噪声和窄带信号的存在。 选择性地抑制风噪声,同时保持窄带信号。 当窄带信号被风噪声掩蔽时,频带信号经过时间和频率被内插。 然后从可以听到的信号谱估计合成时间序列。 本发明克服了现有技术的限制,其需要多于一个麦克风和独立的风速测量。 其应用导致了风噪声严重恶化的数据的高质量语音。
-
公开(公告)号:US07895036B2
公开(公告)日:2011-02-22
申请号:US10688802
申请日:2003-10-16
IPC分类号: G10L21/02
CPC分类号: G10L21/0208 , G10L21/0232
摘要: A voice enhancement logic improves the perceptual quality of a processed voice. The voice enhancement system includes a noise detector and a noise attenuator. The noise detector detects a wind buffet and a continuous noise by modeling the wind buffet. The noise attenuator dampens the wind buffet to improve the intelligibility of an unvoiced, a fully voiced, or a mixed voice segment.
摘要翻译: 语音增强逻辑提高了处理语音的感知质量。 语音增强系统包括噪声检测器和噪声衰减器。 噪音检测器通过对风自助餐进行建模来检测风自助餐和持续的噪音。 噪音衰减器抑制风自助餐,以提高清音,完全浊音或混合声音段的清晰度。
-
公开(公告)号:US07231347B2
公开(公告)日:2007-06-12
申请号:US11136829
申请日:2005-05-24
申请人: Pierre Zakarauskas
发明人: Pierre Zakarauskas
IPC分类号: G10L15/20
CPC分类号: G10L21/0208 , G10L21/0232 , G10L21/0264 , H04R2225/43
摘要: A signal enhancement system improves the quality of a noisy input signal. The system finds a low noise signal model which best matches the noisy input signal. Noisy portions of the input signal are replaced with portions of the low noise signal models. As the input signal increases in noise content, the output signal includes an increasing amount of the low noise signal model. The system thereby produces an output signal with very low noise which corresponds to the input signal.
摘要翻译: 信号增强系统提高噪声输入信号的质量。 该系统发现与噪声输入信号最匹配的低噪声信号模型。 输入信号的噪声部分被低噪声信号模型的部分所取代。 随着输入信号噪声含量的增加,输出信号包括增加量的低噪声信号模型。 因此,该系统产生对应于输入信号的具有非常低的噪声的输出信号。
-
公开(公告)号:US09373340B2
公开(公告)日:2016-06-21
申请号:US13013358
申请日:2011-01-25
申请人: Phil Hetherington , Xueman Li , Pierre Zakarauskas
发明人: Phil Hetherington , Xueman Li , Pierre Zakarauskas
IPC分类号: G10L21/02 , H04B15/00 , G10L21/0208 , G10L21/0216 , G10L21/0232
CPC分类号: G10L21/0232 , G10L21/0208 , G10L21/0264 , G10L2021/02163 , H04R2410/07
摘要: The invention includes a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data. Sound from one or several microphones is digitized into binary data. A time-frequency transform is applied to the data to produce a series of spectra. The spectra are analyzed to detect the presence of wind noise and narrow band signals. Wind noise is selectively suppressed while preserving the narrow band signals. The narrow band signal is interpolated through the times and frequencies when it is masked by the wind noise. A time series is then synthesized from the signal spectral estimate that can be listened to. This invention overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed. Its application results in good-quality speech from data severely degraded by wind noise.
摘要翻译: 本发明包括一种在声学数据中保留窄带信号的同时选择性地抑制风噪声的方法,装置和计算机程序。 来自一个或多个麦克风的声音被数字化为二进制数据。 时间 - 频率变换被应用于数据以产生一系列光谱。 分析光谱以检测风噪声和窄带信号的存在。 选择性地抑制风噪声,同时保持窄带信号。 当窄带信号被风噪声掩蔽时,频带信号经过时间和频率被内插。 然后从可以听到的信号谱估计合成时间序列。 本发明克服了现有技术的限制,其需要多于一个麦克风和独立的风速测量。 其应用导致了风噪声严重恶化的数据的高质量语音。
-
-
-
-
-
-
-
-
-