-
公开(公告)号:US10375473B2
公开(公告)日:2019-08-06
申请号:US15270460
申请日:2016-09-20
Applicant: Vocollect, Inc.
Inventor: Sean Nickel , Dale McGary , Matthew Aaron Nichols , Michael Kloehn
IPC: G10L19/00 , H04R3/00 , G10L15/20 , G10L21/0216 , G10L21/0264 , H04R29/00 , G10L21/0208
Abstract: A device, system, and method whereby a speech-driven system used in an industrial environment distinguishes speech obtained from users of the system from other background sounds. In one aspect, the present system and method provides for a first audio stream from a user microphone collocated with a source of human speech (that is, a user) and a second audio stream from a environmental microphone which is proximate to the source of human speech but more remote than the user microphone. The audio signals from the two microphones are asynchronous. A processor is configured to identify a common, distinctive sound event in the environment, such as an impulse sound or a periodic sound signal. Based on the common sound event, the processor provides for synchronization of the two audio signals. In another aspect, the present system and method provides for a determination of whether or not the sound received at the user microphone is suitable for identification of words in a human voice, based on a comparison of sound elements in the first audio stream and the second audio stream, for example based on a comparison of the sound intensities of the sound elements in the audio streams.
-
公开(公告)号:US20180082702A1
公开(公告)日:2018-03-22
申请号:US15270460
申请日:2016-09-20
Applicant: Vocollect, Inc.
Inventor: Sean Nickel , Dale McGary , Matthew Aaron Nichols , Michael Kloehn
IPC: G10L21/0308 , G10L21/0224
CPC classification number: H04R3/005 , G10L15/20 , G10L21/0208 , G10L21/0264 , G10L25/06 , G10L25/51 , G10L2021/02161 , G10L2021/02165 , H04R5/033 , H04R29/005 , H04R2201/107 , H04R2420/07
Abstract: A device, system, and method whereby a speech-driven system used in an industrial environment distinguishes speech obtained from users of the system from other background sounds. In one aspect, the present system and method provides for a first audio stream from a user microphone collocated with a source of human speech (that is, a user) and a second audio stream from a environmental microphone which is proximate to the source of human speech but more remote than the user microphone. The audio signals from the two microphones are asynchronous. A processor is configured to identify a common, distinctive sound event in the environment, such as an impulse sound or a periodic sound signal. Based on the common sound event, the processor provides for synchronization of the two audio signals. In another aspect, the present system and method provides for a determination of whether or not the sound received at the user microphone is suitable for identification of words in a human voice, based on a comparison of sound elements in the first audio stream and the second audio stream, for example based on a comparison of the sound intensities of the sound elements in the audio streams.
-