-
公开(公告)号:US20240363105A1
公开(公告)日:2024-10-31
申请号:US18290574
申请日:2022-05-13
申请人: Boris Fridman-Mintz
发明人: Boris Fridman-Mintz
IPC分类号: G10L15/187 , G10L15/20 , G10L21/0232 , G10L21/0264 , G10L25/78
CPC分类号: G10L15/187 , G10L15/20 , G10L21/0232 , G10L21/0264 , G10L25/78 , G10L2025/783
摘要: Within each harmonic spectrum of a sequence of spectra derived from analysis of a waveform representing human speech are identified two or more fundamental or harmonic components that have frequencies that are separated by integer multiples of a fundamental acoustic frequency. The highest harmonic frequency that is also greater than 410 Hz is a primary cap frequency, which is used to select a primary phonetic note that corresponds to a subset of phonetic chords from a set of phonetic chords for which acoustic spectral is available. The spectral data can also include frequencies for primary band, secondary band (or secondary note), basal band, or reduced basal band acoustic components, which can be used to select a phonetic chord from the subset of phonetic chords corresponding to the selected primary note.
-
公开(公告)号:US12104815B2
公开(公告)日:2024-10-01
申请号:US17139250
申请日:2020-12-31
IPC分类号: F24F11/63 , F24F120/10 , G05B13/02 , G05B13/04 , G10L15/20 , H04B17/318 , H04L67/12
CPC分类号: F24F11/63 , G05B13/0265 , G05B13/048 , G10L15/20 , H04B17/318 , H04L67/12 , F24F2120/10
摘要: An occupancy tracking device configured to receive sound samples, to identify voices within the sound samples, and to determine a first occupancy level based on the identified voices. The device is further configured to identify user devices connected to an access point and to determine a second occupancy level based on the user devices that are connected to the access point. The device is further configured to measure a signal strength of a network connection with the access point and to determine a third occupancy level based on the signal strength of the network connection with the access point. The device is further configured to determine a predicted occupancy level based on the first occupancy level, the second occupancy level, and the third occupancy level and to control a Heating, Ventilation, and Air Conditioning (HVAC) system based on the predicted occupancy level.
-
公开(公告)号:US12094455B2
公开(公告)日:2024-09-17
申请号:US18242860
申请日:2023-09-06
IPC分类号: G10L15/08 , G10L15/04 , G10L15/20 , G10L15/22 , G10L21/0208 , G10L21/028
CPC分类号: G10L15/08 , G10L15/04 , G10L15/20 , G10L21/028 , G10L2015/088 , G10L15/22 , G10L2021/02082
摘要: Systems and methods for selectively ignoring an occurrence of a wakeword within audio input data is provided herein. In some embodiments, a wakeword may be detected to have been uttered by an individual within a modified time window, which may account for hardware delays and echoing offsets. The detected wakeword that occurs during this modified time window may, in some embodiments, correspond to a word included within audio that is outputted by a voice activated electronic device. This may cause the voice activated electronic device to activate itself, stopping the audio from being outputted. By identifying when these occurrences of the wakeword within outputted audio are going to happen, the voice activated electronic device may selectively determine when to ignore the wakeword, and furthermore, when not to ignore the wakeword.
-
4.
公开(公告)号:US20240304187A1
公开(公告)日:2024-09-12
申请号:US18662334
申请日:2024-05-13
申请人: GOOGLE LLC
IPC分类号: G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0216 , G10L21/0232 , G10L25/84
CPC分类号: G10L15/20 , G10L15/02 , G10L15/08 , G10L15/22 , G10L21/0232 , G10L25/84 , G10L2015/025 , G10L2015/088 , G10L2015/223 , G10L2021/02166
摘要: Techniques are described for selectively adapting and/or selectively utilizing a noise reduction technique in detection of one or more features of a stream of audio data frames. For example, various techniques are directed to selectively adapting and/or utilizing a noise reduction technique in detection of an invocation phrase in a stream of audio data frames, detection of voice characteristics in a stream of audio data frames (e.g., for speaker identification), etc. Utilization of described techniques can result in more robust and/or more accurate detections of features of a stream of audio data frames in various situations, such as in environments with strong background noise. In various implementations, described techniques are implemented in combination with an automated assistant, and feature(s) detected utilizing techniques described herein are utilized to adapt the functionality of the automated assistant.
-
公开(公告)号:US12080317B2
公开(公告)日:2024-09-03
申请号:US17639317
申请日:2020-08-27
IPC分类号: G10L15/20 , G10L21/02 , G10L21/0208 , G10L21/0316
CPC分类号: G10L21/0316 , G10L15/20 , G10L21/0208 , G10L2021/02082
摘要: An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.
-
公开(公告)号:US20240249744A1
公开(公告)日:2024-07-25
申请号:US18625507
申请日:2024-04-03
发明人: Masaki Yamauchi , Nanami FUJIWARA
CPC分类号: G10L25/78 , G10L15/20 , G10L15/22 , G10L2015/228 , G10L2025/783
摘要: An information providing method includes: generating first information indicating that a friendly gathering is occurring in a home when (i) a threshold amount of time or longer has elapsed from a start time of food preparation by a user and (ii) the volume of sound in a dining space is a first threshold volume or greater; obtaining, from a second information processing apparatus connected to a first information processing apparatus, information indicating first request content over a network; and when content of the first information is included in the first request content, outputting, to the second information processing apparatus, second information including information for identifying the user or the home, using the first information generated.
-
公开(公告)号:US12039988B1
公开(公告)日:2024-07-16
申请号:US18424695
申请日:2024-01-26
申请人: Nantong University
发明人: Shibing Zhang , Jianrong Wu
CPC分类号: G10L21/02 , G10L15/063 , G10L15/20 , G10L25/51 , G10L2015/0631
摘要: The present application discloses a method and a system for saturation diving heliumspeech unscrambling based on multi-objective optimization. In a system including a diver and a filter at least, a working language phonetic symbol library and a common working word library for divers are constructed. The divers read them one by one, and a phonetic symbol standard speech library, a phonetic symbol heliumspeech library and a common working word speech library are generated. The filter uses the multi-objective optimization algorithm to design its impulse response coefficients, corrects and unscrambles the tagged and sampled heliumspeech signal word by word, and continuously updates the impulse response coefficients to complete the perfect heliumspeech unscrambling.
-
公开(公告)号:US12014730B2
公开(公告)日:2024-06-18
申请号:US17322238
申请日:2021-05-17
发明人: Xiangyan Xu
CPC分类号: G10L15/20 , G10L15/02 , G10L2015/025
摘要: A voice processing method includes: collecting a voice signal by a microphone of an electronic device, and signal-processing the collected voice signal to obtain a first voice frame segment; performing voice recognition on the first voice frame segment to obtain a first recognition result; in response to the first recognition result not matching a target content and a plurality of tokens in the first recognition result meeting a preset condition, performing frame compensation on the first voice frame segment to obtain a second voice frame segment; and performing voice recognition on the second voice frame segment to obtain a second recognition result. A matching degree between the second recognition result and the target content is greater than a matching degree between the first recognition result and the target content.
-
公开(公告)号:US20240194195A1
公开(公告)日:2024-06-13
申请号:US18581960
申请日:2024-02-20
发明人: Andrew J. Garner, IV , Tyua Larsen Fraser , Kimberly Ann Maclnnis , Paul R. McMahon , Darrell Lee Suen , Zhong Wan
摘要: Systems and techniques for are described herein. A voice profile may be generated for a user. An audio stream may be received including an authentication voice of the user. It may be determined that the authentication voice does not match a first set of authentication criteria. The audio stream may be compared to a second set of authentication criteria. The user may be authenticated based on the comparison.
-
公开(公告)号:US11996091B2
公开(公告)日:2024-05-28
申请号:US16989844
申请日:2020-08-10
IPC分类号: G10L15/20 , G10L15/02 , G10L15/16 , G10L15/22 , G10L17/06 , G10L21/02 , G10L21/0272 , G10L21/0208
CPC分类号: G10L15/20 , G10L15/02 , G10L15/16 , G10L15/22 , G10L17/06 , G10L21/02 , G10L21/0272 , G10L2015/223 , G10L2021/02087
摘要: A mixed speech recognition method, a mixed speech recognition apparatus, and a computer-readable storage medium are provided. The mixed speech recognition method includes: monitoring an input of speech input and detecting an enrollment speech and a mixed speech; acquiring speech features of a target speaker based on the enrollment speech; and determining speech belonging to the target speaker in the mixed speech based on the speech features of the target speaker. The enrollment speech includes preset speech information, and the mixed speech is non-enrollment speech inputted after the enrollment speech.
-
-
-
-
-
-
-
-
-