-
公开(公告)号:US20190074009A1
公开(公告)日:2019-03-07
申请号:US16181138
申请日:2018-11-05
Applicant: Apple Inc.
Inventor: Yoon KIM , John BRIDLE , Joshua D. ATKINS , Feipeng LI , Mehrez SOUDEN
Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.
-
公开(公告)号:US20180218735A1
公开(公告)日:2018-08-02
申请号:US15937653
申请日:2018-03-27
Applicant: Apple Inc.
Inventor: Melvyn HUNT , John BRIDLE
CPC classification number: G10L15/26 , G10L15/30 , G10L2015/025 , G10L2015/0631
Abstract: A system and method of speech recognition involving a mobile device. Speech input is received (202) on a mobile device (102) and converted (204) to a set of phonetic symbols. Data relating to the phonetic symbols is transferred (206) from the mobile device over a communications network (104) to a remote processing device (106) where it is used (208) to identity at least one matching data item from a set of data items (114). Data relating to the at least one matching data item is transferred (210) from the remote processing device to the mobile device and presented (214) thereon.
-
公开(公告)号:US20180336892A1
公开(公告)日:2018-11-22
申请号:US15920091
申请日:2018-03-13
Applicant: Apple Inc.
Inventor: Yoon KIM , John BRIDLE , Joshua D. ATKINS , Feipeng LI , Mehrez SOUDEN
CPC classification number: G10L15/22 , G10L15/1822 , G10L15/30 , G10L25/51 , G10L2015/228 , G10L2021/02166 , H04R3/005
Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.
-
公开(公告)号:US20230111509A1
公开(公告)日:2023-04-13
申请号:US18080550
申请日:2022-12-13
Applicant: Apple Inc.
Inventor: Yoon KIM , John BRIDLE , Joshua D. ATKINS , Feipeng LI , Mehrez SOUDEN
IPC: G10L15/22 , H04R1/40 , G10L15/08 , G10L15/04 , H04R3/00 , G10L15/30 , G10L15/18 , G10L15/28 , G10L21/0216
Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.
-
公开(公告)号:US20210097998A1
公开(公告)日:2021-04-01
申请号:US17111132
申请日:2020-12-03
Applicant: Apple Inc.
Inventor: Yoon KIM , John BRIDLE , Joshua D. ATKINS , Feipeng LI , Mehrez SOUDEN
Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.
-
公开(公告)号:US20160077794A1
公开(公告)日:2016-03-17
申请号:US14834194
申请日:2015-08-24
Applicant: Apple Inc.
Inventor: Yoon KIM , Thomas R. GRUBER , John BRIDLE
Abstract: Systems and processes are disclosed for dynamically adjusting a speech trigger threshold, which can be used in triggering a virtual assistant. Audio input can be received via a microphone. The received audio input can be sampled, and a confidence level can be determined of whether the sampled audio input includes a portion of a spoken trigger. In response to the confidence level exceeding a threshold, a virtual assistant can be triggered to receive a user command from the audio input. The threshold can be dynamically adjusted in response to perceived events (e.g., events indicating a user may be more or less likely to initiate speech interactions, events indicating a trigger may be difficult to detect, events indicating a trigger was missed, etc.), thereby minimizing both missed triggers and false positive triggering events.
Abstract translation: 公开了用于动态调整语音触发阈值的系统和过程,其可以用于触发虚拟助理。 可以通过麦克风接收音频输入。 接收到的音频输入可以被采样,并且可以确定采样的音频输入是否包括口语触发的一部分的置信水平。 响应于置信水平超过阈值,可以触发虚拟助手从音频输入接收用户命令。 可以响应于感知事件(例如,指示用户或多或少可能发起语音交互的事件,指示触发可能难以检测的事件,指示触发被错过的事件等)动态地调整阈值) 从而最大限度地减少错过触发和假阳性触发事件。
-
-
-
-
-