-
公开(公告)号:US11217255B2
公开(公告)日:2022-01-04
申请号:US15679108
申请日:2017-08-16
Applicant: Apple Inc.
Inventor: Yoon Kim , Charles Srisuwananukorn , David A. Carson , Thomas R. Gruber , Justin G. Binder
Abstract: Systems and processes for operating an intelligent automated assistant to provide extension of digital assistant services are provided. An example method includes, at an electronic device having one or more processors, receiving, from a first user, a first speech input representing a user request. The method further includes obtaining an identity of the first user; and in accordance with the user identity, providing a representation of the user request to at least one of a second electronic device or a third electronic device. The method further includes receiving, based on a determination of whether the second electronic device or the third electronic device, or both, is to provide the response to the first electronic device, the response to the user request from the second electronic device or the third electronic device. The method further includes providing a representation of the response to the first user.
-
公开(公告)号:US10395659B2
公开(公告)日:2019-08-27
申请号:US15679098
申请日:2017-08-16
Applicant: Apple Inc.
Inventor: Aimee Piercy , Cyrus Daniel Irani , Yoon Kim , David Chance Graham , Patrick L. Coffman
IPC: G10L17/22 , G06F16/9537 , G06F3/16 , G06F16/2457 , G10L15/22 , G10L13/033 , G10L21/00 , G06N20/00 , G10L13/027 , G10L15/30
Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors and memory, receiving a natural-language speech input indicative of a request to the digital assistant; obtaining, by the digital assistant, context information; determining, by the digital assistant, a text-to-speech mode from a plurality of text-to-speech modes based on the obtained context information; and providing, by the digital assistant, an audio output with the determined text-to-speech mode, where the audio output is indicative of a speech response to the user request.
-
公开(公告)号:US11532306B2
公开(公告)日:2022-12-20
申请号:US17111132
申请日:2020-12-03
Applicant: Apple Inc.
Inventor: Yoon Kim , John Bridle , Joshua D. Atkins , Feipeng Li , Mehrez Souden
IPC: G10L15/22 , H04R1/40 , G10L15/08 , G10L15/04 , H04R3/00 , G10L15/30 , G10L15/18 , G10L15/28 , G10L21/0216 , G10L25/51 , H04R27/00
Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.
-
公开(公告)号:US10789041B2
公开(公告)日:2020-09-29
申请号:US14834194
申请日:2015-08-24
Applicant: Apple Inc.
Inventor: Yoon Kim , Thomas R. Gruber , John Bridle
Abstract: Systems and processes are disclosed for dynamically adjusting a speech trigger threshold, which can be used in triggering a virtual assistant. Audio input can be received via a microphone. The received audio input can be sampled, and a confidence level can be determined of whether the sampled audio input includes a portion of a spoken trigger. In response to the confidence level exceeding a threshold, a virtual assistant can be triggered to receive a user command from the audio input. The threshold can be dynamically adjusted in response to perceived events (e.g., events indicating a user may be more or less likely to initiate speech interactions, events indicating a trigger may be difficult to detect, events indicating a trigger was missed, etc.), thereby minimizing both missed triggers and false positive triggering events.
-
公开(公告)号:US10747498B2
公开(公告)日:2020-08-18
申请号:US15147726
申请日:2016-05-05
Applicant: Apple Inc.
Inventor: William F. Stasior , David A. Carson , Rohit Dasari , Yoon Kim
Abstract: An electronic device can implement a zero-latency digital assistant by capturing audio input from a microphone and using a first processor to write audio data representing the captured audio input to a memory buffer. In response to detecting a user input while capturing the audio input, the device can determine whether the user input meets a predetermined criteria. If the user input meets the criteria, the device can use a second processor to identify and execute a task based on at least a portion of the contents of the memory buffer.
-
公开(公告)号:US10127911B2
公开(公告)日:2018-11-13
申请号:US14835169
申请日:2015-08-25
Applicant: Apple Inc.
Inventor: Yoon Kim , Sachin S. Kajarekar
Abstract: Systems and processes for generating a speaker profile for use in performing speaker identification for a virtual assistant are provided. One example process can include receiving an audio input including user speech and determining whether a speaker of the user speech is a predetermined user based on a speaker profile for the predetermined user. In response to determining that the speaker of the user speech is the predetermined user, the user speech can be added to the speaker profile and operation of the virtual assistant can be triggered. In response to determining that the speaker of the user speech is not the predetermined user, the user speech can be added to an alternate speaker profile and operation of the virtual assistant may not be triggered. In some examples, contextual information can be used to verify results produced by the speaker identification process.
-
公开(公告)号:US10074360B2
公开(公告)日:2018-09-11
申请号:US14834239
申请日:2015-08-24
Applicant: Apple Inc.
Inventor: Yoon Kim
CPC classification number: G10L15/01 , G10L15/22 , G10L25/60 , H04R29/008
Abstract: This relates to providing an indication of the suitability of an acoustic environment for performing speech recognition. One process can include receiving an audio input and determining a speech recognition suitability based on the audio input. The speech recognition suitability can include a numerical, textual, graphical, or other representation of the suitability of an acoustic environment for performing speech recognition. The process can further include displaying a visual representation of the speech recognition suitability to indicate the likelihood that a spoken user input will be interpreted correctly. This allows a user to determine whether to proceed with the performance of a speech recognition process, or to move to a different location having a better acoustic environment before performing the speech recognition process. In some examples, the user device can disable operation of a speech recognition process in response to determining that the speech recognition suitability is below a threshold suitability.
-
8.
公开(公告)号:US12254887B2
公开(公告)日:2025-03-18
申请号:US17543292
申请日:2021-12-06
Applicant: Apple Inc.
Inventor: Yoon Kim , Charles Srisuwananukorn , David A. Carson , Thomas R. Gruber , Justin G. Binder
Abstract: Systems and processes for operating an intelligent automated assistant to provide extension of digital assistant services are provided. An example method includes, at an electronic device having one or more processors, receiving, from a first user, a first speech input representing a user request. The method further includes obtaining an identity of the first user; and in accordance with the user identity, providing a representation of the user request to at least one of a second electronic device or a third electronic device. The method further includes receiving, based on a determination of whether the second electronic device or the third electronic device, or both, is to provide the response to the first electronic device, the response to the user request from the second electronic device or the third electronic device. The method further includes providing a representation of the response to the first user.
-
公开(公告)号:US11954405B2
公开(公告)日:2024-04-09
申请号:US17982337
申请日:2022-11-07
Applicant: Apple Inc.
Inventor: William F. Stasior , David A. Carson , Rohit Dasari , Yoon Kim
CPC classification number: G06F3/167 , G06F3/038 , G06F3/0481 , G06F3/0604 , G06F3/0656 , G06F3/0673 , G10L15/22 , G10L15/32 , G10L2015/088 , G10L2015/223 , G10L15/285 , H04M2201/40 , H04M2250/74
Abstract: An electronic device can implement a zero-latency digital assistant by capturing audio input from a microphone and using a first processor to write audio data representing the captured audio input to a memory buffer. In response to detecting a user input while capturing the audio input, the device can determine whether the user input meets a predetermined criteria. If the user input meets the criteria, the device can use a second processor to identify and execute a task based on at least a portion of the contents of the memory buffer.
-
公开(公告)号:US11550542B2
公开(公告)日:2023-01-10
申请号:US17403674
申请日:2021-08-16
Applicant: Apple Inc.
Inventor: William F. Stasior , David A. Carson , Rohit Dasari , Yoon Kim
Abstract: An electronic device can implement a zero-latency digital assistant by capturing audio input from a microphone and using a first processor to write audio data representing the captured audio input to a memory buffer. In response to detecting a user input while capturing the audio input, the device can determine whether the user input meets a predetermined criteria. If the user input meets the criteria, the device can use a second processor to identify and execute a task based on at least a portion of the contents of the memory buffer.
-
-
-
-
-
-
-
-
-