-
公开(公告)号:US12118999B2
公开(公告)日:2024-10-15
申请号:US18231135
申请日:2023-08-07
Applicant: Apple Inc.
Inventor: Philippe P. Piernot , Justin G. Binder
CPC classification number: G10L15/22 , G06F3/013 , G06F3/167 , G10L15/26 , H04W4/025 , G06F2203/0381 , G10L15/1815 , G10L15/1822 , G10L2015/223 , G10L2015/227 , G10L2015/228 , G10L17/00
Abstract: Systems and processes for selectively processing and responding to a spoken user input are provided. In one example, audio input containing a spoken user input can be received at a user device. The spoken user input can be identified from the audio input by identifying start and end-points of the spoken user input. It can be determined whether or not the spoken user input was intended for a virtual assistant based on contextual information. The determination can be made using a rule-based system or a probabilistic system. If it is determined that the spoken user input was intended for the virtual assistant, the spoken user input can be processed and an appropriate response can be generated. If it is instead determined that the spoken user input was not intended for the virtual assistant, the spoken user input can be ignored and/or no response can be generated.
-
公开(公告)号:US11127397B2
公开(公告)日:2021-09-21
申请号:US16139648
申请日:2018-09-24
Applicant: Apple Inc.
Inventor: Philippe P. Piernot , Justin G. Binder
Abstract: Systems and processes for device voice control are provided. An example process includes, at an electronic device, receiving a spoken user input and interpreting the spoken user input to derive a representation of user intent. The process further includes determining whether a task may be identified based on the representation of user intent. In accordance with a determination that a task may be identified based on the representation of user intent, the task is performed, and in accordance with a determination that a task may not be identified based on the representation of user intent, the spoken user input is disambiguated.
-
公开(公告)号:US09715875B2
公开(公告)日:2017-07-25
申请号:US14502737
申请日:2014-09-30
Applicant: Apple Inc.
Inventor: Philippe P. Piernot , Justin G. Binder
CPC classification number: G10L15/22 , G06F3/167 , G10L15/1815 , G10L15/1822 , G10L15/26 , G10L17/00 , G10L2015/223 , G10L2015/227 , G10L2015/228 , H04W4/025
Abstract: Systems and processes for selectively processing and responding to a spoken user input are provided. In one example, audio input containing a spoken user input can be received at a user device. The spoken user input can be identified from the audio input by identifying start and end-points of the spoken user input. It can be determined whether or not the spoken user input was intended for a virtual assistant based on contextual information. The determination can be made using a rule-based system or a probabilistic system. If it is determined that the spoken user input was intended for the virtual assistant, the spoken user input can be processed and an appropriate response can be generated. If it is instead determined that the spoken user input was not intended for the virtual assistant, the spoken user input can be ignored and/or no response can be generated.
-
4.
公开(公告)号:US09620104B2
公开(公告)日:2017-04-11
申请号:US14298690
申请日:2014-06-06
Applicant: Apple Inc.
Inventor: Devang K. Naik , Thomas R. Gruber , Liam Weiner , Justin G. Binder , Charles Srisuwananukorn , Gunnar Evermann , Shaun Eric Williams , Hong Chen , Lia T. Napolitano
CPC classification number: G10L13/027 , G10L13/04 , G10L13/08 , G10L15/063 , G10L15/22 , G10L15/26 , G10L15/265 , G10L2015/0631 , G10L2015/0638
Abstract: The method is performed at an electronic device with one or more processors and memory storing one or more programs for execution by the one or more processors. A first speech input including at least one word is received. A first phonetic representation of the at least one word is determined, the first phonetic representation comprising a first set of phonemes selected from a speech recognition phonetic alphabet. The first set of phonemes is mapped to a second set of phonemes to generate a second phonetic representation, where the second set of phonemes is selected from a speech synthesis phonetic alphabet. The second phonetic representation is stored in association with a text string corresponding to the at least one word.
-
公开(公告)号:US11217255B2
公开(公告)日:2022-01-04
申请号:US15679108
申请日:2017-08-16
Applicant: Apple Inc.
Inventor: Yoon Kim , Charles Srisuwananukorn , David A. Carson , Thomas R. Gruber , Justin G. Binder
Abstract: Systems and processes for operating an intelligent automated assistant to provide extension of digital assistant services are provided. An example method includes, at an electronic device having one or more processors, receiving, from a first user, a first speech input representing a user request. The method further includes obtaining an identity of the first user; and in accordance with the user identity, providing a representation of the user request to at least one of a second electronic device or a third electronic device. The method further includes receiving, based on a determination of whether the second electronic device or the third electronic device, or both, is to provide the response to the first electronic device, the response to the user request from the second electronic device or the third electronic device. The method further includes providing a representation of the response to the first user.
-
公开(公告)号:US10187440B2
公开(公告)日:2019-01-22
申请号:US15167898
申请日:2016-05-27
Applicant: APPLE INC.
Inventor: Devang K. Naik , Justin G. Binder
Abstract: In some implementations, a user device can personalize a media stream by converting notifications into audio speech data and presenting the audio speech data at locations within the media stream that do not interrupt the enjoyment of the media stream by the user. In some implementations, the user device can receive notifications from various communication services, applications installed on the user device, and/or other sources, determine information describing the notifications, and present the information to the user using the audio speech data. In some implementations, the user device can generate personalized notifications based on the media stream and/or media items selected by the user. The user device can generate personalized notifications based on the user's context (e.g., environment, location, activity, etc.). The personalized notifications can then be presented to the user using audio speech data at appropriate locations in the media stream.
-
公开(公告)号:US10083688B2
公开(公告)日:2018-09-25
申请号:US14838331
申请日:2015-08-27
Applicant: Apple Inc.
Inventor: Philippe P. Piernot , Justin G. Binder
Abstract: Systems and processes for device voice control are provided. An example process includes, at an electronic device, receiving a spoken user input and interpreting the spoken user input to derive a representation of user intent. The process further includes determining whether a task may be identified based on the representation of user intent. In accordance with a determination that a task may be identified based on the representation of user intent, the task is performed, and in accordance with a determination that a task may not be identified based on the representation of user intent, the spoken user input is disambiguated.
-
公开(公告)号:US20170346872A1
公开(公告)日:2017-11-30
申请号:US15167898
申请日:2016-05-27
Applicant: APPLE INC.
Inventor: Devang K. Naik , Justin G. Binder
CPC classification number: H04L65/604 , G01C21/3629 , G01C21/3661 , H04L65/4023 , H04L65/4069 , H04L67/18 , H04L67/26 , H04L67/325
Abstract: In some implementations, a user device can personalize a media stream by converting notifications into audio speech data and presenting the audio speech data at locations within the media stream that do not interrupt the enjoyment of the media stream by the user. In some implementations, the user device can receive notifications from various communication services, applications installed on the user device, and/or other sources, determine information describing the notifications, and present the information to the user using the audio speech data. In some implementations, the user device can generate personalized notifications based on the media stream and/or media items selected by the user. The user device can generate personalized notifications based on the user's context (e.g., environment, location, activity, etc.). The personalized notifications can then be presented to the user using audio speech data at appropriate locations in the media stream.
-
9.
公开(公告)号:US12254887B2
公开(公告)日:2025-03-18
申请号:US17543292
申请日:2021-12-06
Applicant: Apple Inc.
Inventor: Yoon Kim , Charles Srisuwananukorn , David A. Carson , Thomas R. Gruber , Justin G. Binder
Abstract: Systems and processes for operating an intelligent automated assistant to provide extension of digital assistant services are provided. An example method includes, at an electronic device having one or more processors, receiving, from a first user, a first speech input representing a user request. The method further includes obtaining an identity of the first user; and in accordance with the user identity, providing a representation of the user request to at least one of a second electronic device or a third electronic device. The method further includes receiving, based on a determination of whether the second electronic device or the third electronic device, or both, is to provide the response to the first electronic device, the response to the user request from the second electronic device or the third electronic device. The method further includes providing a representation of the response to the first user.
-
公开(公告)号:US11810562B2
公开(公告)日:2023-11-07
申请号:US17461018
申请日:2021-08-30
Applicant: Apple Inc.
Inventor: Philippe P. Piernot , Justin G. Binder
CPC classification number: G10L15/22 , G06F3/013 , G06F3/167 , G10L15/26 , H04W4/025 , G06F2203/0381 , G10L15/1815 , G10L15/1822 , G10L17/00 , G10L2015/223 , G10L2015/227 , G10L2015/228
Abstract: Systems and processes for selectively processing and responding to a spoken user input are provided. In one example, audio input containing a spoken user input can be received at a user device. The spoken user input can be identified from the audio input by identifying start and end-points of the spoken user input. It can be determined whether or not the spoken user input was intended for a virtual assistant based on contextual information. The determination can be made using a rule-based system or a probabilistic system. If it is determined that the spoken user input was intended for the virtual assistant, the spoken user input can be processed and an appropriate response can be generated. If it is instead determined that the spoken user input was not intended for the virtual assistant, the spoken user input can be ignored and/or no response can be generated.
-
-
-
-
-
-
-
-
-