-
公开(公告)号:US12190873B2
公开(公告)日:2025-01-07
申请号:US17952005
申请日:2022-09-23
Applicant: Apple Inc.
Inventor: Ahmed S. Hussen Abdelaziz , Saurabh Adya , Alexander W. Churchill , Pranay Dighe , Sachin S. Kajarekar , Chaitanya Mannemala , Erik Marchi , Seyedmahdad Mirsamadi , Ognjen Rudovic , Ahmed H. Tewfik , Barry-John Theobald , Srikanth Vishnubhotla
Abstract: An example process includes: receiving a speech input representing a user utterance; determining, based on a textual representation of the speech input, a first score corresponding to a type of the user utterance; determining, based on the textual representation of the speech input, a second score representing a correspondence between the user utterance and a domain recognized by a digital assistant; determining, based on the first score and the second score, whether the speech input is intended for the digital assistant; in accordance with a determination that the speech input is intended for the digital assistant: initiating, by the digital assistant, a task based on the speech input; and providing an output indicative of the initiated task.
-
公开(公告)号:US11620999B2
公开(公告)日:2023-04-04
申请号:US17123428
申请日:2020-12-16
Applicant: Apple Inc.
Inventor: Pranay Dighe , Erik Marchi , Srikanth Vishnubhotla , Sachin Kajarekar , Devang K. Naik
Abstract: An example process includes: receiving an audio stream; determining a plurality of acoustic representations of the audio stream, where each acoustic representation of the plurality of acoustic representations corresponds to a respective frame of the audio stream; obtaining a respective plurality of scores indicating whether each respective frame of the audio stream is directed to an electronic device, where the obtaining includes: determining, using a triggering model operating on the electronic device, for each acoustic representation, a score indicating whether the respective frame of the audio stream is directed to the electronic device; determining, based on the respective plurality of scores, a likelihood that the audio stream is directed to the electronic device; determining whether the likelihood is above or below a threshold; and in response to determining that the likelihood is below the threshold, ceasing to process the audio stream.
-