-
公开(公告)号:US20230368783A1
公开(公告)日:2023-11-16
申请号:US17952005
申请日:2022-09-23
Applicant: Apple Inc.
Inventor: Eric MARCHI , Ognjen RUDOVIC , Pranay DIGHE , Sachin S. KAJAREKAR , Saurabh ADYA , Barry-John THEOBALD , Seyedmahdad MIRSAMADI , Ahmed S. HUSSEN ABDELAZIZ
IPC: G10L15/197 , G10L15/22 , G10L15/16
CPC classification number: G10L15/197 , G10L15/16 , G10L15/22 , G10L2015/088
Abstract: An example process includes: receiving a speech input representing a user utterance; determining, based on a textual representation of the speech input, a first score corresponding to a type of the user utterance; determining, based on the textual representation of the speech input, a second score representing a correspondence between the user utterance and a domain recognized by a digital assistant; determining, based on the first score and the second score, whether the speech input is intended for the digital assistant; in accordance with a determination that the speech input is intended for the digital assistant: initiating, by the digital assistant, a task based on the speech input; and providing an output indicative of the initiated task.
-
公开(公告)号:US20220093095A1
公开(公告)日:2022-03-24
申请号:US17123428
申请日:2020-12-16
Applicant: Apple Inc.
Inventor: Pranay DIGHE , Erik MARCHI , Srikanth VISHNUBHOTLA , Sachin KAJAREKAR , Devang K. NAIK
Abstract: An example process includes: receiving an audio stream; determining a plurality of acoustic representations of the audio stream, where each acoustic representation of the plurality of acoustic representations corresponds to a respective frame of the audio stream; obtaining a respective plurality of scores indicating whether each respective frame of the audio stream is directed to an electronic device, where the obtaining includes: determining, using a triggering model operating on the electronic device, for each acoustic representation, a score indicating whether the respective frame of the audio stream is directed to the electronic device; determining, based on the respective plurality of scores, a likelihood that the audio stream is directed to the electronic device; determining whether the likelihood is above or below a threshold; and in response to determining that the likelihood is below the threshold, ceasing to process the audio stream.
-