-
公开(公告)号:US20220293125A1
公开(公告)日:2022-09-15
申请号:US17330862
申请日:2021-05-26
Applicant: Apple Inc.
Inventor: Sreeneel MADDIKA , Ahmed Serag El Din HUSSEN ABDELAZIZ , Chaitanya MANNEMALA , Srikanth VISHNUBHOTLA , Garrett L. WEINBERG
IPC: G10L25/78 , G10L25/51 , G10L21/0208 , G06K9/00
Abstract: Systems and processes for operating an intelligent automated assistant are provided. For example, a first speech input is received from a user. In response to receiving the first speech input, a response is provided. A first output is provided corresponding to a digital assistant in a first state, and a second speech input is received from the user. A first plurality of values is obtained. Based on the first plurality of values, a first confidence level corresponding to the second speech input is obtained. In accordance with a determination that the first confidence level exceeds a first threshold confidence level, a second output is provided corresponding to the digital assistant in a second state. The second speech input continues to be received.
-
公开(公告)号:US20240371378A1
公开(公告)日:2024-11-07
申请号:US18777427
申请日:2024-07-18
Applicant: Apple Inc.
Inventor: Saurabh ADYA , Sameer BADASKAR , Akanksha BINDAL , Ahmed S. HUSSEN ABDELAZIZ , Xiaochuan NIU , Alkeshkumar M. PATEL , Srikanth VISHNUBHOTLA
Abstract: Systems and processes for operating a digital assistant are provided. An example method for processing an image include receiving an image, generating, based on the image, a question corresponding to a first object in the image, generating, based on the image, a caption corresponding to a second object of the image, receiving an utterance from a user, and determining a plurality of speech recognition results from the utterance based on the question and the caption.
-
公开(公告)号:US20240055017A1
公开(公告)日:2024-02-15
申请号:US18237362
申请日:2023-08-23
Applicant: Apple Inc.
Inventor: Sreeneel MADDIKA , Ahmed Serag El Din HUSSEN ABDELAZIZ , Chaitanya MANNEMALA , Srikanth VISHNUBHOTLA , Garrett L. WEINBERG
IPC: G10L25/78 , G10L25/51 , G10L21/0208 , G06V40/16
CPC classification number: G10L25/78 , G10L25/51 , G10L21/0208 , G06V40/171 , G10L2021/02082
Abstract: Systems and processes for operating an intelligent automated assistant are provided. For example, a first speech input is received from a user. In response to receiving the first speech input, a response is provided. A first output is provided corresponding to a digital assistant in a first state, and a second speech input is received from the user. A first plurality of values is obtained. Based on the first plurality of values, a first confidence level corresponding to the second speech input is obtained. In accordance with a determination that the first confidence level exceeds a first threshold confidence level, a second output is provided corresponding to the digital assistant in a second state. The second speech input continues to be received.
-
公开(公告)号:US20220093095A1
公开(公告)日:2022-03-24
申请号:US17123428
申请日:2020-12-16
Applicant: Apple Inc.
Inventor: Pranay DIGHE , Erik MARCHI , Srikanth VISHNUBHOTLA , Sachin KAJAREKAR , Devang K. NAIK
Abstract: An example process includes: receiving an audio stream; determining a plurality of acoustic representations of the audio stream, where each acoustic representation of the plurality of acoustic representations corresponds to a respective frame of the audio stream; obtaining a respective plurality of scores indicating whether each respective frame of the audio stream is directed to an electronic device, where the obtaining includes: determining, using a triggering model operating on the electronic device, for each acoustic representation, a score indicating whether the respective frame of the audio stream is directed to the electronic device; determining, based on the respective plurality of scores, a likelihood that the audio stream is directed to the electronic device; determining whether the likelihood is above or below a threshold; and in response to determining that the likelihood is below the threshold, ceasing to process the audio stream.
-
-
-