Patent search ap:("Apple Inc.") AND inv:"Srikanth VISHNUBHOTLA" Page 1

1.

发明申请
MULTIPLE STATE DIGITAL ASSISTANT FOR CONTINUOUS DIALOG 有权

公开(公告)号：US20220293125A1

公开(公告)日：2022-09-15

申请号：US17330862

申请日：2021-05-26

Applicant: Apple Inc.

Inventor： Sreeneel MADDIKA , Ahmed Serag El Din HUSSEN ABDELAZIZ , Chaitanya MANNEMALA , Srikanth VISHNUBHOTLA , Garrett L. WEINBERG

IPC: G10L25/78 , G10L25/51 , G10L21/0208 , G06K9/00

Abstract: Systems and processes for operating an intelligent automated assistant are provided. For example, a first speech input is received from a user. In response to receiving the first speech input, a response is provided. A first output is provided corresponding to a digital assistant in a first state, and a second speech input is received from the user. A first plurality of values is obtained. Based on the first plurality of values, a first confidence level corresponding to the second speech input is obtained. In accordance with a determination that the first confidence level exceeds a first threshold confidence level, a second output is provided corresponding to the digital assistant in a second state. The second speech input continues to be received.

2.

发明申请
USING VISUAL CONTEXT TO IMPROVE A VIRTUAL ASSISTANT 有权

公开(公告)号：US20240371378A1

公开(公告)日：2024-11-07

申请号：US18777427

申请日：2024-07-18

Applicant: Apple Inc.

Inventor： Saurabh ADYA , Sameer BADASKAR , Akanksha BINDAL , Ahmed S. HUSSEN ABDELAZIZ , Xiaochuan NIU , Alkeshkumar M. PATEL , Srikanth VISHNUBHOTLA

IPC: G10L15/22 , G06F18/214 , G06V10/82 , G06V20/50 , G10L15/06 , G10L15/16 , G10L15/18 , G10L15/24

Abstract: Systems and processes for operating a digital assistant are provided. An example method for processing an image include receiving an image, generating, based on the image, a question corresponding to a first object in the image, generating, based on the image, a caption corresponding to a second object of the image, receiving an utterance from a user, and determining a plurality of speech recognition results from the utterance based on the question and the caption.

3.

发明公开
MULTIPLE STATE DIGITAL ASSISTANT FOR CONTINUOUS DIALOG 审中-公开

公开(公告)号：US20240055017A1

公开(公告)日：2024-02-15

申请号：US18237362

申请日：2023-08-23

Applicant: Apple Inc.

Inventor： Sreeneel MADDIKA , Ahmed Serag El Din HUSSEN ABDELAZIZ , Chaitanya MANNEMALA , Srikanth VISHNUBHOTLA , Garrett L. WEINBERG

IPC: G10L25/78 , G10L25/51 , G10L21/0208 , G06V40/16

CPC classification number: G10L25/78 , G10L25/51 , G10L21/0208 , G06V40/171 , G10L2021/02082

Abstract: Systems and processes for operating an intelligent automated assistant are provided. For example, a first speech input is received from a user. In response to receiving the first speech input, a response is provided. A first output is provided corresponding to a digital assistant in a first state, and a second speech input is received from the user. A first plurality of values is obtained. Based on the first plurality of values, a first confidence level corresponding to the second speech input is obtained. In accordance with a determination that the first confidence level exceeds a first threshold confidence level, a second output is provided corresponding to the digital assistant in a second state. The second speech input continues to be received.

4.

发明申请
REDUCING DEVICE PROCESSING OF UNINTENDED AUDIO 有权

公开(公告)号：US20220093095A1

公开(公告)日：2022-03-24

申请号：US17123428

申请日：2020-12-16

Applicant: Apple Inc.

Inventor： Pranay DIGHE , Erik MARCHI , Srikanth VISHNUBHOTLA , Sachin KAJAREKAR , Devang K. NAIK

IPC: G10L15/22 , G10L15/26 , G10L15/30

Abstract: An example process includes: receiving an audio stream; determining a plurality of acoustic representations of the audio stream, where each acoustic representation of the plurality of acoustic representations corresponds to a respective frame of the audio stream; obtaining a respective plurality of scores indicating whether each respective frame of the audio stream is directed to an electronic device, where the obtaining includes: determining, using a triggering model operating on the electronic device, for each acoustic representation, a score indicating whether the respective frame of the audio stream is directed to the electronic device; determining, based on the respective plurality of scores, a likelihood that the audio stream is directed to the electronic device; determining whether the likelihood is above or below a threshold; and in response to determining that the likelihood is below the threshold, ceasing to process the audio stream.

Patent Agency Ranking