-
公开(公告)号:US20250037018A1
公开(公告)日:2025-01-30
申请号:US18658919
申请日:2024-05-08
Applicant: Apple Inc.
Inventor: Minsik CHO , Keivan ALIZADEH VAHID , Qichen FU , Saurabh ADYA , Carlo Eduardo Cabanero DEL MUNDO , Mohammad RASTEGARI , Devang K. NAIK , Peter ZATLOUKAL
IPC: G06N20/00
Abstract: The subject technology provides memory-efficient differentiable weight clustering for large language model compression. An apparatus determines a tensor including an attention map between learned weights of a trained machine learning model and corresponding centroids. The apparatus also determines a compressed attention table and a plurality of index lists during compression of the trained machine learning model based on an uniquification of the attention map and sharding of an associated index list. The apparatus determines whether the tensor exists at a destination device during compression of the trained machine learning model using a marshaling layer. The apparatus refrains from copying the tensor to the destination device when the tensor exists at the destination device, or copies the tensor to the destination device when the tensor does not exist at the destination device. The apparatus deploys a compressed machine learning model based on the compression of the trained machine learning model.
-
公开(公告)号:US20240144590A1
公开(公告)日:2024-05-02
申请号:US18279752
申请日:2022-02-25
Applicant: APPLE INC.
Inventor: Alkeshkumar M. PATEL , Saurabh ADYA , Shruti BHARGAVA , Angela BLECHSCHMIDT , Vikas R. NAIR , Alexander S. POLICHRONIADIS , Kendal SANDRIDGE , Daniel ULBRICHT , Hong YU
CPC classification number: G06T17/00 , G06V10/25 , G10L15/22 , G06V2201/07 , G10L2015/223
Abstract: In an exemplary process, a speech input including a referenced virtual object is received. Based on the speech input, a first reference set is obtained. The first reference set is then compared to a plurality of second reference sets. Based on the comparison, a second reference set from the plurality of second reference sets is obtained. The second reference set may be identified based on a matching score between the first reference set and the second reference set. An object is then identified based on the second reference set. Based on the identified object, the reference virtual object is displayed.
-
公开(公告)号:US20230042224A1
公开(公告)日:2023-02-09
申请号:US17588887
申请日:2022-01-31
Applicant: Apple Inc.
Inventor: Alkeshkumar PATEL , Saurabh ADYA , Karan DARYANANI , Myra LUKENS , Aswath MANOHARAN
Abstract: Systems and processes for operating an intelligent automated assistant are provided. An example process includes receiving an utterance including a user request and determining whether at least a portion of the user request is ambiguous. If at least the portion of the user request is ambiguous then a set of context data based on the ambiguous portion of the user request is determined, metadata is extracted from the context data and a response to the user request is determined based on the extracted metadata.
-
公开(公告)号:US20240371378A1
公开(公告)日:2024-11-07
申请号:US18777427
申请日:2024-07-18
Applicant: Apple Inc.
Inventor: Saurabh ADYA , Sameer BADASKAR , Akanksha BINDAL , Ahmed S. HUSSEN ABDELAZIZ , Xiaochuan NIU , Alkeshkumar M. PATEL , Srikanth VISHNUBHOTLA
Abstract: Systems and processes for operating a digital assistant are provided. An example method for processing an image include receiving an image, generating, based on the image, a question corresponding to a first object in the image, generating, based on the image, a caption corresponding to a second object of the image, receiving an utterance from a user, and determining a plurality of speech recognition results from the utterance based on the question and the caption.
-
公开(公告)号:US20230368812A1
公开(公告)日:2023-11-16
申请号:US17952038
申请日:2022-09-23
Applicant: Apple Inc.
Inventor: Eric MARCHI , Ognjen RUDOVIC , Sachin S. KAJAREKAR , Saurabh ADYA , Barry-John THEOBALD , Ahmed S. HUSSEN ABDELAZIZ
CPC classification number: G10L25/78 , G06T7/70 , G06V40/161 , G06T2207/30201
Abstract: An example process includes: receiving a speech input representing a user utterance; determining, based on a textual representation of the speech input, a first score corresponding to a type of the user utterance; determining, based on the textual representation of the speech input, a second score representing a correspondence between the user utterance and a domain recognized by a digital assistant; determining, based on the first score and the second score, whether the speech input is intended for the digital assistant; in accordance with a determination that the speech input is intended for the digital assistant: initiating, by the digital assistant, a task based on the speech input; and providing an output indicative of the initiated task.
-
公开(公告)号:US20230401795A1
公开(公告)日:2023-12-14
申请号:US18202849
申请日:2023-05-26
Applicant: Apple Inc.
Inventor: Lynn I. STREJA , Saurabh ADYA , Keith P. AVERY , Karan M. DARYANANI , Stephen O. LEMAY , Myra C. LUKENS , Sreeneel K. MADDIKA , Chaitanya MANNEMALA , Aswath MANOHARAN , Pedro MARI , Jay MOON , Abhishek RAWAT , Garrett L. WEINBERG
CPC classification number: G06T19/003 , G06T13/40 , G10L15/22 , G06F3/013 , G06F3/017 , G06T2219/2012 , G06T2219/2016 , G06T2219/2021 , G10L2015/223
Abstract: An example process includes: while displaying a portion of an extended reality (XR) environment representing a current field of view of a user: detecting a user gaze at a first object displayed in the XR environment, where the first object is persistent in the current field of view of the XR environment; in response to detecting the user gaze at the first object, expanding the first object into a list of objects including a second object representing a digital assistant; detecting a user gaze at the second object; in accordance with detecting the user gaze at the second object, displaying a first animation of the second object indicating that a digital assistant session is initiated; receiving a first audio input from the user; and displaying a second animation of the second object indicating that the digital assistant is actively listening to the user.
-
公开(公告)号:US20230368783A1
公开(公告)日:2023-11-16
申请号:US17952005
申请日:2022-09-23
Applicant: Apple Inc.
Inventor: Eric MARCHI , Ognjen RUDOVIC , Pranay DIGHE , Sachin S. KAJAREKAR , Saurabh ADYA , Barry-John THEOBALD , Seyedmahdad MIRSAMADI , Ahmed S. HUSSEN ABDELAZIZ
IPC: G10L15/197 , G10L15/22 , G10L15/16
CPC classification number: G10L15/197 , G10L15/16 , G10L15/22 , G10L2015/088
Abstract: An example process includes: receiving a speech input representing a user utterance; determining, based on a textual representation of the speech input, a first score corresponding to a type of the user utterance; determining, based on the textual representation of the speech input, a second score representing a correspondence between the user utterance and a domain recognized by a digital assistant; determining, based on the first score and the second score, whether the speech input is intended for the digital assistant; in accordance with a determination that the speech input is intended for the digital assistant: initiating, by the digital assistant, a task based on the speech input; and providing an output indicative of the initiated task.
-
公开(公告)号:US20230199297A1
公开(公告)日:2023-06-22
申请号:US18112371
申请日:2023-02-21
Applicant: Apple Inc.
Inventor: Saurabh ADYA , Myra C. LUKENS , Aswath MANOHARAN , Alkeshkumar M. PATEL
CPC classification number: H04N23/631 , G10L15/1815 , G06F3/013 , H04N23/633
Abstract: Systems and processes for operating a digital assistant are provided. An example process for determining a response includes, at an electronic device having one or more processors and memory, receiving a spoken input including a request, performing a semantic analysis on the spoken input, determining, based on the semantic analysis, a likelihood that the electronic device requires additional contextual data to satisfy the request, and in accordance with the determined likelihood exceeding a threshold, enabling a camera of the electronic device and determining a response to the request based on data captured by the camera of the electronic device.
-
公开(公告)号:US20230046337A1
公开(公告)日:2023-02-16
申请号:US17402328
申请日:2021-08-13
Applicant: Apple Inc.
Inventor: Hong YU , Saurabh ADYA , Shruti BHARGAVA , Myra LUKENS , Jianpeng CHENG , Lin LI , Alkeshkumar PATEL , Dhivya PIRAVIPERUMAL , Stephen Guy PULMAN
Abstract: Systems and processes for operating a digital assistant are provided. An example process for performing a task includes, at an electronic device having one or more processors and memory, receiving a spoken input including a request, receiving an image input including a plurality of objects, selecting a reference resolution module of a plurality of reference resolution modules based on the request and the image input, determining, with the selected reference resolution module, whether the request references a first object of the plurality of objects based on at least the spoken input, and in accordance with a determination that the request references the first object of the plurality of objects, determining a response to the request including information about the first object.
-
-
-
-
-
-
-
-