-
公开(公告)号:US20250029305A1
公开(公告)日:2025-01-23
申请号:US18353693
申请日:2023-07-17
Applicant: Meta Platforms, Inc.
Inventor: Abhay Suresh Harpale , Meryem Pinar Donmez Ediz , Kshitiz Malik , Omer Muzaffar , Mridul Gupta , Vijay Manikandan Janakiraman
Abstract: In one embodiment, a method includes receiving a voice input having acoustic features from a first client system associated with a first user, determining emotions associated with the voice input based on one or more of the acoustic features by machine-learning models, determining facial features for a first extended-reality (XR) avatar representing the first user based on the emotions, and sending instructions for rendering the first XR avatar representing the first user to a second client system associated with a second user, wherein the first XR avatar is rendered with the determined facial features.
-
公开(公告)号:US12125297B2
公开(公告)日:2024-10-22
申请号:US17394159
申请日:2021-08-04
Applicant: Meta Platforms, Inc.
Inventor: Elizabeth Kelsey Santoro , Denis Savenkov , Koon Hui Geoffrey Goh , Kshitiz Malik , Ruchir Srivastava
CPC classification number: G06V20/63 , G06F3/013 , G06F40/295 , G06N3/045 , G06N20/00 , G06V20/20 , G06V30/10
Abstract: In one embodiment, a method includes accessing visual signals comprising images portraying textual content in a real-world environment associated with a first user from a client system associated with the first user, recognizing the textual content based on machine-learning models and the visual signals, determining a context associated with the first user with respect to the real-world environment based on the visual signals, executing tasks determined based on the textual content and the determined context for the first user, and sending instructions for presenting execution results of the tasks to the first user to the client system.
-
公开(公告)号:US20220188361A1
公开(公告)日:2022-06-16
申请号:US17120013
申请日:2020-12-11
Applicant: Meta Platforms, Inc.
Inventor: Fadi Botros , Nanshu Wang , Fan Wang , Meryem Pinar Donmez Ediz , Omer Muzaffar , Kshitiz Malik , Vikas Seshagiri Rao Bhardwaj , Anuj Kumar , Shreyan Bakshi
IPC: G06F16/9032 , G10L15/16 , G10L15/06 , H04L12/58 , G02B27/01
Abstract: In one embodiment, a method includes receiving a first input by a user from a client system associated with the user, wherein the first input is in a voice modality, analyzing the first input to generate one or more candidate hypotheses, determining one or more modalities for presenting output generated by the one or more computing systems to the user at the client system, and sending instructions to the client system for presenting one or more suggested auto-completions corresponding to one or more of the candidate hypotheses, respectively, wherein each suggested auto-completion comprises the corresponding candidate hypothesis, and wherein the one or more suggested auto-completions are presented in the one or more determined modalities.
-
公开(公告)号:US20250148811A1
公开(公告)日:2025-05-08
申请号:US18922119
申请日:2024-10-21
Applicant: Meta Platforms, Inc.
Inventor: Elizabeth Kelsey Santoro , Denis Savenkov , Koon Hui Geoffrey Goh , Ruchir Srivastava , Kshitiz Malik
Abstract: In one embodiment, a method includes accessing visual signals comprising images portraying textual content in a real-world environment associated with a first user from a client system associated with the first user, recognizing the textual content based on machine-learning models and the visual signals, determining a context associated with the first user with respect to the real-world environment based on the visual signals, executing tasks determined based on the textual content and the determined context for the first user, and sending instructions for presenting execution results of the tasks to the first user to the client system.
-
公开(公告)号:US20230245654A1
公开(公告)日:2023-08-03
申请号:US18157413
申请日:2023-01-20
Applicant: Meta Platforms, Inc.
Inventor: Akshat Shrivastava , Shrey Desai , Anchit Gupta , Ali Elkahky , Aleksandr Livshits , Alexander Kolmykov-Zotov , Ahmed Aly , Jinsong Yu , Manali Anand Naik , Shuhui Yang , Baiyang Liu , Surya Teja Appini , Tarun Vir Singh , Hang Su , Jiedan Zhu , Fuchun Peng , Shoubhik Bhattacharya , Kshitiz Malik , Shreyan Bakshi , Akash Bharadwaj , Harish Srinivas , Xiao Yang , Zhuangqun Huang , Gil Keren , Duc Hoang Le , Ahmed Kamal Atwa Mohamed , Zhe Liu , Pranab Mohanty
CPC classification number: G10L15/22 , G10L15/1815 , G10L15/30 , G10L15/063 , G10L15/197 , H04L63/0428 , G10L2015/223 , G10L2015/086
Abstract: In one embodiment, a system includes an automatic speech recognition (ASR) module, a natural-language understanding (NLU) module, a dialog manager, one or more agents, an arbitrator, a delivery system, one or more processors, and a non-transitory memory coupled to the processors comprising instructions executable by the processors, the processors operable when executing the instructions to receive a user input, process the user input using the ASR module, the NLU module, the dialog manager, one or more of the agents, the arbitrator, and the delivery system, and provide a response to the user input.
-
公开(公告)号:US20220374645A1
公开(公告)日:2022-11-24
申请号:US17394159
申请日:2021-08-04
Applicant: Meta Platforms, Inc.
Inventor: Elizabeth Kelsey Santoro , Denis Savenkov , Koon Hui Geoffrey Goh , Kshitiz Malik , Ruchir Srivastava
Abstract: In one embodiment, a method includes accessing visual signals comprising images portraying textual content in a real-world environment associated with a first user from a client system associated with the first user, recognizing the textual content based on machine-learning models and the visual signals, determining a context associated with the first user with respect to the real-world environment based on the visual signals, executing tasks determined based on the textual content and the determined context for the first user, and sending instructions for presenting execution results of the tasks to the first user to the client system.
-
公开(公告)号:US11501081B1
公开(公告)日:2022-11-15
申请号:US16731304
申请日:2019-12-31
Applicant: META PLATFORMS, INC.
Inventor: Prince Gill , Honglei Liu , Wenhai Yang , Kshitiz Malik , Nanshu Wang , David Reiss
Abstract: Exemplary embodiments relate to methods, mediums, and systems for moving language models from a server to the client device. Such embodiments may be deployed in an environment where the server is not able to provide modeling services to the clients, such as an end-to-end encrypted (E2EE) environment. Several different techniques are described to address issues of size and complexity reduction, model architecture optimization, model training, battery power reduction, and latency reduction.
-
公开(公告)号:US11455555B1
公开(公告)日:2022-09-27
申请号:US16731321
申请日:2019-12-31
Applicant: META PLATFORMS, INC.
Inventor: Prince Gill , Honglei Liu , Wenhai Yang , Kshitiz Malik , Nanshu Wang , David Reiss
Abstract: Exemplary embodiments relate to methods, mediums, and systems for moving language models from a server to the client device. Such embodiments may be deployed in an environment where the server is not able to provide modeling services to the clients, such as an end-to-end encrypted (E2EE) environment. Several different techniques are described to address issues of size and complexity reduction, model architecture optimization, model training, battery power reduction, and latency reduction.
-
-
-
-
-
-
-