Patent search ap:("META PLATFORMS Page INC.") AND inv:"Kshitiz Malik"

1.

发明申请
Rendering XR Avatars Based on Acoustical Features 有权

公开(公告)号：US20250029305A1

公开(公告)日：2025-01-23

申请号：US18353693

申请日：2023-07-17

Applicant: Meta Platforms, Inc.

Inventor： Abhay Suresh Harpale , Meryem Pinar Donmez Ediz , Kshitiz Malik , Omer Muzaffar , Mridul Gupta , Vijay Manikandan Janakiraman

IPC: G06T13/20 , G06T13/40 , G10L25/63

Abstract: In one embodiment, a method includes receiving a voice input having acoustic features from a first client system associated with a first user, determining emotions associated with the voice input based on one or more of the acoustic features by machine-learning models, determining facial features for a first extended-reality (XR) avatar representing the first user based on the emotions, and sending instructions for rendering the first XR avatar representing the first user to a second client system associated with a second user, wherein the first XR avatar is rendered with the determined facial features.

2.

发明授权
Task execution based on real-world text detection for assistant systems 有权

公开(公告)号：US12125297B2

公开(公告)日：2024-10-22

申请号：US17394159

申请日：2021-08-04

Applicant: Meta Platforms, Inc.

Inventor： Elizabeth Kelsey Santoro , Denis Savenkov , Koon Hui Geoffrey Goh , Kshitiz Malik , Ruchir Srivastava

IPC: G06K9/32 , G06F3/01 , G06F40/295 , G06N3/045 , G06N20/00 , G06V20/20 , G06V20/62 , G06V30/10

CPC classification number: G06V20/63 , G06F3/013 , G06F40/295 , G06N3/045 , G06N20/00 , G06V20/20 , G06V30/10

Abstract: In one embodiment, a method includes accessing visual signals comprising images portraying textual content in a real-world environment associated with a first user from a client system associated with the first user, recognizing the textual content based on machine-learning models and the visual signals, determining a context associated with the first user with respect to the real-world environment based on the visual signals, executing tasks determined based on the textual content and the determined context for the first user, and sending instructions for presenting execution results of the tasks to the first user to the client system.

3.

发明申请
Voice-based Auto-Completions and Auto-Responses for Assistant Systems 有权

公开(公告)号：US20220188361A1

公开(公告)日：2022-06-16

申请号：US17120013

申请日：2020-12-11

Applicant: Meta Platforms, Inc.

Inventor： Fadi Botros , Nanshu Wang , Fan Wang , Meryem Pinar Donmez Ediz , Omer Muzaffar , Kshitiz Malik , Vikas Seshagiri Rao Bhardwaj , Anuj Kumar , Shreyan Bakshi

IPC: G06F16/9032 , G10L15/16 , G10L15/06 , H04L12/58 , G02B27/01

Abstract: In one embodiment, a method includes receiving a first input by a user from a client system associated with the user, wherein the first input is in a voice modality, analyzing the first input to generate one or more candidate hypotheses, determining one or more modalities for presenting output generated by the one or more computing systems to the user at the client system, and sending instructions to the client system for presenting one or more suggested auto-completions corresponding to one or more of the candidate hypotheses, respectively, wherein each suggested auto-completion comprises the corresponding candidate hypothesis, and wherein the one or more suggested auto-completions are presented in the one or more determined modalities.

4.

发明申请
Task Execution Based on Real-world Text Detection for Assistant Systems 有权

公开(公告)号：US20250148811A1

公开(公告)日：2025-05-08

申请号：US18922119

申请日：2024-10-21

Applicant: Meta Platforms, Inc.

Inventor： Elizabeth Kelsey Santoro , Denis Savenkov , Koon Hui Geoffrey Goh , Ruchir Srivastava , Kshitiz Malik

IPC: G06V20/62 , G06F3/01 , G06F40/295 , G06N3/045 , G06N20/00 , G06V20/20 , G06V30/10

Abstract: In one embodiment, a method includes accessing visual signals comprising images portraying textual content in a real-world environment associated with a first user from a client system associated with the first user, recognizing the textual content based on machine-learning models and the visual signals, determining a context associated with the first user with respect to the real-world environment based on the visual signals, executing tasks determined based on the textual content and the determined context for the first user, and sending instructions for presenting execution results of the tasks to the first user to the client system.

5.

发明公开
Systems and Methods for Implementing Smart Assistant Systems 审中-公开

公开(公告)号：US20230245654A1

公开(公告)日：2023-08-03

申请号：US18157413

申请日：2023-01-20

Applicant: Meta Platforms, Inc.

Inventor： Akshat Shrivastava , Shrey Desai , Anchit Gupta , Ali Elkahky , Aleksandr Livshits , Alexander Kolmykov-Zotov , Ahmed Aly , Jinsong Yu , Manali Anand Naik , Shuhui Yang , Baiyang Liu , Surya Teja Appini , Tarun Vir Singh , Hang Su , Jiedan Zhu , Fuchun Peng , Shoubhik Bhattacharya , Kshitiz Malik , Shreyan Bakshi , Akash Bharadwaj , Harish Srinivas , Xiao Yang , Zhuangqun Huang , Gil Keren , Duc Hoang Le , Ahmed Kamal Atwa Mohamed , Zhe Liu , Pranab Mohanty

IPC: G10L15/22 , G10L15/18 , G10L15/30 , G10L15/06 , G10L15/197 , H04L9/40

CPC classification number: G10L15/22 , G10L15/1815 , G10L15/30 , G10L15/063 , G10L15/197 , H04L63/0428 , G10L2015/223 , G10L2015/086

Abstract: In one embodiment, a system includes an automatic speech recognition (ASR) module, a natural-language understanding (NLU) module, a dialog manager, one or more agents, an arbitrator, a delivery system, one or more processors, and a non-transitory memory coupled to the processors comprising instructions executable by the processors, the processors operable when executing the instructions to receive a user input, process the user input using the ASR module, the NLU module, the dialog manager, one or more of the agents, the arbitrator, and the delivery system, and provide a response to the user input.

6.

发明申请
Task Execution Based on Real-world Text Detection for Assistant Systems 有权

公开(公告)号：US20220374645A1

公开(公告)日：2022-11-24

申请号：US17394159

申请日：2021-08-04

Applicant: Meta Platforms, Inc.

Inventor： Elizabeth Kelsey Santoro , Denis Savenkov , Koon Hui Geoffrey Goh , Kshitiz Malik , Ruchir Srivastava

IPC: G06K9/32 , G06N20/00 , G06N3/04 , G06F3/01 , G06K9/00 , G06F40/295

Abstract: In one embodiment, a method includes accessing visual signals comprising images portraying textual content in a real-world environment associated with a first user from a client system associated with the first user, recognizing the textual content based on machine-learning models and the visual signals, determining a context associated with the first user with respect to the real-world environment based on the visual signals, executing tasks determined based on the textual content and the determined context for the first user, and sending instructions for presenting execution results of the tasks to the first user to the client system.

7.

发明授权
Methods, mediums, and systems for providing a model for an end-user device 有权

公开(公告)号：US11501081B1

公开(公告)日：2022-11-15

申请号：US16731304

申请日：2019-12-31

Applicant: META PLATFORMS, INC.

Inventor： Prince Gill , Honglei Liu , Wenhai Yang , Kshitiz Malik , Nanshu Wang , David Reiss

IPC: G06F40/30 , H04L9/40 , G06N5/04 , G06N20/00 , H04L51/52

Abstract: Exemplary embodiments relate to methods, mediums, and systems for moving language models from a server to the client device. Such embodiments may be deployed in an environment where the server is not able to provide modeling services to the clients, such as an end-to-end encrypted (E2EE) environment. Several different techniques are described to address issues of size and complexity reduction, model architecture optimization, model training, battery power reduction, and latency reduction.

8.

发明授权
Methods, mediums, and systems for training a model 有权

公开(公告)号：US11455555B1

公开(公告)日：2022-09-27

申请号：US16731321

申请日：2019-12-31

Applicant: META PLATFORMS, INC.

Inventor： Prince Gill , Honglei Liu , Wenhai Yang , Kshitiz Malik , Nanshu Wang , David Reiss

IPC: G06N5/04 , G06N20/00

Abstract: Exemplary embodiments relate to methods, mediums, and systems for moving language models from a server to the client device. Such embodiments may be deployed in an environment where the server is not able to provide modeling services to the clients, such as an end-to-end encrypted (E2EE) environment. Several different techniques are described to address issues of size and complexity reduction, model architecture optimization, model training, battery power reduction, and latency reduction.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification