Patent search ap:("Meta Platforms Page Inc.") AND inv:"Satwik Kottur"

1.

发明申请
Multimodal State Tracking via Scene Graphs for Assistant Systems 有权

公开(公告)号：US20250148784A1

公开(公告)日：2025-05-08

申请号：US19018019

申请日：2025-01-13

Applicant: Meta Platforms, Inc.

Inventor： Satwik Kottur

IPC: G06V20/00 , G06F16/901 , G06F16/9536 , G06V20/20

Abstract: In one embodiment, a method includes receiving, from a client system associated with a user, a first user request that includes a reference to a target object and one or more of an attribute or a relationship of the target object. Visual data including one or more images portraying the target object may then be accessed, and the reference may be resolved to the target object portrayed in the one or more images. Object information of the target object that corresponds to the referenced attribute or relationship of the first user request may be determined based on a visual analysis of the one or more images. Finally, responsive to receiving the first user request, the object information of the target object may be stored in a multimodal dialog state.

2.

发明授权
Creating digital stories based on memory graphs and multi-turn dialogs for assistant systems 有权

公开(公告)号：US12142298B1

公开(公告)日：2024-11-12

申请号：US18152064

申请日：2023-01-09

Applicant: Meta Platforms, Inc.

Inventor： Satwik Kottur , Seungwhan Moon , Aram Markosyan , Hardik Shah

IPC: G11B27/031 , G06F40/284 , G06F40/35 , G10L15/22 , G10L15/30 , G10L15/18

Abstract: In one embodiment, a method includes receiving a first user request from a first user for generating a media montage from a client system during a dialog session with the first user, generating an initial media montage during the dialog session based on media collections associated with the first user, sending instructions for presenting the initial media montage to the client system during the dialog session, receiving a second user request from the first user from the client system during the dialog session for editing the initial media montage, generating an edited media montage from the initial media montage during the dialog session based on the second user request and a memory graph associated with the first user, and sending instructions for presenting the edited media montage to the client system during the dialog session.

3.

发明公开
Systems and Methods for Providing User Experiences on Smart Assistant Systems 审中-公开

公开(公告)号：US20230409615A1

公开(公告)日：2023-12-21

申请号：US18334235

申请日：2023-06-13

Applicant: Meta Platforms, Inc.

Inventor： Piyush Khemka , Brandon Ramos , Ryan Wolff , Stephen Chee-Ching Wu , Ashley Gustafson , Gabrielle Catherine Moskey , Hyundong Cho , Andrea Madotto , Zhaojiang Lin , Satwik Kottur , Chinnadhurai Sankar , Ashish Vishwanath Shenoy , Jiangning Chen , Rahim Manji , Bing Liu , Xin Liu , Ziyun Zhang

IPC: G06F16/332 , G06F9/451 , G06V20/68 , G10L15/18

CPC classification number: G06F16/3329 , G06F9/451 , G06V20/68 , G10L15/1822

Abstract: In one embodiment, a system includes an automatic speech recognition (ASR) module, a natural-language understanding (NLU) module, a dialog manager, one or more agents, an arbitrator, a delivery system, one or more processors, and a non-transitory memory coupled to the processors comprising instructions executable by the processors, the processors operable when executing the instructions to receive a user input, process the user input using the ASR module, the NLU module, the dialog manager, one or more of the agents, the arbitrator, and the delivery system, and provide a response to the user input.

4.

发明授权
Multimodal state tracking via scene graphs for assistant systems 有权

公开(公告)号：US12198430B1

公开(公告)日：2025-01-14

申请号：US17009542

申请日：2020-09-01

Applicant: Meta Platforms, Inc.

Inventor： Satwik Kottur

IPC: G06V20/00 , G06F16/901 , G06F16/9536 , G06V20/20

Abstract: In one embodiment, a method includes receiving, from a client system associated with a user, a first user request that includes a reference to a target object and one or more of an attribute or a relationship of the target object. Visual data including one or more images portraying the target object may then be accessed, and the reference may be resolved to the target object portrayed in the one or more images. Object information of the target object that corresponds to the referenced attribute or relationship of the first user request may be determined based on a visual analysis of the one or more images. Finally, responsive to receiving the first user request, the object information of the target object may be stored in a multimodal dialog state.

5.

发明公开
Exploration of User Memories in Multi-turn Dialogs for Assistant Systems 审中-公开

公开(公告)号：US20230401170A1

公开(公告)日：2023-12-14

申请号：US17504276

申请日：2021-10-18

Applicant: Meta Platforms, Inc.

Inventor： Satwik Kottur , Seungwhan Moon

IPC: G06F16/11 , G06F40/205 , G06F40/35 , G06F16/95 , G06N20/00

CPC classification number: G06F16/122 , G06F40/205 , H04L51/02 , G06F16/95 , G06N20/00 , G06F40/35

Abstract: In one embodiment, a method includes receiving a first user input from a user from a client system associated with the user, accessing a memory-data store comprising digital memories, wherein each digital memory comprises media content items, temporal information associated with the media content items, and relational information pointing to second digital memories, selecting priming digital memories from the digital memories based on the first user input, proactively selecting related digital memories based on the relational information associated with the priming digital memories, generating a priming response based on the priming digital memories and a proactive suggestion based on the related digital memories, and sending instructions for presenting the priming response and the proactive suggestion to the client system responsive the first user input.

6.

发明申请
Systems and Methods for Providing User Experiences in AR/VR Environments by Assistant Systems 有权

公开(公告)号：US20220358727A1

公开(公告)日：2022-11-10

申请号：US17719148

申请日：2022-04-12

Applicant: Meta Platforms, Inc.

Inventor： Abhay Kumar Gupta , Gregory Francis Mazurek , Benjamin Gordon Jaeger , Lionel Laurent Reyero , Lihan Bin , Noah Cushing , Seungwhan Moon , Satwik Kottur

IPC: G06T19/00 , G06T7/70

Abstract: In one embodiment, a system includes an automatic speech recognition (ASR) module, a natural-language understanding (NLU) module, a dialog manager, one or more agents, an arbitrator, a delivery system, one or more processors, and a non-transitory memory coupled to the processors comprising instructions executable by the processors, the processors operable when executing the instructions to receive a user input, process the user input using the ASR module, the NLU module, the dialog manager, one or more of the agents, the arbitrator, and the delivery system, and provide a response to the user input.

Patent Agency Ranking