-
公开(公告)号:US20250148784A1
公开(公告)日:2025-05-08
申请号:US19018019
申请日:2025-01-13
Applicant: Meta Platforms, Inc.
Inventor: Satwik Kottur
IPC: G06V20/00 , G06F16/901 , G06F16/9536 , G06V20/20
Abstract: In one embodiment, a method includes receiving, from a client system associated with a user, a first user request that includes a reference to a target object and one or more of an attribute or a relationship of the target object. Visual data including one or more images portraying the target object may then be accessed, and the reference may be resolved to the target object portrayed in the one or more images. Object information of the target object that corresponds to the referenced attribute or relationship of the first user request may be determined based on a visual analysis of the one or more images. Finally, responsive to receiving the first user request, the object information of the target object may be stored in a multimodal dialog state.
-
2.
公开(公告)号:US12142298B1
公开(公告)日:2024-11-12
申请号:US18152064
申请日:2023-01-09
Applicant: Meta Platforms, Inc.
Inventor: Satwik Kottur , Seungwhan Moon , Aram Markosyan , Hardik Shah
IPC: G11B27/031 , G06F40/284 , G06F40/35 , G10L15/22 , G10L15/30 , G10L15/18
Abstract: In one embodiment, a method includes receiving a first user request from a first user for generating a media montage from a client system during a dialog session with the first user, generating an initial media montage during the dialog session based on media collections associated with the first user, sending instructions for presenting the initial media montage to the client system during the dialog session, receiving a second user request from the first user from the client system during the dialog session for editing the initial media montage, generating an edited media montage from the initial media montage during the dialog session based on the second user request and a memory graph associated with the first user, and sending instructions for presenting the edited media montage to the client system during the dialog session.
-
公开(公告)号:US20230409615A1
公开(公告)日:2023-12-21
申请号:US18334235
申请日:2023-06-13
Applicant: Meta Platforms, Inc.
Inventor: Piyush Khemka , Brandon Ramos , Ryan Wolff , Stephen Chee-Ching Wu , Ashley Gustafson , Gabrielle Catherine Moskey , Hyundong Cho , Andrea Madotto , Zhaojiang Lin , Satwik Kottur , Chinnadhurai Sankar , Ashish Vishwanath Shenoy , Jiangning Chen , Rahim Manji , Bing Liu , Xin Liu , Ziyun Zhang
IPC: G06F16/332 , G06F9/451 , G06V20/68 , G10L15/18
CPC classification number: G06F16/3329 , G06F9/451 , G06V20/68 , G10L15/1822
Abstract: In one embodiment, a system includes an automatic speech recognition (ASR) module, a natural-language understanding (NLU) module, a dialog manager, one or more agents, an arbitrator, a delivery system, one or more processors, and a non-transitory memory coupled to the processors comprising instructions executable by the processors, the processors operable when executing the instructions to receive a user input, process the user input using the ASR module, the NLU module, the dialog manager, one or more of the agents, the arbitrator, and the delivery system, and provide a response to the user input.
-
公开(公告)号:US12198430B1
公开(公告)日:2025-01-14
申请号:US17009542
申请日:2020-09-01
Applicant: Meta Platforms, Inc.
Inventor: Satwik Kottur
IPC: G06V20/00 , G06F16/901 , G06F16/9536 , G06V20/20
Abstract: In one embodiment, a method includes receiving, from a client system associated with a user, a first user request that includes a reference to a target object and one or more of an attribute or a relationship of the target object. Visual data including one or more images portraying the target object may then be accessed, and the reference may be resolved to the target object portrayed in the one or more images. Object information of the target object that corresponds to the referenced attribute or relationship of the first user request may be determined based on a visual analysis of the one or more images. Finally, responsive to receiving the first user request, the object information of the target object may be stored in a multimodal dialog state.
-
公开(公告)号:US20230401170A1
公开(公告)日:2023-12-14
申请号:US17504276
申请日:2021-10-18
Applicant: Meta Platforms, Inc.
Inventor: Satwik Kottur , Seungwhan Moon
IPC: G06F16/11 , G06F40/205 , G06F40/35 , G06F16/95 , G06N20/00
CPC classification number: G06F16/122 , G06F40/205 , H04L51/02 , G06F16/95 , G06N20/00 , G06F40/35
Abstract: In one embodiment, a method includes receiving a first user input from a user from a client system associated with the user, accessing a memory-data store comprising digital memories, wherein each digital memory comprises media content items, temporal information associated with the media content items, and relational information pointing to second digital memories, selecting priming digital memories from the digital memories based on the first user input, proactively selecting related digital memories based on the relational information associated with the priming digital memories, generating a priming response based on the priming digital memories and a proactive suggestion based on the related digital memories, and sending instructions for presenting the priming response and the proactive suggestion to the client system responsive the first user input.
-
6.
公开(公告)号:US20220358727A1
公开(公告)日:2022-11-10
申请号:US17719148
申请日:2022-04-12
Applicant: Meta Platforms, Inc.
Inventor: Abhay Kumar Gupta , Gregory Francis Mazurek , Benjamin Gordon Jaeger , Lionel Laurent Reyero , Lihan Bin , Noah Cushing , Seungwhan Moon , Satwik Kottur
Abstract: In one embodiment, a system includes an automatic speech recognition (ASR) module, a natural-language understanding (NLU) module, a dialog manager, one or more agents, an arbitrator, a delivery system, one or more processors, and a non-transitory memory coupled to the processors comprising instructions executable by the processors, the processors operable when executing the instructions to receive a user input, process the user input using the ASR module, the NLU module, the dialog manager, one or more of the agents, the arbitrator, and the delivery system, and provide a response to the user input.
-
-
-
-
-