-
1.
公开(公告)号:US12142298B1
公开(公告)日:2024-11-12
申请号:US18152064
申请日:2023-01-09
Applicant: Meta Platforms, Inc.
Inventor: Satwik Kottur , Seungwhan Moon , Aram Markosyan , Hardik Shah
IPC: G11B27/031 , G06F40/284 , G06F40/35 , G10L15/22 , G10L15/30 , G10L15/18
Abstract: In one embodiment, a method includes receiving a first user request from a first user for generating a media montage from a client system during a dialog session with the first user, generating an initial media montage during the dialog session based on media collections associated with the first user, sending instructions for presenting the initial media montage to the client system during the dialog session, receiving a second user request from the first user from the client system during the dialog session for editing the initial media montage, generating an edited media montage from the initial media montage during the dialog session based on the second user request and a memory graph associated with the first user, and sending instructions for presenting the edited media montage to the client system during the dialog session.
-
公开(公告)号:US20220199079A1
公开(公告)日:2022-06-23
申请号:US17524598
申请日:2021-11-11
Applicant: Meta Platforms, Inc.
Inventor: Michael Robert Hanson , Swati Goel , Leif Haven Martinson , Megha Tiwari , Megha Jhunjhunwala , Ilana Orly Shalowitz , Nicholas Jorge Flores , Kyle Archie , Piyush Khemka , Seungwhan Moon , Kai Sun , Mark Parent , Michael Glueck , Jackson Rushing , Daniel John Wigdor , Stephanie Santosa , Christopher De Paoli
Abstract: In one embodiment, a system includes an automatic speech recognition (ASR) module, a natural-language understanding (NLU) module, a dialog manager, one or more agents, an arbitrator, a delivery system, one or more processors, and a non-transitory memory coupled to the processors comprising instructions executable by the processors, the processors operable when executing the instructions to receive a user input, process the user input using the ASR module, the NLU module, the dialog manager, one or more of the agents, the arbitrator, and the delivery system, and provide a response to the user input.
-
公开(公告)号:US11704745B2
公开(公告)日:2023-07-18
申请号:US17006339
申请日:2020-08-28
Applicant: Meta Platforms, Inc.
Inventor: Shivani Poddar , Seungwhan Moon , Paul Anthony Crook , Rajen Subba
IPC: G06F40/30 , G06F3/01 , G06F9/451 , G06F9/48 , G06F9/54 , G06F16/332 , G06F16/9032 , G06F16/9536 , G06F40/205 , G06F40/253 , G06Q50/00 , H04N7/14 , G06N20/00 , G06F40/35 , G06F40/56 , G06F40/242 , G06V20/20 , G06V10/82 , G06V40/16 , G06V20/30 , G06V10/20 , G06V10/764 , G06V20/00 , G06V40/20 , H04L51/222 , H04L51/224 , H04L51/52 , H04L51/212 , H04L67/75 , G06N3/047 , G06N3/045 , G06F18/2321 , G06N3/08 , G06Q10/109 , G10L15/06 , G10L15/08 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/18 , H04L67/306 , G06V20/40 , G06F3/16
CPC classification number: G06Q50/01 , G06F3/011 , G06F3/013 , G06F9/453 , G06F9/485 , G06F9/4881 , G06F9/547 , G06F16/3329 , G06F16/90332 , G06F16/9536 , G06F18/2321 , G06F40/205 , G06F40/242 , G06F40/253 , G06F40/30 , G06F40/35 , G06F40/56 , G06N3/045 , G06N3/047 , G06N3/08 , G06N20/00 , G06Q10/109 , G06V10/255 , G06V10/764 , G06V10/82 , G06V20/00 , G06V20/20 , G06V20/30 , G06V40/16 , G06V40/25 , G10L15/063 , G10L15/08 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/18 , H04L51/212 , H04L51/222 , H04L51/224 , H04L51/52 , H04L67/306 , H04L67/75 , H04N7/147 , G06F3/017 , G06F3/167 , G06V20/41 , G06V40/174 , G06V2201/10 , G10L2015/088 , G10L2015/223 , G10L2015/227
Abstract: In one embodiment, a method includes receiving, from a client system associated with a user, a user request comprising a reference to a target object, accessing visual data from the client system, wherein the visual data comprises images portraying the target object and one or more additional objects, and wherein attribute information of the target object is recorded in a multimodal dialog state, resolving the reference to the target object based on the attribute information recorded in the multimodal dialog state, determining relational information between the target object and one or more of the additional objects portrayed in the visual data, and sending, to the client system, instructions for presenting a response to the user request, wherein the response comprises the attribute information and the determined relational information.
-
公开(公告)号:US20220382989A1
公开(公告)日:2022-12-01
申请号:US17878778
申请日:2022-08-01
Applicant: Meta Platforms, Inc.
Inventor: Shivani Poddar , Seungwhan Moon , Paul Anthony Crook , Rajen Subba
IPC: G06F40/30 , G06F9/54 , G06F40/205 , G06F40/242 , G06N3/04 , G06N3/08 , G06F16/9536 , G10L15/18 , G10L15/22 , G10L15/30 , G10L15/32 , G06F40/253 , G06N20/00 , G06F3/01 , G06Q50/00 , G06F16/9032 , G06F9/48 , G10L15/08 , H04N7/14 , H04L67/306 , G06V10/20 , G06V20/20 , G06V20/30 , G06V20/40 , G06V40/16 , H04L51/52 , H04L51/212 , H04L67/75 , G06F9/451 , G06F16/332 , G06F40/35 , G06K9/62 , G10L15/06 , G10L15/16
Abstract: In one embodiment, a method includes receiving, at a client system, an audio input, where the audio input comprises a coreference to a target object, accessing visual data from one or more camera associated with the client system, where the visual data comprises images portraying one or more objects, resolving the coreference to the target object from among the one or more objects, resoling the target object to a specific entity, and providing, at the client system, a response to the audio input, where the response comprises information about the specific entity.
-
公开(公告)号:US20240331058A1
公开(公告)日:2024-10-03
申请号:US18623449
申请日:2024-04-01
Applicant: Meta Platforms, Inc
Inventor: Shivani Poddar , Seungwhan Moon , Paul Anthony Crook , Rajen Subba
IPC: G06Q50/00 , G06F3/01 , G06F3/16 , G06F9/451 , G06F9/48 , G06F9/54 , G06F16/332 , G06F16/9032 , G06F16/9536 , G06F18/2321 , G06F40/205 , G06F40/242 , G06F40/253 , G06F40/295 , G06F40/30 , G06F40/35 , G06F40/56 , G06N3/04 , G06N3/045 , G06N3/047 , G06N3/08 , G06N20/00 , G06Q10/109 , G06Q30/0601 , G06V10/20 , G06V10/764 , G06V10/82 , G06V20/00 , G06V20/20 , G06V20/30 , G06V20/40 , G06V40/16 , G06V40/20 , G10L15/06 , G10L15/08 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/18 , H04L51/212 , H04L51/222 , H04L51/224 , H04L51/52 , H04L67/306 , H04L67/75 , H04N7/14
CPC classification number: G06Q50/01 , G06F3/011 , G06F3/013 , G06F9/453 , G06F9/485 , G06F9/4862 , G06F9/4881 , G06F9/547 , G06F16/3329 , G06F16/90332 , G06F16/9536 , G06F18/2321 , G06F40/205 , G06F40/242 , G06F40/253 , G06F40/295 , G06F40/30 , G06F40/35 , G06F40/56 , G06N3/04 , G06N3/045 , G06N3/047 , G06N3/08 , G06N20/00 , G06Q10/109 , G06Q30/0603 , G06Q30/0631 , G06Q30/0633 , G06Q30/0643 , G06V10/255 , G06V10/764 , G06V10/82 , G06V20/00 , G06V20/20 , G06V20/30 , G06V40/16 , G06V40/25 , G10L15/063 , G10L15/08 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/18 , H04L51/212 , H04L51/222 , H04L51/224 , H04L51/52 , H04L67/306 , H04L67/75 , H04N7/147 , G06F3/017 , G06F3/167 , G06V20/41 , G06V40/174 , G06V2201/10 , G10L2015/0631 , G10L2015/088 , G10L2015/223 , G10L2015/227 , G10L2015/228
Abstract: In one embodiment, a method includes receiving, at a client system, an audio input, where the audio input comprises a coreference to a target object, accessing visual data from one or more camera associated with the client system, where the visual data comprises images portraying one or more objects, resolving the coreference to the target object from among the one or more objects, resoling the target object to a specific entity, and providing, at the client system, a response to the audio input, where the response comprises information about the specific entity.
-
公开(公告)号:US11966986B2
公开(公告)日:2024-04-23
申请号:US17878778
申请日:2022-08-01
Applicant: Meta Platforms, Inc.
Inventor: Shivani Poddar , Seungwhan Moon , Paul Anthony Crook , Rajen Subba
IPC: H04L67/306 , G06F3/01 , G06F9/451 , G06F9/48 , G06F9/54 , G06F16/332 , G06F16/9032 , G06F16/9536 , G06F18/2321 , G06F40/205 , G06F40/242 , G06F40/253 , G06F40/30 , G06F40/35 , G06F40/56 , G06N3/045 , G06N3/047 , G06N3/08 , G06N20/00 , G06Q10/109 , G06Q50/00 , G06V10/20 , G06V10/764 , G06V10/82 , G06V20/00 , G06V20/20 , G06V20/30 , G06V40/16 , G06V40/20 , G10L15/06 , G10L15/08 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/18 , H04L51/212 , H04L51/222 , H04L51/224 , H04L51/52 , H04L67/75 , H04N7/14 , G06F3/16 , G06V20/40
CPC classification number: G06Q50/01 , G06F3/011 , G06F3/013 , G06F9/453 , G06F9/485 , G06F9/4881 , G06F9/547 , G06F16/3329 , G06F16/90332 , G06F16/9536 , G06F18/2321 , G06F40/205 , G06F40/242 , G06F40/253 , G06F40/30 , G06F40/35 , G06F40/56 , G06N3/045 , G06N3/047 , G06N3/08 , G06N20/00 , G06Q10/109 , G06V10/255 , G06V10/764 , G06V10/82 , G06V20/00 , G06V20/20 , G06V20/30 , G06V40/16 , G06V40/25 , G10L15/063 , G10L15/08 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/18 , H04L51/212 , H04L51/222 , H04L51/224 , H04L51/52 , H04L67/306 , H04L67/75 , H04N7/147 , G06F3/017 , G06F3/167 , G06V20/41 , G06V40/174 , G06V2201/10 , G10L2015/088 , G10L2015/223 , G10L2015/227
Abstract: In one embodiment, a method includes receiving, at a client system, an audio input, where the audio input comprises a coreference to a target object, accessing visual data from one or more camera associated with the client system, where the visual data comprises images portraying one or more objects, resolving the coreference to the target object from among the one or more objects, resoling the target object to a specific entity, and providing, at the client system, a response to the audio input, where the response comprises information about the specific entity.
-
公开(公告)号:US11948563B1
公开(公告)日:2024-04-02
申请号:US16917664
申请日:2020-06-30
Applicant: Meta Platforms, Inc.
Inventor: Xiaohu Liu , Paul Anthony Crook , Zhiguang Wang , Shivani Poddar , Seungwhan Moon , Krishna Mittal , Shubham Khandelwal , Xin Ming Fan , Eun Joon Cho
CPC classification number: G10L15/22 , G06F40/56 , G10L15/063 , G10L15/08 , G10L2015/0631 , G10L2015/223
Abstract: In one embodiment, a method includes receiving a user request from a client system associated with a user, determining that the user request corresponds to a first suspended task, retrieving a first dialog state of the first suspended task from a dialog history associated with the user, generating a summary of the first suspended task based on the first dialog state using a natural-language generating (NLG) module, and sending instructions to the client system for providing the summary of the first suspended task to the user.
-
公开(公告)号:US11694281B1
公开(公告)日:2023-07-04
申请号:US16921665
申请日:2020-07-06
Applicant: Meta Platforms, Inc.
Inventor: Honglei Liu , Hao Zhou , Seungwhan Moon , Bing Liu , Yulong Qiu , Daniel Chai , Pararth Paresh Shah , Xiaolei Li , Rajen Subba , Hu Xu
IPC: H04L51/52 , H04L51/18 , G06F16/9032 , G06F40/56 , G06F9/451
CPC classification number: H04L51/52 , G06F9/453 , G06F16/90332 , G06F40/56 , H04L51/18
Abstract: In one embodiment, a method includes receiving a user request from a client system associated with a user, generating a response to the user request which references one or more entities, generating a personalized recommendation based on the user request and the response, wherein the personalized recommendation references one or more of the entities of the response, and sending instructions for presenting the response and the personalized recommendation to the client system.
-
9.
公开(公告)号:US20230334338A1
公开(公告)日:2023-10-19
申请号:US16024310
申请日:2018-06-29
Applicant: Meta Platforms, Inc.
Inventor: Seungwhan Moon , Xiao Wu
Abstract: A system for user behavior prediction generates a first series of behavior event elements describing a first set of behaviors of one or more users, upon processing user interactions with an online system. In a first flow, the system generates a first series of time-distributed embeddings of the behavior event elements, and in a second flow parallel with the first flow, the system generates a proposed future embedding of a proposed future behavior of a user at a future time point subsequent to the first set of time points. Using a predictive model (e.g., a recursive neural network), the system transforms components of the first and second flows into an output describing plausibility of occurrence of the proposed future behavior of the user.
-
公开(公告)号:US11663678B2
公开(公告)日:2023-05-30
申请号:US17006339
申请日:2020-08-28
Applicant: Meta Platforms, Inc.
Inventor: Shivani Poddar , Seungwhan Moon , Paul Anthony Crook , Rajen Subba
IPC: G06F40/30 , G06F3/01 , G06F9/451 , G06F9/48 , G06F9/54 , G06F16/332 , G06F16/9032 , G06F16/9536 , G06F40/205 , G06F40/253 , G06F40/242 , G06N3/08 , G10L15/18 , G10L15/22 , G10L15/30 , G10L15/32 , G06N20/00 , G06Q50/00 , G10L15/08 , H04N7/14 , H04L67/306 , G06V10/20 , G06V20/20 , G06V20/30 , G06V20/40 , G06V40/16 , H04L51/52 , H04L51/212 , H04L67/75 , G06F40/35 , G10L15/06 , G10L15/16 , G06F18/2321 , G06N3/045 , G06N3/047 , G06F3/16
CPC classification number: G06F40/30 , G06F3/011 , G06F3/013 , G06F9/453 , G06F9/485 , G06F9/4881 , G06F9/547 , G06F16/3329 , G06F16/90332 , G06F16/9536 , G06F18/2321 , G06F40/205 , G06F40/242 , G06F40/253 , G06F40/35 , G06N3/045 , G06N3/047 , G06N3/08 , G06N20/00 , G06Q50/01 , G06V10/255 , G06V20/20 , G06V20/30 , G06V20/41 , G06V40/174 , G10L15/063 , G10L15/08 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/212 , H04L51/52 , H04L67/306 , H04L67/75 , H04N7/147 , G06F3/017 , G06F3/167 , G06V2201/10 , G10L2015/088 , G10L2015/223 , G10L2015/227
Abstract: In one embodiment, a method includes receiving, from a client system associated with a user, a user request comprising a reference to a target object, accessing visual data from the client system, wherein the visual data comprises images portraying the target object and one or more additional objects, and wherein attribute information of the target object is recorded in a multimodal dialog state, resolving the reference to the target object based on the attribute information recorded in the multimodal dialog state, determining relational information between the target object and one or more of the additional objects portrayed in the visual data, and sending, to the client system, instructions for presenting a response to the user request, wherein the response comprises the attribute information and the determined relational information.
-
-
-
-
-
-
-
-
-