Multimodal Dialog State Tracking and Action Prediction for Assistant Systems

    公开(公告)号:US20210117681A1

    公开(公告)日:2021-04-22

    申请号:US17006339

    申请日:2020-08-28

    Applicant: Facebook, Inc.

    Abstract: In one embodiment, a method includes receiving, from a client system associated with a user, a user request comprising a reference to a target object, accessing visual data from the client system, wherein the visual data comprises images portraying the target object and one or more additional objects, and wherein attribute information of the target object is recorded in a multimodal dialog state, resolving the reference to the target object based on the attribute information recorded in the multimodal dialog state, determining relational information between the target object and one or more of the additional objects portrayed in the visual data, and sending, to the client system, instructions for presenting a response to the user request, wherein the response comprises the attribute information and the determined relational information.

    Content summarization for assistant systems

    公开(公告)号:US10977258B1

    公开(公告)日:2021-04-13

    申请号:US16247439

    申请日:2019-01-14

    Applicant: Facebook, Inc.

    Abstract: In one embodiment, a method includes, by one or more computing systems, receiving, from a client system associated with a user, a request for a summary of user communications from a content source, accessing a plurality of user communications from the content source, identifying a plurality of segments associated with the plurality of user communications, wherein the plurality of segments is associated with a plurality of topics, respectively, calculating, for each segment of the plurality of segments, a user interest score for the segment, selecting one or more of the segments for summarization based on their user interest scores, generating one or more personalized summaries of the one or more selected segments, wherein the personalization of the summary is based on the user profile of the user and sending, to the client system, instructions to present the personalized summaries to the user responsive to the request.

    Methods and systems for providing notifications to users of a social networking service

    公开(公告)号:US10412037B2

    公开(公告)日:2019-09-10

    申请号:US15439652

    申请日:2017-02-22

    Applicant: FACEBOOK, INC.

    Abstract: A method of providing notifications to users of a social networking service includes determining a user intent associated with a post from a user on the social networking service, based at least in part on content of the post. The method further includes generating a first notification of the user intent associated with the post and selecting a plurality of users of the social networking service to receive the first notification. The method further includes providing the first notification to the plurality of users, and after providing the first notification to the plurality of users: (1) receiving one or more responses to the first notification from one or more users of the plurality of users, the one or more responses including information responsive to the first notification, and (2) providing the information responsive to the first notification to the user.

    Generating compositional natural language by assistant systems

    公开(公告)号:US11042554B1

    公开(公告)日:2021-06-22

    申请号:US16176081

    申请日:2018-10-31

    Applicant: Facebook, Inc.

    Abstract: In one embodiment, a method includes receiving a user query from a client system associated with a first user, executing tasks via agents which return responses, each response comprising information items, analyzing the responses to determine slots, each slot corresponding to one of the information items, determining compositional sub-goals for each response, wherein each compositional sub-goal indicates a semantic-intent of the respective response, generating compositional fragments by a compositional natural-language generation (NLG) model, each compositional fragment comprising a partial natural-language response, determining a top-level compositional goal, generating a communication content by the compositional NLG model, wherein the communication content comprises a complete natural-language response to the user query, and wherein the complete natural-language response is based on the partial natural-language responses of the compositional fragments, and sending instructions for presenting the communication content to the client system.

    Multiple Wake Words for Systems with Multiple Smart Assistants

    公开(公告)号:US20210183397A1

    公开(公告)日:2021-06-17

    申请号:US17182951

    申请日:2021-02-23

    Applicant: Facebook, Inc.

    Abstract: In one embodiment, a method includes by a client system associated with a user, receiving, at the client system, a user input from the user, parsing, by the client system, the first user input to identify a request to execute a function to be performed by an assistant system of several assistant systems associated with the client system, determining whether the user is authorized to access the assistant system by comparing a voiceprint of the user to several voiceprints stored on the client system, sending, from the client system to the assistant system in response to determining the user is authorized to access the assistant system, a request to set an assistant xbot of the assistant system into a listening mode, and receiving, at the client system from the assistant system, an indication that the assistant xbot is in listening mode.

    Multimodal Entity and Coreference Resolution for Assistant Systems

    公开(公告)号:US20210118442A1

    公开(公告)日:2021-04-22

    申请号:US17006377

    申请日:2020-08-28

    Applicant: Facebook, Inc.

    Abstract: In one embodiment, a method includes accessing visual data from a client system associated with a user, wherein the visual data comprises images portraying one or more objects, receiving, from the client system, a user request, wherein the user request comprises a coreference to a target object, resolving the coreference to the target object from among the one or more objects, resolving the target object to a specific entity, and sending, to the client system, instructions for providing a response to the user request, wherein the response comprises attribute information about the specific entity.

    Multiple wake words for systems with multiple smart assistants

    公开(公告)号:US10957329B1

    公开(公告)日:2021-03-23

    申请号:US16183650

    申请日:2018-11-07

    Applicant: Facebook, Inc.

    Abstract: In one embodiment, a method includes by a client system associated with a user, receiving, at the client system associated with the user, a user input, parsing the user input to identify an n-gram associated with a wake word from a plurality of wake words corresponding to a plurality of assistant systems associated with the client system, wherein each assistant system provides a particular set of functions, determining that the wake word corresponds to a first assistant system of the plurality of assistant systems, wherein the first assistant system provides a first set of functions, sending, to the first assistant system, a request to set an assistant xbot of the first assistant system into a listening mode, and receiving, from the first assistant system, an indication that the assistant xbot is in listening mode responsive to a determination that the user has permission to access the first assistant system.

    Identifying users through conversations for assistant systems

    公开(公告)号:US10854206B1

    公开(公告)日:2020-12-01

    申请号:US16229828

    申请日:2018-12-21

    Applicant: Facebook, Inc.

    Abstract: In one embodiment, a method includes receiving from a client system a user request from a first user, determining a necessity for resolving the first user to a known entity to execute one or more tasks associated with the user request based on privacy restrictions associated with the user request, determining a set of candidate entities for the first user based on one or more machine-learning models, each candidate entity being associated with a respective confidence score greater than a threshold score, sending instructions for prompting the first user to select a candidate entity from the set of candidate entities, resolving the first user to a selected candidate entity responsive to receiving a selection from the first user, and executing the one or more tasks associated with the user request based on a user profile associated with the selected candidate entity.

Patent Agency Ranking