-
公开(公告)号:US10482874B2
公开(公告)日:2019-11-19
申请号:US15678065
申请日:2017-08-15
Applicant: Apple Inc.
Inventor: Blaise Thomson , Anders Johannsen , Diarmuid Ó Séaghdha , Federico Flego , Luca Simonelli , Stephen J. Young , Thomas David Voice , Thorvaldur Pall Helgason
Abstract: Systems and processes for operating a digital assistant using a hierarchical belief state are disclosed. In an example process, a user utterance of a dialog is received. A belief state for the dialog is determined. The belief state comprises a plurality of dialog slots. Each dialog slot of the plurality of dialog slots includes a respective marginal certainty for a concept or property represented by the respective dialog slot. A first dialog slot of the plurality of dialog slots further includes one or more joint certainties for one or more interpretations arising from the first dialog slot. Based on the marginal certainty of each dialog slot of the plurality of dialog slots and the one or more joint certainties of the first dialog slot, a policy action is selected from a plurality of candidate policy actions that correspond to the belief state. The selected policy action is performed.
-
公开(公告)号:US10810274B2
公开(公告)日:2020-10-20
申请号:US15678059
申请日:2017-08-15
Applicant: Apple Inc.
Inventor: Blaise Thomson , David J. Vandyke , Gennaro Frazzingaro , Silvia Frias Delgado , Thomas Benedict Gunter , Thomas David Voice , Thorvaldur Pall Helgason , Stephen J. Young , Diarmuid Ó Śeaghdha , Dain Kaplan
IPC: G06F16/9535 , H04N21/466 , H04N21/233 , G10L15/18 , G10L15/22 , H04N21/482 , H04N21/414 , H04N21/422 , H04N21/239 , G10L15/08 , G10L13/02 , G10L15/02 , G10L15/30 , G10L13/00
Abstract: Systems and processes for optimizing dialogue policy decisions for digital assistants using implicit feedback are provided. In an example process, a user utterance is received. Based on a text representation of the user utterance, one or more user intents corresponding to the user utterance are determined. A policy action is selected from a plurality of candidate policy actions based on a belief state for the one or more user intents and a policy model. The policy action is performed, including outputting results of the policy action for presentation. A success score for the policy action is determined based on whether one or more predetermined types of implicit user feedback are detected after performing the policy action. A set of parameter values of the policy model is modified using the determined success score.
-