-
公开(公告)号:US20180329998A1
公开(公告)日:2018-11-15
申请号:US15678059
申请日:2017-08-15
Applicant: Apple Inc.
Inventor: Blaise THOMSON , David J. VANDYKE , Gennaro FRAZZINGARO , Silvia FRIAS DELGADO , Thomas Benedict GUNTER , Thomas David VOICE , Thorvaldur Pall HELGASON , Stephen J. YOUNG , Diarmuid Ó SEAGHDHA , Dain KAPLAN
IPC: G06F17/30 , H04N21/466
Abstract: Systems and processes for optimizing dialogue policy decisions for digital assistants using implicit feedback are provided. In an example process, a user utterance is received. Based on a text representation of the user utterance, one or more user intents corresponding to the user utterance are determined. A policy action is selected from a plurality of candidate policy actions based on a belief state for the one or more user intents and a policy model. The policy action is performed, including outputting results of the policy action for presentation. A success score for the policy action is determined based on whether one or more predetermined types of implicit user feedback are detected after performing the policy action. A set of parameter values of the policy model is modified using the determined success score.