-
公开(公告)号:US20230298080A1
公开(公告)日:2023-09-21
申请号:US18108916
申请日:2023-02-13
Applicant: Maplebear Inc. (dba Instacart)
Inventor: Tilman Drerup , Nour Alkhatib , Jonathan Gu , Amin Akbari , Changyao Chen
IPC: G06Q30/0601 , G06N3/092
CPC classification number: G06Q30/0617 , G06N3/092
Abstract: An online system may receive, from a content provider, a content presentation campaign that includes one or more objectives. The online system may define a set of one or more policy functions that automatically controls the content presentation campaign. A policy function may control one or more criteria in bidding content slots. The online system may monitor a realized outcome of the content presentation campaign. The online system may apply a reinforcement learning algorithm in adjusting the set of policy functions. The reinforcement learning algorithm adjusts one or more parameters in the set of policy functions to reduce a difference between the realized outcome and the desired outcome set by the content provider. The online system generates an adjusted set of policy functions and uses the adjusted set of policy functions in bidding content slots to present one or more content items provided by the content provider.