Patent search ap:("Maplebear Inc. (dba Instacart)") AND inv:"Nour Alkhatib" Page 1

1.

发明公开
AUTOMATED POLICY FUNCTION ADJUSTMENT USING REINFORCEMENT LEARNING ALGORITHM 审中-公开

公开(公告)号：US20230298080A1

公开(公告)日：2023-09-21

申请号：US18108916

申请日：2023-02-13

Applicant: Maplebear Inc. (dba Instacart)

Inventor： Tilman Drerup , Nour Alkhatib , Jonathan Gu , Amin Akbari , Changyao Chen

IPC: G06Q30/0601 , G06N3/092

CPC classification number: G06Q30/0617 , G06N3/092

Abstract: An online system may receive, from a content provider, a content presentation campaign that includes one or more objectives. The online system may define a set of one or more policy functions that automatically controls the content presentation campaign. A policy function may control one or more criteria in bidding content slots. The online system may monitor a realized outcome of the content presentation campaign. The online system may apply a reinforcement learning algorithm in adjusting the set of policy functions. The reinforcement learning algorithm adjusts one or more parameters in the set of policy functions to reduce a difference between the realized outcome and the desired outcome set by the content provider. The online system generates an adjusted set of policy functions and uses the adjusted set of policy functions in bidding content slots to present one or more content items provided by the content provider.

Patent Agency Ranking