AUTOMATED POLICY FUNCTION ADJUSTMENT USING REINFORCEMENT LEARNING ALGORITHM

    公开(公告)号:US20230298080A1

    公开(公告)日:2023-09-21

    申请号:US18108916

    申请日:2023-02-13

    CPC classification number: G06Q30/0617 G06N3/092

    Abstract: An online system may receive, from a content provider, a content presentation campaign that includes one or more objectives. The online system may define a set of one or more policy functions that automatically controls the content presentation campaign. A policy function may control one or more criteria in bidding content slots. The online system may monitor a realized outcome of the content presentation campaign. The online system may apply a reinforcement learning algorithm in adjusting the set of policy functions. The reinforcement learning algorithm adjusts one or more parameters in the set of policy functions to reduce a difference between the realized outcome and the desired outcome set by the content provider. The online system generates an adjusted set of policy functions and uses the adjusted set of policy functions in bidding content slots to present one or more content items provided by the content provider.

    TRAINING A MODEL TO PREDICT LIKELIHOODS OF USERS PERFORMING AN ACTION AFTER BEING PRESENTED WITH A CONTENT ITEM

    公开(公告)号:US20220398605A1

    公开(公告)日:2022-12-15

    申请号:US17343026

    申请日:2021-06-09

    Abstract: An online concierge system trains a user interaction model to predict a probability of a user performing an interaction after one or more content items are displayed to the user. This provides a measure of an effect of displaying content items to the user on the user performing one or more interactions. The user interaction model is trained from displaying content items to certain users of the online concierge system and withholding display of the content items to other users of the online concierge system. To train the user interaction model, the user interaction model is applied to labeled examples identifying a user and value based on interactions the user performed after one or more content items were displayed to the user and interactions the user performed when one or more content items were not used.

    SELECTING AN ITEM FOR INCLUSION IN AN ORDER FROM A USER OF AN ONLINE CONCIERGE SYSTEM FROM A GENERIC ITEM DESCRIPTION RECEIVED FROM THE USER

    公开(公告)号:US20220358560A1

    公开(公告)日:2022-11-10

    申请号:US17308993

    申请日:2021-05-05

    Abstract: An online concierge system maintains a taxonomy associating one or more specific items offered by a warehouse with a generic item description. When the online concierge system receives a generic item description from a user for inclusion in an order, the online concierge system uses the taxonomy to select a set of items associated with the generic item description. Based on probabilities of the user purchasing various items of the set, the online concierge system selects an item of the set for inclusion in the order For example, the online concierge system selects an item of the set for which the user has a maximum probability of being purchased. Subsequently, the online concierge system displays an interface for the user that is prepopulated with information identifying the selected item of the set.

    USER INTERFACE THAT PRE-POPULATES ITEMS IN AN ORDER MODULE FOR A USER OF AN ONLINE CONCIERGE SYSTEM USING A PREDICTION MODEL

    公开(公告)号:US20220335493A1

    公开(公告)日:2022-10-20

    申请号:US17232651

    申请日:2021-04-16

    Abstract: An online concierge system maintains historical orders received from a user that include one or more items. For items included in one more historical orders, the online concierge system determines an interval between orders including an item, providing an indication of a frequency with which the user orders the item. When the online concierge system receives a request to create an order from the user, in response to an amount of time between a most recently received order including the item and a time when the request was received is within a threshold duration of the interval between orders including the item, the online concierge system selects an item from a category including the item. The selected item may be the item or an alternative item in the category. Subsequently, the online concierge system displays an interface for the user that is prepopulated with information identifying the selected item.

Patent Agency Ranking