REINFORCEMENT LEARNING (RL) MODEL FOR OPTIMIZING LONG TERM REVENUE

    公开(公告)号:US20240273575A1

    公开(公告)日:2024-08-15

    申请号:US18108090

    申请日:2023-02-10

    Applicant: ROKU, INC.

    CPC classification number: G06Q30/0269 G06Q30/0261

    Abstract: Disclosed herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for optimizing user experience/engagement and revenue. An example embodiment operates by a computer-implemented method for providing one or more advertisements to a media device. The method includes receiving, by at least one computer processor, a user state associated with a user of the media device, where the user state corresponds to a time step. The method further includes receiving a revenue value associated with the user of the media device, where the revenue value corresponds to the time step. The method also include determining an action associated with the user based on the user state and the revenue value. The action includes one or more parameters associated with the one or more advertisements. The method further includes providing the action to the user.

Patent Agency Ranking