Recommending sequences of content with bootstrapped reinforcement learning

Invention Grant

US11429892B2 Recommending sequences of content with bootstrapped reinforcement learning 有权

Please log in to see more content

Patent Title: Recommending sequences of content with bootstrapped reinforcement learning
Application No.: US15934531

Application Date: 2018-03-23
Publication No.: US11429892B2

Publication Date: 2022-08-30
Inventor: Sorathan Chaturapruek , Georgios Theocharous , Kent Andrew Edmonds
Applicant: ADOBE INC.
Applicant Address: US CA San Jose
Assignee: ADOBE INC.
Current Assignee: ADOBE INC.
Current Assignee Address: US CA San Jose
Agency: Shook, Hardy & Bacon LLP
Main IPC: G06K9/62
IPC: G06K9/62 ; G06N20/00 ; G06N5/04 ; G06F9/451 ; G06T11/60

Recommending sequences of content with bootstrapped reinforcement learning

Abstract:

Systems and methods provide a recommendation system for recommending sequential content. The training of a reinforcement learning (RL) agent is bootstrapped from passive data. The RL agent of the sequential recommendations system is trained using the passive data over a number of epochs involving interactions between the sequential recommendation system and user devices. At each epoch, available active data from previous epochs is obtained, and transition probabilities are generated from the passive data and at least one parameter derived from the currently available active data. Recommended content is selected based on a current state and the generated transition probabilities, and the active data is updated from the current epoch based on the recommended content and a resulting new state. A clustering approach can also be employed when deriving parameters from active data to balance model expressiveness and data sparsity.

Public/Granted literature

US20190295004A1 RECOMMENDING SEQUENCES OF CONTENT WITH BOOTSTRAPPED REINFORCEMENT LEARNING Public/Granted day:2019-09-26

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )
G06K9/62	.应用电子设备进行识别的方法或装置