Patent search ap:("BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO. Page LTD.") AND inv:"Lihang Liu"

1.

发明授权
Method and apparatus for generating information 有权

公开(公告)号：US11412070B2

公开(公告)日：2022-08-09

申请号：US16891822

申请日：2020-06-03

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Xiaomin Fang , Yaxue Chen , Lihang Liu , Lingke Zeng , Fan Wang , Jingzhou He

IPC: H04L67/50 , G06N5/04 , H04L67/55 , G06Q10/10 , G06N20/00 , G06Q10/04 , G06Q30/02

Abstract: Embodiment of the disclosure provide a method and apparatus for generating information. The method includes: acquiring vectors of a plurality of users, the vector being used to characterize behavior habits of the users; inputting the vectors of the plurality of users and push information pushed by a push system to the plurality of users into a feedback information generating model established in advance, to generate the feedback information of the plurality of users for the push information, wherein the feedback information generating model is used to characterize a corresponding relationship between the vectors, the push information and the feedback information; and generating an evaluation report of the push system based on the feedback information.

2.

发明授权
Method and apparatus for optimizing recommendation system, device and computer storage medium 有权

公开(公告)号：US11836222B2

公开(公告)日：2023-12-05

申请号：US17083704

申请日：2020-10-29

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Lihang Liu , Xiaomin Fang , Fan Wang , Jingzhou He

IPC: G06Q30/00 , G06F18/21 , G06N20/00 , G06F16/9535 , G06Q30/0207 , G05B19/418 , G06Q30/0601

CPC classification number: G06F18/2178 , G06F16/9535 , G06F18/2193 , G06N20/00 , G06Q30/0221 , G06Q30/0225 , G06Q30/0631

Abstract: A method and apparatus for optimizing a recommendation system, a device and a computer storage medium are described, which relates to the technical field of deep learning and intelligent search in artificial intelligence. A specific implementation solution is: taking the recommendation system as an agent, a user as an environment, each recommended content of the recommendation system as an action of the agent, and a long-term behavioral revenue of the user as a reward of the environment; and optimizing to-be-optimized parameters in the recommendation system by reinforcement learning to maximize the reward of the environment. The present disclosure can effectively optimize long-term behavioral revenues of users.

Patent Agency Ranking