-
公开(公告)号:US11412070B2
公开(公告)日:2022-08-09
申请号:US16891822
申请日:2020-06-03
Inventor: Xiaomin Fang , Yaxue Chen , Lihang Liu , Lingke Zeng , Fan Wang , Jingzhou He
Abstract: Embodiment of the disclosure provide a method and apparatus for generating information. The method includes: acquiring vectors of a plurality of users, the vector being used to characterize behavior habits of the users; inputting the vectors of the plurality of users and push information pushed by a push system to the plurality of users into a feedback information generating model established in advance, to generate the feedback information of the plurality of users for the push information, wherein the feedback information generating model is used to characterize a corresponding relationship between the vectors, the push information and the feedback information; and generating an evaluation report of the push system based on the feedback information.
-
2.
公开(公告)号:US11836222B2
公开(公告)日:2023-12-05
申请号:US17083704
申请日:2020-10-29
Inventor: Lihang Liu , Xiaomin Fang , Fan Wang , Jingzhou He
IPC: G06Q30/00 , G06F18/21 , G06N20/00 , G06F16/9535 , G06Q30/0207 , G05B19/418 , G06Q30/0601
CPC classification number: G06F18/2178 , G06F16/9535 , G06F18/2193 , G06N20/00 , G06Q30/0221 , G06Q30/0225 , G06Q30/0631
Abstract: A method and apparatus for optimizing a recommendation system, a device and a computer storage medium are described, which relates to the technical field of deep learning and intelligent search in artificial intelligence. A specific implementation solution is: taking the recommendation system as an agent, a user as an environment, each recommended content of the recommendation system as an action of the agent, and a long-term behavioral revenue of the user as a reward of the environment; and optimizing to-be-optimized parameters in the recommendation system by reinforcement learning to maximize the reward of the environment. The present disclosure can effectively optimize long-term behavioral revenues of users.
-