-
1.
公开(公告)号:US20240160943A1
公开(公告)日:2024-05-16
申请号:US18054009
申请日:2022-11-09
Applicant: Salesforce.com, inc.
Inventor: Soham Phade , Stefano Ermon , Stephan Zheng
IPC: G06N3/092
CPC classification number: G06N3/092
Abstract: Embodiments described herein provide systems and methods for solving and applying a multi-agent decision process. A system performs a process, where at each iterative step, the system determines policies for a plurality of agents that optimize respective reward values based on the plurality of costs, and the characteristics of the plurality of agents. The system simulates the multi-agent decision process using the determined policies, thereby generating respective reward values and aggregated resource contribution values. The system increments or decrements the plurality of costs based on the constraints and the aggregated resource contribution values. The system updates a final reward value based on the respective reward values. The system updates a final plurality of costs based on the plurality of costs. After performing the iterative step for a predetermined number of iterations, the system outputs the final reward value and the final plurality of costs.