Multi-agent deep reinforcement learning for dynamically controlling electrical equipment in buildings

    公开(公告)号:US12111620B2

    公开(公告)日:2024-10-08

    申请号:US17029788

    申请日:2020-09-23

    CPC classification number: G05B13/027 F24F11/62 G05B15/02 G06N3/088

    Abstract: Reinforcement Learning agent interacting with a real-world building to determine optimal policy may not be viable due to comfort constraints. Embodiments of the present disclosure provide multi-deep agent RL for dynamically controlling electrical equipment in buildings, wherein a simulation model is generated using design specification of (i) controllable electrical equipment (or subsystem) and (ii) building. Each RL agent is trained using simulation model and deployed in the subsystem. Reward function for each subsystem includes some portion of reward from other subsystem(s). Based on reward function of each RL agent, each RL agent learns an optimal control parameter during execution of RL agent in subsystem. Further, a global optimal control parameter list is generated using the optimal control parameter. The control parameters in the global optimal control parameters list are fine-tuned to improve subsystem's performance. Information on fine-tuning parameters of the subsystem and reward function are used for training RL agents.

    POINT-OF-INTEREST RECOMMENDATION METHOD AND SYSTEM BASED ON BRAIN-INSPIRED SPATIOTEMPORAL PERCEPTUAL REPRESENTATION

    公开(公告)号:US20240330690A1

    公开(公告)日:2024-10-03

    申请号:US17756849

    申请日:2021-09-13

    CPC classification number: G06N3/088 G06N3/045

    Abstract: A POI recommendation method and system based on brain-inspired spatiotemporal perceptual representation is provided. The method includes: constructing a POI context graph structure based on a POI check-in dataset; sampling a check-in sequence context graph, and training a POI check-in sequence embedding model in a brain-inspired spatiotemporal perceptual embedding model by unsupervised learning; sampling a spatial context graph and a spatiotemporal context graph to train a spatiotemporal embedding model in a brain-inspired spatiotemporal perceptual embedding model; combining a POI sequence representation vector and a POI spatiotemporal union representation vector into a POI spatiotemporal perceptual representation vector; training a recurrent neural network recommender based on the POI spatiotemporal perceptual representation vector; and recommending a next POI through the trained recurrent neural network recommender. By mining the spatiotemporal complexity and check-in sequences of POIs, the POI recommendation method and system enable efficient representation of POIs from multiple perspectives.

Patent Agency Ranking