EFFICIENT SCALING OF PARTITIONED NEURAL NETWORK INFERENCE

    公开(公告)号:US20250094823A1

    公开(公告)日:2025-03-20

    申请号:US18368801

    申请日:2023-09-15

    Abstract: In one implementation, a controller determines performance of a partitioned neural network. The controller identifies, based on the performance, a particular partition of the partitioned neural network as a bottleneck. The controller configures a first device to execute a replica of the particular partition. The controller configures a multiplexer that provides an output of the particular partition or the replica of the particular partition as input to a downstream partition of the partitioned neural network.

    GROUP BIAS MITIGATION IN FEDERATED LEARNING SYSTEMS

    公开(公告)号:US20250036961A1

    公开(公告)日:2025-01-30

    申请号:US18227535

    申请日:2023-07-28

    Abstract: In one embodiment, a supervisory device in a federated learning system generates an aggregated model that aggregates a plurality of machine learning models trained by trainer nodes in a federated learning system during a training round. The supervisory device computes an accuracy loss metric for the aggregated model. The supervisory device also computes a fairness loss metric for the aggregated model based on fairness-related metrics associated with the plurality of machine learning models trained by the trainer nodes. The supervisory device initiates an additional training round during which the trainer nodes retrain their machine learning models for aggregation by the apparatus, in accordance with a constrained optimization problem that seeks to optimize a tradeoff between accuracy and fairness associated with the aggregated model.

Patent Agency Ranking