COMMUNICATION EFFICIENT MACHINE LEARNING OF DATA ACROSS MULTIPLE SITES

    公开(公告)号:US20200090002A1

    公开(公告)日:2020-03-19

    申请号:US16131150

    申请日:2018-09-14

    Abstract: In one embodiment, a service receives machine learning-based generative models from a plurality of distributed sites. Each generative model is trained locally at a site using unlabeled data observed at that site to generate synthetic unlabeled data that mimics the unlabeled data used to train the generative model. The service receives, from each of the distributed sites, a subset of labeled data observed at that site. The service uses the generative models to generate synthetic unlabeled data. The service trains a global machine learning-based model using the received subsets of labeled data received from the distributed sites and the synthetic unlabeled data generated by the generative models.

Patent Agency Ranking