-
公开(公告)号:US20250021664A1
公开(公告)日:2025-01-16
申请号:US18900188
申请日:2024-09-27
Applicant: Oracle International Corporation
Inventor: Pallika Haridas Kanani , Virendra J. Marathe , Daniel Wyde Peterson , Anshuman Suri
Abstract: Subject level privacy attack analysis for federated learning may be performed. A request that selects an analysis of one or more inference attacks may be received to determine a presence of data of a subject in a training set of a federated machine learning model. The selected inference attacks may be performed to determine the presence of the data of subject in the training set of the federated machine learning model. Respective success measurements may be generated for the selected inference attacks based on the performance of the selected inference attacks, which may then be provided.
-
公开(公告)号:US20230274004A1
公开(公告)日:2023-08-31
申请号:US17681638
申请日:2022-02-25
Applicant: Oracle International Corporation
Inventor: Pallika Haridas Kanani , Virendra J. Marathe , Daniel Wyde Peterson , Anshuman Suri
CPC classification number: G06F21/577 , G06F21/6245 , G06F2221/033
Abstract: Subject level privacy attack analysis for federated learning may be performed. A request that selects an analysis of one or more inference attacks may be received to determine a presence of data of a subject in a training set of a federated machine learning model. The selected inference attacks may be performed to determine the presence of the data of subject in the training set of the federated machine learning model. Respective success measurements may be generated for the selected inference attacks based on the performance of the selected inference attacks, which may then be provided.
-
公开(公告)号:US20230047092A1
公开(公告)日:2023-02-16
申请号:US17663008
申请日:2022-05-11
Applicant: Oracle International Corporation
Inventor: Virendra Marathe , Pallika Haridas Kanani , Daniel Peterson , Swetasudha Panda
Abstract: User-level privacy preservation is implemented within federated machine learning. An aggregation server may distribute a machine learning model to multiple users each including respective private datasets. Individual users may train the model using the local, private dataset to generate one or more parameter updates. Prior to sending the generated parameter updates to the aggregation server for incorporation into the machine learning model, a user may modify the parameter updates by applying respective noise values to individual ones of the parameter updates to ensure differential privacy for the dataset private to the user. The aggregation server may then receive the respective modified parameter updates from the multiple users and aggregate the updates into a single set of parameter updates to update the machine learning model. The federated machine learning may further include iteratively performing said sending, training, modifying, receiving, aggregating and updating steps.
-
公开(公告)号:US20200372290A1
公开(公告)日:2020-11-26
申请号:US16781955
申请日:2020-02-04
Applicant: Oracle International Corporation
Inventor: Jean-Baptiste Frederic George Tristan , Pallika Haridas Kanani , Michael Louis Wick , Swetasudha Panda , Haniyeh Mahmoudian
Abstract: A Bayesian test of demographic parity for learning to rank may be applied to determine ranking modifications. A fairness control system receiving a ranking of items may apply Bayes factors to determine a likelihood of bias for the ranking. These Bayes factors may include a factor for determining bias in each item and a factor for determining bias in the ranking of the items. An indicator of bias may be generated using the applied Bayes factors and the fairness control system may modify the ranking if the determines likelihood of bias satisfies modification criteria for the ranking.
-
公开(公告)号:US12130929B2
公开(公告)日:2024-10-29
申请号:US17681638
申请日:2022-02-25
Applicant: Oracle International Corporation
Inventor: Pallika Haridas Kanani , Virendra J. Marathe , Daniel Wyde Peterson , Anshuman Suri
CPC classification number: G06F21/577 , G06F21/6245 , G06F2221/033
Abstract: Subject level privacy attack analysis for federated learning may be performed. A request that selects an analysis of one or more inference attacks may be received to determine a presence of data of a subject in a training set of a federated machine learning model. The selected inference attacks may be performed to determine the presence of the data of subject in the training set of the federated machine learning model. Respective success measurements may be generated for the selected inference attacks based on the performance of the selected inference attacks, which may then be provided.
-
公开(公告)号:US20230394374A1
公开(公告)日:2023-12-07
申请号:US17805674
申请日:2022-06-06
Applicant: Oracle International Corporation
Inventor: Virendra J. Marathe , Pallika Haridas Kanani
CPC classification number: G06N20/20 , G06F21/6245
Abstract: Hierarchical gradient averaging is performed as part of training a machine learning model to enforce subject level privacy. A sample of data items from a training data set is identified and respective gradients for the data items are determined. The gradients are then clipped. Each subject's clipped gradients in the sample are averaged. A noise value is added to a sum of the averaged gradients of each of the subjects in the sample. An average gradient for the entire sample is determined from the averaged gradients of the individual subjects with the added noise value. This average gradient for the entire sample is used for determining machine learning model updates.
-
公开(公告)号:US10410139B2
公开(公告)日:2019-09-10
申请号:US15168309
申请日:2016-05-31
Applicant: ORACLE INTERNATIONAL CORPORATION
Inventor: Pallika Haridas Kanani , Michael Louis Wick , Katherine Silverstein
Abstract: A system that performs natural language processing receives a text corpus that includes a plurality of documents and receives a knowledge base. The system generates a set of document n-grams from the text corpus and considers all n-grams as candidate mentions. The system, for each candidate mention, queries the knowledge base and in response retrieves results. From the results retrieved by the queries, the system generates a search space and generates a joint model from the search space.
-
公开(公告)号:US20240394597A1
公开(公告)日:2024-11-28
申请号:US18597771
申请日:2024-03-06
Applicant: Oracle International Corporation
Inventor: Virendra J. Marathe , Pallika Haridas Kanani
IPC: G06N20/00
Abstract: Federated training of a machine learning model with enforcement of subject level privacy is implemented. Respective samples of data items from a training data set are generated at multiple nodes of a federated machine learning system. Noise values are determined for individual ones of the sampled data items according to respective counts of data items of particular subjects and the cumulative counts of the items of the subjects. Respective gradients for the data items are the determined The gradients are then clipped and noise values are applied. Each subject's noisy clipped gradients in the sample are then aggregated. The aggregasted gradients for the entire sample are then used for determining machine learning model updates.
-
公开(公告)号:US20230052231A1
公开(公告)日:2023-02-16
申请号:US17663009
申请日:2022-05-11
Applicant: Oracle International Corporation
Inventor: Virendra J. Marathe , Pallika Haridas Kanani
Abstract: Group-level privacy preservation is implemented within federated machine learning. An aggregation server may distribute a machine learning model to multiple users each including respective private datasets. The private datasets may individually include multiple items associated with a single group. Individual users may train the model using their local, private dataset to generate one or more parameter updates and to determine a count of the largest number of items associated with any single group of a number of groups in the dataset. Parameter updates generated by the individual users may be modified by applying respective noise values to individual ones of the parameter updates according to the respective counts to ensure differential privacy for the groups of the dataset. The aggregation server may aggregate the updates into a single set of parameter updates to update the machine learning model.
-
公开(公告)号:US11443240B2
公开(公告)日:2022-09-13
申请号:US16829433
申请日:2020-03-25
Applicant: Oracle International Corporation
Inventor: Daniel Peterson , Pallika Haridas Kanani , Virendra J. Marathe
Abstract: Herein are techniques for domain adaptation of a machine learning (ML) model. These techniques impose differential privacy onto federated learning by the ML model. In an embodiment, each of many client devices receive, from a server, coefficients of a general ML model. For respective new data point(s), each client device operates as follows. Based on the new data point(s), a respective private ML model is trained. Based on the new data point(s), respective gradients are calculated for the coefficients of the general ML model. Random noise is added to the gradients to generate respective noisy gradients. A combined inference may be generated based on: the private ML model, the general ML model, and one of the new data point(s). The noisy gradients are sent to the server. The server adjusts the general ML model based on the noisy gradients from the client devices. This client/server process may be repeated indefinitely.
-
-
-
-
-
-
-
-
-