-
公开(公告)号:US20240265294A1
公开(公告)日:2024-08-08
申请号:US18156915
申请日:2023-01-19
Applicant: Google LLC
Inventor: Badih Ghazi , Pritish Kamath , Shanmugasundaram Ravikumar , Ethan Jacob Leeman , Pasin Manurangsi , Avinash Vaidyanathan Varadarajan , Chiyuan Zhang
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: An example method is provided for conducting differentially private communication of training data for training a machine-learned model. Initial label data can be obtained that corresponds to feature data. A plurality of label bins can be determined to respectively provide representative values for initial label values assigned to the plurality of label bins. Noised label data can be generated, based on a probability distribution over the plurality of label bins, to correspond to the initial label data, the probability distribution characterized by, for a respective noised label corresponding to a respective initial label of the initial label data, a first probability for returning a representative value of a label bin to which the respective initial label is assigned, and a second probability for returning another value. The noised label data can be communicated for training the machine-learned model.