DISTRIBUTED LABELING FOR SUPERVISED LEARNING

    公开(公告)号:US20240028890A1

    公开(公告)日:2024-01-25

    申请号:US18225656

    申请日:2023-07-24

    Applicant: Apple Inc.

    CPC classification number: G06N3/08 G06N3/04 G06N3/10 G06N3/044 G06N3/045

    Abstract: Embodiments described herein provide a technique to crowdsource labeling of training data for a machine learning model while maintaining the privacy of the data provided by crowdsourcing participants. Client devices can be used to generate proposed labels for a unit of data to be used in a training dataset. One or more privacy mechanisms are used to protect user data when transmitting the data to a server. The server can aggregate the proposed labels and use the most frequently proposed labels for an element as the label for the element when generating training data for the machine learning model. The machine learning model is then trained using the crowdsourced labels to improve the accuracy of the model.

Patent Agency Ranking