System, Method, and Computer Program Product for Debiasing Embedding Vectors of Machine Learning Models

    公开(公告)号:US20240160854A1

    公开(公告)日:2024-05-16

    申请号:US18280792

    申请日:2022-03-30

    CPC classification number: G06F40/40

    Abstract: Described are a system, method, and computer program product for debiasing embedding vectors of machine learning models. The method includes receiving embedding vectors and generating two clusters thereof. The method includes determining a first mean vector of the first cluster and a second mean vector of the second cluster. The method includes determining a bias associated with each of a plurality of first candidate vectors and replacing the first mean vector with a first candidate vector based on the bias. The method includes determining a bias associated with each of a plurality of second candidate vectors and replacing the second mean vector with a second candidate vector based on the bias. The method includes repeatedly replacing the first and second mean vectors until an extremum of the bias score is reached, and debiasing the embedding vectors by linear projection using a direction defined by the first and second mean vectors.

Patent Agency Ranking