Active entity resolution model recommendation system

    公开(公告)号:US11720601B2

    公开(公告)日:2023-08-08

    申请号:US16920189

    申请日:2020-07-02

    Applicant: SAP SE

    CPC classification number: G06F16/285 G06N20/00

    Abstract: Systems and methods are provided for accessing master data comprising a plurality of representative data records, where each representative data record represents a cluster of similar data records, and each similar data record has a confidence score indicating a confidence level that the similar data record corresponds to the cluster, and comparing a new data record to each representative data record of the plurality of representative data records using a machine learning model to generate a distance score. The systems and methods further provide for analyzing the cluster of similar data records corresponding to each representative data record in a selected set of representative data records to generate candidate values for the requested data field of the new data record, and generating a candidate score for each of the candidate values using the distance score and the confidence score to use in providing a recommended candidate value.

    ACTIVE ENTITY RESOLUTION MODEL RECOMMENDATION SYSTEM

    公开(公告)号:US20220004567A1

    公开(公告)日:2022-01-06

    申请号:US16920189

    申请日:2020-07-02

    Applicant: SAP SE

    Abstract: Systems and methods are provided for accessing master data comprising a plurality of representative data records, where each representative data record represents a cluster of similar data records, and each similar data record has a confidence score indicating a confidence level that the similar data record corresponds to the cluster, and comparing a new data record to each representative data record of the plurality of representative data records using a machine learning model to generate a distance score. The systems and methods further provide for analyzing the cluster of similar data records corresponding to each representative data record in a selected set of representative data records to generate candidate values for the requested data field of the new data record, and generating a candidate score for each of the candidate values using the distance score and the confidence score to use in providing a recommended candidate value.

Patent Agency Ranking