Software version fingerprint generation and identification

    公开(公告)号:US10474456B2

    公开(公告)日:2019-11-12

    申请号:US16415192

    申请日:2019-05-17

    Applicant: SAP SE

    Abstract: Systems and methods are provided for accessing a source code repository comprising a plurality of versions of code, analyzing the plurality of versions of code of the component to compute metrics to identify each version of code, analyzing the metrics to determine a subset of the metrics to use to as a fingerprint definition to identify each version of the code, generating a fingerprint for each version of code using the fingerprint definition, generating a fingerprint matrix with the fingerprint for each version of code for the software component and storing the fingerprint definition and the fingerprint matrix

    Software version fingerprint generation and identification

    公开(公告)号:US10338916B2

    公开(公告)日:2019-07-02

    申请号:US15371678

    申请日:2016-12-07

    Applicant: SAP SE

    Abstract: Systems and methods are provided for accessing a source code repository comprising a plurality of versions of code, analyzing the plurality of versions of code of the component to compute metrics to identify each version of code, analyzing the metrics to determine a subset of the metrics to use to as a fingerprint definition to identify each version of the code, generating a fingerprint for each version of code using the fingerprint definition, generating a fingerprint matrix with the fingerprint for each version of code for the software component and storing the fingerprint definition and the fingerprint matrix.

    Automatic Discriminatory Pattern Detection in Data Sets Using Machine Learning

    公开(公告)号:US20190026345A1

    公开(公告)日:2019-01-24

    申请号:US15655753

    申请日:2017-07-20

    Applicant: SAP SE

    Inventor: Michele Bezzi

    Abstract: A front end receives a request for data specifying a data type. A query handler retrieves data of the data type comprising a plurality of data records from at least one database. The query handler assigns a classification attribute to each data record using a pre-defined classification policy stored in a policy store. A discrimination detection engine statistically evaluates the classification attributes for the data to identify a mutual information metric. The query handler generates a listing of one or more discriminatory attributes and corresponding mutual information metric contributing to discriminatory data patterns based on the mutual information metric.

    SOFTWARE VERSION FINGERPRINT GENERATION AND IDENTIFICATION

    公开(公告)号:US20180157486A1

    公开(公告)日:2018-06-07

    申请号:US15371678

    申请日:2016-12-07

    Applicant: SAP SE

    CPC classification number: G06F8/71 G06F8/36 G06F8/77

    Abstract: Systems and methods are provided for accessing a source code repository comprising a plurality of versions of code, analyzing the plurality of versions of code of the component to compute metrics to identify each version of code, analyzing the metrics to determine a subset of the metrics to use to as a fingerprint definition to identify each version of the code, generating a fingerprint for each version of code using the fingerprint definition, generating a fingerprint matrix with the fingerprint for each version of code for the software component and storing the fingerprint definition and the fingerprint matrix

    Commit conformity verification system

    公开(公告)号:US11972258B2

    公开(公告)日:2024-04-30

    申请号:US17850380

    申请日:2022-06-27

    Applicant: SAP SE

    CPC classification number: G06F8/77

    Abstract: Systems and methods are provided for training a machine learning model to generate a score indicating a level of discrepancy between a commit message and a corresponding code change. The computing system receives a commit comprising a given commit message and a given corresponding code change and analyzes, using the trained machine learning model, the given commit message and given corresponding code change to generate a score indicating the level of discrepancy between the given commit message and the given corresponding code change of the received commit.

    Distributed Vectorized Representations of Source Code Commits

    公开(公告)号:US20220129261A1

    公开(公告)日:2022-04-28

    申请号:US17080520

    申请日:2020-10-26

    Applicant: SAP SE

    Abstract: Distributed vector representations of source code commits, are generated to become part of a data corpus for machine learning (ML) for analyzing source code. The code commit is received, and time information is referenced to split the source code into pre-change source code and post-change source code. The pre-change source code is converted into a first code representation (e.g., based on a graph model), and the post-change source code into a second code representation. A first particle is generated from the first code representation, and a second particle is generated from the second code representation. The first particle and the second particle are compared to create a delta. The delta is transformed into a first commit vector by referencing an embedding matrix to numerically encode the first particle and the second particle. Following classification, the commit vector is stored in a data corpus for performing ML analysis upon source code.

    Automatic discriminatory pattern detection in data sets using machine learning

    公开(公告)号:US10963474B2

    公开(公告)日:2021-03-30

    申请号:US15655753

    申请日:2017-07-20

    Applicant: SAP SE

    Inventor: Michele Bezzi

    Abstract: A front end receives a request for data specifying a data type. A query handler retrieves data of the data type comprising a plurality of data records from at least one database. The query handler assigns a classification attribute to each data record using a pre-defined classification policy stored in a policy store. A discrimination detection engine statistically evaluates the classification attributes for the data to identify a mutual information metric. The query handler generates a listing of one or more discriminatory attributes and corresponding mutual information metric contributing to discriminatory data patterns based on the mutual information metric.

    SOFTWARE VERSION FINGERPRINT GENERATION AND IDENTIFICATION

    公开(公告)号:US20190272170A1

    公开(公告)日:2019-09-05

    申请号:US16415192

    申请日:2019-05-17

    Applicant: SAP SE

    Abstract: Systems and methods are provided for accessing a source code repository comprising a plurality of versions of code, analyzing the plurality of versions of code of the component to compute metrics to identify each version of code, analyzing the metrics to determine a subset of the metrics to use to as a fingerprint definition to identify each version of the code, generating a fingerprint for each version of code using the fingerprint definition, generating a fingerprint matrix with the fingerprint for each version of code for the software component and storing the fingerprint definition and the fingerprint matrix

    COMMIT CONFORMITY VERIFICATION SYSTEM
    9.
    发明公开

    公开(公告)号:US20230418599A1

    公开(公告)日:2023-12-28

    申请号:US17850380

    申请日:2022-06-27

    Applicant: SAP SE

    CPC classification number: G06F8/77

    Abstract: Systems and methods are provided for training a machine learning model to generate a score indicating a level of discrepancy between a commit message and a corresponding code change. The computing system receives a commit comprising a given commit message and a given corresponding code change and analyzes, using the trained machine learning model, the given commit message and given corresponding code change to generate a score indicating the level of discrepancy between the given commit message and the given corresponding code change of the received commit.

    Security-relevant code detection system

    公开(公告)号:US10831899B2

    公开(公告)日:2020-11-10

    申请号:US15978691

    申请日:2018-05-14

    Applicant: SAP SE

    Abstract: Systems and methods are provided for retrieving a set of code changes to source code from a source code repository, analyzing the set of code changes to generate a vector representation of each code change of the set of code changes, analyzing the vector representation of each code change of the set of code changes using a trained security-relevant code detection machine learning model, receiving a prediction from the security-relevant code detection machine learning model representing a probability that each code change of the set of code changes contains security-relevant changes, analyzing the prediction to determine whether the prediction is below or above a predetermined threshold, and generating results based on determining whether the prediction is below or above a predetermined threshold.

Patent Agency Ranking