SYSTEMS AND METHODS FOR NORMALIZATION OF MACHINE LEARNING DATASETS

    公开(公告)号:US20240256225A1

    公开(公告)日:2024-08-01

    申请号:US18018836

    申请日:2021-08-06

    申请人: PRENOSIS, INC.

    IPC分类号: G06F7/76 G16H10/40

    CPC分类号: G06F7/76 G16H10/40

    摘要: A system for biomarker data normalization in training datasets. The system includes one or more processors and one or more memory devices storing instructions that configure the one or more processors to perform operations. The operations may include receiving, data files comprising biomarker records (each of the biomarker records comprising a plurality of metadata fields), identifying a template record for normalization, the template record comprising template metadata fields, and generating a normalization vector comprising mismatching biomarker records. The operations may also include identifying adjustment functions for each one of the plurality of metadata fields, modifying data fields of biomarker records in the normalization vector by applying the adjustment functions, and generating a normalized data file comprising the modified biomarker records.

    A TOOL FOR SELECTING RELEVANT FEATURES IN PRECISION DIAGNOSTICS

    公开(公告)号:US20230042330A1

    公开(公告)日:2023-02-09

    申请号:US17791880

    申请日:2021-01-12

    申请人: PRENOSIS, INC.

    IPC分类号: G16H50/50

    摘要: A method for ranking an unmeasured feature for an instance given at least one feature is measured is provided. The method includes imputing a first value to the unmeasured feature in the instance while holding the other remaining unmeasured features constant and evaluating a first outcome with a model using the first value in the instance. The method includes imputing a second value to the unmeasured feature in the dataset while holding the other remaining unmeasured features constant, evaluating a second outcome with the model using the second value in the instance, and determining a statistical parameter with the first outcome and the second outcome. The method also includes assigning the unmeasured feature a ranking corresponding to the determined statistical parameter. A system and a non-transitory, computer readable medium storing instructions to perform the above method are also presented.

    A TIME-SENSITIVE TRIGGER FOR A STREAMING DATA ENVIRONMENT

    公开(公告)号:US20230040185A1

    公开(公告)日:2023-02-09

    申请号:US17791879

    申请日:2021-01-12

    申请人: PRENOSIS, INC.

    IPC分类号: G16H50/30

    摘要: A method for making dynamic risk predictions is provided. The method includes receiving a dataset with a first data field and a second data field. The first data field is populated with a measured value. The method also includes imputing a first predicted value to the second data field, generating a first risk score and a first set of associated metrics based on the measured value and the first predicted value, and imputing a second predicted value to the second data field. The method also includes calculating a statistically derived metric and determining whether the statistically derived metric exceeds a predetermined threshold, wherein a predetermined action is recommended if the statistically derived metric exceeds the predetermined threshold. A system and a non-transitory, computer readable medium storing instructions to cause the system to perform the above method are also provided.