-
公开(公告)号:US11893036B2
公开(公告)日:2024-02-06
申请号:US16517320
申请日:2019-07-19
Applicant: Ab Initio Technology LLC
Inventor: Andrew Blom , Darren Miller , Marshall A. Isman
IPC: G06F7/00 , G06F17/00 , G06F16/25 , G06F16/901 , G06F8/34 , H04L67/565
CPC classification number: G06F16/254 , G06F8/34 , G06F16/258 , G06F16/9024 , H04L67/565
Abstract: A method for generating an executable application to transform and load data into a structured dataset includes receiving a metadata file that specifies values for parameters for structuring data feeds, received from a networked data source, into a structured database. The metadata file specifies logical rules for transforming the data feeds. The values of the parameters and the logical rules for transforming the plurality of the data feeds are validated to ensure logical consistency for each data feed. Data rules are generated that specify standards for transforming each data feed in accordance with the validated values of the parameters and logical rules. The executable application is generated that is configured to receive source data comprising a data feed from one or more data sources and transform the source data into structured data that satisfies the one or more standards for the structured data record in compliance with the data rules.
-
公开(公告)号:US20230112179A1
公开(公告)日:2023-04-13
申请号:US17704469
申请日:2022-03-25
Applicant: Ab Initio Technology LLC
Inventor: John Joyce , Marshall A. Isman , Edward Alan Bach
Abstract: A method for updating a computer program includes receiving a computer program hosted on and configured to be executed by a first computing system. The method includes analyzing the computer program to obtain characterization of a lineage, an architecture, and an operation of the computer program. The lineage includes relationships among elements of the computer program, the architecture includes a characteristic of the data source, the data target, and one or more processors configured to process the data contained in data records, and the operation includes processes that are executed to process the data from the data records. The method includes receiving a characterization of an update to be made to the computer program, in which when the computer program is modified according to the update, at least some of the modified computer program is configured to be hosted on and executed by a second computing system; and modifying the computer program to implement the update to generate the modified computer program.
-
公开(公告)号:US10936289B2
公开(公告)日:2021-03-02
申请号:US15433467
申请日:2017-02-15
Applicant: Ab Initio Technology LLC
Inventor: Marshall A. Isman , John Joyce
Abstract: A method includes analyzing, by a processor, a first version of a computer program, the analyzing including identifying a first process included in the first version of the computer program, the first process configured to perform an operation on data having a first format; and by a processor, generating a second version of at least a portion of the computer program, including omitting the first process and including in the second version of the at least portion of the computer program one or more second processes configured to perform a second operation on data of a second format different from the first format, wherein the second operation is based on the first operation.
-
公开(公告)号:US10705807B2
公开(公告)日:2020-07-07
申请号:US15881979
申请日:2018-01-29
Applicant: Ab Initio Technology LLC
Inventor: Marshall A. Isman , John Joyce
Abstract: A method includes analyzing, by a processor, a first version of a computer program. The analyzing includes identifying a first data processing element included in the first version of the computer program. The first data processing element references a first data source external to the first version of the computer program. The method includes generating a data source element that represents a second data source different from the first data source. The method includes generating a second version of the computer program. The second version of the computer program includes the generated data source element and a second data processing element that is based on the first data processing element. In the second version of the computer program, the second data processing element references the generated data source element.
-
公开(公告)号:US20200026711A1
公开(公告)日:2020-01-23
申请号:US16517320
申请日:2019-07-19
Applicant: Ab Initio Technology LLC
Inventor: Andrew Blom , Darren Miller , Marshall A. Isman
IPC: G06F16/25 , H04L29/08 , G06F16/901 , G06F8/34
Abstract: A method for generating an executable application to transform and load data into a structured dataset includes receiving a metadata file that specifies values for parameters for structuring data feeds, received from a networked data source, into a structured database. The metadata file specifies logical rules for transforming the data feeds. The values of the parameters and the logical rules for transforming the plurality of the data feeds are validated to ensure logical consistency for each data feed. Data rules are generated that specify standards for transforming each data feed in accordance with the validated values of the parameters and logical rules. The executable application is generated that is configured to receive source data comprising a data feed from one or more data sources and transform the source data into structured data that satisfies the one or more standards for the structured data record in compliance with the data rules.
-
公开(公告)号:US20190073289A1
公开(公告)日:2019-03-07
申请号:US16181582
申请日:2018-11-06
Applicant: Ab Initio Technology LLC
Inventor: Marshall A. Isman , Richard A. Epstein
IPC: G06F11/36
CPC classification number: G06F11/36 , G06F11/3688
Abstract: A method includes receiving data indicative of a number of times each of one or more rules was executed by a data processing application during processing of one or more records; based on the number of times each of the rules was executed by the data processing application, determining a content criterion for each of one or more particular fields; generating content for each of the particular fields based on the content criterion; and populating each of the particular fields with the generated content.
-
27.
公开(公告)号:US12242444B2
公开(公告)日:2025-03-04
申请号:US18545416
申请日:2023-12-19
Applicant: Ab Initio Technology LLC
Inventor: John Joyce , Marshall A. Isman , Sandrick Melbouci
Abstract: Methods and systems are configured to determine a semantic meaning for data and generate data processing rules based on the semantic meaning of the data. The semantic meaning includes syntactical or contextual meaning for the data that is determined, for example, by profiling, by the data processing system, values stored in a field included in data records of one or more datasets; applying, by the data processing system, one or more classifiers to the profiled values; identifying, based on applying the one or more classifiers, one or more attributes indicative of a logical or syntactical characteristic for the values of the field, with each of the one or more attributes having a respective confidence level that is based on an output of each of the one or more classifiers. The attributes are associated with the fields and are used for generating data processing rules and processing the data.
-
公开(公告)号:US12045286B2
公开(公告)日:2024-07-23
申请号:US17858605
申请日:2022-07-06
Applicant: Ab Initio Technology LLC
Inventor: Jonah Egenolf , Marshall A. Isman , Frederic Wild
IPC: G06F16/901 , G06F8/10 , G06F8/34 , G06F16/178 , G06F16/25 , G06F16/26
CPC classification number: G06F16/9024 , G06F8/10 , G06F8/34 , G06F16/1794 , G06F16/258 , G06F16/26
Abstract: A method performed by a computer system including: accessing a specification that specifies a plurality of modules to be implemented by the computer program for processing the one or more values of the one or more fields in the structured data item; transforming the specification into the computer program that implements the plurality of modules, wherein the transforming includes: for each of one or more first modules of the plurality of modules: identifying one or more second modules of the plurality of modules that each receive input that is at least partly based on an output of the first module; and formatting an output data format of the first module such that the first module outputs only one or more values of one or more fields of the structured data item.
-
29.
公开(公告)号:US20240152495A1
公开(公告)日:2024-05-09
申请号:US18545416
申请日:2023-12-19
Applicant: Ab Initio Technology LLC
Inventor: John Joyce , Marshall A. Isman , Sandrick Melbouci
IPC: G06F16/215 , G06F16/22 , G06F16/28 , G06N5/04 , G06N20/00
CPC classification number: G06F16/215 , G06F16/2228 , G06F16/285 , G06N5/04 , G06N20/00
Abstract: Methods and systems are configured to determine a semantic meaning for data and generate data processing rules based on the semantic meaning of the data. The semantic meaning includes syntactical or contextual meaning for the data that is determined, for example, by profiling, by the data processing system, values stored in a field included in data records of one or more datasets; applying, by the data processing system, one or more classifiers to the profiled values; identifying, based on applying the one or more classifiers, one or more attributes indicative of a logical or syntactical characteristic for the values of the field, with each of the one or more attributes having a respective confidence level that is based on an output of each of the one or more classifiers. The attributes are associated with the fields and are used for generating data processing rules and processing the data.
-
30.
公开(公告)号:US20240126734A1
公开(公告)日:2024-04-18
申请号:US18399522
申请日:2023-12-28
Applicant: Ab Initio Technology LLC
Inventor: John Joyce , Marshall A. Isman , Sandrick Melbouci
IPC: G06F16/215 , G06F16/22 , G06F16/28 , G06N5/04 , G06N20/00
CPC classification number: G06F16/215 , G06F16/2228 , G06F16/285 , G06N5/04 , G06N20/00
Abstract: Methods and systems are configured to determine a semantic meaning for data and generate data processing rules based on the semantic meaning of the data. The semantic meaning includes syntactical or contextual meaning for the data that is determined, for example, by profiling, by the data processing system, values stored in a field included in data records of one or more datasets; applying, by the data processing system, one or more classifiers to the profiled values; identifying, based on applying the one or more classifiers, one or more attributes indicative of a logical or syntactical characteristic for the values of the field, with each of the one or more attributes having a respective confidence level that is based on an output of each of the one or more classifiers. The attributes are associated with the fields and are used for generating data processing rules and processing the data.
-
-
-
-
-
-
-
-
-