System and method for automatically validating data stored in multiple locations

    公开(公告)号:US12026172B2

    公开(公告)日:2024-07-02

    申请号:US17122307

    申请日:2020-12-15

    CPC classification number: G06F16/254 G06F16/2282 G06F16/2365 G06F16/258

    Abstract: An apparatus includes a memory and a processor. The processor receives a request to validate target data against source data. The processor identifies a disagreement between an element of source data and a corresponding element of target data. The disagreement indicates that the ordered set of alphanumeric characters representing the source element is different from the ordered set of alphanumeric characters representing the target element. In response to identifying the disagreement, the processor determines that the source element does not match the target element, where the source element includes a first value stored according to a first format, and the target element includes a second value stored according to a second format. In response, the processor extracts an element of the source data corresponding to the source element, transforms this element from the first format to the second format, and loads it into the target data.

    Data storage system conflict management

    公开(公告)号:US12026141B2

    公开(公告)日:2024-07-02

    申请号:US18169733

    申请日:2023-02-15

    Applicant: Adobe Inc.

    Abstract: Data storage system conflict management techniques are described. In one example, data is sent by client devices to both a repository system and a search index system of a data storage system. Each of the client devices maintains, locally, records of changes to data of the repository system, e.g., through use of an invertible Bloom lookup table. The records are used as a basis by the client devices to send reports to an arbiter system and used to detect conflicts When detected, a conflict request is communicated to client devices involved in the conflict as well as to the repository system and responses used to generate a correction that is communicated to update the repository system and search index system.

    Time-to-run column for database management systems

    公开(公告)号:US12019619B2

    公开(公告)日:2024-06-25

    申请号:US18094481

    申请日:2023-01-09

    Applicant: Red Hat, Inc.

    CPC classification number: G06F16/2365 G06F16/221 G06F16/2282 G06F16/2358

    Abstract: An apparatus and method of performing a time-based operation on a row in a database table includes receiving a function to be applied to a particular row in a table within the database management system. The method may also include, in response to receiving the function, adding a new column to the table. A new cell in the new column includes a timer indicating a time that the processing device will call the function, and the new cell references the function. The method may also include, upon expiration of the timer, determining that the database management system is available for execution of the function, and executing the function on the particular row in the table.

    Data quality enforcement as a service invoked using descriptive language

    公开(公告)号:US12019617B2

    公开(公告)日:2024-06-25

    申请号:US17828835

    申请日:2022-05-31

    CPC classification number: G06F16/2365 G06F16/215 G06F16/2358 G06F16/28

    Abstract: A computer-implemented method executed using a first networked computer and comprising receiving a digitally stored workflow pattern that specifies at least an input data source, a data transformation process, an output data destination, a data quality assertion and a data quality source; the workflow pattern comprising a structured plurality of name declarations and value specifications that are human readable and machine readable; the data transformation process specified in the workflow pattern including one or more references to processing logic, a processing logic source outside the workflow pattern at which the processing logic is stored, and one or more available process engines that are capable of processing the processing logic; machine parsing the workflow pattern and dividing the workflow pattern into a plurality of execution units, each execution unit being associated with a particular process engine among the one or more available process engines; accessing the input data source specified in the workflow pattern and loading at least a portion of data from the input data source into main memory; accessing the processing logic source at a second networked computer and loading a copy of the processing logic specified in the workflow pattern from the second networked computer; for each of the execution units, selecting a particular process engine among the plurality of available process engines, calling the particular process engine, programmatically providing access to the portion of data and the copy of the processing logic, and receiving output data that has been created by the particular process engine after transforming the portion of data; translating the data quality assertion into a data quality request and automatically forwarding the data quality request to the data quality source at a third computer, the data quality request comprising the data quality assertion, and receiving a response to the request that specifies whether the output data conforms to the data quality assertion.

Patent Agency Ranking