SYSTEM AND METHOD FOR ENRICHING AND NORMALIZING DATA

    公开(公告)号:US20240362194A1

    公开(公告)日:2024-10-31

    申请号:US18766437

    申请日:2024-07-08

    Applicant: KPMG LLP

    CPC classification number: G06F16/215 G06F16/2379 G06F16/254

    Abstract: A computer-implemented method for extracting bulk data by generating with a secure agent a transfer request for transfer of the bulk data; generating with a content management unit a bulk data extraction job having a job ID associated therewith in response to the transfer request and then transferring the job ID to the secure agent; generating a programmatic call using the job ID with the secure agent requesting data files including a manifest file; generating with the secure agent a search request for searching the manifest file for selected information; retrieving the manifest file with the content management unit in response to the search request; searching and parsing the manifest file with the content management unit to identify and retrieve the data files corresponding to the job ID; and transferring the data files associated with the job ID with the content management unit to a data extraction unit.

    Real-time intelligent filtering and extraction of mainframe log data

    公开(公告)号:US12130830B1

    公开(公告)日:2024-10-29

    申请号:US16352161

    申请日:2019-03-13

    CPC classification number: G06F16/254 G06F16/258

    Abstract: Systems and methods are provided that extract information from IMS log records to reduce the amount of data transmitted and input to an analysis engine. An example method includes writing IMS log records matching log types identified in an extraction list to a file within an IMS control region of a mainframe computer and outside of the IMS control region, and reading records from the file. For each record read, the method may also include extracting fields of interest from the log record based on fields of interest associated with the log type of the log record in the extraction list and a data-to-field mapping for the log type, converting the fields of interest to a predetermined format based on the data-to-field mapping, and writing at least one field of interest to an output file. The method may also include transmitting the output file to an analytics engine for processing.

    Medical ETL task dispatching method, system and apparatus based on multiple centers

    公开(公告)号:US12119108B2

    公开(公告)日:2024-10-15

    申请号:US18363701

    申请日:2023-08-01

    Applicant: ZHEJIANG LAB

    CPC classification number: G16H40/20 G06F9/4881 G06F16/254 G16H10/60

    Abstract: The present disclosure discloses a medical ETL task dispatching method, system and apparatus based on multiple centers. The method includes following steps: step S1: testing and verifying ETL tasks; step S2: deploying the ETL tasks to a hospital center, and dispatching the ETL tasks to a plurality of executors for execution; step S3: screening an executor set meeting resource demands of ETL tasks to be dispatched; step S4: calculating a current task load of each executor in the executor set; step S5: selecting the executor with a minimum current task load to execute the ETL tasks; and step S6: selecting, by the dispatching machine, the ETL tasks from executor active queues according to a priority for execution. The present disclosure selects the most suitable executor by analyzing a serving index as a task to be dispatched on a current dispatching machine.

    Computing resource autoscaling based on predicted metric behavior

    公开(公告)号:US12118396B2

    公开(公告)日:2024-10-15

    申请号:US17317571

    申请日:2021-05-11

    CPC classification number: G06F9/5027 G06F16/2465 G06F16/254

    Abstract: Methods, systems, apparatuses, and computer-readable storage mediums described herein are configured to automatically allocate or deallocate computing resources based on a prediction of performance metrics behavior. For instance, the historical behavior of compute metrics (or a time series obtained therefor) is analyzed to detect a seasonality (i.e., a seasonal pattern) and a trend therefor. A prediction of the metrics' behavior for a future time frame is determined based on the seasonality and the trend. Based on the prediction, computing resources are allocated or deallocated at or prior to the future time frame occurring. For example, if a prediction is made that a particular metric will increase, additional compute resources are allocated to handle the increase ahead of the predicted metric increase. If a prediction is made that a particular metric will decrease, compute resources are deallocated at the time the metric is predicted to decrease.

Patent Agency Ranking