Accelerating development and deployment of enterprise applications in data driven enterprise IT systems

    公开(公告)号:US11449413B2

    公开(公告)日:2022-09-20

    申请号:US17345166

    申请日:2021-06-11

    Abstract: This disclosure relates generally to accelerating development and deployment of enterprise applications where the applications involve both data driven and task driven components in data driven enterprise information technology (IT) systems. The disclosed system is capable of determining components of the application that may be task-driven and/or those components which may be data-driven using inputs such as business use case, data sources and requirements specifications. The system is capable of determining the components that may be developed using task-driven and data-drive paradigms and enables migration of components from the task driven paradigm to the data driven paradigm. Also, the system trains a reinforcement learning (RL) model for facilitating migration of the identified components from the task driven paradigm to the data driven paradigm. The system is further capable of integrating the migrated and existing components to accelerate development and deployment an integrated IT application.

    Method and system for scalable acceleration of data processing pipeline

    公开(公告)号:US12050563B2

    公开(公告)日:2024-07-30

    申请号:US18049363

    申请日:2022-10-25

    CPC classification number: G06F16/211

    Abstract: The present disclosure provides a scalable acceleration of data processing in Machine Learning pipeline which is unavailable in conventional methods. Initially, the system receives a dataset and a data processing code. A plurality of sample datasets are obtained based on the received dataset using a sampling technique. A plurality of performance parameters corresponding to each of the plurality of sample datasets are obtained based on the data processing code using a profiling technique. A plurality of scalable performance parameters corresponding to each of a plurality of larger datasets are predicted based on the plurality of performance parameters and the data processing code using a curve fitting technique. Simultaneously, a plurality of anti-patterns are located in the data processing code using a pattern matching technique. Finally, an accelerated code is recommended based on the plurality of anti-patterns and the predicted plurality of scalable performance parameters using an accelerated code recommendation technique.

    METHOD AND SYSTEM FOR ACCELERATION OF SLOWER DATA PROCESSING CODES IN MACHINE LEARNING PIPELINES

    公开(公告)号:US20240220245A1

    公开(公告)日:2024-07-04

    申请号:US18544697

    申请日:2023-12-19

    CPC classification number: G06F9/30007 G06F9/3877

    Abstract: Data processing code in machine learning pipelines is primarily done using data frame APIs provided by Pandas and similar libraries. Though, these libraries are easy to use, their temporal performance is worse than similar code written using NumPy or other high-performance libraries. Embodiments herein provide a system and method for acceleration of slower data processing code in machine learning pipelines by automatically generating an accelerated data processing code. Initially, a code is received and pre-processed based on a predefined format to get a standardized code. Further, system identifies code statements having operations that to be performed on a data frame, and an ordered list of data frame columns to generate a filtered dictionary code. Further, a data processing representation is generated using filtering dictionary code and ordered list of data frame columns. Finally, an accelerated data processing code is recommended based on the data processing representation.

    METHOD AND SYSTEM FOR GENERATING LABELED DATASET USING A TRAINING DATA RECOMMENDER TECHNIQUE

    公开(公告)号:US20220092354A1

    公开(公告)日:2022-03-24

    申请号:US17471564

    申请日:2021-09-10

    Abstract: This disclosure relates generally to a method and system for generating labelled dataset using a training data recommender technique. Recommender systems face major challenges in handling dynamic data on machine learning paradigms thereby rendering inaccurate unlabeled dataset. The method of the present disclosure is based on a training data recommender technique suitably constructed with a newly defined parameter such as the labelled data prediction threshold to determine the adequate amount of labelled training data required for training the one or more machine learning models. The method processes the received unlabeled dataset for labelling the unlabeled dataset based on a labelled data prediction threshold which is determined using a trained training data recommender technique. This labelling data threshold leads to a significant reduction in training time while performing the one or more machine learning models and thus recommender systems to quickly adapt disruptions thereby decreasing the reduction factor.

Patent Agency Ranking