Abstract:
This disclosure relates generally to accelerating development and deployment of enterprise applications where the applications involve both data driven and task driven components in data driven enterprise information technology (IT) systems. The disclosed system is capable of determining components of the application that may be task-driven and/or those components which may be data-driven using inputs such as business use case, data sources and requirements specifications. The system is capable of determining the components that may be developed using task-driven and data-drive paradigms and enables migration of components from the task driven paradigm to the data driven paradigm. Also, the system trains a reinforcement learning (RL) model for facilitating migration of the identified components from the task driven paradigm to the data driven paradigm. The system is further capable of integrating the migrated and existing components to accelerate development and deployment an integrated IT application.
Abstract:
This disclosure relates generally to methods and systems for generating a textual summary from a tabular data. During the textual summary generation using conventional end-to-end neural network-based techniques, a numeric data present in the tables is encoded via textual embeddings. However, the textual embeddings cannot reliably encode information about numeric concepts and relationships. The methods and systems generate the textual summary from the tabular data, by incorporating rank information for different records present in the tabular data. Then, a two-stage encoder-decoder network is used to learn correlations between the rank information and the probability of including the records based on the rank information, to obtain the textual summary generation model. The textual summary generation model identifies the content selection having the records present in the tables to be included in the textual summary and generates the textual summary from the identified content selection.
Abstract:
The present disclosure discloses system and method for providing perceptually efficient visualization of rules and exceptions mined from dataset. Further, parsing is performed on data-attributes associated with the rules. The data-attributes may include antecedents, consequents, ranges of the antecedents, syntax and statistics of the rules and exceptions. The visualization scheme of present disclosure present an overview first, allows semantic zooming, and then shows details on demand. Further, data attributes of the rules are mapped with visual attributes of graphical elements such as shape, color, opacity to create the perceptually efficient visualization of the rules and exceptions. Initially, the visualization shows main rule highlighting the exceptions associated and properties of the exceptions. Further, a semantic zoom slider is provided for allowing a user to navigate through different exception levels of the exception. Further, an interface is provided for obtaining additional information associated with the rules and the exceptions.
Abstract:
A method for performing email analytics is described. The method includes extracting emails from the configured email repository. The emails are then grouped into mail groups based on identification of content similarity of the emails. A network graph is then constructed for each of the mail group to identify an association of emails in the mail group based on header-level analysis of emails. Thereafter, email analytics is performed on the mail groups by clustering the mail groups into mail clusters based on temporal progression of emails in the mail groups. Key phrases are then determined based on a content analysis of emails in the mail groups in the mail clusters. The key phrases are then associated with the network graphs of the mail groups.
Abstract:
Causality is a crucial paradigm in several domains where observational data is available. Primary goal of Causal Inference (CI) is to uncover cause-effect relationship between entities. Conventional methods face challenges in providing an accurate CI framework due to cofounding and selection bias in multiple treatment scenario. The present disclosure computes a Propensity Score (PS) from a received CI data for the plurality of subjects under test for a treatment. A Generalized Propensity Score (GPS) is computed for a plurality of treatments corresponding to the plurality of subjects by using the PS. Further, a plurality of task batches are created using the GPS and given as input to the DNN for training. Errors in factual data and in balancing representation of the DNN are rectified using a novel loss function. The trained DNN is further used for predicting the counter factual treatment response corresponding to the factual treatment data.
Abstract:
The disclosure herein describes a method and a system for generating hybrid learning techniques. The hybrid learning technique refers to learning techniques that are a combination a plurality of techniques that include of deep learning, machine learning and signal processing to enable a rich feature space representation and classifier construction. The generation of the hybrid learning techniques also considers influence/impact of domain constraints that include business requirements and computational constraints, while generating hybrid learning techniques. Further from the plurality hybrid learning techniques a single hybrid learning technique is chosen based on performance matrix based on optimization techniques.
Abstract:
Anomaly detection from time series is one of the key components in automated monitoring of one or more entities. Domain-driven sensor selection for anomaly detection is restricted by knowledge of important sensors to capture only a certain set of anomalies from the entire set of possible anomalies. Hence, existing anomaly detection approaches are not very effective for multi-dimensional time series. Embodiments of the present disclosure depict sparse neural network for anomaly detection in multi-dimensional time series (MDTS) corresponding to a plurality of parameters of entities. A reduced-dimensional time series is obtained from the MDTS via an at least one feedforward layer by using a dimensionality reduction model. The dimensionality reduction model and recurrent neural network (RNN) encoder-decoder model are simultaneously learned to obtain a multi-layered sparse neural network. A plurality of error vectors corresponding to at least one time instance of the MDTS is computed to obtain an anomaly score.
Abstract:
A method and a system for interpreting a dataset comprising a plurality of items is described herein. The method may include computing a rule set pertaining to the dataset, generating a rule cover, calculating a plurality of distances between the plurality of rule pairs in the rule cover and generating a distance matrix based on the calculated plurality of distances between the plurality of rule pairs, storing the calculated plurality of distances between the plurality of rule pairs, clustering the overlapping rules within the rule cover using the distance matrix; selecting a representative rule from each cluster, determining at least one exception for each representative rule in the rule cover selected from each cluster and interpreting the dataset using the representative rules and the at least one exception determined for each representative rule in the rule set.
Abstract:
Existing semi-supervised and unsupervised approaches for intent discovery require an estimate of the number of new intents present in the user logs. The present disclosure receives labeled utterances from known intents and update parameters of a pre-trained language model (PLM). Representation learning and clustering is performed iteratively using labeled and unlabeled utterances from known intents and unlabeled utterances from unknown intents to fine-tune PLM and a plurality of clusters is generated. Cluster merger algorithm is executed iteratively on generated plurality of clusters. A query cluster is obtained by randomly selecting one cluster from the plurality of clusters and by obtaining a corresponding plurality of nearest neighbors based on a cosine-similarity. A response for merging the query cluster and corresponding plurality of nearest neighbors is obtained, and a new cluster is created. The corresponding cluster representation is recalculated and each of the new cluster is interpreted as an intent.
Abstract:
Conversational systems are required to be capable of handling more sophisticated interactions than providing factual answers only. Such interactions are handled by resolving abstract anaphoric references in conversational systems which includes antecedent fact references and posterior fact references. The present disclosure resolves abstract anaphoric references in conversational systems using hierarchically stacked neural networks. In the present disclosure, a deep hierarchical maxpool network based model is used to obtain a representation of each utterance received from users and a representation of one or more generated sequences of utterances. The obtained representations are further used to identify contextual dependencies with in the one or more generated sequences which helps in resolving abstract anaphoric references in conversational systems. Further, a response for an incoming sequence of utterances is retrieved based on classification of incoming sequence of utterances into one or more pre-created responses. The proposed model takes lesser time to retrain.