Abstract:
Provided is a computer-implemented method, system, and computer program product for generating a goal-oriented dialogue from a grounding document. A processor may analyze a corpus of text. The processor may identify, based on the analyzing, one or more semantic structures that can be used to simulate a dialogue. The processor may generate, based on the identifying, a simulated dialogue, the simulated dialogue including one or more utterances from a simulated agent and one or more utterances from a simulated user to form a dialogue flow.
Abstract:
A method of providing real-time speech analysis to a user includes capturing a speech input, performing a real-time recognition of the speech input including converting the speech input to a text, analyzing the recognized speech input to identify an error in a voice of the user, the analyzing including comparing a voice of a correct text generated by an automated speech generation system with the captured speed input, and processing the text to extract a context dialog prompt.
Abstract:
Techniques for generating a clarification to distinguish among retrieved content in interactive systems are provided. In one aspect, a method for generating a clarification prompt in an interactive system includes: obtaining a training dataset for generating the clarification prompt from existing question-answering datasets by modifying original queries in the existing question-answering datasets to obtain training examples of under-specified queries; and training a machine learning model using the training dataset how to select latent differentiating factors in content candidates obtained from an under-specified query from a user and, based on the latent differentiating factors, generate the clarification prompt to clarify an intent of the user.
Abstract:
A computer system, product, and method are provided. The computer system includes an artificial intelligence (AI) platform operatively coupled to a processor. The AI platform includes tools in the form of a machine learning model (MLM) manager, a metric manager, and a training manager. The MLM manager accesses a plurality of pre-trained source MLMs, and inputs a plurality of data objects of a test dataset into each of the source MLMs. The test dataset includes the plurality of data objects associated with respective labels. For each source MLM, associated labels are generated from the inputted data objects and a similarity metric is calculated. The MLM manager selects a base MLM to be used for transfer learning from the plurality of source MLMs based upon the calculated similarity metric. The training manager trains the selected base MLM with a target dataset for the target domain.
Abstract:
A method of providing real-time speech analysis for a user includes capturing a speech input, performing a real-time recognition of the speech input including converting the speech input to a text using an automatic speech recognition component, analyzing the recognized speech input, by a processing unit of a computer in a speech recognition and analyzing system, to identify an error in the user's speech, and by comparing a voice of a correct text generated by a speech generation and analyzing system with the captured speech input, and providing a real-time correction to the user based on a result of the comparing the voice of the correct text with the captured speech input. The comparing the voice of the correct text with the captured speech input includes comparing a standard pronunciation of the correct text with a pronunciation of the user in the captured speech input to identify the error.
Abstract:
Mechanisms are provided for performing a cognitive operation. An input graph is received having a plurality of first nodes, where subsets of first nodes are coupled to one another via first edges and each first edge has an associated weight. A blinking graph model is generated based on the graph, where blink rates are associated with second edges and are calculated based on weights of corresponding first edges in the input graph. The blink rate specifies a fraction of time a corresponding second edge is determined to be present in the blinking graph model. A relatedness metric is calculated for a target node relative to a node of interest based on the blink rates of the second edges. The relatedness metric indicates a degree of relatedness of the target node to the node of interest. A cognitive operation is then performed based on the relatedness metric.
Abstract:
Mechanisms are provided for performing a cognitive operation. An input graph is received having a plurality of first nodes, where subsets of first nodes are coupled to one another via first edges and each first edge has an associated weight. A blinking graph model is generated based on the graph, where blink rates are associated with second edges and are calculated based on weights of corresponding first edges in the input graph. The blink rate specifies a fraction of time a corresponding second edge is determined to be present in the blinking graph model. A relatedness metric is calculated for a target node relative to a node of interest based on the blink rates of the second edges. The relatedness metric indicates a degree of relatedness of the target node to the node of interest. A cognitive operation is then performed based on the relatedness metric.
Abstract:
From metadata of a corpus of natural language text documents, a relativity matrix is constructed, a row-column intersection in the relativity matrix corresponding to a relationship between two instances of a type of metadata. An encoder model is trained, generating a trained encoder model, to compute an embedding corresponding to a token of a natural language text document within the corpus and the relativity matrix, the encoder model comprising a first encoder layer, the first encoder layer comprising a token embedding portion, a relativity embedding portion, a token self-attention portion, a metadata self-attention portion, and a fusion portion, the training comprising adjusting a set of parameters of the encoder model.
Abstract:
Techniques facilitating interpretation of deep neural model based dialogue agents are provided. In one example, a computer-implemented method comprises extracting, by a device operatively coupled to a processor, features from a dialogue model independently from the dialogue model, the features comprising input features provided to the dialogue model and output features produced via the dialogue model in response to the input features, resulting in extracted features; and analyzing, by the device, a dialogue context associated with the extracted features by identifying pairwise interactions between respective ones of the extracted features.
Abstract:
A computer system, product, and method are provided. The computer system includes an artificial intelligence (AI) platform operatively coupled to a processor. The AI platform includes tools in the form of a machine learning model (MLM) manager, a metric manager, and a training manager. The MLM manager accesses a plurality of pre-trained source MLMs, and inputs a plurality of data objects of a test dataset into each of the source MLMs. The test dataset includes the plurality of data objects associated with respective labels. For each source MLM, associated labels are generated from the inputted data objects and a similarity metric is calculated. The MLM manager selects a base MLM to be used for transfer learning from the plurality of source MLMs based upon the calculated similarity metric. The training manager trains the selected base MLM with a target dataset for the target domain.