摘要:
Methods and a natural language processor for processing a natural language query are provided. The processor includes a classifier, a rule-based pre-processor, a rule-based post-processor, a named entity recognizer, and an output module. The method involves receiving a text representation of the natural language query, pre-processing the text representation, applying a classification statistical model to the text representation when pre-processing fails, applying a post-processing rule, and performing name entity recognition.
摘要:
Described herein are systems and methods for providing a natural language comprehension system that employs a two-stage process for machine comprehension of text. The first stage indicates words in one or more text passages that potentially answer a question. The first stage outputs a set of candidate answers for the question, along with a first probability of correctness for each candidate answer. The second stage forms one or more hypotheses by inserting each candidate answer into the question and determines whether a sematic relationship exists between each hypothesis and each sentence in the text. The second processing circuitry generates a second probability of correctness for each candidate answer and combines the first probability with the second probability to produce a score that is used to rank the candidate answers. The candidate answer with the highest score is selected as a predicted answer.
摘要:
A natural language system for recognizing geographic specific language embodied within a query received at a computing device is disclosed. A given territory such as a country may be divided into sub-territories. The data source content may be limited to a predetermined number 5 of each type of entity determined by establishing a radius for each type of entity from the center of the particular sub-territory, and only including each entity with the distance of the radius. One or more sentence templates may be gathered from common queries, and training sentences may be created by substituting entities into the sentence patterns. When the natural language system receives a query, the system may apply a speech recognition module associated with 10 the geographic location of the computing device so that geographic specific language such as businesses, street and cities may be recognized by the particular speech recognition model.
摘要:
A server, method, and non-transitory computer readable medium for classifying queries based on contextual information are provided. The server includes a network interface, a memory storage unit and a processor. The method involves receiving a plurality of queries, analyzing the queries and determining a likelihood divergence and selecting a domain. The non-transitory computer readable medium is encoded with codes to direct a processor to carry out the method.
摘要:
A method, system, and computer program product provide a conversation agent to process natural language queries expressed by a user and perform commands according to the derived intention of the user. A natural language processing (NLP) engine derives intent using conditional random fields to identify a domain and at least one task embodied in the query. The NLP may further identify one or more subdomains, and one or more entities related to the identified command. A template system creates a data structure for information relevant to the derived intent and passes a template to a services manager for interfacing with one or more services capable of accomplishing the task. A dialog manager may elicit more entities from the user if required by the services manager and otherwise engage in conversation with the user. In one embodiment, the conversational agent allows a user to engage in multiple conversations simultaneously.
摘要:
A computer-implemented method and system and computer-readable medium are disclosed for linking an ontology provided by a content service (i.e. category ontology) with a word expansion ontology (i.e. lexical ontology). A user may provide an input such as a voice command to an application. The voice command is processed by a natural language processing (NLP) engine to derive the user's intent and to extract relevant entities embodied in the command. The NLP engine may create a composite concept set containing multiple permutations of the concepts (entities extracted) and provide the composite concept set to a concept mapper. The concept mapper searches a mapping file and applies one or more scoring operations to determine a best match between the composite concept set and at least one category provided by the category ontology. The content service is searched using the category and the results are displayed to the user.
摘要:
A system, method and computer program is provided for generating customized text representations of audio commands. A first speech recognition module may be used for generating a first text representation of an audio command based on a general language grammar. A second speech recognition module may be used for generating a second text representation of the audio command, the second module including a custom language grammar that may include contacts for a particular user. Entity extraction is applied to the second text representation and the entities are checked against a file containing personal language. If the entities are found in the user-specific language, the two text representations may be fused into a combined text representation and named entity recognition may be performed again to extract further entities.