摘要:
Task list linking may be provided. Upon receiving an input from a user, the input may be translated into at least one actionable item. The at least one actionable item may be linked to a data source and displayed to the user.
摘要:
Sentence simplification may be provided. A spoken phrase may be received and converted to a text phrase. An intent associated with the text phrase may be identified. The text phrase may then be reformatted according to the identified intent and a task may be performed according to the reformatted text phrase.
摘要:
Sentence simplification may be provided. A spoken phrase may be received and converted to a text phrase. An intent associated with the text phrase may be identified. The text phrase may then be reformatted according to the identified intent and a task may be performed according to the reformatted text phrase.
摘要:
Training for a statistical dialog manager may be provided. A plurality of log data associated with an intent may be received, and at least one step associated with completing the intent according to the plurality of log data may be identified. An understanding model associated with the intent may be created, including a plurality of queries mapped to the intent. In response to receiving a natural language query from a user that is associated with the intent a response to the user may be provided according to the understanding model.
摘要:
Domain detection training in a spoken language understanding system may be provided. Log data associated with a search engine, each associated with a search query, may be received. A domain label for each search query may be identified and the domain label and link data may be provided to a training set for a spoken language understanding model.
摘要:
A machine-readable medium may include a group of reusable components for building a spoken dialog system. The reusable components may include a group of previously collected audible utterances. A machine-implemented method to build a library of reusable components for use in building a natural language spoken dialog system may include storing a dataset in a database. The dataset may include a group of reusable components for building a spoken dialog system. The reusable components may further include a group of previously collected audible utterances. A second method may include storing at least one set of data. Each one of the at least one set of data may include ones of the reusable components associated with audible data collected during a different collection phase.
摘要:
Open-domain question answering is the task of finding a concise answer to a natural language question using a large domain, such as the Internet. The use of a semantic role labeling approach to the extraction of the answers to an open domain factoid (Who/When/What/Where) natural language question that contains a predicate is described. Semantic role labeling identities predicates and semantic argument phrases in the natural language question and the candidate sentences. When searching for an answer to a natural language question, the missing argument in the question is matched using semantic parses of the candidate answers. Such a technique may improve the accuracy of a question answering system and may decrease the length of answers for enabling voice interface to a question answering system.
摘要翻译:开放域问答是使用大型域(如互联网)找到一个简明的自然语言问题答案的任务。 描述了使用语义角色标注方法来提取对包含谓词的开放域factoid(Who / When / What / Where)自然语言问题的答案。 自然语言问题和候选句子中的语义角色标识身份谓词和语义参数短语。 在搜索自然语言问题的答案时,使用候选答案的语义分析来匹配问题中的缺失参数。 这样的技术可以提高问答系统的准确性,并且可以减少用于启用语音接口到问答系统的答案的长度。
摘要:
A system and a method are provided. A textual transcript of a recorded voice communication is received. Text from a non-voice communication is received. Information about the textual transcript of the recorded voice communication and the text from the non-voice communication is provided such that a user can manage a group of communications, based at least in part on the textual transcript of the recorded voice communication and the text from the non-voice communication.
摘要:
Utterance data that includes at least a small amount of manually transcribed data is provided. Automatic speech recognition is performed on ones of the utterance data not having a corresponding manual transcription to produce automatically transcribed utterances. A model is trained using all of the manually transcribed data and the automatically transcribed utterances. A predetermined number of utterances not having a corresponding manual transcription are intelligently selected and manually transcribed. Ones of the automatically transcribed data as well as ones having a corresponding manual transcription are labeled. In another aspect of the invention, audio data is mined from at least one source, and a language model is trained for call classification from the mined audio data to produce a language model.
摘要:
The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.