摘要:
An in-vehicle infotainment system, smart home information access and device control unit, or mobile system presents summarized information to a user based on a user preference model that is associated with the user. The system modifies the presentation of information to the user based on environmental context data about the vehicle and user context data about the activity of the user. During presentation of the information, the system modifies the content and presentation of the summarized information in response to multi-modal input requests from the user.
摘要:
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating a frequency count of each utterance in the plurality of utterances, generating a high-frequency plurality of utterances from the plurality of utterances having a frequency that exceeds a predetermined frequency threshold, generating a low-frequency plurality of utterances from the plurality of utterances having a frequency that is below the predetermined frequency threshold, generating a grammar-based language model using the high-frequency plurality of utterances as training data, and generating a statistical language model using the low-frequency plurality of utterances as training data.
摘要:
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating a frequency count of each utterance in the plurality of utterances, generating a high-frequency plurality of utterances from the plurality of utterances having a frequency that exceeds a predetermined frequency threshold, generating a low-frequency plurality of utterances from the plurality of utterances having a frequency that is below the predetermined frequency threshold, generating a grammar-based language model using the high-frequency plurality of utterances as training data, and generating a statistical language model using the low-frequency plurality of utterances as training data.
摘要:
An in-vehicle infotainment system, smart home information access and device control unit, or mobile system presents summarized information to a user based on a user preference model that is associated with the user. The system modifies the presentation of information to the user based on environmental context data about the vehicle and user context data about the activity of the user. During presentation of the information, the system modifies the content and presentation of the summarized information in response to multi-modal input requests from the user.
摘要:
Embodiments of an interface system that enables a call center agent to access and intervene in an interaction between an automated call center system and a caller whenever necessary for complex application tasks is described. The system includes a user interface that presents the agent with one or more categories of information, including the conversation flow, obtained semantic information, the recognized utterances, and access to the utterance waveforms. This information is cross-linked and attached with a confidence level for better access and navigation within the dialog system for the generation of appropriate responses to the caller.
摘要:
Embodiments of an interface system that enables a call center agent to access and intervene in an interaction between an automated call center system and a caller whenever necessary for complex application tasks is described. The system includes a user interface that presents the agent with one or more categories of information, including the conversation flow, obtained semantic information, the recognized utterances, and access to the utterance waveforms. This information is cross-linked and attached with a confidence level for better access and navigation within the dialog system for the generation of appropriate responses to the caller.
摘要:
A method of automatically generating a terminology definition knowledge base (KB) includes mapping each word in a word sequence to a real value dense vector using dense vector representations. The word sequence is then processed using a Convolutional Neural Network (CNN) model to identify whether the word sequence includes a terminology definition and to label the word sequence with a label indicating whether a terminology definition exists within the word sequence. The word sequence is then processed using a Conditional Random Field (CRF) model to identify boundaries of the terminology definition in the word sequence. The terminology definition is then extracted and added to the terminology definition KB.
摘要:
Embodiments of an ontological determination method for use in natural language processing applications are described. In one embodiment, shallow lexico-syntactic patterns are applied to identify relations by extracting term features to distinguish relation terms from non-relation terms, identifying coordinate relations for every adjacent terms; identifying short-distance ontological (e.g., hypernym or part-whole relations) for other adjacent terms based on term features and lexico-syntactic patterns; and then inferring long-distance hypernym and part-whole relations based on the identified coordinate relations and the short-distance relations.
摘要:
Representation-neutral dialogue systems and methods (“RNDS”) are described that include multi-application, multi-device spoken-language dialogue systems based on the information-state update approach. The RNDS includes representation-neutral core components of a dialogue system that provide scripted domain-specific extensions to routines such as dialogue move modeling and reference resolution, easy substitution of specific semantic representations and associated routines, and clean interfaces to external components for language-understanding (i.e., speech-recognition and parsing) and language-generation, and to domain-specific knowledge sources. The RNDS also resolves multi-device dialogue by evaluating and selecting among candidate dialogue moves based on features at multiple levels. Multiple sources of information are combined, multiple speech recognition and parsing hypotheses tested, and multiple device and moves considered to choose the highest scoring hypothesis overall. Confirmation and clarification behavior can be governed by the overall score.
摘要:
Embodiments of an ontological determination method for use in natural language processing applications are described. In one embodiment, shallow lexico-syntactic patterns are applied to identify relations by extracting term features to distinguish relation terms from non-relation terms, identifying coordinate relations for every adjacent terms; identifying short-distance ontological (e.g., hypernym or part-whole relations) for other adjacent terms based on term features and lexico-syntactic patterns; and then inferring long-distance hypernym and part-whole relations based on the identified coordinate relations and the short-distance relations.