摘要:
An in-vehicle infotainment system, smart home information access and device control unit, or mobile system presents summarized information to a user based on a user preference model that is associated with the user. The system modifies the presentation of information to the user based on environmental context data about the vehicle and user context data about the activity of the user. During presentation of the information, the system modifies the content and presentation of the summarized information in response to multi-modal input requests from the user.
摘要:
An in-vehicle infotainment system, smart home information access and device control unit, or mobile system presents summarized information to a user based on a user preference model that is associated with the user. The system modifies the presentation of information to the user based on environmental context data about the vehicle and user context data about the activity of the user. During presentation of the information, the system modifies the content and presentation of the summarized information in response to multi-modal input requests from the user.
摘要:
Embodiments of an interface system that enables a call center agent to access and intervene in an interaction between an automated call center system and a caller whenever necessary for complex application tasks is described. The system includes a user interface that presents the agent with one or more categories of information, including the conversation flow, obtained semantic information, the recognized utterances, and access to the utterance waveforms. This information is cross-linked and attached with a confidence level for better access and navigation within the dialog system for the generation of appropriate responses to the caller.
摘要:
Embodiments of an interface system that enables a call center agent to access and intervene in an interaction between an automated call center system and a caller whenever necessary for complex application tasks is described. The system includes a user interface that presents the agent with one or more categories of information, including the conversation flow, obtained semantic information, the recognized utterances, and access to the utterance waveforms. This information is cross-linked and attached with a confidence level for better access and navigation within the dialog system for the generation of appropriate responses to the caller.
摘要:
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating a frequency count of each utterance in the plurality of utterances, generating a high-frequency plurality of utterances from the plurality of utterances having a frequency that exceeds a predetermined frequency threshold, generating a low-frequency plurality of utterances from the plurality of utterances having a frequency that is below the predetermined frequency threshold, generating a grammar-based language model using the high-frequency plurality of utterances as training data, and generating a statistical language model using the low-frequency plurality of utterances as training data.
摘要:
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating a frequency count of each utterance in the plurality of utterances, generating a high-frequency plurality of utterances from the plurality of utterances having a frequency that exceeds a predetermined frequency threshold, generating a low-frequency plurality of utterances from the plurality of utterances having a frequency that is below the predetermined frequency threshold, generating a grammar-based language model using the high-frequency plurality of utterances as training data, and generating a statistical language model using the low-frequency plurality of utterances as training data.
摘要:
A method of automatically generating a terminology definition knowledge base (KB) includes mapping each word in a word sequence to a real value dense vector using dense vector representations. The word sequence is then processed using a Convolutional Neural Network (CNN) model to identify whether the word sequence includes a terminology definition and to label the word sequence with a label indicating whether a terminology definition exists within the word sequence. The word sequence is then processed using a Conditional Random Field (CRF) model to identify boundaries of the terminology definition in the word sequence. The terminology definition is then extracted and added to the terminology definition KB.
摘要:
A method for speech recognition includes providing a source of geographical information within a vehicle. The geographical information pertains to a current location of the vehicle, a planned travel route of the vehicle, a map displayed within the vehicle, and/or a gesture marked by a user on a map. Words spoken within the vehicle are recognized by use of a speech recognition module. The recognizing is dependent upon the geographical information.
摘要:
A method for speech recognition includes providing a source of geographical information within a vehicle. The geographical information pertains to a current location of the vehicle, a planned travel route of the vehicle, a map displayed within the vehicle, and/or a gesture marked by a user on a map. Words spoken within the vehicle are recognized by use of a speech recognition module. The recognizing is dependent upon the geographical information.
摘要:
A method for speech recognition includes providing a source of geographical information within a vehicle. The geographical information pertains to a current location of the vehicle, a planned travel route of the vehicle, a map displayed within the vehicle, and/or a gesture marked by a user on a map. Words spoken within the vehicle are recognized by use of a speech recognition module. The recognizing is dependent upon the geographical information.