摘要:
Described is automatically testing the quality of an audio coupling between juxtaposed first and second digital telephones, e.g., VoIP telephones, such as to quantitatively determine the quality of audio echo cancellers in those digital telephones. An analyzer receives timestamps from a first telephone and second telephone during a calling session, including timestamps for when the second telephone initially provides audio (e.g., speech) to the first telephone, when the first telephone initially detects sound, when the first telephone initially provides audio to the second telephone, and when the second telephone initially detects sound. The analyzer uses the relative timing of the timestamps and the speech recognizer's outcome to determine whether the audio coupling is experiencing interference or echo. When the audio includes speech, a confidence level corresponding to accuracy of speech recognition also may establish the audio coupling's quality.
摘要:
A method and system for collecting and verifying the location information of a calling party and a device of the calling party is provided. More specifically, a method and system is provided for determining whether the identity of the calling party can be confirmed, via evaluating location information, with an acceptable degree of certainty. The location information may be provided by the calling party or obtained from various sources over a digital communication channel. Some of the provided location information which can be accidentally or intentionally altered is identified and evaluated to determine its accuracy as part of the verification process of the caller's identity.
摘要:
Generally described, aspects of the present invention are directed at software systems for responding to a received voicemail message. In one embodiment, a selection user interface is provided where a primary callee may generate an event to create a draft voicemail message that is related to a received voicemail message. In response to receiving an event from the selection user interface to create a draft voicemail message, aspects of the present invention (1) create an electronic file to store the draft voicemail message, and (2) insert metadata into the electronic file that defines the relationship between the draft voicemail message and the received voicemail message. As a result, a callee may easily create a draft voicemail message that is related to a received voicemail message and have the draft voicemail message automatically populated with contextual data.
摘要:
Generally described, the present invention relates to the identification, extraction, and further use of content contained in a digital voice conversation, such as a Voice over Internet Protocol (VoIP) conversation. More specifically, the present invention relates to the use of “mined” data from a conversation to provide extended services, such as recommendations to individuals participating in a digital voice conversation.
摘要:
The present invention employs user modeling to model a user's behavior patterns. The user's behavior patterns are then used to influence named entity (NE) recognition.
摘要:
Generally described, the present invention provides the ability to process digital voice conversations to identify data packets containing content of interest and to further process the identified data packets. More specifically, mining profiles may be developed identifying particular types of content that is to be mined and further identifying what is to be done when data packets containing such content is located. A system may search a digital voice conversation for the data packets containing the content and perform processing on the data packets once identified.
摘要:
Methods and system for authenticating a user are disclosed. The present invention includes accessing a collection of personal information related to the user. The present invention also includes performing an authentication operation that is based on the collection of personal information. The authentication operation incorporates at least one dynamic component and prompts the user to give an audible utterance. The audible utterance is compared to a stored voiceprint.
摘要:
A system for controlling a telephone infrastructure device or other network traffic service model based device includes an object oriented based application including a device object adapted for storing information pertaining to a physical or logical device, a call object adapted for storing information pertaining to a call between at least two device, a listener object adapted to provide speech recognition, a prompt object adapted to provide synthesized speech, and a connection object adapted for storing information pertaining to a connection between a call object and one of a device object, a listener object and a prompt object.
摘要:
A framework for easy and accurate transcription of speech data is provided. Utterances related to a single task are grouped together and processed using combinations of associated sets of recognition results and/or context information in a manner that allows the same transcription for a selected recognition result to be assigned to each of the utterances under consideration.
摘要:
Methods and apparatus for producing efficiently sized models suitable for pattern recognition purposes are described. Various embodiments are directed to the automated generation, evaluation, and selection of reduced size models from an initial model having a relatively large number of components, e.g., more components than can be stored for a particular intended application. To achieve model size reduction in an automated iterative manner, expectation maximization (EM) model training techniques are combined, in accordance with the present invention, with model size constraints. In one embodiment, a new reduced size model is generated using a LaGrange multiplier from an input model and input size constraints during each iteration of the size reducing model training process. The reduced size model generated during one iteration of the process serves as the input to the next iteration. Scoring, e.g., maximum likelihood scoring, and evaluation steps, in conjunction with stop criteria, are used to determine the number of model size reducing iterations performed, and which reduced size model is selected as the output of the size reduction model training process.