Abstract:
A computer detects a connected voice or video call between participants and records a brief media sample. Speech recognition is utilized to determine when the call is connected as well as to transcribe the content of the audio portion of the media sample. The recorded media sample and transcribed content is associated with the connected voice or video call such that a user may reference it at a later point. The computer additionally suggests creating or editing contact information associated with the participants of the connected voice or video call based on the transcribed content.
Abstract:
A speech recognizer on a portable computing device is trained and configured to reject machine articulations of utterances attempting access to resources on such device. The recognizer is trained with scoring feedback from human listeners identifying articulations as originating from a machine.
Abstract:
A system for automated adaptation and improvement of speaker authentication in a voice biometric system environment, comprising a speech sample collector, a target selector, a voice analyzer, a voice data modifier, and a call flow creator. The speech sample collector retrieves speech samples from a database of enrolled participants in a speaker authentication system. The target selector selects target users that will be used to test the speaker authentication system. The voice analyzer extracts a speech component data set from each of the speech samples. The call flow creator creates a plurality of call flows for testing the speaker authentication system, each call flow being either an impostor call flow or a legitimate call flow. The call flows created by the call flow creator are used to test the speaker authentication system.
Abstract:
A speech recognition capability in which speakers of spoken text are identified based on the contour of sound waves representing the spoken text. Variations in the contour of the sound waves are identified, features are assigned to those variations, and parameters of those features are grouped into predefined characteristics. The predefined characteristics are combined into voice characteristic groups. If a prior voice characteristic group is present, the voice characteristic group from the soundlet is compared to existing voice characteristic groups and, if a match is present, the sound construct is assigned to a speaker identified by the existing voice characteristic group.
Abstract:
One-to-many comparisons of callers' words and/or voice prints with known words and/or voice prints to identify any substantial matches between them. When a customer communicates with a particular entity, such as a customer service center, the system makes a recording of the real-time call including both the customer's and agent's voices. The system segments the recording to extract different words, such as words of anger. The system may also segment at least a portion of the customer's voice to create a tone profile, and it formats the segmented words and tone profiles for network transmission to a server. The server compares the customer's words and/or tone profiles with multiple known words and/or tone profiles stored on a database to determine any substantial matches. The identification of any matches may be used for a variety of purposes, such as providing representative feedback or customer follow-up.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data corresponding to an utterance, determining that the audio data corresponds to a hotword, generating a hotword audio fingerprint of the audio data that is determined to correspond to the hotword, comparing the hotword audio fingerprint to one or more stored audio fingerprints of audio data that was previously determined to correspond to the hotword, detecting whether the hotword audio fingerprint matches a stored audio fingerprint of audio data that was previously determined to correspond to the hotword based on whether the comparison indicates a similarity between the hotword audio fingerprint and one of the one or more stored audio fingerprints that satisfies a predetermined threshold, and in response to detecting that the hotword audio fingerprint matches a stored audio fingerprint, disabling access to a computing device into which the utterance was spoken.
Abstract:
Systems and methods are provided for enabling real-time synchronous communication with persons appearing in image or video files. For example, an image or video file is displayed on a display screen of a computing device. The computing device detects a user selection of a person present in the displayed image or video. A request is sent from the computing device to a service provider for profile information associated with the user-selected person. The computing device receives from the service provider profile information associated with the user-selected person, wherein the profile information includes a communications address of the user-selected person. The communications address is utilized to initiate a communications session on the computing device with the user-selected person present in the displayed image or video.
Abstract:
Systems and methods are provided for enabling real-time synchronous communication with persons appearing in image or video files. For example, an image or video file is displayed on a display screen of a computing device. The computing device detects a user selection of a person present in the displayed image or video. A request is sent from the computing device to a service provider for profile information associated with the user-selected person. The computing device receives from the service provider profile information associated with the user-selected person, wherein the profile information includes a communications address of the user-selected person. The communications address is utilized to initiate a communications session on the computing device with the user-selected person present in the displayed image or video.
Abstract:
Increasing the efficiency of phone call usage is accomplished by using strategic call forwarding techniques to analyze incoming calls and process these calls in real time to: 1) divert unwanted robocallers and/or 2) provide information about unknown human callers. Robocallers are detected by analyzing incoming calls to determine if the audio is human-based or generated by a robocaller. Information about unknown human callers is obtained by real-time look up and reporting techniques to allow the user to determine whether to answer a call.
Abstract:
A computer detects a connected voice or video call between participants and records a brief media sample. Speech recognition is utilized to determine when the call is connected as well as to transcribe the content of the audio portion of the media sample. The recorded media sample and transcribed content is associated with the connected voice or video call such that a user may reference it at a later point. The computer additionally suggests creating or editing contact information associated with the participants of the connected voice or video call based on the transcribed content.