摘要:
Techniques are disclosed for using phonetic features for speech recognition. For example, a method comprises the steps of obtaining a first dictionary and a training data set associated with a speech recognition system, computing one or more support parameters from the training data set, transforming the first dictionary into a second dictionary, wherein the second dictionary is a function of one or more phonetic labels of the first dictionary, and using the one or more support parameters to select one or more samples from the second dictionary to create a set of one or more exemplar-based class identification features for a pattern recognition task.
摘要:
Techniques for facilitating communication are provided. The techniques include using a machine-to-machine communication to facilitate communication between one or more human users of a communicator device and a compatible communicator device, wherein using the machine-to-machine communication to facilitate communication between one or more human users comprises initiating a machine-to-machine communication with a compatible communicator device if the device is within the geographic proximity, wherein the machine-to-machine communication incorporates one or more related items from a user profile of each device automatically extracted by the device initiating the machine-to-machine communication, and conducting the machine-to-machine communication in a manner in which the communication can be monitored by the one or more human users.
摘要:
A simulation method and system. A computing system receives a first audio and/or video data stream. The first audio and/or video data stream includes data associated with a first person. The computing system monitors the first audio and/or video data stream. The computing system identifies emotional attributes comprised by the first audio and/or video data stream. The computing system generates a second audio and/or video data stream associated with the first audio and/or video data stream. The second audio and/or video data stream includes the data without the emotional attributes. The computing system stores the second audio and/or video data stream.
摘要:
In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.
摘要:
In a voice processing system, a multimodal request is received from a plurality of modality input devices, and the requested application is run to provide a user with the feedback of the multimodal request. In the voice processing system, a multimodal aggregating unit is provided which receives a multimodal input from a plurality of modality input devices, and provides an aggregated result to an application control based on the interpretation of the interaction ergonomics of the multimodal input within the temporal constraints of the multimodal input. Thus, the multimodal input from the user is recognized within a temporal window. Interpretation of the interaction ergonomics of the multimodal input include interpretation of interaction biometrics and interaction mechani-metrics, wherein the interaction input of at least one modality may be used to bring meaning to at least one other input of another modality.
摘要:
Techniques are disclosed for modifying speech quality in a conversation over a voice channel. For example, a method for modifying a speech quality associated with a spoken utterance transmittable over a voice channel comprises the following steps. The spoken utterance is obtained prior to an intended recipient of the spoken utterance receiving the spoken utterance. An existing speech quality of the spoken utterance is determined. The existing speech quality of the spoken utterance is compared to at least one desired speech quality associated with at least one previously obtained spoken utterance to determine whether the existing speech quality substantially matches the desired speech quality. At least one characteristic of the spoken utterance is modified to change the existing speech quality of the spoken utterance to the desired speech quality when the existing speech quality does not substantially match the desired speech quality. The spoken utterance is presented with the desired speech quality to the intended recipient.
摘要:
Techniques for managing vehicular emergencies are disclosed. For example, a method of managing a vehicular emergency includes the steps of collecting biometric data regarding at least one occupant of a vehicle, collecting data regarding at least one operational characteristic of the vehicle, and detecting vehicular emergencies through analysis of at least a portion of the biometric data and the operational characteristic data. This method may also include communicating at least one message relating to the data, wherein the content of the message is determined by the processing device based at least in part on the data and/or controlling a function of the vehicle in response to the data. The method may also include collecting data regarding at least one operational characteristic of at least one proximate vehicle.
摘要:
A method and system for nano-encoding and decoding information related to printed texts and images on paper and other surfaces is provided. The system and method includes a nano-encoder for encoding information related to printed texts and images; and then collocating the encoded information with the related printed texts and/or images. The system also includes a nano-decoder for decoding information encoded by the nano-encoder. The nano-decoder includes a text processing database having a translator database. The translator database includes a definition database; and a summary database. In addition, the system and method includes detecting luminescent nano particles and/or magnetic nano particles; and determining invariant properties of the detected nano particles. The invariant properties are then matched with coded information. The system and method includes matching the invariant properties with predetermined coded information and analyzing the invariant properties of the detected nano particles for segmentation.
摘要:
This invention involves tracking and backing all the information that a user generates on its computer devices (including embedded devices) in real time. The local user server records all user actions and gestures (via various means that include TV cameras). All of this information (user actions and saved files in a computer) is then sent to a remote server via the Internet. This remote server has a virtual map of all the embedded devices on a computer that the person uses. The remote server immediately starts to interpret the user's actions (including user gestures). In one implementation, the invention stores user actions that are related to data generation (e.g. actions that called some links where data is stored, or executed some programs that generated data). In another variant the remote server generates and downloads the same files that are downloaded on the local user computer devices. For example, if a person begins to download a program, the server may also download the same program on a remote backup server. This way, if the user loses this program, it can be retrieved automatically through a provided server on the Internet. If user's files are backed up by regular backup periodically, relevant data that were stored by real time backup servers can be eliminated.
摘要:
Using metadata to detect alteration of data. A first set of metadata characteristics including at least one respective semantic description are recorded for a first set of data representing original data. A second set of metadata characteristics including at least one corresponding semantic description are recorded for a second set of data representing data under test. The first and second sets of metadata characteristics are compared. If the first and second sets of metadata characteristics are not identical, these sets are processed to identify locations in the first set of data that have been altered. Using the at least one semantic description for the first set of data and the at least one corresponding semantic description for the second set of data, one or more metadata characteristics that have changed from the first set of data to the second set of data are identified.