摘要:
A method and apparatus are provided for training parameters in a hidden conditional random field model for use in speech recognition and phonetic classification. The hidden conditional random field model uses parameterized features that are determined from a segment of speech, and those values are used to identify a phonetic unit for the segment of speech. The parameters are updated after processing of individual training samples.
摘要:
A method of creating translingual text representations takes in documents in a first language and in a second language and creates a matrix using the words in the documents to represent which words are present in which language. An algorithm is applied to each matrix such that like documents are placed close to each other and unlike documents are moved far from each other.
摘要:
A regression-based residual echo suppression (RES) system and process for suppressing the portion of the microphone signal corresponding to a playback of a speaker audio signal that was not suppressed by an acoustic echo canceller (AEC). In general, a prescribed regression technique is used between a prescribed spectral attribute of multiple past and present, fixed-length, periods (e.g., frames) of the speaker signal and the same spectral attribute of a current period (e.g., frame) of the echo residual in the output of the AEC. This automatically takes into consideration the correlation between the time periods of the speaker signal. The parameters of the regression can be easily tracked using adaptive methods. Multiple applications of RES can be used to produce better results and this system and process can be applied to stereo-RES as well.
摘要:
A system level automatic gain control (“System AGC”) automatically initializes and controls analog microphone gain in an environment where multiple independent applications simultaneously receive an input from a single analog microphone or microphone array. In one embodiment, the System AGC also prevents those applications from acting to separately control the gain by intercepting external gain control commands and responding to the corresponding application with a corresponding digital gain applied to the input signal from the microphone. Consequently, the System AGC avoids problems relating to oscillations and instability in the microphone gain resulting from multiple applications trying to simultaneously control the gain while preventing each application from adversely affecting the quality of another application's audio capture signal. Further, in one embodiment, the System AGC also acts to maximize the signal to noise (SNR) ratio of the microphone without introducing clipping as a function of a sampled background environment.
摘要:
A method of creating translingual text representations takes in documents in a first language and in a second language and creates a matrix using the words in the documents to represent which words are present in which language. An algorithm is applied to each matrix such that like documents are placed close to each other and unlike documents are moved far from each other.
摘要:
A general probabilistic formulation referred to as ‘Conditional Harmonic Mixing’ is provided, in which links between classification nodes are directed, a conditional probability matrix is associated with each link, and where the numbers of classes can vary from node to node. A posterior class probability at each node is updated by minimizing a divergence between its distribution and that predicted by its neighbors. For arbitrary graphs, as long as each unlabeled point is reachable from at least one training point, a solution generally always exists, is unique, and can be found by solving a sparse linear system iteratively. In one aspect, an automated data classification system is provided. The system includes a data set having at least one labeled category node in the data set. A semi-supervised learning component employs directed arcs to determine the label of at least one other unlabeled category node in the data set.
摘要:
Extracting features from signals for use in classification, retrieval, or identification of data represented by those signals uses a “Distortion Discriminant Analysis” (DDA) of a set of training signals to define parameters of a signal feature extractor. The signal feature extractor takes signals having one or more dimensions with a temporal or spatial structure, applies an oriented principal component analysis (OPCA) to limited regions of the signal, aggregates the output of multiple OPCAs that are spatially or temporally adjacent, and applies OPCA to the aggregate. The steps of aggregating adjacent OPCA outputs and applying OPCA to the aggregated values are performed one or more times for extracting low-dimensional noise-robust features from signals, including audio signals, images, video data, or any other time or frequency domain signal. Such extracted features are useful for many tasks, including automatic authentication or identification of particular signals, or particular elements within such signals.
摘要:
A system level automatic gain control (“System AGC”) automatically initializes and controls analog microphone gain in an environment where multiple independent applications simultaneously receive an input from a single analog microphone or microphone array. In one embodiment, the System AGC also prevents those applications from acting to separately control the gain by intercepting external gain control commands and responding to the corresponding application with a corresponding digital gain applied to the input signal from the microphone. Consequently, the System AGC avoids problems relating to oscillations and instability in the microphone gain resulting from multiple applications trying to simultaneously control the gain while preventing each application from adversely affecting the quality of another application's audio capture signal. Further, in one embodiment, the System AGC also acts to maximize the signal to noise (SNR) ratio of the microphone without introducing clipping as a function of a sampled background environment.
摘要:
The subject invention provides a unique system and method that facilitates an interactive game-powered search engine that serve the purposes of both users who may be looking for information as well as game participants who may desire to earn some reward or level of enjoyment by playing the game. More specifically, the system and method provides feedback to a user based on the user's input string or a string derived therefrom. The feedback can be a response or answer to the user's input in the form of text, an image, audio or sound, video, and/or a URL that is provided by one or more game participants when there is some degree of consistency or agreement between the responses or when individual players have demonstrated good reliability in their responses.
摘要:
The present invention relates to systems and/or methods that generate playlist(s) for a library or collection of media items via selecting a plurality of seed items, at least one of which is an undesirable seed item. Some of the seed items are desirable indicating that a user prefers additional media items similar to the desirable seed items and others are undesirable indicating that the user prefers additional media items dissimilar to the undesirable seed items. Additionally, the seed items can be weighted to establish a relative importance of the seed items. The invention compares media items in the collection with the seed items and determines which media items are added into the playlist by computation of similarity metrics or values. The playlist can be regenerated by adding desirable seed items to the playlist and removing media items from the playlist (e.g., undesirable seed items).