摘要:
Speech recognition is performed by receiving isolated speech training data indicative of a plurality of discretely spoken training words, and receiving continuous speech training data indicative of a plurality of continuously spoken training words. A plurality of speech unit models is trained based on the isolated speech training data and the continuous speech training data. Speech is recognized based on the speech unit models trained.
摘要:
A method and system for dynamically selecting words for training a speech recognition system. The speech recognition system models each phoneme using a hidden Markov model and represents each word as a sequence of phonemes. The training system ranks each phoneme for each frame according to the probability that the corresponding codeword will be spoken as part of the phoneme. The training system collects spoken utterances for which the corresponding word is known. The training system then aligns the codewords of each utterance with the phoneme that it is recognized to be part of. The training system then calculates an average rank for each phoneme using the aligned codewords for the aligned frames. Finally, the training system selects words for training that contain phonemes with a low rank.
摘要:
A method and apparatus are provided for segmenting words into component parts. Under the invention, mutual information scores for pairs of graphoneme units found in a set of words are determined. Each graphoneme unit includes at least one letter. The graphoneme units of one pair of graphoneme units are combined based on the mutual information score. This forms a new graphoneme unit. Under one aspect of the invention, a syllable n-gram model is trained based on words that have been segmented into syllables using mutual information. The syllable n-gram model is used to segment a phonetic representation of a new word into syllables. Similarly, an inventory of morphemes is formed using mutual information and a morpheme n-gram is trained that can be used to segment a new word into a sequence of morphemes.
摘要:
A method and apparatus are provided for segmenting words into component parts. Under the invention, mutual information scores for pairs of graphoneme units found in a set of words are determined. Each graphoneme unit includes at least one letter. The graphoneme units of one pair of graphoneme units are combined based on the mutual information score. This forms a new graphoneme unit. Under one aspect of the invention, a syllable n-gram model is trained based on words that have been segmented into syllables using mutual information. The syllable n-gram model is used to segment a phonetic representation of a new word into syllables. Similarly, an inventory of morphemes is formed using mutual information and a morpheme n-gram is trained that can be used to segment a new word into a sequence of morphemes.
摘要:
A speech recognition system is extensible in that new terms may be added to a list of terms that are recognized by the speech recognition system. The speech recognition system provides audio feedback when new terms are added so that a user may hear how the system expects the word to be pronounced. The user may then accept the pronunciation or provide his own pronunciation. The user may also selectively change the pronunciation of words to avoid misrecognitions by the system. The system may provide appropriate user interface elements for enabling a user to change the pronunciation of words. The system may also include intelligence for automatically changing the pronunciation of words used in recognition based upon empirically derived information.
摘要:
A method and system for editing words that have been misrecognized. The system allows a speaker to specify a number of alternative words to be displayed in a correction window by resizing the correction window. The system also displays the words in the correction window in alphabetical order. A preferred system eliminates the possibility, when a misrecognized word is respoken, that the respoken utterance will be again recognized as the same misrecognized word. This elimination occurs based on the probabilities of alternative words associated with both the misrecognized utterance and the respoken utterance. The system, when operating with a word processor, allows the speaker to specify the amount of speech that is buffered before transferring to the word processor. The system also uses a word correction metaphor or a phrase correction metaphor.
摘要:
Among other things, one or more techniques and/or systems are provided for facilitating the completion of a user task. That is, user intent (e.g., intentions of a user to perform a user task) may be identified. The user intent may comprise an entity (e.g., a movie entity) and/or an action (e.g., an order movie tickets action) that the user wants to perform on the entity. A provider list may be created based upon one or more providers capable of performing the action on the entity (e.g., a movie application may be capable of performing the order movie tickets action on the movie entity). Providers may be dynamically selected for inclusion within the provider list at run-time. For example, an open market of providers may be maintained (e.g., providers may be added, removed, and/or updated over time), such that providers may be selected from the open market to complete user tasks.
摘要:
Desirable completion zones can be identified using closure stress in combination with one or more other attributes such as porosity. One computer-based well placement method includes using the computer to: process a seismic data volume to map the spatial distribution of a seismic-based CSS attribute; acquire logs from one or more boreholes in the subsurface region; derive from the logs a relationship between CSS and a minimum in-situ stress; apply the relationship to the CSS attribute map to produce a landing map that highlights desirable completion zones; and place one or more wells in the desirable completion zones. The borehole logs may include direct measurements of minimum in-situ stress (acquired via microfracture testing), sonic tool measurements of P-wave and S-wave velocity, and density tool measurements of bulk formation density.
摘要:
This invention relates to methods and apparatus for determination of ion concentrations, particularly in downhole water from hydrocarbon wells, aquifers etc. It is useful in a wide range of applications, including predicting the formation of scale and fingerprinting waters from different sources. More particularly, the invention relates to the use of ligands whose electronic configuration is altered by the binding of the scaling ions within a water sample. These alterations are detected electrochemically by applying varying potential to electrodes and measuring current flow as potential is varied, from which is derived the concentration of scaling ions in the fluid.
摘要:
In an electronic device, an image point A on an image of an object is selected. A spectral confocal sensor is controlled to move to a position above a measuring point A′ on the object, where the measuring point A′ corresponds to the image point A, and a Z-coordinate of the measuring point A′ is computed using the spectral confocal sensor. A focal position of the measuring point A′ is computed according to the Z-coordinate of the measuring point A′, and a CCD lens is controlled to move to the focal position. The Z-coordinate of the measuring point A′ is stored into a storage unit of the electronic device.