摘要:
A method and apparatus are provided for storing parameters of a deleted interpolation language model as parameters of a backoff language model. In particular, the parameters of the deleted interpolation language model are stored in the standard ARPA format. Under one embodiment, the deleted interpolation language model parameters are formed using fractional counts.
摘要:
A switchable apochromatic polarization rotator is provided. A first fixed waveplate has a first principal axis fixed in a first orientation. A second fixed waveplate has a second principal axis fixed in a second orientation. First and second switchable waveplates have principal axes oriented such that broad spectral range electromagnetic radiation transmitted through all of the waveplates has a first rotated polarization, wherein, in response to one or more control signals applied to the switchable waveplates, the principal axes of the switchable waveplates rotate such that the electromagnetic radiation transmitted through all of the waveplates has a second rotated polarization. The switchable waveplates utilize ferroelectric liquid crystal material, nematic liquid crystal material, or be mechanically rotated to adjust for orientation of their principal axes. Utilizing waveplates as described may be used to tune for a desired spectral range and/or compensate for temperature dependencies.
摘要:
In accordance with one embodiment of the present invention, unanticipated semantic intents are discovered in audio data in an unsupervised manner. For instance, the audio acoustics are clustered based on semantic intent and representative acoustics are chosen for each cluster. The human then need only listen to a small number of representative acoustics for each cluster (and possibly only one per cluster) in order to identify the unforeseen semantic intents.
摘要:
A method of modeling a speech recognition system includes decoding a speech signal produced from a training text to produce a sequence of predicted speech units. The training text comprises a sequence of actual speech units that is used with the sequence of predicted speech units to form a confusion model. In further embodiments, the confusion model is used to decode a text to identify an error rate that would be expected if the speech recognition system decoded speech based on the text.
摘要:
Computer-readable media, computer systems, and computing methods are provided for implicitly discovering applications that have a corresponding location. Initially, an application manifest is constructed that maintains a mapping between apps and their respective geographic boundaries. In operation, upon detecting a location signal being transmitted from a mobile device, the location signal is employed to identify a set of valid applications. Generally, the location signal represents location-based information entered to or received from the mobile device. Identifying the set of valid applications involves comparing the location signal against the application manifest and, based on the comparison, designating those apps that are mapped to geographic boundaries that meet the location signal as the set of valid applications. Upon identifying the set of valid applications, a selection of apps included within the set of valid applications is sent to the mobile device for display to the user within a operational context.
摘要:
Computer-readable media, computer systems, and computing methods are provided for implicitly discovering applications using a query or search results, where the query does not explicitly target a particular application. A search engine is used to receive a user-initiated query and to employ a keyword of the query to generate an initial set of search results. Characteristics of the search results are identified and applications that are relevant to the characteristics are determined. The determination of relevant applications include: accessing an application manifest that manages a mapping between applications and predefined characteristics; comparing the characteristics of the search results against the predefined characteristics of the application manifest to determine a match; and, when a match exists between the search-result characteristics and the predefined characteristics, identifying applications mapped to the matched characteristics as being relevant to the search results. These identified applications are presented in response to the query.
摘要:
Methods are disclosed for estimating language models such that the conditional likelihood of a class given a word string, which is very well correlated with classification accuracy, is maximized. The methods comprise tuning statistical language model parameters jointly for all classes such that a classifier discriminates between the correct class and the incorrect ones for a given training sentence or utterance. Specific embodiments of the present invention pertain to implementation of the rational function growth transform in the context of a discriminative training technique for n-gram classifiers.
摘要:
Audio/video (A/V) content is analyzed using speech and language analysis components. Metadata is automatically generated based upon the analysis. The metadata is used in generating user interface interaction components which allow a user to view subject matter in various segments of the A/V content and to interact with the A/V content based on the automatically generated metadata.
摘要:
Methods are disclosed for estimating language models such that the conditional likelihood of a class given a word string, which is very well correlated with classification accuracy, is maximized. The methods comprise tuning statistical language model parameters jointly for all classes such that a classifier discriminates between the correct class and the incorrect ones for a given training sentence or utterance. Specific embodiments of the present invention pertain to implementation of the rational function growth transform in the context of a discriminative training technique for n-gram classifiers.