摘要:
A method and system for differentiating between intended user input and inadvertent or incidental contact with a touchpad is herein disclosed. When a user engages the touchpad, sensors on the touchpad are activated and generate touch sensor signals. Based on the pattern of engaged sensors, a hand pattern can be determined. From the hand pattern, a hand model may be retrieved. The hand model may indicate passive zones and active zones. Contact in the active zones may be considered intentional, while contact in the passive zones may be considered unintended or incidental. Moreover, a global shift may be calculated, and input from the active zones may be compensated for the global shift. The input from the active zones can then be used to control a graphical user interface.
摘要:
Sensors around the periphery of the remote control unit detect contact with the user's hand. A trained model-based pattern classification system analyzes the periphery sensor data and makes a probabilistic prediction of the user's hand size. The hand size is then used to control a mapping system that defines how gestures by the user's thumb upon a touchpad of the remote control unit are mapped to the control region upon a separate display screen.
摘要:
The hand held case of the remote control unit includes at least one touchpad, and other sensors, such as acceleration sensors, case perimeter sensors, pressure sensors, RF signal sensors. These sensors provide a rich array of sensory inputs that are classified by a pattern recognizer to generate control commands for both the consumer electronic equipment and the remote control unit itself. A power management system to conserve unit battery power is also responsive to the pattern recognizer to allow intelligent power management control. The control system uses the display of the consumer electronic equipment to provide instructions to the user, and the behavior of the remote control system uses what is displayed on the display as context information for pattern recognition.
摘要:
Model compression is combined with model compensation. Model compression is needed in embedded ASR to reduce the size and the computational complexity of compressed models. Model-compensation is used to adapt in real-time to changing noise environments. The present invention allows for the design of smaller ASR engines (memory consumption reduced to up to one-sixth) with reduced impact on recognition accuracy and/or robustness to noises.
摘要:
System and method for touch sensitive surface provide detection and tracking of multiple touch points on the surface by using two independent arrays of orthogonal linear capacitive sensors.
摘要:
A language model back-off system can be used with a user interface employing one or more language models to constrain navigation of selectable user interface input components. A user input interpretation module receives user input and interprets the user input to determine if a selection is made of one or more user interface input components. If a selection is not made, the user input interpretation module determines whether conditions are met for backing off one or more language models employed to constrain navigation of the user interface input components. If the conditions are met, a language model back-off module backs off the one or more language models.
摘要:
A noise robustness method operates jointly in a signal domain and a model domain. For example, energy is added in the signal domain for frequency bands where an actual noise level of an incoming signal is lower than a noise level used to train models, thus obtaining a compensated signal. Also, energy is added in the model domain for frequency bands where noise level of the incoming signal or the compensated signal is higher than the noise level used to train the models. Moreover, energy is never removed, thereby avoiding problems of higher sensitivity of energy removal to estimation errors.
摘要:
Linear approximation of the background noise is applied after feature extraction and prior to speaker adaptation to allow the speaker adaptation system to adapt the speech models to the enrolling user without distortion from background noise. The linear approximation is applied in the feature domain, such as in the cepstral domain. Any adaptation technique that is commutative in the feature domain may be used.
摘要:
An embedded device for playing media files is capable of generating a play list of media files based on input speech from a user. It includes an indexer generating a plurality of speech recognition grammars. According to one aspect of the invention, the indexer generates speech recognition grammars based on contents of a media file header of the media file. According to another aspect of the invention, the indexer generates speech recognition grammars based on categories in a file path for retrieving the media file to a user location. When a speech recognizer receives an input speech from a user while in a selection mode, a media file selector compares the input speech received while in the selection mode to the plurality of speech recognition grammars, thereby selecting the media file.
摘要:
The acoustic speech signal is decomposed into wavelets arranged in an asymmetrical tree data structure from which individual nodes may be selected to best extract local features, as needed to model specific classes of sound units. The wavelet packet transformation is smoothed through integration and compressed to apply a non-linearity prior to discrete cosine transformation. The resulting subband features such as cepstral coefficients may then be used to construct the speech recognizer's speech models. Using the local feature information extracted in this manner allows a single recognizer to be optimized for several different classes of sound units, thereby eliminating the need for parallel path recognizers.