摘要:
Systems and methods for voice activated commands in a digital home communication terminal are disclosed. One example method includes storing a program audio signal corresponding to a program tuned by the digital home communication terminal. The method also includes storing an incoming audio signal carrying speech and removing from the incoming audio signal a portion of the incoming audio signal that corresponds to the program audio signal, this producing an improved version of the incoming audio signal. The method also includes selecting one of a plurality of voice-activated commands that corresponds to the improved version of the incoming audio signal, and performing a function corresponding to the selected voice-activated command.
摘要:
Systems and methods for training voice activation control of electronic equipment are disclosed. One example method includes receiving a selection corresponding to at least one command used to control the electronic equipment. The method further includes instructing a user to speak, and responsive to the instruction, receiving a digitized speech stream. The method further includes segmenting the speech stream into speech segments, storing at least one of the speech segments as an entry in a dictionary, and associating the dictionary entry with the selected command.
摘要:
A method, apparatus and system that receives speech commands at a remote control device microphone, digitizes those input speech commands, compresses the digitized speech commands, multiplexes control commands with the compressed digitized speech commands, and transmits the compressed digitized speech commands to an electronic device, such as a digital home communication terminal (DCHT). The electronic device decompresses and interprets the speech commands to allow the remote control operator to control the electronic device. Because speech recognition is performed at the electronic device, rather than at the remote control device, the remote control does not have to interpret and transmit infrared signals that represent user commands. This simplifies the processing and voice recognition capabilities required by the remote control. Additionally, because the electronic device processes the digitized voice received from the remote control device, the electronic device can negate the effect of sounds, such as television audio, likely captured by the microphone on the remote control device. This results in a great capability of the electronic device to interpret user commands.
摘要:
A method, apparatus and system that receives speech commands at a remote control device, digitizes those speech commands, and transmits the digitized speech commands to an electronic device, such as a digital home communication terminal (DCHT). The electronic device interprets the speech commands to allow the remote control operator to control the electronic device.
摘要:
Systems and methods for voice activated commands in a digital home communication terminal are disclosed. One example method includes storing a program audio signal corresponding to a program tuned by the digital home communication terminal. The method also includes storing an incoming audio signal carrying speech and removing from the incoming audio signal a portion of the incoming audio signal that corresponds to the program audio signal, this producing an improved version of the incoming audio signal. The method also includes selecting one of a plurality of voice-activated commands that corresponds to the improved version of the incoming audio signal, and performing a function corresponding to the selected voice-activated command.
摘要:
Receiving a video stream in a transport stream comprising a plurality of compressed pictures, wherein information in the video stream includes plural data fields comprising: a first data field corresponding to a location in the video stream of a potential splice point, wherein the first data field identifies a location in the video stream after the location of the received information; a second data field corresponding to decompressed pictures yet to be output (DPYTBO) by a video decoder at the identified potential splice point (IPSP) when the video decoder decompresses the video stream, wherein the second data field is a number corresponding to the DPYTBO by the video decoder at the IPSP; and a third data field corresponding to pictures with contiguous output times (WCOT), wherein the third field corresponds to a set of pictures WCOT of the DPYTBO by the video decoder at the IPSP.
摘要:
The present invention provides a method and system for accessing services in a television system. In one implementation, a DHCT presents a user a menu containing a plurality of selectable link representations corresponding to separate services or applications offered by the cable television system. The user navigates the menu with a remote device and selects a desired service by choosing the selectable link representation corresponding to the desired service or application. The DHCT receives the user input, translates the selectable link command into an executable call, and activates the service or application corresponding to the selected link representation from the menu chosen by the user.
摘要:
An apparatus for facilitating robust data transport. In one embodiment, the apparatus includes a first mechanism for selecting plural lattices of an input video signal, processing plural decimated video signals, and time shifting corresponding portions of plural video streams in accordance with a second relative temporal order. A second mechanism changes an initial relative temporal order to the second relative temporal order.
摘要:
In one embodiment, a method that provides plural representations of a single video signal that comprises a successive sequence of pictures, one or more of the plural representations including a respective sequence of latticed pictures, each latticed picture in the one or more plural representations originating from a corresponding respective picture of the video signal, the order of successive latticed pictures in the one or more of the plural representations of the video signal corresponding to the order of successive pictures in the video signal; processes the plural representations based on a predetermined encoding strategy, the predetermined encoding strategy targeting an appropriate respective amount of bits to each of a plurality of the processed latticed pictures, each of the plurality of the processed latticed pictures having a respective picture importance; and provides the plurality of processed latticed pictures in plural successive, non-overlapping, ordered segments in a single video stream.
摘要:
Systems and methods may be provided embodying a novel approach to measuring degradation (or distortion) by analyzing disparity maps from original 3D video and reconstructed 3D video. The disparity maps may be derived using a stereo-matching algorithm exploiting 2-view stereo image disparity. An overall distortion measure may also be determined resulting from the weighted sum of plural measures of distortions, one of the plural distortion measures corresponding to a measure of disparity degradation, and another one corresponding to a measure of geometrical distortion. The measure (or overall distortion measure) is used during real-time encoding to effect various decisions, including mode decision in the coding of each corresponding stereo pair, and in rate control (including stereo pair quantization).