Recognition of voice-activated commands
    1.
    发明授权
    Recognition of voice-activated commands 有权
    识别语音激活的命令

    公开(公告)号:US07996232B2

    公开(公告)日:2011-08-09

    申请号:US12388733

    申请日:2009-02-19

    IPC分类号: G10L21/00 G10L15/20

    摘要: Systems and methods for voice activated commands in a digital home communication terminal are disclosed. One example method includes storing a program audio signal corresponding to a program tuned by the digital home communication terminal. The method also includes storing an incoming audio signal carrying speech and removing from the incoming audio signal a portion of the incoming audio signal that corresponds to the program audio signal, this producing an improved version of the incoming audio signal. The method also includes selecting one of a plurality of voice-activated commands that corresponds to the improved version of the incoming audio signal, and performing a function corresponding to the selected voice-activated command.

    摘要翻译: 公开了一种用于数字家庭通信终端中的语音激活命令的系统和方法。 一个示例性方法包括存储与由数字家庭通信终端调谐的节目相对应的节目音频信号。 该方法还包括存储携带语音的输入音频信号,并从输入音频信号中删除对应于节目音频信号的输入音频信号的一部分,这产生了输入音​​频信号的改进版本。 该方法还包括选择对应于输入音频信号的改进版本的多个语音激活命令中的一个,以及执行与所选择的语音激活命令相对应的功能。

    Training of voice-controlled television navigation
    2.
    发明授权
    Training of voice-controlled television navigation 有权
    语音电视导航培训

    公开(公告)号:US08849660B2

    公开(公告)日:2014-09-30

    申请号:US11956675

    申请日:2007-12-14

    IPC分类号: G10L15/00

    摘要: Systems and methods for training voice activation control of electronic equipment are disclosed. One example method includes receiving a selection corresponding to at least one command used to control the electronic equipment. The method further includes instructing a user to speak, and responsive to the instruction, receiving a digitized speech stream. The method further includes segmenting the speech stream into speech segments, storing at least one of the speech segments as an entry in a dictionary, and associating the dictionary entry with the selected command.

    摘要翻译: 公开了用于训练电子设备语音激活控制的系统和方法。 一个示例性方法包括接收与用于控制电子设备的至少一个命令相对应的选择。 该方法还包括指示用户说话,并且响应于该指令接收数字化语音流。 该方法还包括将语音流分割为语音片段,将至少一个语音片段存储为字典中的条目,并将字典条目与所选择的命令相关联。

    Systems and methods for TV navigation with compressed voice-activated commands
    3.
    发明授权
    Systems and methods for TV navigation with compressed voice-activated commands 有权
    使用压缩语音激活命令进行电视导航的系统和方法

    公开(公告)号:US07321857B2

    公开(公告)日:2008-01-22

    申请号:US11032438

    申请日:2005-01-10

    IPC分类号: G10L15/22

    摘要: A method, apparatus and system that receives speech commands at a remote control device microphone, digitizes those input speech commands, compresses the digitized speech commands, multiplexes control commands with the compressed digitized speech commands, and transmits the compressed digitized speech commands to an electronic device, such as a digital home communication terminal (DCHT). The electronic device decompresses and interprets the speech commands to allow the remote control operator to control the electronic device. Because speech recognition is performed at the electronic device, rather than at the remote control device, the remote control does not have to interpret and transmit infrared signals that represent user commands. This simplifies the processing and voice recognition capabilities required by the remote control. Additionally, because the electronic device processes the digitized voice received from the remote control device, the electronic device can negate the effect of sounds, such as television audio, likely captured by the microphone on the remote control device. This results in a great capability of the electronic device to interpret user commands.

    摘要翻译: 一种在远程控制设备麦克风处接收语音命令的方法,装置和系统,对那些输入语音命令进行数字化,压缩数字化语音命令,用压缩的数字化语音命令复用控制命令,并将压缩的数字化语音命令发送到电子设备 ,例如数字家庭通信终端(DCHT)。 电子设备解压缩和解释语音命令,以允许遥控操作员控制电子设备。 因为在电子设备而不是在遥控设备处执行语音识别,所以遥控器不必解释和发送表示用户命令的红外信号。 这简化了遥控器所需的处理和语音识别功能。 此外,由于电子设备处理从遥控设备接收的数字化声音,所以电子设备可以抵消可能由麦克风在遥控设备上捕获的诸如电视音频的声音的影响。 这导致电子设备解释用户命令的巨大能力。

    Recognition of Voice-Activated Commands
    5.
    发明申请
    Recognition of Voice-Activated Commands 有权
    语音激活命令的识别

    公开(公告)号:US20090299752A1

    公开(公告)日:2009-12-03

    申请号:US12388733

    申请日:2009-02-19

    IPC分类号: G10L15/00

    摘要: Systems and methods for voice activated commands in a digital home communication terminal are disclosed. One example method includes storing a program audio signal corresponding to a program tuned by the digital home communication terminal. The method also includes storing an incoming audio signal carrying speech and removing from the incoming audio signal a portion of the incoming audio signal that corresponds to the program audio signal, this producing an improved version of the incoming audio signal. The method also includes selecting one of a plurality of voice-activated commands that corresponds to the improved version of the incoming audio signal, and performing a function corresponding to the selected voice-activated command.

    摘要翻译: 公开了一种用于数字家庭通信终端中的语音激活命令的系统和方法。 一个示例性方法包括存储与由数字家庭通信终端调谐的节目相对应的节目音频信号。 该方法还包括存储携带语音的输入音频信号,并从输入音频信号中删除对应于节目音频信号的输入音频信号的一部分,这产生了输入音​​频信号的改进版本。 该方法还包括选择对应于输入音频信号的改进版本的多个语音激活命令中的一个,以及执行与所选择的语音激活命令相对应的功能。

    MANAGING SPLICE POINTS FOR NON-SEAMLESS CONCATENATED BITSTREAMS
    6.
    发明申请
    MANAGING SPLICE POINTS FOR NON-SEAMLESS CONCATENATED BITSTREAMS 有权
    管理针对非密集型定位点的漏洞

    公开(公告)号:US20140351854A1

    公开(公告)日:2014-11-27

    申请号:US14457236

    申请日:2014-08-12

    摘要: Receiving a video stream in a transport stream comprising a plurality of compressed pictures, wherein information in the video stream includes plural data fields comprising: a first data field corresponding to a location in the video stream of a potential splice point, wherein the first data field identifies a location in the video stream after the location of the received information; a second data field corresponding to decompressed pictures yet to be output (DPYTBO) by a video decoder at the identified potential splice point (IPSP) when the video decoder decompresses the video stream, wherein the second data field is a number corresponding to the DPYTBO by the video decoder at the IPSP; and a third data field corresponding to pictures with contiguous output times (WCOT), wherein the third field corresponds to a set of pictures WCOT of the DPYTBO by the video decoder at the IPSP.

    摘要翻译: 在包括多个压缩图像的传输流中接收视频流,其中视频流中的信息包括多个数据字段,包括:对应于潜在拼接点的视频流中的位置的第一数据字段,其中第一数据字段 在接收到的信息的位置之后识别视频流中的位置; 当所述视频解码器解压缩所述视频流时,所述第二数据字段对应于由所述识别的电位拼接点(IPSP)上的视频解码器还未被输出的解压缩图像(DPYTBO),其中所述第二数据字段是与所述DPYTBO相对应的数字, 视频解码器在IPSP; 以及对应于具有连续输出时间(WCOT)的图像的第三数据字段,其中第三字段对应于视频解码器在IPSP处的DPYTBO的一组图像WCOT。

    Configuration of presentations of selectable TV services according to usage
    7.
    发明授权
    Configuration of presentations of selectable TV services according to usage 有权
    根据用途配置可选电视服务

    公开(公告)号:US08739212B2

    公开(公告)日:2014-05-27

    申请号:US13596689

    申请日:2012-08-28

    IPC分类号: H04N5/445

    摘要: The present invention provides a method and system for accessing services in a television system. In one implementation, a DHCT presents a user a menu containing a plurality of selectable link representations corresponding to separate services or applications offered by the cable television system. The user navigates the menu with a remote device and selects a desired service by choosing the selectable link representation corresponding to the desired service or application. The DHCT receives the user input, translates the selectable link command into an executable call, and activates the service or application corresponding to the selected link representation from the menu chosen by the user.

    摘要翻译: 本发明提供一种用于在电视系统中访问服务的方法和系统。 在一个实现中,DHCT向用户呈现包含对应于由有线电视系统提供的单独的服务或应用的多个可选择链接表示的菜单。 用户使用远程设备导航菜单,并通过选择与期望的服务或应用相对应的可选择链接表示来选择期望的服务。 DHCT接收用户输入,将可选择的链接命令转换成可执行的呼叫,并从用户选择的菜单中激活与所选择的链接表示相对应的服务或应用。

    Targeted bit appropriations based on picture importance
    9.
    发明授权
    Targeted bit appropriations based on picture importance 失效
    基于图片重要性的目标位拨款

    公开(公告)号:US08681876B2

    公开(公告)日:2014-03-25

    申请号:US12617043

    申请日:2009-11-12

    摘要: In one embodiment, a method that provides plural representations of a single video signal that comprises a successive sequence of pictures, one or more of the plural representations including a respective sequence of latticed pictures, each latticed picture in the one or more plural representations originating from a corresponding respective picture of the video signal, the order of successive latticed pictures in the one or more of the plural representations of the video signal corresponding to the order of successive pictures in the video signal; processes the plural representations based on a predetermined encoding strategy, the predetermined encoding strategy targeting an appropriate respective amount of bits to each of a plurality of the processed latticed pictures, each of the plurality of the processed latticed pictures having a respective picture importance; and provides the plurality of processed latticed pictures in plural successive, non-overlapping, ordered segments in a single video stream.

    摘要翻译: 在一个实施例中,一种提供单个视频信号的多个表示的方法,该单个视频信号包括连续的图像序列,多个表示中的一个或多个包括相应的网格化图像序列,每个网格图像源自一个或多个多个表示 视频信号的对应的相应图像,与视频信号中的连续图像的顺序对应的视频信号的多个表示中的一个或多个表示中的连续网格图像的顺序; 基于预定的编码策略来处理多个表示,所述预定编码策略针对多个处理格子化图像中的每一个的适当的各自的位数量,所述多个处理格子化图像中的每一个具有相应的图片重要性; 并且在单个视频流中以多个连续的,非重叠的有序的段提供多个经处理的网格化图像。

    Stereo Matching for 3D Encoding and Quality Assessment
    10.
    发明申请
    Stereo Matching for 3D Encoding and Quality Assessment 有权
    立体声匹配3D编码和质量评估

    公开(公告)号:US20140015923A1

    公开(公告)日:2014-01-16

    申请号:US13549945

    申请日:2012-07-16

    IPC分类号: H04N13/00

    摘要: Systems and methods may be provided embodying a novel approach to measuring degradation (or distortion) by analyzing disparity maps from original 3D video and reconstructed 3D video. The disparity maps may be derived using a stereo-matching algorithm exploiting 2-view stereo image disparity. An overall distortion measure may also be determined resulting from the weighted sum of plural measures of distortions, one of the plural distortion measures corresponding to a measure of disparity degradation, and another one corresponding to a measure of geometrical distortion. The measure (or overall distortion measure) is used during real-time encoding to effect various decisions, including mode decision in the coding of each corresponding stereo pair, and in rate control (including stereo pair quantization).

    摘要翻译: 可以提供系统和方法,体现通过从原始3D视频和重建的3D视频分析视差图来测量退化(或失真)的新方法。 视差图可以使用利用二维立体图像差异的立体匹配算法导出。 还可以通过多个失真度量的加权和来确定整体失真度量,其中多个失真度量对应于视差劣化的度量,另一个对应于几何失真度量。 在实时编码期间使用测量(或整体失真测量)来实现各种决策,包括每个相应立体声对的编码中的模式决定以及速率控制(包括立体声对量化)。