Adaptive quantization for enhancement layer video coding
    81.
    发明授权
    Adaptive quantization for enhancement layer video coding 有权
    增强层视频编码的自适应量化

    公开(公告)号:US08897359B2

    公开(公告)日:2014-11-25

    申请号:US12156864

    申请日:2008-06-03

    Abstract: Techniques and tools for encoding enhancement layer video with quantization that varies spatially and/or between color channels are presented, along with corresponding decoding techniques and tools. For example, an encoding tool determines whether quantization varies spatially over a picture, and the tool also determines whether quantization varies between color channels in the picture. The tool signals quantization parameters for macroblocks in the picture in an encoded bit stream. In some implementations, to signal the quantization parameters, the tool predicts the quantization parameters, and the quantization parameters are signaled with reference to the predicted quantization parameters. A decoding tool receives the encoded bit stream, predicts the quantization parameters, and uses the signaled information to determine the quantization parameters for the macroblocks of the enhancement layer video. The decoding tool performs inverse quantization that can vary spatially and/or between color channels.

    Abstract translation: 提出了用于在空间上和/或颜色通道之间变化的具有量化的增强层视频编码的技术和工具,以及相应的解码技术和工具。 例如,编码工具确定量化是否在图像上空间上变化,并且该工具还确定量化是否在图像中的颜色通道之间变化。 该工具在编码比特流中针对图像中的宏块发送量化参数。 在一些实现中,为了发信号通知量化参数,该工具预测量化参数,参考预测量化参数来发信号通知量化参数。 解码工具接收编码比特流,预测量化参数,并使用信号信息确定增强层视频的宏块的量化参数。 解码工具执行在空间上和/或颜色通道之间变化的逆量化。

    Noise robust speech classifier ensemble
    82.
    发明授权
    Noise robust speech classifier ensemble 有权
    噪声鲁棒的语音分类器集合

    公开(公告)号:US08412525B2

    公开(公告)日:2013-04-02

    申请号:US12433143

    申请日:2009-04-30

    Abstract: Embodiments for implementing a speech recognition system that includes a speech classifier ensemble are disclosed. In accordance with one embodiment, the speech recognition system includes a classifier ensemble to convert feature vectors that represent a speech vector into log probability sets. The classifier ensemble includes a plurality of classifiers. The speech recognition system includes a decoder ensemble to transform the log probability sets into output symbol sequences. The speech recognition system further includes a query component to retrieve one or more speech utterances from a speech database using the output symbol sequences.

    Abstract translation: 公开了实现包括语音分类器集合的语音识别系统的实施例。 根据一个实施例,语音识别系统包括将表示语音向量的特征向量转换为对数概率集的分类器集合。 分类器集合包括多个分类器。 语音识别系统包括将对数概率集合变换为输出符号序列的解码器集合。 该语音识别系统还包括一个查询组件,用于使用输出符号序列从语音数据库中检索一个或多个语音话语。

    SOUND SOURCE LOCALIZATION USING PHASE SPECTRUM
    83.
    发明申请
    SOUND SOURCE LOCALIZATION USING PHASE SPECTRUM 有权
    使用相位谱的声源定位

    公开(公告)号:US20130016852A1

    公开(公告)日:2013-01-17

    申请号:US13182449

    申请日:2011-07-14

    CPC classification number: G01S3/8083 G01S3/8006 G01S3/82 H04R3/005

    Abstract: An array of microphones placed on a mobile robot provides multiple channels of audio signals. A received set of audio signals is called an audio segment, which is divided into multiple frames. A phase analysis is performed on a frame of the signals from each pair of microphones. If both microphones are in an active state during the frame, a candidate angle is generated for each such pair of microphones. The result is a list of candidate angles for the frame. This list is processed to select a final candidate angle for the frame. The list of candidate angles is tracked over time to assist in the process of selecting the final candidate angle for an audio segment.

    Abstract translation: 放置在移动机器人上的麦克风阵列提供多个音频信号通道。 接收到的一组音频信号被称为音频片段,其被分成多个帧。 对来自每对麦克风的信号的帧进行相位分析。 如果在帧期间两个麦克风处于活动状态,则为每个这样的麦克风生成候选角度。 结果是帧的候选角度列表。 处理该列表以选择帧的最终候选角度。 候选角度的列表随着时间被跟踪以帮助选择音频片段的最终候选角度的过程。

    Technique to enable efficient adaptive streaming and transcoding of video and other signals
    85.
    发明授权
    Technique to enable efficient adaptive streaming and transcoding of video and other signals 有权
    技术能够实现视频和其他信号的高效自适应流和代码转换

    公开(公告)号:US07639739B2

    公开(公告)日:2009-12-29

    申请号:US10286364

    申请日:2002-11-01

    Abstract: A method is described for efficiently determining total end-to-end distortion of a pre-compressed data stream, such as video streams or other media streams, at the time of delivery over a lossy-network, and for providing adaptive error-resilient delivery schemes based on distortion estimates. The methods can be utilized with single or multilayer packet streams and are particularly well suited for video streams. By way of example, distortion estimates are performed by generating side-information at the time of data stream compression, wherein the side-information is used in conjunction with information about the network status to determine an estimated distortion for the group of packets when the data stream is transported over the network to a destination end. This estimation may be utilized within described resiliency techniques in which the error correction mechanism is selected in response to the estimated distortion, which may be additionally refined in reference to cost factors.

    Abstract translation: 描述了一种用于在通过有损网络传送时有效地确定预压缩数据流(例如视频流或其他媒体流)的总端对端失真的方法,并且用于提供自适应的错误弹性传递 基于失真估计的方案。 这些方法可以用于单个或多个分组流,并且特别适合于视频流。 作为示例,通过在数据流压缩时生成旁侧信息来执行失真估计,其中侧信息结合关于网络状态的信息使用,以确定数据组的估计失真 流通过网络传输到目的地端。 可以在所述的弹性技术中使用该估计,其中响应于估计的失真来选择纠错机制,这可以参考成本因素进一步细化。

    Signaling coding and display options in entry point headers
    86.
    发明申请
    Signaling coding and display options in entry point headers 有权
    入口点标题中的信号编码和显示选项

    公开(公告)号:US20050123274A1

    公开(公告)日:2005-06-09

    申请号:US10989845

    申请日:2004-11-15

    Abstract: A decoder receives an entry point header comprising plural control parameters for an entry point segment corresponding to the entry point header. The entry point header is in an entry point layer of a bitstream comprising plural layers. The decoder decodes the entry point header. The plural control parameters can include various combinations of control parameters such as a pan scan on/off parameter, a reference frame distance on/off parameter, a loop filtering on/off parameter, a fast chroma motion compensation on/off parameter, an extended range motion vector on/off parameter, a variable sized transform on/off parameter, an overlapped transform on/off parameter, a quantization decision parameter, and an extended differential motion vector coding on/off parameter, a broken link parameter, a closed entry parameter, one or more coded picture size parameters, one or more range mapping parameters, a hypothetical reference decoder buffer parameter, and/or other parameter(s).

    Abstract translation: 解码器接收包括与入口点标题相对应的入口点段的多个控制参数的入口点标题。 入口点头在包含多个层的比特流的入口点层中。 解码器解码入口点标题。 多个控制参数可以包括控制参数的各种组合,例如平移扫描开启/关闭参数,参考帧距离开/关参数,循环滤波开/关参数,快速色度运动补偿开/关参数,扩展 范围运动矢量开/关参数,可变大小的变换开/关参数,重叠变换开/关参数,量化决定参数和扩展差分运动矢量编码开/关参数,断链接参数,关闭条目 参数,一个或多个编码图像大小参数,一个或多个范围映射参数,假设参考解码器缓冲器参数和/或其他参数。

Patent Agency Ranking