CHANNEL EXTENSION CODING FOR MULTI-CHANNEL SOURCE
    1.
    发明申请
    CHANNEL EXTENSION CODING FOR MULTI-CHANNEL SOURCE 有权
    多通道源的通道扩展编码

    公开(公告)号:US20090112606A1

    公开(公告)日:2009-04-30

    申请号:US11925733

    申请日:2007-10-26

    IPC分类号: G10L19/00

    CPC分类号: G10L19/008

    摘要: A multi-channel audio decoder reconstructs multi-channel audio of more than two physical channels from a reduced set of coded channels based on correlation parameters that specify a full power cross-correlation matrix of the physical channels, or merely preserve a partial correlation matrix (such as power of the physical channels, and some subset of cross-correlations between the physical channels, or cross-correlations of the physical channels with coded or virtual channels).

    摘要翻译: 多声道音频解码器基于指定物理信道的全功率互相关矩阵的相关参数来重建来自缩减编码信道集合的多于两个物理信道的多声道音频,或仅保留部分相关矩阵( 例如物理信道的功率,以及物理信道之间的互相关的一些子集,或者物理信道与编码或虚拟信道的交叉相关性)。

    Optimized client side rate control and indexed file layout for streaming media
    2.
    发明授权
    Optimized client side rate control and indexed file layout for streaming media 有权
    针对流媒体优化客户端速率控制和索引文件布局

    公开(公告)号:US08379851B2

    公开(公告)日:2013-02-19

    申请号:US12119364

    申请日:2008-05-12

    IPC分类号: H04N7/167 G06F15/16

    摘要: An indexed file layout, comprising index information, is defined for segmented streaming of multimedia content. The index information can comprise program description information and streaming segment index information. In addition, the layout can comprise files containing streaming segments of the program, where the streaming segments are each encoded at one or more bitrates independently of other streaming segments of the program. The layout supports client switching between different bitrates at segment boundaries. Optimized client-side rate control of streaming content can be provided by defining a plurality of states, selecting available paths based on constraint conditions, and selecting a best path through the states (e.g., based on a distortion measure). In one client-side rate control solution states correspond to a specific bitrate of a specific streaming segment, and in another client-side rate control solution states correspond to a measure of client buffer fullness.

    摘要翻译: 包括索引信息的索引文件布局被定义用于多媒体内容的分段流。 索引信息可以包括节目描述信息和流分片索引信息。 此外,布局可以包括包含程序的流片段的文件,其中流片段每个以独立于节目的其他流片段的一个或多个比特率进行编码。 该布局支持在段边界处的不同比特率之间的客户端切换。 可以通过定义多个状态,基于约束条件选择可用路径以及选择通过状态的最佳路径(例如,基于失真度量)来提供流内容的优化客户端速率控制。 在一个客户端速率控制解决方案中,状态对应于特定流分段的特定比特率,并且在另一客户端速率控制解决方案状态对应于客户端缓冲区充满度的度量。

    Channel extension coding for multi-channel source
    3.
    发明授权
    Channel extension coding for multi-channel source 有权
    用于多通道源的通道扩展编码

    公开(公告)号:US08249883B2

    公开(公告)日:2012-08-21

    申请号:US11925733

    申请日:2007-10-26

    IPC分类号: G10L19/00

    CPC分类号: G10L19/008

    摘要: A multi-channel audio decoder reconstructs multi-channel audio of more than two physical channels from a reduced set of coded channels based on correlation parameters that specify a full power cross-correlation matrix of the physical channels, or merely preserve a partial correlation matrix (such as power of the physical channels, and some subset of cross-correlations between the physical channels, or cross-correlations of the physical channels with coded or virtual channels).

    摘要翻译: 多声道音频解码器基于指定物理信道的全功率互相关矩阵的相关参数来重建来自缩减编码信道集合的多于两个物理信道的多声道音频,或仅保留部分相关矩阵( 例如物理信道的功率,以及物理信道之间的互相关的一些子集,或者物理信道与编码或虚拟信道的交叉相关性)。

    OPTIMIZED CLIENT SIDE RATE CONTROL AND INDEXED FILE LAYOUT FOR STREAMING MEDIA
    4.
    发明申请
    OPTIMIZED CLIENT SIDE RATE CONTROL AND INDEXED FILE LAYOUT FOR STREAMING MEDIA 有权
    优化的客户端速率控制和用于流媒体的索引文件布局

    公开(公告)号:US20090282162A1

    公开(公告)日:2009-11-12

    申请号:US12119364

    申请日:2008-05-12

    IPC分类号: G06F15/173

    摘要: An indexed file layout, comprising index information, is defined for segmented streaming of multimedia content. The index information can comprise program description information and streaming segment index information. In addition, the layout can comprise files containing streaming segments of the program, where the streaming segments are each encoded at one or more bitrates independently of other streaming segments of the program. The layout supports client switching between different bitrates at segment boundaries. Optimized client-side rate control of streaming content can be provided by defining a plurality of states, selecting available paths based on constraint conditions, and selecting a best path through the states (e.g., based on a distortion measure). In one client-side rate control solution states correspond to a specific bitrate of a specific streaming segment, and in another client-side rate control solution states correspond to a measure of client buffer fullness.

    摘要翻译: 包括索引信息的索引文件布局被定义用于多媒体内容的分段流。 索引信息可以包括节目描述信息和流分片索引信息。 此外,布局可以包括包含程序的流片段的文件,其中流片段每个以独立于节目的其他流片段的一个或多个比特率进行编码。 该布局支持在段边界处的不同比特率之间的客户端切换。 可以通过定义多个状态,基于约束条件选择可用路径以及选择通过状态的最佳路径(例如,基于失真度量)来提供流内容的优化的客户端速率控制。 在一个客户端速率控制解决方案中,状态对应于特定流分段的特定比特率,并且在另一客户端速率控制解决方案状态对应于客户端缓冲区充满度的度量。

    Efficient coding of digital media spectral data using wide-sense perceptual similarity
    6.
    发明授权
    Efficient coding of digital media spectral data using wide-sense perceptual similarity 有权
    使用广义感知相似性对数字媒体光谱数据进行高效编码

    公开(公告)号:US08645127B2

    公开(公告)日:2014-02-04

    申请号:US12324689

    申请日:2008-11-26

    IPC分类号: G10L11/04

    摘要: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

    摘要翻译: 传统的音频编码器可以通过编码少于所有频谱系数来节省编码比特率,这可以在重建中产生模糊的低通声音。 使用广义感知相似性的音频编码器通过编码被忽略的频谱系数的感知相似版本来提高质量,表示为已编码频谱的缩放版本。 省略的频谱系数被划分为多个子带。 子带被编码为两个参数:比例因子,其可以表示频带中的能量; 以及形状参数,其可以表示带的形状。 形状参数可以是指向已编码频谱的一部分的运动矢量的形式,固定码本中的频谱形状的索引或随机噪声向量。 因此,编码有效地表示在解码时要复制的类似形状的频谱部分的缩放版本。

    ADAPTIVE BANDWIDTH ESTIMATION
    7.
    发明申请
    ADAPTIVE BANDWIDTH ESTIMATION 有权
    自适应带宽估计

    公开(公告)号:US20130114421A1

    公开(公告)日:2013-05-09

    申请号:US13288968

    申请日:2011-11-04

    IPC分类号: H04L12/26

    摘要: It can be determined whether relative one way delay for data packets in a data stream exceeds a delay threshold. If so, then a delay congestion signal indicating that the relative one way delay exceeds the delay threshold can be generated. The delay congestion signal can be used in calculating an adaptive bandwidth estimate for the data stream. A packet loss rate congestion signal may also be used in calculating the bandwidth estimate. It can be determined whether a data stream of data packets is in a contention state. If the data stream is in the contention state, then an adaptive bandwidth estimate can be calculated for the data stream using a first bandwidth estimation technique. If the data stream is not in the contention state, then the bandwidth estimate for the data stream can be calculated using a second bandwidth estimation technique.

    摘要翻译: 可以确定数据流中的数据分组的相对单向延迟是否超过延迟阈值。 如果是,则可以产生指示相对单向延迟超过延迟阈值的延迟拥塞信号。 延迟拥塞信号可用于计算数据流的自适应带宽估计。 丢包率拥塞信号也可用于计算带宽估计。 可以确定数据包的数据流是否处于争用状态。 如果数据流处于竞争状态,则可以使用第一带宽估计技术对数据流计算自适应带宽估计。 如果数据流不处于竞争状态,则可以使用第二带宽估计技术来计算数据流的带宽估计。

    Encoding streaming media as a high bit rate layer, a low bit rate layer, and one or more intermediate bit rate layers
    8.
    发明授权
    Encoding streaming media as a high bit rate layer, a low bit rate layer, and one or more intermediate bit rate layers 有权
    将流媒体编码为高比特率层,低比特率层和一个或多个中间比特率层

    公开(公告)号:US08325800B2

    公开(公告)日:2012-12-04

    申请号:US12116878

    申请日:2008-05-07

    IPC分类号: H04N7/12 H04N11/02 H04N11/04

    摘要: A method of encoding an input video stream comprising a video component and an audio component is disclosed. The input video stream is split into a plurality of segments, each comprising a plurality of frames. Each of the segments is encoded as a low bit rate layer, a high bit rate layer, and one or more intermediate bit rate layers. The bit rate of the low bit rate layer is selected such that a network streaming the segment will always be able to stream the segment encoded as the low bit rate layer. The bit rate of the high bit rate layer is selected such that the segment is able to be decoded and played back at or above a quality threshold. The bit rates of the intermediate bit rate layers are produced by applying a bit rate factor to another bit rate.

    摘要翻译: 公开了一种编码包括视频分量和音频分量的输入视频流的方法。 输入视频流被分割成多个段,每个段包括多个帧。 每个段被编码为低比特率层,高比特率层和一个或多个中间比特率层。 选择低比特率层的比特率,使得流分段的网络将总是能够将编码为低比特率层的段流传输。 选择高比特率层的比特率使得该片段能够在质量阈值以上或高于质量阈值时被解码和回放。 中间比特率层的比特率通过将比特率因子应用于另一个比特率来产生。

    Low complexity decoder for complex transform coding of multi-channel sound
    9.
    发明授权
    Low complexity decoder for complex transform coding of multi-channel sound 有权
    低复杂度解码器,用于多声道声音的复杂变换编码

    公开(公告)号:US08046214B2

    公开(公告)日:2011-10-25

    申请号:US11767457

    申请日:2007-06-22

    IPC分类号: G10L19/00

    CPC分类号: G10L19/008

    摘要: A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.

    摘要翻译: 多声道音频解码器提供了一种降低复杂度的处理,从编码比特流重建多声道音频,其中多声道音频被表示为频道的编码子集以及复信道相关矩阵参数化。 解码器将复信道相关矩阵参数化转换为满足复信道相关矩阵幅度的实数变换。 多声道音频通过使用实数值效应信号和实数缩放的信道扩展处理从编码的信道子集导出。