AUDIO DISCONTINUITY DETECTION AND CORRECTION
    11.
    发明申请

    公开(公告)号:US20180218749A1

    公开(公告)日:2018-08-02

    申请号:US15745824

    申请日:2016-07-26

    摘要: Methods for detecting whether a rendered version of a specified seamless connection (“SSC”) at a connection point between two audio segment sequences results in an audible discontinuity, and methods for analyzing at least one SSC between audio segment sequences to determine whether a renderable version of each SSC would have an audible discontinuity at the connection point when rendered, and in appropriate cases, for a SSC having a renderable version which is determined to have an audible discontinuity when rendered, correcting at least one audio segment of at least one segment sequence to be connected in accordance with the SSC in an effort to ensure that rendering of the SSC will result in seamless connection without an audible discontinuity. Other aspects are editing systems configured to implement any of the methods, and storage media and rendering systems which store audio data generated in accordance with any of the methods.

    Matrix Decomposition for Rendering Adaptive Audio Using High Definition Audio Codecs
    12.
    发明申请
    Matrix Decomposition for Rendering Adaptive Audio Using High Definition Audio Codecs 有权
    使用高清音频编解码器渲染自适应音频的矩阵分解

    公开(公告)号:US20170048639A1

    公开(公告)日:2017-02-16

    申请号:US15306454

    申请日:2015-04-23

    IPC分类号: H04S3/02 G10L19/008

    摘要: A method of decomposing a matrix of dimension L-by-N, where L is less than or equal to N, into a sequence of N-by-N unit primitive matrices and a permutation matrix comprising a sequence that is the product of the primitive matrices and the permutation matrix, containing L rows that are substantially close to the provided L-by-N matrix, where the choice of the permutation matrix and the indices of the non-trivial rows in the primitive matrices are chosen to limit the coefficient values in the primitive matrices.

    摘要翻译: 将L尺寸小于或等于N的尺寸为L-by-N的矩阵分解成N×N单位原始矩阵的序列和包括作为原语的乘积的序列的置换矩阵的方法 矩阵和排列矩阵,其包含基本上接近所提供的L乘N矩阵的L行,其中选择基本矩阵中的排列矩阵和非平凡行的索引以限制系数值 在原始矩阵中。

    Audio Segmentation Based on Spatial Metadata
    14.
    发明申请
    Audio Segmentation Based on Spatial Metadata 审中-公开
    基于空间元数据的音频分割

    公开(公告)号:US20170047071A1

    公开(公告)日:2017-02-16

    申请号:US15306051

    申请日:2015-04-23

    摘要: A method of encoding adaptive audio, comprising receiving N objects and associated spatial metadata that describes the continuing motion of these objects, and partitioning the audio into segments based on the spatial metadata. The method encodes adaptive audio having objects and channel beds by capturing a continuing motion of a number N objects in a time-varying matrix trajectory comprising a sequence of matrices, coding coefficients of the time-varying matrix trajectory in spatial metadata to be transmitted via a high-definition audio format for rendering the adaptive audio through a number M output channels, and segmenting the sequence of matrices into a plurality of sub-segments based on the spatial metadata, wherein the plurality of sub segments are configured to facilitate coding of one or more characteristics of the adaptive audio.

    摘要翻译: 一种编码自适应音频的方法,包括接收N个对象和描述这些对象的持续运动的相关联的空间元数据,以及基于空间元数据将音频分割成段。 该方法通过捕获包括矩阵序列的时变矩阵轨迹中的N个对象的持续运动来编码具有对象和信道床的自适应音频,将空间元数据中的时变矩阵轨迹的编码系数经由 高清晰度音频格式,用于通过M个输出通道渲染自适应音频,以及基于空间元数据将矩阵序列分割成多个子段,其中多个子段被配置为便于编码一个或多个 更多特点的自适应音频。

    Signal Decorrelation in an Audio Processing System
    15.
    发明申请
    Signal Decorrelation in an Audio Processing System 有权
    音频处理系统中的信号解相关

    公开(公告)号:US20150380000A1

    公开(公告)日:2015-12-31

    申请号:US14766371

    申请日:2014-01-22

    摘要: Audio processing methods may involve receiving audio data corresponding to a plurality of audio channels. The audio data may include a frequency domain representation corresponding to filterbank coefficients of an audio encoding or processing system. A decorrelation process may be performed with the same filterbank coefficients used by the audio encoding or processing system. The decorrelation process may be performed without converting coefficients of the frequency domain representation to another frequency domain or time domain representation. The decorrelation process may involve selective or signal-adaptive decorrelation of specific channels and/or specific frequency bands. The decorrelation process may involve applying a decorrelation filter to a portion of the received audio data to produce filtered audio data. The decorrelation process may involve using a non-hierarchal mixer to combine a direct portion of the received audio data with the filtered audio data according to spatial parameters.

    摘要翻译: 音频处理方法可以涉及接收对应于多个音频频道的音频数据。 音频数据可以包括对应于音频编码或处理系统的滤波器组系数的频域表示。 解相关处理可以用音频编码或处理系统使用的相同的滤波器组系数执行。 可以在不将频域表示的系数转换到另一频域或时域表示的情况下执行去相关处理。 解相关过程可以涉及特定信道和/或特定频带的选择性或信号自适应去相关。 解相关过程可以包括将去相关滤波器应用于所接收的音频数据的一部分以产生滤波后的音频数据。 去相关处理可以包括使用非层级混合器来根据空间参数将接收的音频数据的直接部分与经滤波的音频数据组合。

    RENDERING OF MULTICHANNEL AUDIO USING INTERPOLATED MATRICES
    16.
    发明申请
    RENDERING OF MULTICHANNEL AUDIO USING INTERPOLATED MATRICES 有权
    使用插值矩阵渲染多通道音频

    公开(公告)号:US20160241981A1

    公开(公告)日:2016-08-18

    申请号:US15024925

    申请日:2014-09-26

    摘要: Methods which uses interpolated primitive matrices to decode encoded audio to recover (losslessly) content of a multichannel audio program and/or to recover at least one downmix of such content, and encoding methods for generating such encoded audio. In some embodiments, a decoder performs interpolation on a set of seed primitive matrices to determine interpolated matrices for use in rendering channels of the program. Other aspects are a system or device configured to implement any embodiment of the method.

    摘要翻译: 使用内插原语矩阵来解码编码音频以恢复(无损耗)多声道音频节目的内容和/或恢复这样的内容的至少一个缩混的方法,以及用于生成这种编码音频的编码方法。 在一些实施例中,解码器对一组种子基元矩阵执行插值,以确定用于呈现节目的频道的内插矩阵。 其他方面是被配置为实现该方法的任何实施例的系统或设备。

    Multi-Stage Quantization of Parameter Vectors from Disparate Signal Dimensions
    17.
    发明申请
    Multi-Stage Quantization of Parameter Vectors from Disparate Signal Dimensions 审中-公开
    参数矢量的多阶段量化从不同的信号尺寸

    公开(公告)号:US20160133266A1

    公开(公告)日:2016-05-12

    申请号:US14898211

    申请日:2014-06-17

    摘要: A first vector quantization process may be applied to two or more parameter values along a first dimension of the N-dimensional parameter set to produce a first set of quantized values. Two or more parameter prediction values may be calculated for a second dimension of the N-dimensional parameter set based, at least in part, on one or more values of the first set of quantized values. Prediction residual values may be calculated based, at least in part, on the parameter prediction values. A second vector quantization process may be applied to the prediction residual values to produce a second set of quantized values. These processes may be extended to any number of dimensions. Corresponding inverse vector quantization processes may be performed.

    摘要翻译: 第一矢量量化处理可以沿着N维参数集的第一维应用于两个或更多个参数值,以产生第一组量化值。 可以至少部分地基于第一组量化值的一个或多个值来计算N维参数集的第二维度的两个或更多个参数预测值。 可以至少部分地基于参数预测值来计算预测残差值。 可以将第二矢量量化处理应用于预测残差值以产生第二组量化值。 这些过程可以扩展到任何数量的维度。 可以执行相应的逆矢量量化处理。

    Methods for Controlling the Inter-Channel Coherence of Upmixed Audio Signals
    18.
    发明申请
    Methods for Controlling the Inter-Channel Coherence of Upmixed Audio Signals 有权
    用于控制上混合音频信号的信道间相干性的方法

    公开(公告)号:US20160005406A1

    公开(公告)日:2016-01-07

    申请号:US14767279

    申请日:2014-01-22

    IPC分类号: G10L19/008 H04S3/00

    摘要: Audio characteristics of audio data corresponding to a plurality of audio channels may be determined. The audio characteristics may include spatial parameter data. Decorrelation filtering processes for the audio data may be based, at least in part, on the audio characteristics. The decorrelation filtering processes may cause a specific inter-decorrelation signal coherence (“IDC”) between channel-specific decorrelation signals for at least one pair of channels. The channel-specific decorrelation signals may be received and/or determined. Inter-channel coherence (“ICC”) between a plurality of audio channel pairs may be controlled. Controlling ICC may involve at receiving an ICC value and/or determining an ICC value based, at least partially, on the spatial parameter data. A set of IDC values may be based, at least partially, on the set of ICC values. A set of channel-specific decorrelation signals, corresponding with the set of IDC values, may be synthesized by performing operations on the filtered audio data.

    摘要翻译: 可以确定与多个音频通道对应的音频数据的音频特性。 音频特征可以包括空间参数数据。 音频数据的解相关滤波处理可以至少部分地基于音频特性。 去相关滤波处理可以在至少一对信道之间引起信道特定解相关信号之间的特定解相关信号相干性(“IDC”)。 信道特定的去相关信号可以被接收和/或确定。 可以控制多个音频通道对之间的通道间相干(“ICC”)。 控制ICC可以涉及至少部分地基于空间参数数据接收ICC值和/或确定ICC值。 一组IDC值可以至少部分地基于ICC值集合。 可以通过对经滤波的音频数据执行操作来合成与集合的IDC值相对应的一组通道特定的解相关信号。

    Methods for Audio Signal Transient Detection and Decorrelation Control
    19.
    发明申请
    Methods for Audio Signal Transient Detection and Decorrelation Control 有权
    音频信号瞬态检测和相关控制方法

    公开(公告)号:US20160005405A1

    公开(公告)日:2016-01-07

    申请号:US14766957

    申请日:2014-01-22

    IPC分类号: G10L19/008 H04S5/00

    摘要: Some audio processing methods may involve receiving audio data corresponding to a plurality of audio channels and determining audio characteristics of the audio data, which may include transient information. An amount of decorrelation for the audio data may be based, at least in part, on the audio characteristics. If a definite transient event is determined, a decorrelation process may be temporarily halted or slowed. Determining transient information may involve evaluating the likelihood and/or the severity of a transient event. In some implementations, determining transient information may involve evaluating a temporal power variation in the audio data. Explicit transient information may or may not be received with the audio data, depending on the implementation. Explicit transient information may include a transient control value corresponding to a definite transient event, a definite non-transient event or an intermediate transient control value.

    摘要翻译: 一些音频处理方法可以包括接收对应于多个音频信道的音频数据并确定音频数据的音频特性,其可以包括瞬时信息。 用于音频数据的去相关的量可以至少部分地基于音频特性。 如果确定了一个确定的瞬时事件,则解相关过程可能暂时停止或减慢。 确定瞬时信息可能涉及评估瞬态事件的可能性和/或严重性。 在一些实现中,确定瞬时信息可以包括评估音频数据中的时间功率变化。 取决于实现情况,显式瞬时信息可能会或可能不会与音频数据一起接收。 显式瞬态信息可以包括对应于确定的瞬态事件,确定的非瞬态事件或中间瞬态控制值的瞬态控制值。

    Audio Signal Enhancement Using Estimated Spatial Parameters
    20.
    发明申请
    Audio Signal Enhancement Using Estimated Spatial Parameters 有权
    使用预估空间参数的音频信号增强

    公开(公告)号:US20160005413A1

    公开(公告)日:2016-01-07

    申请号:US14767565

    申请日:2014-01-22

    摘要: Received audio data may include a first set of frequency coefficients and a second set of frequency coefficients. Spatial parameters for at least part of the second set of frequency coefficients may be estimated, based at least in part on the first set of frequency coefficients. The estimated spatial parameters may be applied to the second set of frequency coefficients to generate a modified second set of frequency coefficients. The first set of frequency coefficients may correspond to a first frequency range (for example, an individual channel frequency range) and the second set of frequency coefficients may correspond to a second frequency range (for example, a coupled channel frequency range). Combined frequency coefficients of a composite coupling channel may be based on frequency coefficients of two or more channels. Cross-correlation coefficients, between frequency coefficients of a first channel and the combined frequency coefficients, may be computed.

    摘要翻译: 接收的音频数据可以包括第一组频率系数和第二组频率系数。 可以至少部分地基于第一组频率系数来估计第二组频率系数的至少一部分的空间参数。 估计的空间参数可以应用于第二组频率系数,以产生经修改的第二组频率系数。 第一组频率系数可以对应于第一频率范围(例如,单个信道频率范围),并且第二组频率系数可以对应于第二频率范围(例如,耦合的信道频率范围)。 复合耦合信道的组合频率系数可以基于两个或更多个信道的频率系数。 可以计算第一通道的频率系数与组合频率系数之间的互相关系数。