HYBRID SPATIAL AUDIO DECODER
    1.
    发明申请

    公开(公告)号:WO2020247033A1

    公开(公告)日:2020-12-10

    申请号:PCT/US2020/018447

    申请日:2020-02-14

    Applicant: DTS, INC.

    Abstract: A spatial audio signal decoder is provided that includes a processor and storage media that includes instructions that when executed cause the processor to: receive input spatial audio signals including a set of channels having an input spatial format; partition the set of channels into at least a first channel subset and a second channel subset; determine an estimate of a number and directions of arrival of directional audio sources represented in at least a portion of the set of channels; determine one of the active and passive components of the first channel subset signals, based at least in part on the estimated number and directions of arrival of directional audio sources; determine the other of the active and passive components of the first channel subset signals, based upon the determined one of the active and passive components of the first channel subset signals; decode the components to an output signal.

    SIGNAL TRANSFORMATION BASED ON UNIQUE KEY-BASED NETWORK GUIDANCE AND CONDITIONING

    公开(公告)号:WO2022025923A1

    公开(公告)日:2022-02-03

    申请号:PCT/US2020/044522

    申请日:2020-07-31

    Applicant: DTS, INC.

    Abstract: A method comprises receiving input audio and target audio having a target audio characteristic. The method includes estimating key parameters that represent the target audio characteristic based on one or more of the target audio and the input audio. The method further comprises configuring a neural network, trained to be configured by the key parameters, with the key parameters to cause the neural network to perform a signal transformation of the input audio, to produce output audio having an output audio characteristic corresponding to and that matches the target audio characteristic.

    DIRECT-DIFFUSE DECOMPOSITION
    3.
    发明申请
    DIRECT-DIFFUSE DECOMPOSITION 审中-公开
    直接差分分解

    公开(公告)号:WO2013040172A1

    公开(公告)日:2013-03-21

    申请号:PCT/US2012/055103

    申请日:2012-09-13

    CPC classification number: H04R5/04 G10L19/008 G10L21/0308 G10L25/06 H04S3/00

    Abstract: There is disclosed methods and apparatus for decomposing a signal having a plurality of channels into direct and diffuse components. The correlation coefficient between each pair of signals from the plurality of signals may be estimated. A linear system of equations relating the estimated correlation coefficients and direct energy fractions of each of the plurality of channels may be constructed. The linear system may be solved to estimate the direct energy fractions. A direct component output signal and a diffuse component output signal may be generated based in part on the direct energy fractions.

    Abstract translation: 公开了用于将具有多个通道的信号分解成直接和漫射部件的方法和装置。 可以估计来自多个信号的每对信号之间的相关系数。 可以构建将估计的相关系数和多个通道中的每一个的直接能量分数相关联的方程式的线性系统。 可以解决线性系统以估计直接能量分数。 可以部分地基于直接能量分数来生成直接分量输出信号和漫反射分量输出信号。

    SIGNALS IN TRANSFORM-BASED AUDIO CODECS
    5.
    发明申请

    公开(公告)号:WO2019173195A1

    公开(公告)日:2019-09-12

    申请号:PCT/US2019/020514

    申请日:2019-03-04

    Applicant: DTS, INC.

    Abstract: Systems and methods include audio encoders having improved coding of harmonic signals. The audio encoders can be implemented as transform-based codecs with frequency coefficients quantized using spectral weights. The frequency coefficients can be quantized by use of the generated spectral weights applied to the frequency coefficients prior to the quantization or by use of the generated spectral weights in computation of error within a vector quantization that performs the quantization. Additional apparatus, systems, and methods are disclosed.

    BIT RATE CONTROL OVER GROUPS OF FRAMES
    6.
    发明申请

    公开(公告)号:WO2018200822A1

    公开(公告)日:2018-11-01

    申请号:PCT/US2018/029584

    申请日:2018-04-26

    Applicant: DTS, INC.

    Abstract: Systems and methods are described for processing data from a sequential series of groups of frames to achieve a target average processing bit rate for a particular group of frames in the series. In an example, a look-ahead buffer circuit can be populated with a number of frames from a particular group of frames, and a bit allocation can be determined for a frame in the look-ahead buffer circuit using bit request information about all of the frames in the buffer. The look-ahead buffer circuit can be populated with streaming frame information in a first-in-first-out manner, and bit allocation processing can be performed for each frame, in a particular group of frames, based on a frame position in the look-ahead buffer circuit and further based on bit requests associated with other frames in the look-ahead buffer circuit.

    POST-ENCODING BITRATE REDUCTION OF MULTIPLE OBJECT AUDIO
    7.
    发明申请
    POST-ENCODING BITRATE REDUCTION OF MULTIPLE OBJECT AUDIO 审中-公开
    后编码双极化减少多个对象音频

    公开(公告)号:WO2015134272A1

    公开(公告)日:2015-09-11

    申请号:PCT/US2015/017732

    申请日:2015-02-26

    Applicant: DTS, INC.

    Inventor: FEJZO, Zoran

    Abstract: A post-encoding bitrate reduction system and method for generating one more scaled compressed bitstreams from a single encoded plenary file. The plenary file contains multiple audio object files that were encoded separately using a scalable encoding process having fine-grained scalability. Activity in the data frames of the encoded audio object files at a time period are compared with each other to obtain a data frame activity comparison. Bits from an available bitpool are assigned to all of the data frames based on the data frame activity comparison and corresponding hierarchical metadata. The plenary file is scaled down by truncating bits in the data frames to conform to the bit allocation. In some embodiments frame activity is compared to a silence threshold and the data frame contains silence if the frame activity is less than or equal to the threshold and minimal bits are used to represent the silent frame.

    Abstract translation: 一种后编码比特率降低系统和方法,用于从单个编码的全局文件生成一个更多的缩放的压缩比特流。 全文档包含多个音频对象文件,它们使用具有细粒度可扩展性的可扩展编码过程单独编码。 在一段时间将编码音频对象文件的数据帧中的活动进行比较,以获得数据帧活动比较。 基于数据帧活动比较和相应的分层元数据,可用位池的位被分配给所有的数据帧。 通过截断数据帧中的比特来缩小全局文件以符合比特分配。 在一些实施例中,将帧活动与静默阈值进行比较,并且如果帧活动小于或等于阈值,则数据帧包含静音,并且使用最小比特来表示无声帧。

    ROOM CHARACTERIZATION AND CORRECTION FOR MULTI-CHANNEL AUDIO
    9.
    发明申请
    ROOM CHARACTERIZATION AND CORRECTION FOR MULTI-CHANNEL AUDIO 审中-公开
    多通道音频的房间特征和校正

    公开(公告)号:WO2012154823A1

    公开(公告)日:2012-11-15

    申请号:PCT/US2012/037081

    申请日:2012-05-09

    Abstract: Devices and methods are adapted to characterize a multi-channel loudspeaker configuration, to correct loudspeaker room delay, gain and frequency response or to configure sub-band domain correction fillers. In an embodiment for characterizing a multi-channel loudspeaker configuration, a broadband probe signal is supplied to each audio output of an preamplifier of which a plurality are coupled to loudspeakers in a multi-channel configuration in a listening environment. The loudspeakers convert the probe signal to acoustic responses that are transmitted in non-overlapping time slots separated by silent periods as sound waves into the listening environment. For each audio output that is probed, sound waves are received by a multi-microphone array that converts the acoustic responses to broadband electric response signals.

    Abstract translation: 设备和方法适用于表征多声道扬声器配置,以校正扬声器室延迟,增益和频率响应或配置子带域校正填充。 在用于表征多声道扬声器配置的实施例中,宽带探测信号被提供给前置放大器的每个音频输出,在前置放大器中,多个音频输出以多通道配置在听音环境中耦合到扬声器。 扬声器将探测信号转换为声音响应,这些声音响应以非沉默时段分隔的非重叠时隙作为声波传输到听音环境中。 对于被探测的每个音频输出,声波由多麦克风阵列接收,多声道阵列将声响应转换为宽带电响应信号。

    SYSTEM AND METHOD FOR LONG-TERM PREDICTION IN AUDIO CODECS
    10.
    发明申请
    SYSTEM AND METHOD FOR LONG-TERM PREDICTION IN AUDIO CODECS 审中-公开
    用于音频编解码器中的长期预测的系统和方法

    公开(公告)号:WO2018049279A1

    公开(公告)日:2018-03-15

    申请号:PCT/US2017/050845

    申请日:2017-09-08

    Applicant: DTS, INC.

    Abstract: A frequency domain long-term prediction system and method for estimating and applying an optimum long term predictor. Embodiments of the system and method include determining parameters of a single-tap predictor using a frequency- domain analysis having an optimality criteria based on spectral flatness measure. Embodiments of the system and method also include determining parameters of the long-term predictor by accounting for the performance of the vector quantizer in quantizing the various subbands. In some embodiments other encoder metrics (such as signal tonality) are used as well. Other embodiments of the system and method include determining the optimal parameters of the long-term predictor by accounting for some of the decoder operation. Other embodiments of the system and method include extending a 1-tap predictor to a k-th order predictor by convolving the 1-tap predictor with a pre-set filter and selecting from a table of such pre-set filters based on a minimum energy criteria.

    Abstract translation: 用于估计和应用最优长期预测器的频域长期预测系统和方法。 该系统和方法的实施例包括使用具有基于频谱平坦度量度的最优性准则的频域分析来确定单抽头预测器的参数。 该系统和方法的实施例还包括通过考虑矢量量化器在量化各个子带中的性能来确定长期预测器的参数。 在一些实施例中,也使用其他编码器度量(诸如信号音调)。 该系统和方法的其他实施例包括通过考虑一些解码器操作来确定长期预测器的最佳参数。 该系统和方法的其他实施例包括通过将1抽头预测器与预设滤波器进行卷积并且基于最小能量从这样的预设滤波器的表中选择来将1抽头预测器扩展到第k阶预测器 标准。

Patent Agency Ranking