Patent search ap:("DTS Page INC.") AND inv:"FEJZO

1.

发明申请
HYBRID SPATIAL AUDIO DECODER 审中-公开

公开(公告)号：WO2020247033A1

公开(公告)日：2020-12-10

申请号：PCT/US2020/018447

申请日：2020-02-14

Applicant: DTS, INC.

Inventor： GOODWIN, Michael M. , FEJZO, Zoran

IPC: G10L19/008 , H04S3/00

Abstract: A spatial audio signal decoder is provided that includes a processor and storage media that includes instructions that when executed cause the processor to: receive input spatial audio signals including a set of channels having an input spatial format; partition the set of channels into at least a first channel subset and a second channel subset; determine an estimate of a number and directions of arrival of directional audio sources represented in at least a portion of the set of channels; determine one of the active and passive components of the first channel subset signals, based at least in part on the estimated number and directions of arrival of directional audio sources; determine the other of the active and passive components of the first channel subset signals, based upon the determined one of the active and passive components of the first channel subset signals; decode the components to an output signal.

2.

发明申请
SIGNAL TRANSFORMATION BASED ON UNIQUE KEY-BASED NETWORK GUIDANCE AND CONDITIONING 审中-公开

公开(公告)号：WO2022025923A1

公开(公告)日：2022-02-03

申请号：PCT/US2020/044522

申请日：2020-07-31

Applicant: DTS, INC.

Inventor： VENKATRAMAN, Atti , FEJZO, Zoran , KALKER, Antonius

IPC: G10L21/003 , G06N3/02 , G10L25/12 , G10L25/18

Abstract: A method comprises receiving input audio and target audio having a target audio characteristic. The method includes estimating key parameters that represent the target audio characteristic based on one or more of the target audio and the input audio. The method further comprises configuring a neural network, trained to be configured by the key parameters, with the key parameters to cause the neural network to perform a signal transformation of the input audio, to produce output audio having an output audio characteristic corresponding to and that matches the target audio characteristic.

3.

发明申请
DIRECT-DIFFUSE DECOMPOSITION 审中-公开
Title translation: 直接差分分解

公开(公告)号：WO2013040172A1

公开(公告)日：2013-03-21

申请号：PCT/US2012/055103

申请日：2012-09-13

Applicant: DTS, INC. , THOMPSON, Jeff , SMITH, Brandon , WARNER, Aaron , FEJZO, Zoran , JOT, Jean-Mar

Inventor： THOMPSON, Jeff , SMITH, Brandon , WARNER, Aaron , FEJZO, Zoran , JOT, Jean-Mar

IPC: H04B15/00

CPC classification number: H04R5/04 , G10L19/008 , G10L21/0308 , G10L25/06 , H04S3/00

Abstract: There is disclosed methods and apparatus for decomposing a signal having a plurality of channels into direct and diffuse components. The correlation coefficient between each pair of signals from the plurality of signals may be estimated. A linear system of equations relating the estimated correlation coefficients and direct energy fractions of each of the plurality of channels may be constructed. The linear system may be solved to estimate the direct energy fractions. A direct component output signal and a diffuse component output signal may be generated based in part on the direct energy fractions.

Abstract translation: 公开了用于将具有多个通道的信号分解成直接和漫射部件的方法和装置。可以估计来自多个信号的每对信号之间的相关系数。可以构建将估计的相关系数和多个通道中的每一个的直接能量分数相关联的方程式的线性系统。可以解决线性系统以估计直接能量分数。可以部分地基于直接能量分数来生成直接分量输出信号和漫反射分量输出信号。

4.

发明申请
NEURAL NETWORK-BASED KEY GENERATION FOR KEY-GUIDED NEURAL-NETWORK-BASED AUDIO SIGNAL TRANSFORMATION 审中-公开

公开(公告)号：WO2022025922A1

公开(公告)日：2022-02-03

申请号：PCT/US2020/044518

申请日：2020-07-31

Applicant: DTS, INC.

Inventor： FEJZO, Zoran , KALKER, Antonius , VENKATRAMAN, Atti

IPC: G10L25/30 , G06N3/04 , G10L19/04 , G06N3/08 , G10L21/003 , G10L21/0316 , G10L21/038

Abstract: A method comprise: receiving input audio and target audio having a target audio characteristic; using a first neural network, trained to generate key parameters that represent the target audio characteristic based on one or more of the target audio and the input audio, generating the key parameters; and configuring a second neural network, trained to be configured by the key parameters, with the key parameters to cause the second neural network to perform a signal transformation of the input audio, to produce output audio having an output audio characteristic corresponding to and that matches the target audio characteristic.

5.

发明申请
SIGNALS IN TRANSFORM-BASED AUDIO CODECS 审中-公开

公开(公告)号：WO2019173195A1

公开(公告)日：2019-09-12

申请号：PCT/US2019/020514

申请日：2019-03-04

Applicant: DTS, INC.

Inventor： NEMER, Elias , FEJZO, Zoran

IPC: G10L19/02 , G10L19/03

Abstract: Systems and methods include audio encoders having improved coding of harmonic signals. The audio encoders can be implemented as transform-based codecs with frequency coefficients quantized using spectral weights. The frequency coefficients can be quantized by use of the generated spectral weights applied to the frequency coefficients prior to the quantization or by use of the generated spectral weights in computation of error within a vector quantization that performs the quantization. Additional apparatus, systems, and methods are disclosed.

6.

发明申请
BIT RATE CONTROL OVER GROUPS OF FRAMES 审中-公开

公开(公告)号：WO2018200822A1

公开(公告)日：2018-11-01

申请号：PCT/US2018/029584

申请日：2018-04-26

Applicant: DTS, INC.

Inventor： STACHURSKI, Jacek , FEJZO, Zoran

IPC: G06F15/16 , H04L12/18 , H04L12/26 , H04N7/26 , H04N19/14 , H04N19/40

Abstract: Systems and methods are described for processing data from a sequential series of groups of frames to achieve a target average processing bit rate for a particular group of frames in the series. In an example, a look-ahead buffer circuit can be populated with a number of frames from a particular group of frames, and a bit allocation can be determined for a frame in the look-ahead buffer circuit using bit request information about all of the frames in the buffer. The look-ahead buffer circuit can be populated with streaming frame information in a first-in-first-out manner, and bit allocation processing can be performed for each frame, in a particular group of frames, based on a frame position in the look-ahead buffer circuit and further based on bit requests associated with other frames in the look-ahead buffer circuit.

7.

发明申请
POST-ENCODING BITRATE REDUCTION OF MULTIPLE OBJECT AUDIO 审中-公开
Title translation: 后编码双极化减少多个对象音频

公开(公告)号：WO2015134272A1

公开(公告)日：2015-09-11

申请号：PCT/US2015/017732

申请日：2015-02-26

Applicant: DTS, INC.

Inventor： FEJZO, Zoran

IPC: G10L19/02 , G10L19/008 , G10L19/022

CPC classification number: G10L19/002 , G10L19/008 , G10L19/24 , H04N19/132 , H04N19/40 , H04N21/2662 , H04N21/434

Abstract: A post-encoding bitrate reduction system and method for generating one more scaled compressed bitstreams from a single encoded plenary file. The plenary file contains multiple audio object files that were encoded separately using a scalable encoding process having fine-grained scalability. Activity in the data frames of the encoded audio object files at a time period are compared with each other to obtain a data frame activity comparison. Bits from an available bitpool are assigned to all of the data frames based on the data frame activity comparison and corresponding hierarchical metadata. The plenary file is scaled down by truncating bits in the data frames to conform to the bit allocation. In some embodiments frame activity is compared to a silence threshold and the data frame contains silence if the frame activity is less than or equal to the threshold and minimal bits are used to represent the silent frame.

Abstract translation: 一种后编码比特率降低系统和方法，用于从单个编码的全局文件生成一个更多的缩放的压缩比特流。全文档包含多个音频对象文件，它们使用具有细粒度可扩展性的可扩展编码过程单独编码。在一段时间将编码音频对象文件的数据帧中的活动进行比较，以获得数据帧活动比较。基于数据帧活动比较和相应的分层元数据，可用位池的位被分配给所有的数据帧。通过截断数据帧中的比特来缩小全局文件以符合比特分配。在一些实施例中，将帧活动与静默阈值进行比较，并且如果帧活动小于或等于阈值，则数据帧包含静音，并且使用最小比特来表示无声帧。

8.

发明申请
AUTOMATIC MULTI-CHANNEL MUSIC MIX FROM MULTIPLE AUDIO STEMS 审中-公开
Title translation: 多声道音乐自动多通道音乐混合

公开(公告)号：WO2014151092A1

公开(公告)日：2014-09-25

申请号：PCT/US2014/024962

申请日：2014-03-12

Applicant: DTS, INC. , FEJZO, Zoran , MAHER, Fred

Inventor： FEJZO, Zoran , MAHER, Fred

IPC: H03F99/00

CPC classification number: G10H1/46 , G10H1/125 , G10H2210/125 , G10H2210/295 , G10H2210/301 , G10H2250/055 , H04S3/00 , H04S3/008 , H04S2400/07 , H04S2400/15 , H04S2420/03

Abstract: There are disclosed automatic mixers and methods for creating a surround audio mix. A set of rules may be stored in a rule base. A rule engine may select a subset of the set of rules based, at least in part, on metadata associated with a plurality of stems. A mixing matrix may mix the plurality of stems in accordance with the selected subset of rules to provide three or more output channels.

Abstract translation: 公开了用于创建环绕音频混合的自动混音器和方法。一组规则可以存储在规则库中。至少部分地，规则引擎可以选择所述规则集合的子集，所述元数据与多个词干相关联的元数据。混合矩阵可以根据所选择的规则子集来混合多个茎杆以提供三个或更多个输出通道。

9.

发明申请
ROOM CHARACTERIZATION AND CORRECTION FOR MULTI-CHANNEL AUDIO 审中-公开
Title translation: 多通道音频的房间特征和校正

公开(公告)号：WO2012154823A1

公开(公告)日：2012-11-15

申请号：PCT/US2012/037081

申请日：2012-05-09

Applicant: DTS, INC. , FEJZO, Zoran , JOHNSTON, James, D.

Inventor： FEJZO, Zoran , JOHNSTON, James, D.

IPC: H04R5/02

CPC classification number: H04S7/303 , H04R3/005 , H04R5/02 , H04S3/008 , H04S7/301 , H04S2400/01 , H04S2420/01

Abstract: Devices and methods are adapted to characterize a multi-channel loudspeaker configuration, to correct loudspeaker room delay, gain and frequency response or to configure sub-band domain correction fillers. In an embodiment for characterizing a multi-channel loudspeaker configuration, a broadband probe signal is supplied to each audio output of an preamplifier of which a plurality are coupled to loudspeakers in a multi-channel configuration in a listening environment. The loudspeakers convert the probe signal to acoustic responses that are transmitted in non-overlapping time slots separated by silent periods as sound waves into the listening environment. For each audio output that is probed, sound waves are received by a multi-microphone array that converts the acoustic responses to broadband electric response signals.

Abstract translation: 设备和方法适用于表征多声道扬声器配置，以校正扬声器室延迟，增益和频率响应或配置子带域校正填充。在用于表征多声道扬声器配置的实施例中，宽带探测信号被提供给前置放大器的每个音频输出，在前置放大器中，多个音频输出以多通道配置在听音环境中耦合到扬声器。扬声器将探测信号转换为声音响应，这些声音响应以非沉默时段分隔的非重叠时隙作为声波传输到听音环境中。对于被探测的每个音频输出，声波由多麦克风阵列接收，多声道阵列将声响应转换为宽带电响应信号。

10.

发明申请
SYSTEM AND METHOD FOR LONG-TERM PREDICTION IN AUDIO CODECS 审中-公开
Title translation: 用于音频编解码器中的长期预测的系统和方法

公开(公告)号：WO2018049279A1

公开(公告)日：2018-03-15

申请号：PCT/US2017/050845

申请日：2017-09-08

Applicant: DTS, INC.

Inventor： NEMER, Elias , FEJZO, Zoran , STACHURSKI, Jacek , KALKER, Antonius

IPC: G10L19/02

Abstract: A frequency domain long-term prediction system and method for estimating and applying an optimum long term predictor. Embodiments of the system and method include determining parameters of a single-tap predictor using a frequency- domain analysis having an optimality criteria based on spectral flatness measure. Embodiments of the system and method also include determining parameters of the long-term predictor by accounting for the performance of the vector quantizer in quantizing the various subbands. In some embodiments other encoder metrics (such as signal tonality) are used as well. Other embodiments of the system and method include determining the optimal parameters of the long-term predictor by accounting for some of the decoder operation. Other embodiments of the system and method include extending a 1-tap predictor to a k-th order predictor by convolving the 1-tap predictor with a pre-set filter and selecting from a table of such pre-set filters based on a minimum energy criteria.

Abstract translation: 用于估计和应用最优长期预测器的频域长期预测系统和方法。该系统和方法的实施例包括使用具有基于频谱平坦度量度的最优性准则的频域分析来确定单抽头预测器的参数。该系统和方法的实施例还包括通过考虑矢量量化器在量化各个子带中的性能来确定长期预测器的参数。在一些实施例中，也使用其他编码器度量（诸如信号音调）。该系统和方法的其他实施例包括通过考虑一些解码器操作来确定长期预测器的最佳参数。该系统和方法的其他实施例包括通过将1抽头预测器与预设滤波器进行卷积并且基于最小能量从这样的预设滤波器的表中选择来将1抽头预测器扩展到第k阶预测器标准。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification