专利检索 ap:("Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.") AND inv:"Hölzer, Andreas" 第 2 页

11.

发明公开
Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals 审中-公开
标题翻译：音频编码器，音频解码器，方法和计算机程序使用联合编码的残差信号

公开(公告)号：EP2830051A2

公开(公告)日：2015-01-28

申请号：EP13189305.9

申请日：2013-10-18

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Dick, Sascha , Ertel, Christian , Helmrich, Christian , Hilpert, Johannes , Hölzer, Andreas , Kuntz, Achim

IPC分类号： G10L19/008 , G10L21/038

CPC分类号： G10L19/008 , G10L19/0017 , G10L21/038 , H04S3/008 , H04S7/30 , H04S2400/01 , H04S2400/03 , H04S2420/03

摘要： An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.

摘要翻译： 用于基于编码表示提供至少四个音频信道信号的音频解码器被配置为基于第一残差信号和第二残差信号的联合编码表示来提供第一残差信号和第二残差信号使用多通道解码。音频解码器被配置为基于第一缩混信号和使用残余信号辅助的多通道解码的第一残留信号来提供第一音频通道信号和第二音频通道信号。音频解码器被配置为基于第二缩混信号和第二残余信号使用残余信号辅助的多通道解码来提供第三音频通道信号和第四音频通道信号。音频编码器基于相应的考虑。

12.

发明授权
MPEG-SAOC AUDIO SIGNAL DECODER, MPEG-SAOC AUDIO SIGNAL ENCODER, METHOD FOR PROVIDING AN UPMIX SIGNAL REPRESENTATION USING MPEG-SAOC DECODING, METHOD FOR PROVIDING A DOWNMIX SIGNAL REPRESENTATION USING MPEG-SAOC DECODING, AND COMPUTER PROGRAM USING A TIME/FREQUENCY-DEPENDENT COMMON INTER-OBJECT-CORRELATION PARAMETER VALUE 有权

公开(公告)号：EP3093843B1

公开(公告)日：2020-12-02

申请号：EP16176048.3

申请日：2010-09-28

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. , Dolby International AB

发明人： Hölzer, Andreas , Herre, Jürgen , Hilpert, Johannes , Engdegard, Jonas , Purnhagen, Heiko

IPC分类号： G10L19/005 , G10L19/20 , H04S3/02 , H04S5/00

13.

发明公开
MULTI-CHANNEL AUDIO DECODER, MULTI-CHANNEL AUDIO ENCODER, METHODS AND COMPUTER PROGRAM USING A RESIDUAL-SIGNAL-BASED ADJUSTMENT OF A CONTRIBUTION OF A DECORRELATED SIGNAL 审中-公开

公开(公告)号：EP3425633A1

公开(公告)日：2019-01-09

申请号：EP18182535.7

申请日：2014-07-17

申请人： FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V.

发明人： Dick, Sascha , Helmrich, Christian , Hilpert, Johannes , Hölzer, Andreas

IPC分类号： G10L19/008 , G10L19/20

摘要： A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to perform a weighted combination of a downmix signal, a decorrelated signal and a residual signal, to obtain one of the output audio signals. The multi-channel audio decoder is configured to determine a weight describing a contribution of the decorrelated signal in the weighted combination in dependence on the residual signal. A multi-channel audio encoder for providing an encoded representation of a multi-channel audio signal is configured to obtain a downmix signal on the basis of the multi-channel audio signal, to provide parameters describing dependencies between the channels of the multi-channe! audio signal, and to provide a residual signal. The multi-channel audio encoder is configured to vary an amount of residual signal included into the encoded representation in dependence on the multi-channel audio signal.

14.

发明公开
FREQUENCY-DOMAIN AUDIO CODING SUPPORTING TRANSFORM LENGTH SWITCHING 审中-公开
标题翻译：频域音频编码支持转换长度切换

公开(公告)号：EP3312836A1

公开(公告)日：2018-04-25

申请号：EP17189418.1

申请日：2014-07-15

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Dick, Sascha , Helmrich, Christian , Hölzer, Andreas

IPC分类号： G10L19/022 , G10L19/03 , G10L19/008 , G10L19/028

CPC分类号： G10L19/022 , G10L19/008 , G10L19/028 , G10L19/03

摘要： A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility. As far as coding efficiency penalties due to the coding of the frequency domain coefficients in a manner transparent for older decoders are concerned, same are of comparatively minor nature due to the interleaving.

摘要翻译： 频域音频编解码器具有以向后兼容的方式另外支持特定变换长度的能力，通过以下方式：相应帧的频域系数以交织方式传输，而不考虑信号化信令关于实际应用变换长度的帧，并且另外频域系数提取和比例因子提取独立于信号化进行操作。通过这种措施，对信号不敏感的老式频域音频编码器/解码器将仍然能够无故障地工作并且重现合理的质量。同时，尽管具有向后兼容性，但能够支持额外变换长度的频域音频编码器/解码器可以提供更好的质量。就由于以较老的解码器而言透明的方式对频域系数进行编码而导致的编码效率处罚来说，由于交织相同，其性质相对较小。

15.

发明授权
Audio signal encoder, audio bitstream, method and computer program using an object-related parametric information 有权
标题翻译：音频信号编码器，音频比特流，使用对象相关参数信息的方法和计算机程序

公开(公告)号：EP2816555B1

公开(公告)日：2016-03-23

申请号：EP14180279.3

申请日：2010-04-28

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. , Dolby International AB

发明人： Herre, Jürgen , Hölzer, Andreas , Terentiv, Leon , Falch, Cornelia , Purnhagen, Heiko , Engdegard, Jonas , Ridderbusch, Falko , Kastner, Thorsten

IPC分类号： G10L19/008 , G10L19/20

CPC分类号： G10L19/008 , G10L19/20

摘要： An audio signal encoder (600) for providing a downmix signal representation (614) and an object-related parametric information (616) on the basis of a plurality of object signals (x 1 to x N ) comprises a downmixer (620) configured to provide one or more downmix signals in dependence on downmix coefficients (d 1 to d N ) associated with the object signals (x 1 to x N ), such that the one or more downmix signals comprise a superposition of a plurality of object signals, and a side information provider (630) configured to provide an inter-object-relationship side information (OLD, IOC) describing level differences and correlation characteristics of object signals (x 1 to x N ) and an individual-object side information describing one or more individual properties of the individual object signals (x 1 to x N ).

16.

发明公开
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING 审中-公开
标题翻译： VORRICHTUNG UND VERFAHRENFÜRBILDSCHIRMBEZOGENE AUDIOOBJEKT-NEUABBILDUNG

公开(公告)号：EP2928216A1

公开(公告)日：2015-10-07

申请号：EP14196769.5

申请日：2014-12-08

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Füg, Simone , Plogsties, Jan , Dick, Sascha , Hilpert, Johannes , Robilliard, Julien , Kuntz, Achim , Hölzer, Andreas

IPC分类号： H04S7/00 , H04S3/00

CPC分类号： G10L19/20 , G10L19/008 , G10L19/167 , H04N21/233 , H04N21/4318 , H04N21/439 , H04N21/4516 , H04N21/8106 , H04N21/84 , H04S3/008 , H04S7/00 , H04S7/30 , H04S7/308 , H04S2400/11

摘要： An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.

摘要翻译： 提供一种用于产生扬声器信号的装置。该装置包括对象元数据处理器（110）和对象渲染器（120）。对象渲染器（120）被配置为接收音频对象。对象元数据处理器（110）被配置为接收元数据，其包括关于音频对象是否是屏幕相关的指示，还包括音频对象的第一位置。对象元数据处理器（110）被配置为根据音频对象的第一位置并根据屏幕的大小来计算音频对象的第二位置，如果音频对象在元数据中被指示为屏幕相关。对象渲染器（120）被配置为根据音频对象并根据位置信息产生扬声器信号。如果音频对象在元数据中被指示为不与屏幕相关，则对象元数据处理器（110）被配置为将音频对象的第一位置作为位置信息馈送到对象渲染器（120）。如果音频对象在元数据中被指示为与屏幕相关的话，对象元数据处理器（110）被配置为将音频对象的第二位置作为位置信息馈送到对象渲染器（120）。

17.

发明公开
Concept for audio encoding and decoding for audio channels and audio objects 审中-公开
标题翻译： Konzept zur Audiocodierung und AudiodecodierungfürAudiokanäleund Audioobjekte

公开(公告)号：EP2830045A1

公开(公告)日：2015-01-28

申请号：EP13177378.0

申请日：2013-07-22

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. , Friedrich-Alexander-Universität Erlangen-Nürnberg

发明人： Adami, Alexander , Borss, Christian , Dick, Sascha , Ertel, Christian , Füg, Simone , Herre, Jürgen , Hilpert, Johannes , Hölzer, Andreas , Kratschmer, Michael , Küch, Fabian , Kuntz, Achim , Murtaza, Adrian , Plogsties, Jan , Silzle, Andreas , Stenzel, Hanne

IPC分类号： G10L19/008

CPC分类号： G10L19/20 , G10L19/008 , G10L19/028 , G10L19/18 , G10L19/22 , H04S3/008 , H04S2400/03 , H04S2400/11

摘要： Audio encoder for encoding audio input data (101) to obtain audio output data (501) comprises an input interface (100) for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer (200) for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel comprising audio data of a channel and audio data of at least one object; a core encoder (300) for core encoding core encoder input data; and a metadata compressor (400) for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes comprising a first mode, in which the core encoder is configured to encode the plurality of audio channels and the plurality of audio objects received by the input interface as core encoder input data, and a second mode, in which the core encoder (300) is configured for receiving, as the core encoder input data, the plurality of pre-mixed channels generated by the mixer (200).

摘要翻译： 用于编码音频输入数据（101）以获得音频输出数据（501）的音频编码器包括用于接收多个音频通道的输入接口（100），与多个音频中的一个或多个音频相关的多个音频对象和元数据对象; 混合器（200），用于混合多个对象和多个通道以获得多个预混合通道，每个预混合通道包括通道的音频数据和至少一个对象的音频数据; 核心编码器（300），用于核心编码核心编码器输入数据; 以及用于压缩与所述多个音频对象中的一个或多个音频对象有关的元数据的元数据压缩器（400），其中所述音频编码器被配置为在包括第一模式的两种模式的组的至少一种模式中操作，其中核心编码器被配置为将由输入接口接收的多个音频频道和多个音频对象编码为核心编码器输入数据，以及第二模式，其中核心编码器（300）被配置为接收作为核心编码器输入数据，由混合器（200）产生的多个预混频道。

18.

发明公开
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING 审中-公开

公开(公告)号：EP4254988A3

公开(公告)日：2023-11-01

申请号：EP23167354.2

申请日：2015-03-25

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Neukam, Simone , Plogsties, Jan , Dick, Sascha , Hilpert, Johannes , Robilliard, Julien , Kuntz, Achim , Hölzer, Andreas

IPC分类号： H04S7/00 , H04S3/00

摘要： An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.

19.

发明公开
CONCEPT FOR AUDIO DECODING FOR AUDIO CHANNELS AND AUDIO OBJECTS 审中-公开

公开(公告)号：EP4033485A1

公开(公告)日：2022-07-27

申请号：EP22159568.9

申请日：2014-07-16

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Adami, Alexander , Borss, Christian , Dick, Sascha , Ertel, Christian , Neukam, Simone , Herre, Jürgen , Hilpert, Johannes , Hölzer, Andreas , Kratschmer, Michael , Küch, Fabian , Kuntz, Achim , Murtaza, Adrian , Plogsties, Jan , Silzle, Andreas , Stenzel, Hanne

IPC分类号： G10L19/008 , G10L19/18 , G10L19/20 , G10L19/22 , H04S3/00

摘要： Audio decoder for decoding encoded audio data, comprising: an input interface (1100) for receiving the encoded audio data, the encoded audio data comprising a plurality of encoded channels or a plurality of encoded objects or compress metadata related to the plurality of objects; a core decoder (1300) for decoding the plurality of encoded channels and the plurality of encoded objects; a metadata decompressor (1400) for decompressing the compressed metadata; an object processor (1200) for processing the plurality of decoded objects using the decompressed metadata to obtain a number of output channels (1205) comprising audio data from the objects and the decoded channels; and a post-processor (1700) for converting the number of output channels (1205) into an output format, wherein the audio decoder is configured to bypass the object processor and to feed a plurality of decoded channels into the post-processor (1700), when the encoded audio data does not contain any audio objects and to feed the plurality of decoded objects and the plurality of decoded channels into the object processor (1200), when the encoded audio data comprises encoded channels and encoded objects..

20.

发明公开
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING 审中-公开

公开(公告)号：EP3487189A1

公开(公告)日：2019-05-22

申请号：EP18248305.7

申请日：2015-03-25

申请人： FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V.

发明人： Neukam, Simone , Plogsties, Jan , Dick, Sascha , Hilpert, Johannes , Robilliard, Julien , Kuntz, Achim , Hölzer, Andreas

IPC分类号： H04S7/00 , H04S3/00

摘要： An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类