专利检索 ap:("Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.") AND inv:"Dick, Sascha" 第 1 页

1.

发明公开
Frequency-domain audio coding supporting transform length switching 审中-公开
标题翻译：常见问题解答

公开(公告)号：EP2830058A1

公开(公告)日：2015-01-28

申请号：EP13189334.9

申请日：2013-10-18

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Dick, Sascha , Helmrich, Christian , Hölzer, Andreas

IPC分类号： G10L19/022

CPC分类号： G10L19/022 , G10L19/008 , G10L19/028 , G10L19/03

摘要： A frequency-domain audio codec is is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility. As far as coding efficiency penalties due to the coding of the frequency domain coefficients in a manner transparent for older decoders are concerned, same are of comparatively minor nature due to the interleaving.

摘要翻译： 通过以下方式，向频域音频编解码器提供额外支持特定变换长度的能力：通过交织方式发送相应帧的频域系数，而不管信令信令如何对于实际应用哪个变换长度的帧，并且频域系数提取和比例因子提取独立于信号化操作。通过这种措施，对信号化不敏感的老式频域音频编码器/解码器将能够无故障地运行并且再现合理的质量。同时，即使向后兼容，能够支持附加变换长度的频域音频编码器/解码器将提供更好的质量。关于由于对于旧解码器而言以透明的方式对频域系数的编码所引起的编码效率的惩罚是由于交织而相同的。

2.

发明公开
Audio decoder, audio encoder, method for providing at least four audio channel signals on the basis of an encoded representation, method for providing an encoded representation on the basis of at least four audio channel signals and computer program using a bandwidth extension 审中-公开
标题翻译：音频解码器，音频编码器，提供基于一个编码表示至少四个音频信道信号的方法，基于至少四个音频信道信号和计算机程序有带宽扩展提供了一个编码表示的方法

公开(公告)号：EP2830052A1

公开(公告)日：2015-01-28

申请号：EP13189306.7

申请日：2013-10-18

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Dick, Sascha , Ertel, Christian , Helmrich, Christian , Hilpert, Johannes , Hölzer, Andreas , Kuntz, Achim

IPC分类号： G10L19/008 , G10L21/038

CPC分类号： G10L19/008 , G10L19/0017 , G10L21/038 , H04S3/008 , H04S7/30 , H04S2400/01 , H04S2400/03 , H04S2420/03

摘要： An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation is configured to provide a first downmix signal and a second downmix signal on the basis of a jointly encoded representation of the first downmix signal and the second downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a first audio channel signal and a second audio channel signal on the basis of the first downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a third audio channel signal and a fourth audio channel signal on the basis of the second downmix signal using a multi-channel decoding. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the first audio channel signal and the third audio channel signal, to obtain a first bandwidth-extended channel signal and a third bandwidth-extended channel signal. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the second audio channel signal and the fourth audio channel signal, to obtain a second bandwidth extended channel signal and a fourth bandwidth extended channel signal. An audio encoder uses a related concept.

摘要翻译： 用于编码表示的基础上，提供至少四个带宽扩展信道信号的音频解码器被配置为提供一个第一混合信号和所述第一混合信号的联合编码表示的基础上，第二缩混信号和所述第二下混利用信号的多声道解码。音频解码器被配置为提供至少一个第一音频信道信号，并使用多通道解码所述第一缩混信号的基础上的第二音频信道信号。音频解码器被配置为提供至少一个第三通道的音频信号，并使用多通道解码所述第二缩混信号的基础上的第四信道的音频信号。音频解码器被配置为将第一音频信道信号和第三音频信道信号的基础上执行多通道带宽扩展，以获得第一带宽扩展信道信号和第三带宽扩展信道信号。音频解码器被配置为将第二音频信道信号和所述第四音频信道信号的基础上执行多通道带宽扩展，以获得第二带宽扩展信道信号和第四信道带宽扩展信号。音频编码器使用相关的概念。

3.

发明公开
DIRECTIONAL LOUDNESS MAP BASED AUDIO PROCESSING 审中-公开

公开(公告)号：EP4213147A1

公开(公告)日：2023-07-19

申请号：EP23159427.6

申请日：2019-10-28

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Herre, Jürgen , Delgado, Pablo Manuel , Dick, Sascha

IPC分类号： G10L25/03 , G10L25/60 , G10L19/16 , G10L19/008 , G10L25/69

摘要： An audio analyzer configured to obtain spectral domain representations of two or more input audio signals. Additionally the audio analyzer is configured to obtain directional information associated with spectral bands of the spectral domain representations and to obtain loudness information associated with different directions as an analysis result. Contributions to the loudness information are determined in dependence on the directional information.

4.

发明公开
MULTI-CHANNEL AUDIO DECODER, MULTI-CHANNEL AUDIO ENCODER, METHODS AND COMPUTER PROGRAM USING A RESIDUAL-SIGNAL-BASED ADJUSTMENT OF A CONTRIBUTION OF A DECORRELATED SIGNAL 审中-公开

公开(公告)号：EP3425633A1

公开(公告)日：2019-01-09

申请号：EP18182535.7

申请日：2014-07-17

申请人： FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V.

发明人： Dick, Sascha , Helmrich, Christian , Hilpert, Johannes , Hölzer, Andreas

IPC分类号： G10L19/008 , G10L19/20

摘要： A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to perform a weighted combination of a downmix signal, a decorrelated signal and a residual signal, to obtain one of the output audio signals. The multi-channel audio decoder is configured to determine a weight describing a contribution of the decorrelated signal in the weighted combination in dependence on the residual signal. A multi-channel audio encoder for providing an encoded representation of a multi-channel audio signal is configured to obtain a downmix signal on the basis of the multi-channel audio signal, to provide parameters describing dependencies between the channels of the multi-channe! audio signal, and to provide a residual signal. The multi-channel audio encoder is configured to vary an amount of residual signal included into the encoded representation in dependence on the multi-channel audio signal.

5.

发明公开
FREQUENCY-DOMAIN AUDIO CODING SUPPORTING TRANSFORM LENGTH SWITCHING 审中-公开
标题翻译：频域音频编码支持转换长度切换

公开(公告)号：EP3312836A1

公开(公告)日：2018-04-25

申请号：EP17189418.1

申请日：2014-07-15

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Dick, Sascha , Helmrich, Christian , Hölzer, Andreas

IPC分类号： G10L19/022 , G10L19/03 , G10L19/008 , G10L19/028

CPC分类号： G10L19/022 , G10L19/008 , G10L19/028 , G10L19/03

摘要： A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility. As far as coding efficiency penalties due to the coding of the frequency domain coefficients in a manner transparent for older decoders are concerned, same are of comparatively minor nature due to the interleaving.

摘要翻译： 频域音频编解码器具有以向后兼容的方式另外支持特定变换长度的能力，通过以下方式：相应帧的频域系数以交织方式传输，而不考虑信号化信令关于实际应用变换长度的帧，并且另外频域系数提取和比例因子提取独立于信号化进行操作。通过这种措施，对信号不敏感的老式频域音频编码器/解码器将仍然能够无故障地工作并且重现合理的质量。同时，尽管具有向后兼容性，但能够支持额外变换长度的频域音频编码器/解码器可以提供更好的质量。就由于以较老的解码器而言透明的方式对频域系数进行编码而导致的编码效率处罚来说，由于交织相同，其性质相对较小。

6.

发明公开
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING 审中-公开
标题翻译： VORRICHTUNG UND VERFAHRENFÜRBILDSCHIRMBEZOGENE AUDIOOBJEKT-NEUABBILDUNG

公开(公告)号：EP2928216A1

公开(公告)日：2015-10-07

申请号：EP14196769.5

申请日：2014-12-08

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Füg, Simone , Plogsties, Jan , Dick, Sascha , Hilpert, Johannes , Robilliard, Julien , Kuntz, Achim , Hölzer, Andreas

IPC分类号： H04S7/00 , H04S3/00

CPC分类号： G10L19/20 , G10L19/008 , G10L19/167 , H04N21/233 , H04N21/4318 , H04N21/439 , H04N21/4516 , H04N21/8106 , H04N21/84 , H04S3/008 , H04S7/00 , H04S7/30 , H04S7/308 , H04S2400/11

摘要： An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.

摘要翻译： 提供一种用于产生扬声器信号的装置。该装置包括对象元数据处理器（110）和对象渲染器（120）。对象渲染器（120）被配置为接收音频对象。对象元数据处理器（110）被配置为接收元数据，其包括关于音频对象是否是屏幕相关的指示，还包括音频对象的第一位置。对象元数据处理器（110）被配置为根据音频对象的第一位置并根据屏幕的大小来计算音频对象的第二位置，如果音频对象在元数据中被指示为屏幕相关。对象渲染器（120）被配置为根据音频对象并根据位置信息产生扬声器信号。如果音频对象在元数据中被指示为不与屏幕相关，则对象元数据处理器（110）被配置为将音频对象的第一位置作为位置信息馈送到对象渲染器（120）。如果音频对象在元数据中被指示为与屏幕相关的话，对象元数据处理器（110）被配置为将音频对象的第二位置作为位置信息馈送到对象渲染器（120）。

7.

发明公开
Concept for audio encoding and decoding for audio channels and audio objects 审中-公开
标题翻译： Konzept zur Audiocodierung und AudiodecodierungfürAudiokanäleund Audioobjekte

公开(公告)号：EP2830045A1

公开(公告)日：2015-01-28

申请号：EP13177378.0

申请日：2013-07-22

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. , Friedrich-Alexander-Universität Erlangen-Nürnberg

发明人： Adami, Alexander , Borss, Christian , Dick, Sascha , Ertel, Christian , Füg, Simone , Herre, Jürgen , Hilpert, Johannes , Hölzer, Andreas , Kratschmer, Michael , Küch, Fabian , Kuntz, Achim , Murtaza, Adrian , Plogsties, Jan , Silzle, Andreas , Stenzel, Hanne

IPC分类号： G10L19/008

CPC分类号： G10L19/20 , G10L19/008 , G10L19/028 , G10L19/18 , G10L19/22 , H04S3/008 , H04S2400/03 , H04S2400/11

摘要： Audio encoder for encoding audio input data (101) to obtain audio output data (501) comprises an input interface (100) for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer (200) for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel comprising audio data of a channel and audio data of at least one object; a core encoder (300) for core encoding core encoder input data; and a metadata compressor (400) for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes comprising a first mode, in which the core encoder is configured to encode the plurality of audio channels and the plurality of audio objects received by the input interface as core encoder input data, and a second mode, in which the core encoder (300) is configured for receiving, as the core encoder input data, the plurality of pre-mixed channels generated by the mixer (200).

摘要翻译： 用于编码音频输入数据（101）以获得音频输出数据（501）的音频编码器包括用于接收多个音频通道的输入接口（100），与多个音频中的一个或多个音频相关的多个音频对象和元数据对象; 混合器（200），用于混合多个对象和多个通道以获得多个预混合通道，每个预混合通道包括通道的音频数据和至少一个对象的音频数据; 核心编码器（300），用于核心编码核心编码器输入数据; 以及用于压缩与所述多个音频对象中的一个或多个音频对象有关的元数据的元数据压缩器（400），其中所述音频编码器被配置为在包括第一模式的两种模式的组的至少一种模式中操作，其中核心编码器被配置为将由输入接口接收的多个音频频道和多个音频对象编码为核心编码器输入数据，以及第二模式，其中核心编码器（300）被配置为接收作为核心编码器输入数据，由混合器（200）产生的多个预混频道。

8.

发明公开
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING 审中-公开

公开(公告)号：EP4254988A3

公开(公告)日：2023-11-01

申请号：EP23167354.2

申请日：2015-03-25

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Neukam, Simone , Plogsties, Jan , Dick, Sascha , Hilpert, Johannes , Robilliard, Julien , Kuntz, Achim , Hölzer, Andreas

IPC分类号： H04S7/00 , H04S3/00

摘要： An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.

9.

发明公开
CONCEPT FOR AUDIO DECODING FOR AUDIO CHANNELS AND AUDIO OBJECTS 审中-公开

公开(公告)号：EP4033485A1

公开(公告)日：2022-07-27

申请号：EP22159568.9

申请日：2014-07-16

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Adami, Alexander , Borss, Christian , Dick, Sascha , Ertel, Christian , Neukam, Simone , Herre, Jürgen , Hilpert, Johannes , Hölzer, Andreas , Kratschmer, Michael , Küch, Fabian , Kuntz, Achim , Murtaza, Adrian , Plogsties, Jan , Silzle, Andreas , Stenzel, Hanne

IPC分类号： G10L19/008 , G10L19/18 , G10L19/20 , G10L19/22 , H04S3/00

摘要： Audio decoder for decoding encoded audio data, comprising: an input interface (1100) for receiving the encoded audio data, the encoded audio data comprising a plurality of encoded channels or a plurality of encoded objects or compress metadata related to the plurality of objects; a core decoder (1300) for decoding the plurality of encoded channels and the plurality of encoded objects; a metadata decompressor (1400) for decompressing the compressed metadata; an object processor (1200) for processing the plurality of decoded objects using the decompressed metadata to obtain a number of output channels (1205) comprising audio data from the objects and the decoded channels; and a post-processor (1700) for converting the number of output channels (1205) into an output format, wherein the audio decoder is configured to bypass the object processor and to feed a plurality of decoded channels into the post-processor (1700), when the encoded audio data does not contain any audio objects and to feed the plurality of decoded objects and the plurality of decoded channels into the object processor (1200), when the encoded audio data comprises encoded channels and encoded objects..

10.

发明公开
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING 审中-公开

公开(公告)号：EP3487189A1

公开(公告)日：2019-05-22

申请号：EP18248305.7

申请日：2015-03-25

申请人： FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V.

发明人： Neukam, Simone , Plogsties, Jan , Dick, Sascha , Hilpert, Johannes , Robilliard, Julien , Kuntz, Achim , Hölzer, Andreas

IPC分类号： H04S7/00 , H04S3/00

摘要： An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类