-
公开(公告)号:US20180130480A1
公开(公告)日:2018-05-10
申请号:US15563936
申请日:2016-04-01
Applicant: Dolby International AB
Inventor: Heiko PURNHAGEN , Per EKSTRAND , Harald MUNDT , Klaus PEICHL
IPC: G10L19/035 , G10L19/02
CPC classification number: G10L19/035 , G10L19/0204 , G10L21/0316 , G10L2019/0002 , G10L2019/0004
Abstract: Disclosed are some examples of systems, apparatus, methods and computer program products implementing techniques for extending the range of a set of decoded parameter values for a sequence of frequency bands in an identifiable time frame of an audio signal. In some implementations, the parameter values vary in relation to a sequence of time frames of the audio signal and in relation to a sequence of frequency bands in each time frame. In some implementations, it is determined that a decoded value corresponds to a minimum of a first range of values of a first coding protocol of a set of coding protocols. The determined value is modified to be below the minimum of the first range of values to produce an extended value. A modified set of decoded values including one or more extended values can thus be provided.
-
公开(公告)号:US20180130479A1
公开(公告)日:2018-05-10
申请号:US15867318
申请日:2018-01-10
Applicant: DOLBY INTERNATIONAL AB
Inventor: Heiko PURNHAGEN , Leif SEHLSTROM , Lars VILLEMOES , Glenn N. DICKINS , Mark S. VINTON
IPC: G10L19/03 , G10L19/26 , H04M3/56 , G10L19/16 , G10L19/002
Abstract: An audio communication endpoint receives a bitstream containing spectral components representing spectral content of an audio signal, wherein the spectral components relate to a first range extending up to a first break frequency, above which any spectral components are unassigned. The endpoint adapts the received bitstream in accordance with a second range extending up to a second break frequency by removing moving spectral components or adding neutral-valued spectral components relating to a range between the first and second break frequencies. The endpoint then attenuates spectral content in a neighbourhood of the least of the first and second break frequencies for thereby achieving a gradual spectral decay. After this, reconstructing the audio signal is reconstructed by an inverse transform operating on spectral components relating to said second range in the adapted and attenuated received bitstream. At small computational expense, the endpoint may to adapt to different sample rates in received bitstreams.
-
公开(公告)号:US20180033441A1
公开(公告)日:2018-02-01
申请号:US15730652
申请日:2017-10-11
Applicant: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V. , Dolby International AB
Inventor: Juergen HERRE , Johannes HILPERT , Andreas HOELZER , Jonas ENGDEGARD , Heiko PURNHAGEN
IPC: G10L19/008 , G10L19/20 , G10L19/005 , H04S3/02 , H04S5/00
Abstract: An audio signal decoder for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information and in dependence on a rendering information has an object parameter determinator. The object parameter determinator is configured to obtain inter-object-correlation values for a plurality of pairs of audio objects. The object parameter determinator is configured to evaluate a bitstream signaling parameter in order to decide whether to evaluate individual inter-object-correlation bitstream parameter values to obtain inter-object-correlation values for a plurality of pairs of related audio objects, or to obtain inter-object-correlation values for a plurality of pairs of related audio objects using a common inter-object-correlation bitstream parameter value. The audio signal decoder also has a signal processor configured to obtain the upmix signal representation on the basis of the downmix signal representation and using the inter-object-correlation values for a plurality of pairs of related objects and the rendering information.
-
公开(公告)号:US20170301362A1
公开(公告)日:2017-10-19
申请号:US15641033
申请日:2017-07-03
Applicant: DOLBY INTERNATIONAL AB
Inventor: Kristofer KJOERLING , Heiko PURNHAGEN , Harald MUNDT , Karl Jonas ROEDEN , Leif SEHLSTROM
CPC classification number: G10L19/20 , G10L19/008 , G10L19/0212 , G10L19/167 , G10L25/18 , H04S3/008 , H04S2400/03 , H04S2420/03
Abstract: A method for decoding an encoded audio bitstream in an audio processing system is disclosed. The method includes extracting from the encoded audio bitstream a first waveform-coded signal comprising spectral coefficients corresponding to frequencies up to a first cross-over frequency for a time frame and performing parametric decoding at a second cross-over frequency for the time frame to generate a reconstructed signal. The second cross-over frequency is above the first cross-over frequency and the parametric decoding uses reconstruction parameters derived from the encoded audio bitstream to generate the reconstructed signal. The method also includes extracting from the encoded audio bitstream a second waveform-coded signal comprising spectral coefficients corresponding to a subset of frequencies above the first cross-over frequency for the time frame and interleaving the second waveform-coded signal with the reconstructed signal to produce an interleaved signal for the time frame.
-
公开(公告)号:US20170238112A1
公开(公告)日:2017-08-17
申请号:US15498376
申请日:2017-04-26
Applicant: DOLBY INTERNATIONAL AB
Inventor: Heiko PURNHAGEN , Lars VILLEMOES , Jonas ENGDEGARD , Jonas ROEDEN , Kristofer KJOERLING
IPC: H04S5/00 , G10L19/16 , G10L19/02 , G10L19/008
CPC classification number: H04R5/00 , G10L19/008 , G10L19/0204 , G10L19/032 , G10L19/167 , G10L19/26 , H04S3/02 , H04S5/00 , H04S2400/01 , H04S2400/03
Abstract: A method performed by an audio decoder for reconstructing N audio channels from an audio signal containing M audio channels is disclosed. The method includes receiving a bitstream containing an encoded audio signal having M audio channels and a set of spatial parameters, the set of spatial parameters including an inter-channel intensity difference parameter and an inter-channel coherence parameter. The encoded audio bitstream is then decoded to obtain a decoded frequency domain representation of the M audio channels, and at least a portion of the frequency domain representation is decorrelated with an all-pass filter having a fractional delay. The all-pass filter is attenuated at locations of a transient. A matrixed version of the decorrelated signals are summed with a matrixed version of the decoded frequency domain representation to obtain N audio signals that collectively having N audio channels where M is less than N.
-
公开(公告)号:US20170229129A1
公开(公告)日:2017-08-10
申请号:US15498401
申请日:2017-04-26
Applicant: DOLBY INTERNATIONAL AB
Inventor: Heiko PURNHAGEN , Lars VILLEMOES , Jonas ENGDEGARD , Jonas ROEDEN , Kristofer KJOERLING
IPC: G10L19/008 , H04S3/02 , G10L19/02 , G10L19/26 , G10L19/032
CPC classification number: H04R5/00 , G10L19/008 , G10L19/0204 , G10L19/032 , G10L19/167 , G10L19/26 , H04S3/02 , H04S5/00 , H04S2400/01 , H04S2400/03
Abstract: A method performed by an audio decoder for reconstructing N audio channels from an audio signal containing M audio channels is disclosed. The method includes receiving a bitstream containing an encoded audio signal having M audio channels and a set of spatial parameters, the set of spatial parameters including an inter-channel intensity difference parameter and an inter-channel coherence parameter. The encoded audio bitstream is then decoded to obtain a decoded frequency domain representation of the M audio channels, and at least a portion of the frequency domain representation is decorrelated with an all-pass filter having a fractional delay. The all-pass filter is attenuated at locations of a transient. A matrixed version of the decorrelated signals are summed with a matrixed version of the decoded frequency domain representation to obtain N audio signals that collectively having N audio channels where M is less than N.
-
公开(公告)号:US20170180905A1
公开(公告)日:2017-06-22
申请号:US15300159
申请日:2015-03-31
Applicant: DOLBY INTERNATIONAL AB
Inventor: Heiko PURNHAGEN , Janusz KLEJSA
IPC: H04S7/00 , G10L19/008 , H04S3/00
CPC classification number: H04S7/302 , G10L19/008 , H04S3/008 , H04S2400/01 , H04S2400/03 , H04S2400/11
Abstract: There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary decoding method described is for reconstructing audio objects based on a data stream, wherein the data stream corresponds to a plurality of time frames, wherein the data stream comprises a plurality of side information instances, wherein the data stream further comprises, for each side information instance, transition data including two independently assignable portions which in combination define a point in time to begin a transition from a current reconstruction setting to a desired reconstruction setting specified by the side information instance, and a point in time to complete the transition.
-
公开(公告)号:US20170133025A1
公开(公告)日:2017-05-11
申请号:US15410377
申请日:2017-01-19
Applicant: DOLBY INTERNATIONAL AB
Inventor: Heiko PURNHAGEN , Kristofer KJOERLING
CPC classification number: G10L19/06 , G10L19/008 , G10L19/02 , G10L19/0204 , G10L19/0212 , G10L19/167 , G10L19/265 , G10L25/06 , H04S1/007 , H04S2400/03 , H04S2420/03
Abstract: The present disclosure provides methods, devices and computer program products for encoding and decoding a stereo audio signal based on an input signal. According to the disclosure, a hybrid approach of using both parametric stereo coding and a discrete representation of the stereo audio signal is used which may improve the quality of the encoded and decoded audio for certain bitrates.
-
公开(公告)号:US20170084285A1
公开(公告)日:2017-03-23
申请号:US15344170
申请日:2016-11-04
Applicant: Dolby International AB
Inventor: Jonas ENGDEGARD , Lars VILLEMOES , Heiko PURNHAGEN , Barbara RESCH
IPC: G10L19/20 , H04S3/00 , G10L19/008 , H04S3/02
CPC classification number: G10L19/20 , G10L19/008 , H04S3/008 , H04S3/02 , H04S5/00 , H04S7/30 , H04S2400/03 , H04S2400/11 , H04S2420/03
Abstract: An audio object coder for generating an encoded object signal using a plurality of audio objects includes a downmix information generator for generating downmix information indicating a distribution of the plurality of audio objects into at least two downmix channels, an audio object parameter generator for generating object parameters for the audio objects, and an output interface for generating the imported audio output signal using the downmix information and the object parameters. An audio synthesizer uses the downmix information for generating output data usable for creating a plurality of output channels of the predefined audio output configuration.
-
50.
公开(公告)号:US20170013387A1
公开(公告)日:2017-01-12
申请号:US15114383
申请日:2015-04-01
Applicant: Dolby International AB
Inventor: Christof FERSCH , Heiko PURNHAGEN , Jens POPP , Martin WOLTERS
IPC: H04S7/00 , H04S3/00 , G10L19/008
CPC classification number: H04S7/30 , G10L19/008 , H04S3/008 , H04S2400/03 , H04S2400/11 , H04S2400/13
Abstract: The present document relates to the field of encoding and decoding of audio. In particular, the present document relates to encoding and decoding of an audio scene comprising audio objects. A method (400) for encoding metadata relating to plurality of audio objects (106a) of an audio scene (102) is described. The metadata comprises a first set (114, 314) of metadata and a second set (104) of metadata. The first and second sets (104, 114, 314) of metadata comprise one or more data elements which are indicative of a property of an audio object (106a) from the plurality of audio objects (106a) and/or of a downmix signal (112) derived from the plurality of audio objects (106a). The method (400) comprises identifying (401) a redundant data element which is common to the first and second sets (104, 114, 314) of metadata. Furthermore, the method comprises encoding (402) the redundant data element of the first set (114, 314) of metadata by referring to a redundant data element of a set (104) of metadata external for the first set (114, 314) of metadata.
Abstract translation: 本文件涉及音频的编码和解码领域。 特别地,本文件涉及包括音频对象的音频场景的编码和解码。 描述了用于编码与音频场景(102)的多个音频对象(106a)有关的元数据的方法(400)。 元数据包括元数据的第一集合(114,314)和元数据的第二集合(104)。 元数据的第一和第二组(104,114,314)包括指示来自多个音频对象(106a)和/或降混信号(106a)的音频对象(106a)的属性的一个或多个数据元素 112)从多个音频对象(106a)导出。 方法(400)包括识别(401)元数据的第一和第二组(104,114,314)共有的冗余数据元素。 此外,该方法包括通过参考第一组(114,314)的外部元数据(104)的冗余数据元素(104)来编码(402)元数据的第一组(114,314)的冗余数据元素 元数据。
-
-
-
-
-
-
-
-
-