-
公开(公告)号:US11817108B2
公开(公告)日:2023-11-14
申请号:US17975955
申请日:2022-10-28
Applicant: DOLBY INTERNATIONAL AB
Inventor: Tobias Friedrich , Alexander Mueller , Karsten Linzmeier , Claus-Christian Spenger , Tobias R. Wagenblass
IPC: G10L19/008 , G10L19/16 , H04S3/00
CPC classification number: G10L19/008 , G10L19/167 , H04S3/008 , H04S2400/01 , H04S2400/03 , H04S2420/03
Abstract: The present document relates to audio coding systems. In particular, the present document relates to efficient methods and systems for parametric multi-channel audio coding. An audio encoding system configured to generate a bitstream indicative of a downmix signal and spatial metadata for generating a multi-channel upmix signal from the downmix signal is described. The system comprises a downmix processing unit configured to generate the downmix signal from a multi-channel input signal; wherein the downmix signal comprises m channels and wherein the multi-channel input signal comprises n channels; n, m being integers with m
-
72.
公开(公告)号:US20230362575A1
公开(公告)日:2023-11-09
申请号:US18352115
申请日:2023-07-13
Applicant: DOLBY INTERNATIONAL AB
Inventor: Leon TERENTIV , Christof FERSCH , Daniel FISCHER
CPC classification number: H04S7/303 , H04S3/008 , H04S2400/11 , H04S2400/01 , H04S2400/13
Abstract: A method (910) for rendering an audio signal in a virtual reality rendering environment (180) is described. The method (910) comprises rendering (911) an origin audio signal of an audio source (311, 312, 313) from an origin source position on an origin sphere (114) around an origin listening position (301) of a listener (181). Furthermore, the method (900) comprises determining (912) that the listener (181) moves from the origin listening position (301) to a destination listening position (302). In addition, the method (900) comprises determining (913) a destination source position of the audio source (311, 312, 313) on a destination sphere (114) around the destination listening position (302) based on the origin source position, and determining (914) a destination audio signal of the audio source (311, 312, 313) based on the origin audio signal. Furthermore, the method (900) comprises rendering (915) the destination audio signal of the audio source (311, 312, 313) from the destination source position on the destination sphere (114) around the destination listening position (302).
-
公开(公告)号:US20230360659A1
公开(公告)日:2023-11-09
申请号:US18351769
申请日:2023-07-13
Inventor: Dirk Jeroen Breebaart , David Matthew Cooper , Leif Jonas Samuelsson
IPC: G10L19/02 , H04S7/00 , G10L19/008
CPC classification number: G10L19/0212 , H04S7/308 , G10L19/008 , G10L19/0204 , H04S2420/01 , H04S2420/03 , H04S2420/07 , H04S2400/01 , H04R2460/03
Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
-
74.
公开(公告)号:US20230353781A1
公开(公告)日:2023-11-02
申请号:US18219036
申请日:2023-07-06
Applicant: DOLBY INTERNATIONAL AB
Inventor: Felix Henry , Stephane Pateux
IPC: H04N19/593 , H04N19/50 , H04N19/13 , H04N19/61 , H04N19/91 , H04N19/174 , H04N19/436 , H04N19/25 , H04N19/184 , H04N19/51
CPC classification number: H04N19/593 , H04N19/50 , H04N19/13 , H04N19/61 , H04N19/91 , H04N19/174 , H04N19/436 , H04N19/25 , H04N19/184 , H04N19/51
Abstract: A method of coding at least one image comprising the steps of splitting the image into a plurality of blocks, of grouping said blocks into a predetermined number of subsets of blocks, of coding each of said subsets of blocks in parallel, the blocks of a subset considered being coded according to a predetermined sequential order of traversal. The coding step comprises, for a current block of a subset considered, the sub-step of predictive coding of said current block with respect to at least one previously coded and decoded block, and the sub-step of entropy coding of said current block on the basis of at least one probability of appearance of a symbol.
-
公开(公告)号:US20230319190A1
公开(公告)日:2023-10-05
申请号:US17628732
申请日:2020-07-29
Inventor: Glenn N. DICKINS , Christopher Graham HINES , David GUNAWAN , Richard J. CARTWRIGHT , Alan J. SEEFELDT , Daniel Arteaga , Mark R.P. THOMAS , Joshua B. LANDO
CPC classification number: H04M9/082 , G10L2015/223 , G10L15/22
Abstract: An audio processing method may involve receiving output signals from each microphone of a plurality of microphones in an audio environment, the output signals corresponding to a current utterance of a person and determining, based on the output signals, one or more aspects of context information relating to the person, including an estimated current proximity of the person to one or more microphone locations. The method may involve selecting two or more loudspeaker-equipped audio devices based, at least in part, on the one or more aspects of the context information, determining one or more types of audio processing changes to apply to audio data being rendered to loudspeaker feed signals for the audio devices and causing one or more types of audio processing changes to be applied. In some examples, the audio processing changes have the effect of increasing a speech to echo ratio at one or more microphones.
-
公开(公告)号:US11769514B2
公开(公告)日:2023-09-26
申请号:US17679693
申请日:2022-02-24
Inventor: Sripal S. Mehta , Thomas Ziegler , Giles Baker , Jeffrey Riedmiller , Prinyar Saungsomboon
CPC classification number: G10L19/008 , G06F3/165 , G10L19/20 , H04S3/008 , H04S7/30 , G10L19/167 , H04S2400/01 , H04S2400/13 , H04S2400/15 , H04S2420/03
Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.
-
77.
公开(公告)号:US11756559B2
公开(公告)日:2023-09-12
申请号:US17677608
申请日:2022-02-22
Applicant: DOLBY INTERNATIONAL AB
Inventor: Kristofer Kjoerling , Lars Villemoes , Heiko Purnhagen , Per Ekstrand
CPC classification number: G10L19/02 , G10L19/167 , G10L19/26 , H03M7/6005 , H03M7/6011
Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.
-
公开(公告)号:US11749288B2
公开(公告)日:2023-09-05
申请号:US17854947
申请日:2022-06-30
Applicant: Dolby International AB
Inventor: Kristofer Kjoerling , Harald Mundt , Heiko Purnhagen
IPC: G10L19/008
CPC classification number: G10L19/008
Abstract: Encoding and decoding devices for encoding the channels of an audio system having at least four channels are disclosed. The decoding device has a first stereo decoding component which subjects a first pair of input channels to a first stereo decoding, and a second stereo decoding component which subjects a second pair of input channels to a second stereo decoding. The results of the first and second stereo decoding components are crosswise coupled to a third and a fourth stereo decoding component which each performs stereo decoding on one channel resulting from the first stereo decoding component, and one channel resulting from the second stereo decoding component.
-
公开(公告)号:US11743674B2
公开(公告)日:2023-08-29
申请号:US17467112
申请日:2021-09-03
Inventor: Nicolas R. Tsingos , David S. McGrath , Freddie Sanchez , Antonio Mateos Sole
IPC: H04S7/00
CPC classification number: H04S7/308 , H04S2400/11 , H04S2400/13
Abstract: The positions of a plurality of speakers at a media consumption site are determined. Audio information in an object-based format is received. Gain adjustment value for a sound content portion in the object-based format may be determined based on the position of the sound content portion and the positions of the plurality of speakers. Audio information in a ring-based channel format is received. Gain adjustment value for each ring-based channel in a set of ring-based channels may be determined based on the ring to which the ring-based channel belongs and the positions of the speakers at a media consumption site.
-
公开(公告)号:US20230269551A1
公开(公告)日:2023-08-24
申请号:US18099658
申请日:2023-01-20
Inventor: Antonio MATEOS SOLE , Nicolas R. TSINGOS
CPC classification number: H04S7/30 , H04S3/008 , H04S5/005 , H04S2400/01 , H04S2400/11 , H04S2400/13 , H04S2400/15
Abstract: Multiple virtual source locations may be defined for a volume within which audio objects can move. A set-up process for rendering audio data may involve receiving reproduction speaker location data and pre-computing gain values for each of the virtual sources according to the reproduction speaker location data and each virtual source location. The gain values may be stored and used during “run time,” during which audio reproduction data are rendered for the speakers of the reproduction environment. During run time, for each audio object, contributions from virtual source locations within an area or volume defined by the audio object position data and the audio object size data may be computed. A set of gain values for each output channel of the reproduction environment may be computed based, at least in part, on the computed contributions. Each output channel may correspond to at least one reproduction speaker of the reproduction environment.
-
-
-
-
-
-
-
-
-