-
31.
公开(公告)号:US20240031768A1
公开(公告)日:2024-01-25
申请号:US18353063
申请日:2023-07-15
IPC分类号: H04S7/00
CPC分类号: H04S7/308 , H04S2400/11 , H04S2400/13
摘要: The positions of a plurality of speakers at a media consumption site are determined. Audio information in an object-based format is received. Gain adjustment value for a sound content portion in the object-based format may be determined based on the position of the sound content portion and the positions of the plurality of speakers. Audio information in a ring-based channel format is received. Gain adjustment value for each ring-based channel in a set of ring-based channels may be determined based on the ring to which the ring-based channel belongs and the positions of the speakers at a media consumption site.
-
32.
公开(公告)号:US20240029747A1
公开(公告)日:2024-01-25
申请号:US18357679
申请日:2023-07-24
CPC分类号: G10L19/02 , G10L19/167 , G10L19/26 , H03M7/6005 , H03M7/6011
摘要: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.
-
公开(公告)号:US20240022869A1
公开(公告)日:2024-01-18
申请号:US18255554
申请日:2021-12-02
CPC分类号: H04S7/301 , H04R5/02 , H04R3/005 , H04S7/303 , H04R2430/23 , H04S2400/15
摘要: A method may involve: receiving direction of arrival (DOA) data corresponding to sound emitted by at least a first smart audio device of the audio environment that includes a first audio transmitter and a first audio receiver, the DOA data corresponding to sound received by at least a second smart audio device of the audio environment that includes a second audio transmitter and a second audio receiver, the DOA data corresponding to sound emitted by at least the second smart audio device and received by at least the first smart audio device; receiving one or more configuration parameters corresponding to the audio environment, to one or more audio devices, or both; and minimizing a cost function based at least in part on the DOA data and the configuration parameter(s), to estimate a position and an orientation of at least the first smart audio device and the second smart audio device.
-
34.
公开(公告)号:US20240018844A1
公开(公告)日:2024-01-18
申请号:US18355168
申请日:2023-07-19
发明人: Jeffrey RIEDMILLER , Karl J. ROEDEN , Kristofer KJOERLING , Heiko PURNHAGEN , Vinay MELKOTE , Leif SEHLSTROM
IPC分类号: E21B33/138 , E21B41/00 , E21B21/00 , G10L19/008 , G10L19/16 , G10L19/18 , G10L19/24
CPC分类号: E21B33/138 , E21B41/00 , E21B21/003 , G10L19/008 , G10L19/167 , G10L19/18 , G10L19/24
摘要: On the basis of a bitstream (P), an n-channel audio signal (X) is reconstructed by deriving an m-channel core signal (Y) and multichannel coding parameters (a) from the bitstream, where 1≤m
-
公开(公告)号:US11875805B2
公开(公告)日:2024-01-16
申请号:US17495184
申请日:2021-10-06
IPC分类号: G10L19/00 , G10L19/02 , G10L21/038 , G10L21/0388 , G10L19/26
CPC分类号: G10L19/0208 , G10L19/02 , G10L19/0212 , G10L19/26 , G10L21/038 , G10L21/0388
摘要: There is provided methods and apparatuses for decoding and encoding of audio signals. In particular, a method for decoding includes receiving a waveform-coded signal having a spectral content corresponding to a subset of the frequency range above a cross-over frequency. The waveform-coded signal is interleaved with a parametric high frequency reconstruction of the audio signal above the cross-over frequency. In this way an improved reconstruction of the high frequency bands of the audio signal is achieved.
-
公开(公告)号:US20240013793A1
公开(公告)日:2024-01-11
申请号:US18255232
申请日:2021-12-02
发明人: Stefan BRUHN , Harald MUNDT , David S. MCGRATH , Stefanie BROWN
IPC分类号: G10L19/008 , G10L19/002 , G10L19/032
CPC分类号: G10L19/008 , G10L19/002 , G10L19/032
摘要: Method for encoding scene-based audio is provided. In some implementations, the method involves determining, by an encoder, a spatial direction of a dominant sound component in a frame of an input audio signal. In some implementations, the method involves determining rotation parameters based on the determined spatial direction and a direction preference of a coding scheme to be used to encode the input audio signal. In some implementations, the method involves rotating sound components of the frame based on the rotation parameters such that, after being rotated, the dominant sound component has a spatial direction that aligns with the direction preference of the coding scheme. In some implementations, the method involves encoding the rotated sound components of the frame of the input audio signal using the coding scheme in connection with an indication of the rotation parameters or an indication of the spatial direction of the dominant sound component.
-
公开(公告)号:US20240005942A1
公开(公告)日:2024-01-04
申请号:US18248801
申请日:2021-10-13
发明人: Xiaoyu LIU , Jordi PONS PUIG
IPC分类号: G10L21/028
CPC分类号: G10L21/028
摘要: Described is a method of training a deep-learning-based system for sound source separation. The system comprises a separation stage for frame-wise extraction of representations of sound sources from a representation of an audio signal, and a clustering stage for generating, for each frame, a vector indicative of an assignment permutation of extracted frames of representations of sound sources to respective sound sources. The representation of the audio signal is a waveform-based representation. The separation stage is trained using frame-level permutation invariant training. Further, the clustering stage is trained to generate embedding vectors for the frames of the audio signal that allow to determine estimates of respective assignment permutations between extracted sound signals and labels of sound sources that had been used for the frames. Also described is a method of using the deep-learning-based system for sound source separation.
-
公开(公告)号:USRE49787E1
公开(公告)日:2024-01-02
申请号:US17150967
申请日:2021-01-15
发明人: Jiuhuai Lu , Tao Chen , Yoshiichiro Kashiwagi , Shinya Kadono , Chong Soon Lim
IPC分类号: H04N19/159
CPC分类号: H04N19/159
摘要: A moving picture coding apparatus 1 includes: a quantization matrix holding unit (112) that holds a quantization matrix (WM) which has already been transmitted in a parameter set and a matrix ID for identifying the quantization matrix (WM), which are associated with each other; and a variable length coding unit (111) that obtains the matrix ID corresponding to the quantization matrix (WM) used for quantization from the quantization matrix holding unit (112) and places the matrix ID in a coded stream Str.
-
39.
公开(公告)号:US20230394287A1
公开(公告)日:2023-12-07
申请号:US18248805
申请日:2021-10-12
发明人: Cong Zhou , Mark S. Vinton , Grant A. Davidson , Lars Villemoes
IPC分类号: G06N3/0475 , G06N3/044 , G06N3/045
CPC分类号: G06N3/0475 , G06N3/044 , G06N3/045
摘要: A neural network system for predicting frequency coefficients of a media signal, the neural network system comprising a time predicting portion including at least one neural network trained to predict a first set of output variables representing a specific frequency band of a current time frame given coefficients of one or several previous time frames, and a frequency predicting portion including a at least one neural network trained to predict a second set of output variables representing a specific frequency band given coefficients of one or several frequency bands adjacent to the specific frequency band in said current time frame. Such a neural network system forms a predictor capable of capturing both temporal and frequency dependencies occurring in time-frequency tiles of a media signal.
-
40.
公开(公告)号:US11830508B2
公开(公告)日:2023-11-28
申请号:US17544959
申请日:2021-12-08
发明人: Stephan Schreiner , Christof Fersch
IPC分类号: G10L19/16 , G10L19/008 , H04N21/426 , H04N21/434 , H04N21/4363 , H04N21/439
CPC分类号: G10L19/167 , G10L19/008 , H04N21/42615 , H04N21/434 , H04N21/4363 , H04N21/4394
摘要: The disclosure relates to methods, apparatus and systems for side load processing of packetized media streams. In an embodiment, the apparatus comprises: a receiver for receiving a bitstream, and a splitter for identifying a packet type in the bitstream and splitting, based on the identification of a value of the packet type in the bit stream into a main stream and an auxiliary stream.
-
-
-
-
-
-
-
-
-