-
11.
公开(公告)号:US20230230606A1
公开(公告)日:2023-07-20
申请号:US18188883
申请日:2023-03-23
Inventor: Adrian TOMASEK , Ralph SPERSCHNEIDER , Jan BÜTHE , Alexander TSCHEKALINSKIJ , Manfred LUTZKY
IPC: G10L19/035 , H04B17/309 , G10L21/0324 , G10L19/022 , H03M13/07 , H03M13/15 , H04L1/00
CPC classification number: G10L19/035 , H04B17/309 , G10L21/0324 , G10L19/022 , H03M13/07 , H03M13/1515 , H04L1/0009 , H04L1/0032 , H04L1/0046 , H04L1/0042 , H04L1/0045 , H04L1/0084
Abstract: An audio transmitter processor for generating an error protected frame using encoded audio data of an audio frame, the encoded audio data for the audio frame having a first amount of information units and a second amount of information units, has: a frame builder for building a codeword frame having a codeword raster, wherein the frame builder is configured to determine a border between a first amount of information units and a second amount of information units so that a starting information unit of the second amount of information units coincides with a codeword border; and an error protection coder to obtain a plurality of processed codewords representing the error protected frame.
-
公开(公告)号:US11694701B2
公开(公告)日:2023-07-04
申请号:US17396526
申请日:2021-08-06
Inventor: Martin Dietz , Guillaume Fuchs , Christian Helmrich , Goran Markovic
IPC: G10L25/00 , G10L19/00 , G10L19/035 , G10L19/02 , G10H1/06 , G10L25/18 , G10L25/21 , G10L25/45 , G10L25/03
CPC classification number: G10L19/035 , G10H1/06 , G10L19/0212 , G10L25/18 , G10L25/21 , G10L25/45 , G10H2210/561 , G10L25/03
Abstract: The invention provides an audio encoder for encoding an audio signal so as to produce therefrom an encoded signal, the audio encoder including: a framing device configured to extract frames from the audio signal; a quantizer configured to map spectral lines of a spectrum signal derived from the frame of the audio signal to quantization indices, wherein the quantizer has a dead-zone, in which the input spectral lines are mapped to quantization index zero; and a control device configured to modify the dead-zone; wherein the control device includes a tonality calculating device configured to calculate at least one tonality indicating value for at least one spectrum line or for at least one group of spectral lines, wherein the control device is configured to modify the dead-zone for the at least one spectrum line or the at least one group of spectrum lines depending on the respective tonality indicating value.
-
公开(公告)号:US11616954B2
公开(公告)日:2023-03-28
申请号:US17030466
申请日:2020-09-24
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Ho-sang Sung , Ki-hyun Choo , Eun-mi Oh
IPC: G10L19/032 , H04N19/124 , H04N19/196 , H04N19/40 , G10L19/16 , G10L19/24 , H03M13/05 , H03M13/15 , H03M13/31 , G10L19/035 , G10L19/038 , G10L19/02
Abstract: A spectrum coding method includes quantizing spectral data of a current band based on a first quantization scheme, generating a lower bit of the current band using the spectral data and the quantized spectral data, quantizing a sequence of lower bits including the lower bit of the current band based on a second quantization scheme, and generating a bitstream based on a upper bit excluding N bits, where N is 1 or greater, from the quantized spectral data and the quantized sequence of lower bits.
-
14.
公开(公告)号:US11580999B2
公开(公告)日:2023-02-14
申请号:US17331416
申请日:2021-05-26
Inventor: Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang
IPC: G10L19/022 , G10L19/06 , G10L19/16 , G10L19/035
Abstract: An audio signal encoding method performed by an encoder includes identifying an audio signal of a time domain in units of a block, generating a combined block by combining i) a current original block of the audio signal and ii) a previous original block chronologically adjacent to the current original block, extracting a first residual signal of a frequency domain from the combined block using linear predictive coding of a time domain, overlapping chronologically adjacent first residual signals among first residual signals converted into a time domain, and quantizing a second residual signal of a time domain extracted from the overlapped first residual signal by converting the second residual signal of the time domain into a frequency domain using linear predictive coding of a frequency domain.
-
公开(公告)号:US11501787B2
公开(公告)日:2022-11-15
申请号:US16548146
申请日:2019-08-22
Applicant: Google LLC
Inventor: Beat Gfeller , Dominik Roblek , Félix de Chaumont Quitry , Marco Tagliasacchi
IPC: G10L19/035 , G06N20/00 , G10L19/038 , G10L25/18
Abstract: Systems and methods for training a machine-learned model are provided. A method can include can include obtaining an unlabeled audio signal, sampling the unlabeled audio signal to select one or more sampled slices, inputting the one or more sampled slices into a machine-learned model, receiving, as an output of the machine-learned model, one or more determined characteristics associated with the audio signal, determining a loss function for the machine-learned model based at least in part on a difference between the one or more determined characteristics and one or more corresponding ground truth characteristics of the audio signal, and training the machine-learned model from end to end based at least in part on the loss function. The one or more determined characteristics can include one or more reconstructed portions of the audio signal temporally adjacent to the one or more sampled slices or an estimated distance between two sampled slices.
-
公开(公告)号:US20220335956A1
公开(公告)日:2022-10-20
申请号:US17635593
申请日:2020-07-27
Applicant: Nokia Technologies Oy
Inventor: Adriana VASILACHE
IPC: G10L19/008 , G10L19/035
Abstract: A method for spatial audio signal encoding comprising: obtaining, for a first frame, a plurality of audio direction parameters, wherein each parameter comprises an elevation value and an azimuth value and wherein each parameter has an ordered position; determining whether, for a preceding frame, any of the plurality of audio direction parameters was differentially encoded based on a difference between the preceding frame parameter elevation value and a further preceding frame parameter elevation value and the preceding frame parameter azimuth value and a further preceding frame parameter azimuth value; generating, for any audio direction parameter which was not differentially encoded in the considered preceding frame, a differential parameter value based on a difference between the frame parameter elevation value and a preceding frame parameter elevation value and a difference between the frame parameter azimuth value and a preceding frame parameter azimuth value; generating for each of the plurality of audio direction parameters a difference parameter value based on a difference between the audio direction parameter and a rotated derived audio direction parameter; quantizing the difference between the audio direction parameter and a rotated derived audio direction parameter and the differential parameter value; and selecting for each of the plurality of audio direction parameters, either of the quantized difference or differential parameter value.
-
17.
公开(公告)号:US20220293115A1
公开(公告)日:2022-09-15
申请号:US17831080
申请日:2022-06-02
Applicant: DOLBY INTERNATIONAL AB
Inventor: Lars Villemoes , Heiko Purnhagen , Per Ekstrand
IPC: G10L19/16 , G10L19/035 , G10L19/24 , G10L21/038
Abstract: Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.
-
公开(公告)号:US20220279299A1
公开(公告)日:2022-09-01
申请号:US17628792
申请日:2020-06-15
Applicant: Nokia Technologies Oy
Inventor: Adriana VASILACHE
IPC: H04S7/00 , G10L19/035 , G10L19/008 , H04S3/00
Abstract: There is disclosed inter alia an apparatus for spatial audio signal encoding configured to derive for each of a plurality of audio direction parameters a corresponding derived audio direction parameter comprising an elevation value and an azimuth value. Each derived audio direction parameter is rotated by the azimuth value of an audio direction parameter in the first position of the plurality of audio direction parameters. The position of some of the audio direction parameters are changed followed by determining for each of the plurality audio direction parameters a difference between each audio direction parameter and a corresponding rotated derived audio direction parameter. The difference for each of the plurality of audio direction parameters is then quantised.
-
公开(公告)号:US11355129B2
公开(公告)日:2022-06-07
申请号:US17111893
申请日:2020-12-04
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Ki-hyun Choo , Eun-mi Oh
IPC: G10L19/00 , G10L19/032 , G10L21/00 , G10L19/035 , G10L19/02
Abstract: A lossless encoding method is provided that includes determining a lossless encoding mode of a quantization coefficient as one of an infinite-range lossless encoding mode and a finite-range lossless encoding mode; encoding the quantization coefficient in the infinite-range lossless encoding mode in correspondence with a result of the lossless encoding mode determination; and encoding the quantization coefficient in the finite-range lossless encoding mode in correspondence with a result of the lossless encoding mode determination.
-
公开(公告)号:US20220130402A1
公开(公告)日:2022-04-28
申请号:US17573360
申请日:2022-01-11
Inventor: Srikanth Nagisetty , Zong Xian Liu , Hiroyuki Ehara
IPC: G10L19/02 , G10L19/028 , G10L19/035
Abstract: An encoding device according to the disclosure includes a first encoder, which in operation, encodes a low-band signal from a voice or audio input signal to generate a first encoded signal; a decoder, which in operation, decodes the first encoded signal to generate a low-band decoded signal; a second encoder, which in operation, encodes, on the basis of the low-band decoded signal, a high-band signal comprising a band from the voice or audio input signal, the band being higher than that of the low-band signal to generate a high-band encoded signal; an energy calculator, which in operation, calculates an energy of the voice or audio input signal for each subband of a plurality of subbands of the voice or audio input signal to acquire a calculated energy for each subband of the plurality of subbands of the voice or audio input signal, quantizes the calculated energy for each subband of the plurality of subbands of the voice or audio input signal to acquire a quantized band energy for each subband of the plurality of subbands of the voice or audio input signal and outputs the quantized band energy for each subband of the plurality of subbands of the voice or audio input signal; and a multiplexer, which in operation, multiplexes the quantized band energy for each subband of the plurality of subbands of the voice or audio input signal, the first encoded signal, and the high-band encoded signal to generate and output an encoded signal.
-
-
-
-
-
-
-
-
-