Patent search ipc:"G10L19/035" Page 2

11.

发明公开
AUDIO TRANSMITTER PROCESSOR, AUDIO RECEIVER PROCESSOR AND RELATED METHODS AND COMPUTER PROGRAMS 审中-公开

公开(公告)号：US20230230606A1

公开(公告)日：2023-07-20

申请号：US18188883

申请日：2023-03-23

Applicant: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventor： Adrian TOMASEK , Ralph SPERSCHNEIDER , Jan BÜTHE , Alexander TSCHEKALINSKIJ , Manfred LUTZKY

IPC: G10L19/035 , H04B17/309 , G10L21/0324 , G10L19/022 , H03M13/07 , H03M13/15 , H04L1/00

CPC classification number: G10L19/035 , H04B17/309 , G10L21/0324 , G10L19/022 , H03M13/07 , H03M13/1515 , H04L1/0009 , H04L1/0032 , H04L1/0046 , H04L1/0042 , H04L1/0045 , H04L1/0084

Abstract: An audio transmitter processor for generating an error protected frame using encoded audio data of an audio frame, the encoded audio data for the audio frame having a first amount of information units and a second amount of information units, has: a frame builder for building a codeword frame having a codeword raster, wherein the frame builder is configured to determine a border between a first amount of information units and a second amount of information units so that a starting information unit of the second amount of information units coincides with a codeword border; and an error protection coder to obtain a plurality of processed codewords representing the error protected frame.

12.

发明授权
Low-complexity tonality-adaptive audio signal quantization 有权

公开(公告)号：US11694701B2

公开(公告)日：2023-07-04

申请号：US17396526

申请日：2021-08-06

Applicant: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventor： Martin Dietz , Guillaume Fuchs , Christian Helmrich , Goran Markovic

IPC: G10L25/00 , G10L19/00 , G10L19/035 , G10L19/02 , G10H1/06 , G10L25/18 , G10L25/21 , G10L25/45 , G10L25/03

CPC classification number: G10L19/035 , G10H1/06 , G10L19/0212 , G10L25/18 , G10L25/21 , G10L25/45 , G10H2210/561 , G10L25/03

Abstract: The invention provides an audio encoder for encoding an audio signal so as to produce therefrom an encoded signal, the audio encoder including: a framing device configured to extract frames from the audio signal; a quantizer configured to map spectral lines of a spectrum signal derived from the frame of the audio signal to quantization indices, wherein the quantizer has a dead-zone, in which the input spectral lines are mapped to quantization index zero; and a control device configured to modify the dead-zone; wherein the control device includes a tonality calculating device configured to calculate at least one tonality indicating value for at least one spectrum line or for at least one group of spectral lines, wherein the control device is configured to modify the dead-zone for the at least one spectrum line or the at least one group of spectrum lines depending on the respective tonality indicating value.

13.

发明授权
Signal encoding method and apparatus and signal decoding method and apparatus 有权

公开(公告)号：US11616954B2

公开(公告)日：2023-03-28

申请号：US17030466

申请日：2020-09-24

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Ho-sang Sung , Ki-hyun Choo , Eun-mi Oh

IPC: G10L19/032 , H04N19/124 , H04N19/196 , H04N19/40 , G10L19/16 , G10L19/24 , H03M13/05 , H03M13/15 , H03M13/31 , G10L19/035 , G10L19/038 , G10L19/02

Abstract: A spectrum coding method includes quantizing spectral data of a current band based on a first quantization scheme, generating a lower bit of the current band using the spectral data and the quantized spectral data, quantizing a sequence of lower bits including the lower bit of the current band based on a second quantization scheme, and generating a bitstream based on a upper bit excluding N bits, where N is 1 or greater, from the quantized spectral data and the quantized sequence of lower bits.

14.

发明授权
Method and apparatus for encoding and decoding audio signal to reduce quantization noise 有权

公开(公告)号：US11580999B2

公开(公告)日：2023-02-14

申请号：US17331416

申请日：2021-05-26

Applicant: Electronics and Telecommunications Research Institute

Inventor： Seung Kwon Beack , Jongmo Sung , Mi Suk Lee , Tae Jin Lee , Woo-taek Lim , Inseon Jang

IPC: G10L19/022 , G10L19/06 , G10L19/16 , G10L19/035

Abstract: An audio signal encoding method performed by an encoder includes identifying an audio signal of a time domain in units of a block, generating a combined block by combining i) a current original block of the audio signal and ii) a previous original block chronologically adjacent to the current original block, extracting a first residual signal of a frequency domain from the combined block using linear predictive coding of a time domain, overlapping chronologically adjacent first residual signals among first residual signals converted into a time domain, and quantizing a second residual signal of a time domain extracted from the overlapped first residual signal by converting the second residual signal of the time domain into a frequency domain using linear predictive coding of a frequency domain.

15.

发明授权
Self-supervised audio representation learning for mobile devices 有权

公开(公告)号：US11501787B2

公开(公告)日：2022-11-15

申请号：US16548146

申请日：2019-08-22

Applicant: Google LLC

Inventor： Beat Gfeller , Dominik Roblek , Félix de Chaumont Quitry , Marco Tagliasacchi

IPC: G10L19/035 , G06N20/00 , G10L19/038 , G10L25/18

Abstract: Systems and methods for training a machine-learned model are provided. A method can include can include obtaining an unlabeled audio signal, sampling the unlabeled audio signal to select one or more sampled slices, inputting the one or more sampled slices into a machine-learned model, receiving, as an output of the machine-learned model, one or more determined characteristics associated with the audio signal, determining a loss function for the machine-learned model based at least in part on a difference between the one or more determined characteristics and one or more corresponding ground truth characteristics of the audio signal, and training the machine-learned model from end to end based at least in part on the loss function. The one or more determined characteristics can include one or more reconstructed portions of the audio signal temporally adjacent to the one or more sampled slices or an estimated distance between two sampled slices.

16.

发明申请
QUANTIZATION OF SPATIAL AUDIO DIRECTION PARAMETERS 有权

公开(公告)号：US20220335956A1

公开(公告)日：2022-10-20

申请号：US17635593

申请日：2020-07-27

Applicant: Nokia Technologies Oy

Inventor： Adriana VASILACHE

IPC: G10L19/008 , G10L19/035

Abstract: A method for spatial audio signal encoding comprising: obtaining, for a first frame, a plurality of audio direction parameters, wherein each parameter comprises an elevation value and an azimuth value and wherein each parameter has an ordered position; determining whether, for a preceding frame, any of the plurality of audio direction parameters was differentially encoded based on a difference between the preceding frame parameter elevation value and a further preceding frame parameter elevation value and the preceding frame parameter azimuth value and a further preceding frame parameter azimuth value; generating, for any audio direction parameter which was not differentially encoded in the considered preceding frame, a differential parameter value based on a difference between the frame parameter elevation value and a preceding frame parameter elevation value and a difference between the frame parameter azimuth value and a preceding frame parameter azimuth value; generating for each of the plurality of audio direction parameters a difference parameter value based on a difference between the audio direction parameter and a rotated derived audio direction parameter; quantizing the difference between the audio direction parameter and a rotated derived audio direction parameter and the differential parameter value; and selecting for each of the plurality of audio direction parameters, either of the quantized difference or differential parameter value.

17.

发明申请
DECODING AUDIO BITSTREAMS WITH ENHANCED SPECTRAL BAND REPLICATION METADATA IN AT LEAST ONE FILL ELEMENT 有权

公开(公告)号：US20220293115A1

公开(公告)日：2022-09-15

申请号：US17831080

申请日：2022-06-02

Applicant: DOLBY INTERNATIONAL AB

Inventor： Lars Villemoes , Heiko Purnhagen , Per Ekstrand

IPC: G10L19/16 , G10L19/035 , G10L19/24 , G10L21/038

Abstract: Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.

18.

发明申请
QUANTIZATION OF SPATIAL AUDIO DIRECTION PARAMETERS 有权

公开(公告)号：US20220279299A1

公开(公告)日：2022-09-01

申请号：US17628792

申请日：2020-06-15

Applicant: Nokia Technologies Oy

Inventor： Adriana VASILACHE

IPC: H04S7/00 , G10L19/035 , G10L19/008 , H04S3/00

Abstract: There is disclosed inter alia an apparatus for spatial audio signal encoding configured to derive for each of a plurality of audio direction parameters a corresponding derived audio direction parameter comprising an elevation value and an azimuth value. Each derived audio direction parameter is rotated by the azimuth value of an audio direction parameter in the first position of the plurality of audio direction parameters. The position of some of the audio direction parameters are changed followed by determining for each of the plurality audio direction parameters a difference between each audio direction parameter and a corresponding rotated derived audio direction parameter. The difference for each of the plurality of audio direction parameters is then quantised.

19.

发明授权
Energy lossless-encoding method and apparatus, audio encoding method and apparatus, energy lossless-decoding method and apparatus, and audio decoding method and apparatus 有权

公开(公告)号：US11355129B2

公开(公告)日：2022-06-07

申请号：US17111893

申请日：2020-12-04

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Ki-hyun Choo , Eun-mi Oh

IPC: G10L19/00 , G10L19/032 , G10L21/00 , G10L19/035 , G10L19/02

Abstract: A lossless encoding method is provided that includes determining a lossless encoding mode of a quantization coefficient as one of an infinite-range lossless encoding mode and a finite-range lossless encoding mode; encoding the quantization coefficient in the infinite-range lossless encoding mode in correspondence with a result of the lossless encoding mode determination; and encoding the quantization coefficient in the finite-range lossless encoding mode in correspondence with a result of the lossless encoding mode determination.

20.

发明申请
ENCODING DEVICE, DECODING DEVICE, ENCODING METHOD, DECODING METHOD, AND NON-TRANSITORY COMPUTER-READABLE RECORDING MEDIUM 有权

公开(公告)号：US20220130402A1

公开(公告)日：2022-04-28

申请号：US17573360

申请日：2022-01-11

Applicant: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventor： Srikanth Nagisetty , Zong Xian Liu , Hiroyuki Ehara

IPC: G10L19/02 , G10L19/028 , G10L19/035

Abstract: An encoding device according to the disclosure includes a first encoder, which in operation, encodes a low-band signal from a voice or audio input signal to generate a first encoded signal; a decoder, which in operation, decodes the first encoded signal to generate a low-band decoded signal; a second encoder, which in operation, encodes, on the basis of the low-band decoded signal, a high-band signal comprising a band from the voice or audio input signal, the band being higher than that of the low-band signal to generate a high-band encoded signal; an energy calculator, which in operation, calculates an energy of the voice or audio input signal for each subband of a plurality of subbands of the voice or audio input signal to acquire a calculated energy for each subband of the plurality of subbands of the voice or audio input signal, quantizes the calculated energy for each subband of the plurality of subbands of the voice or audio input signal to acquire a quantized band energy for each subband of the plurality of subbands of the voice or audio input signal and outputs the quantized band energy for each subband of the plurality of subbands of the voice or audio input signal; and a multiplexer, which in operation, multiplexes the quantized band energy for each subband of the plurality of subbands of the voice or audio input signal, the first encoded signal, and the high-band encoded signal to generate and output an encoded signal.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification