专利检索 cpc:"G10L19/008" 第 1 页

1.

发明公开
Creating Spatial Audio Stream from Audio Objects with Spatial Extent 审中-公开

公开(公告)号：US20240355341A1

公开(公告)日：2024-10-24

申请号：US18574918

申请日：2022-06-16

申请人： Nokia Technologies Oy

发明人： Mikko-Ville LAITINEN , Tapani PIHLAJAKUJA

IPC分类号： G10L19/008

CPC分类号： G10L19/008

摘要： An apparatus for spatial audio encoding including circuitry configured to: obtain a first spatial audio stream of a first spatial audio format configured to be encoded with a low bitrate, wherein the first spatial audio stream includes an audio signal and a first metadata; obtain a second and different spatial audio stream of a second spatial audio format, wherein the second spatial audio stream includes a second audio signal and a second metadata; convert the second spatial audio format into the first spatial audio format to encode a converted second spatial audio stream with the low bitrate, wherein the converted spatial audio stream represents spatial audio properties of the second spatial audio stream; combine the first spatial audio stream and the converted second spatial audio stream to generate a combined spatial audio stream; and encode the combined spatial audio stream.

2.

发明授权
Methods, apparatus and systems for 6DOF audio rendering and data representations and bitstream structures for 6DOF audio rendering 有权

公开(公告)号：US12126985B2

公开(公告)日：2024-10-22

申请号：US17896005

申请日：2022-08-25

申请人： DOLBY INTERNATIONAL AB

发明人： Leon Terentiv , Christof Fersch , Daniel Fischer

IPC分类号： H04S7/00 , G10L19/008 , G10L19/16 , H04S3/00

CPC分类号： H04S7/303 , G10L19/008 , G10L19/167 , H04S3/008 , H04S2400/01 , H04S2400/11

摘要： The present disclosure relates to methods, apparatus and systems for encoding an audio signal into a bitstream, in particular at an encoder, comprising: encoding or including audio signal data associated with 3DoF audio rendering into one or more first bitstream parts of the bitstream, and encoding or including metadata associated with 6DoF audio rendering into one or more second bitstream parts of the bitstream. The present disclosure further relates to methods, apparatus and systems for decoding an audio signal and audio rendering based on the bitstream.

3.

发明授权
Method and system for decoding left and right channels of a stereo sound signal 有权

公开(公告)号：US12125492B2

公开(公告)日：2024-10-22

申请号：US17071299

申请日：2020-10-15

申请人： VOICEAGE CORPORATION

发明人： Tommy Vaillancourt , Milan Jelinek

IPC分类号： G10L19/008

CPC分类号： G10L19/008

摘要： A stereo sound decoding method and system decode left and right channels of a stereo sound signal, using received encoding parameters comprising encoding parameters of a primary channel, encoding parameters of a secondary channel, and a factor β. The primary channel encoding parameters comprise LP filter coefficients of the primary channel. The primary channel is decoded in response to the primary channel encoding parameters. The secondary channel is decoded using one of a plurality of coding models, wherein at least one of the coding models uses the primary channel LP filter coefficients to decode the secondary channel. The decoded primary and secondary channels are time domain up-mixed using the factor β to produce the decoded left and right channels of the stereo sound signal, wherein the factor β determines respective contributions of the primary and secondary channels upon production of the left and right channels.

4.

发明授权
Acoustic reproduction method, acoustic reproduction device, and recording medium 有权

公开(公告)号：US12120500B2

公开(公告)日：2024-10-15

申请号：US17939114

申请日：2022-09-07

申请人： Panasonic Intellectual Property Corporation of America

发明人： Seigo Enomoto , Tomokazu Ishikawa

IPC分类号： H04R5/02 , G10L19/008 , H04S3/00 , H04S7/00

CPC分类号： H04S7/304 , G10L19/008 , H04S3/008 , H04S2400/01 , H04S2400/11 , H04S2400/15 , H04S2420/01

摘要： An acoustic reproduction method includes: localizing a first sound image at a first position in a target space in which a user is present; and localizing, at a second position in the target space, a second sound image that represents an anchor sound for indicating a reference position.

5.

发明授权
Sound signal downmixing method, sound signal coding method, sound signal downmixing apparatus, sound signal coding apparatus, program and recording medium 有权

公开(公告)号：US12119009B2

公开(公告)日：2024-10-15

申请号：US17909698

申请日：2021-02-08

申请人： NIPPON TELEGRAPH AND TELEPHONE CORPORATION

发明人： Ryosuke Sugiura , Takehiro Moriya , Yutaka Kamamoto

IPC分类号： G10L19/008 , G10L19/24 , H04S1/00 , H04S7/00

CPC分类号： G10L19/008 , G10L19/24 , H04S1/007 , H04S7/30 , H04S2400/03

摘要： A sound signal downmix method includes an inter-channel relationship information obtaining step of obtaining an inter-channel correlation value and an inter-channel time difference in an approximate manner, and a downmix step of obtaining a downmix signal based on the obtained information. In the inter-channel relationship information obtaining step, multiple channel signals are sorted such that signals of adjacent channels are similar to each other, the inter-channel correlation value and the inter-channel time difference are determined only between adjacent channels after the sorting, the inter-channel correlation value between non-adjacent channels is obtained by determining a value that has a monotonically non-decreasing relationship with the inter-channel correlation between the adjacent channels, and the inter-channel time difference between non-adjacent channels is obtained by adding up the inter-channel time differences of adjacent channels.

6.

发明授权
Apparatus and method for encoding or decoding directional audio coding parameters using quantization and entropy coding 有权

公开(公告)号：US12106763B2

公开(公告)日：2024-10-01

申请号：US17571970

申请日：2022-01-10

申请人： Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

发明人： Guillaume Fuchs , Jürgen Herre , Fabian Küch , Stefan Döhla , Markus Multrus , Oliver Thiergart , Oliver Wübbolt , Florin Ghido , Stefan Bayer , Wolfgang Jaegers

IPC分类号： G10L19/008 , G10L19/02 , G10L19/032 , G10L19/038 , G10L19/16 , G10L19/26 , H03M7/30

CPC分类号： G10L19/008 , G10L19/0204 , G10L19/032 , G10L19/038 , G10L19/167 , G10L19/26 , H03M7/3082 , H03M7/6005 , H03M7/6011

摘要： An apparatus for encoding directional audio coding parameters comprising diffuseness parameters and direction parameters having a parameter calculator (100) for calculating the diffuseness parameters with a first time or frequency resolution and for calculating the direction parameters with a second time or frequency resolution; and a quantizer and encoder processor (200) for generating a quantized and encoded representation of the diffuseness parameters and the direction parameters.

7.

发明公开
Multi-Channel Speech Compression System and Method 审中-公开

公开(公告)号：US20240323630A1

公开(公告)日：2024-09-26

申请号：US18676347

申请日：2024-05-28

申请人： Microsoft Technology Licensing, LLC

发明人： Dushyant Sharma , Patrick A. Naylor , Uwe Helmut Jost

IPC分类号： H04S7/00 , G06T7/70 , G10L15/06 , G10L15/22 , G10L19/00 , G10L19/008 , G10L19/16 , G10L21/0208 , G10L21/0216 , H04R1/40 , H04R3/00 , H04R5/027 , H04S3/00

CPC分类号： H04S7/30 , G06T7/70 , G10L15/063 , G10L15/22 , G10L19/008 , G10L19/167 , G10L21/0208 , H04R1/406 , H04R3/005 , H04R5/027 , H04S3/008 , G10L2019/0001 , G10L2019/0002 , G10L2021/02166 , H04R2201/401 , H04S2400/01 , H04S2400/15

摘要： A method, computer program product, and computing system for encoding audio encounter information of a reference audio acquisition device of a plurality of audio acquisition devices of an audio recording system, thus defining encoded reference audio encounter information. Location information may be estimated, via a machine vision system, for an acoustic source within an acoustic environment. One or more acoustic relative transfer functions may be selected from a plurality of acoustic relative transfer functions for the plurality of audio acquisition devices of the audio recording system based upon, at least in part, the location information. The encoded reference audio encounter information and a representation of the selected one or more acoustic relative transfer function may be transmitted.

8.

发明公开
METHOD AND DEVICE FOR UNIFIED TIME-DOMAIN / FREQUENCY DOMAIN CODING OF A SOUND SIGNAL 审中-公开

公开(公告)号：US20240321285A1

公开(公告)日：2024-09-26

申请号：US18259971

申请日：2022-01-05

申请人： VOICEAGE CORPORATION

发明人： Tommy VAILLANCOURT , Vladimir MALENOVSKY

IPC分类号： G10L19/12 , G10L19/008 , G10L19/22 , G10L21/0232

CPC分类号： G10L19/12 , G10L19/008 , G10L19/22 , G10L21/0232

摘要： A unified time-domain/frequency-domain coding method and device for coding an input sound signal comprise a classifier of the input sound signal into one of a plurality of sound signal categories comprising an unclear signal type category showing that the nature of the input sound signal is unclear. One of a plurality of coding sub-modes is selected for coding the input sound signal if the input sound signal is classified in the unclear signal type category. A mixed time-domain/frequency-domain encoder codes the input sound signal using the selected coding sub-mode. The mixed time-domain/frequency-domain encoder comprises a selector of frequency bands and allocator of bits for selecting frequency bands to quantize and for distributing a bit budget available to quantization between the selected frequency bands. Corresponding sound signal decoder and decoding method are also provided.

9.

发明授权
Sound signal downmixing method, sound signal coding method, sound signal downmixing apparatus, sound signal coding apparatus, program and recording medium 有权

公开(公告)号：US12100403B2

公开(公告)日：2024-09-24

申请号：US17909666

申请日：2020-11-04

申请人： NIPPON TELEGRAPH AND TELEPHONE CORPORATION

发明人： Ryosuke Sugiura , Takehiro Moriya , Yutaka Kamamoto

IPC分类号： G10L19/008 , G10L19/24 , H04S1/00 , H04S7/00

CPC分类号： G10L19/008 , G10L19/24 , H04S1/007 , H04S7/30 , H04S2400/03

摘要： A sound signal downmix device for obtaining a downmix signal that is a signal obtained by mixing a left channel input sound signal and a right channel input sound signal includes a left-right relationship information acquisition unit 185 that obtains preceding channel information that is information indicating which of the left channel input sound signal and the right channel input sound signal is preceding and a left-right correlation coefficient that is a correlation coefficient between the left channel input sound signal and the right channel input sound signal and a downmix unit 112 that obtains the downmix signal by weighted averaging the left channel input sound signal and the right channel input sound signal to include a larger amount of an input sound signal of a preceding channel among the left channel input sound signal and the right channel input sound signal as the left-right correlation coefficient is greater, based on the preceding channel information and the left-right correlation coefficient.

10.

发明授权
Apparatus and method for downmixing or upmixing a multichannel signal using phase compensation 有权

公开(公告)号：US12100402B2

公开(公告)日：2024-09-24

申请号：US17824297

申请日：2022-05-25

申请人： FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.

发明人： Jan Buethe , Guillaume Fuchs , Wolfgang Jagers , Franz Reutelhuber , Juergen Herre , Eleni Fotopoulou , Markus Multrus , Srikanth Korse

IPC分类号： G10L19/008 , H04S1/00 , H04S3/00 , H04S3/02 , H04S7/00

CPC分类号： G10L19/008 , H04S1/007 , H04S3/008 , H04S7/30 , H04S2400/01 , H04S2420/03

摘要： An apparatus for downmixing a multi-channel signal having at least two channels, has: a downmixer for calculating a downmix signal from the multi-channel signal, wherein the downmixer is configured to calculate the downmix using an absolute phase compensation, so that a channel having a lower energy among the at least two channels is only rotated or is rotated stronger than a channel having a greater energy in calculating the downmix signal; and an output interface for generating an output signal, the output signal having information on the downmix signal.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类