Methods and Apparatus for Rendering Audio Objects

    公开(公告)号:US20240334145A1

    公开(公告)日:2024-10-03

    申请号:US18623762

    申请日:2024-04-01

    Abstract: Multiple virtual source locations may be defined for a volume within which audio objects can move. A set-up process for rendering audio data may involve receiving reproduction speaker location data and pre-computing gain values for each of the virtual sources according to the reproduction speaker location data and each virtual source location. The gain values may be stored and used during “run time,” during which audio reproduction data are rendered for the speakers of the reproduction environment. During run time, for each audio object, contributions from virtual source locations within an area or volume defined by the audio object position data and the audio object size data may be computed. A set of gain values for each output channel of the reproduction environment may be computed based, at least in part, on the computed contributions. Each output channel may correspond to at least one reproduction speaker of the reproduction environment.

    SYSTEMS AND METHODS FOR DETERMINING AUDIO CHANNELS IN AUDIO DATA

    公开(公告)号:US20240196148A1

    公开(公告)日:2024-06-13

    申请号:US18080663

    申请日:2022-12-13

    Inventor: Harvey Landy

    CPC classification number: H04S5/005 H04R5/04

    Abstract: The current embodiments relate to an audio processing system that may determine the identity or type of audio channel of audio channels present in audio data. For instance, the audio processing system may include one or more processors that receive audio data that includes a plurality of audio channels, determine a respective type of audio channel for each respective audio channel of the plurality of audio channels, and generate characterized audio data indicative of the respective type of audio channel for each respective audio channel of the plurality of audio channels.

    Video-informed Spatial Audio Expansion
    4.
    发明公开

    公开(公告)号:US20230305800A1

    公开(公告)日:2023-09-28

    申请号:US18327134

    申请日:2023-06-01

    Applicant: GOOGLE LLC

    CPC classification number: G06F3/165 G10L25/51 G06V20/41 H04S5/005

    Abstract: First video frames that include a visual object and a non-spatialized first audio segment that includes an auditory event are received. If that second video frames do not include the visual object and a first time difference between the first video frames and the second video frames does not exceed a certain time, a motion vector of the visual object is used to assign a spatial location to the auditory event in at least one of the second video frames. A second audio segment that includes the auditory event and third video frames are received. If the third video frames do not include the visual object and a second time difference between the first video frames and the third video frames exceeds the certain time, the auditory event is assigned to a diffuse sound field. An audio output that conveys spatial locations of the visual object is output.

    Methods and Apparatus for Rendering Audio Objects

    公开(公告)号:US20230269551A1

    公开(公告)日:2023-08-24

    申请号:US18099658

    申请日:2023-01-20

    Abstract: Multiple virtual source locations may be defined for a volume within which audio objects can move. A set-up process for rendering audio data may involve receiving reproduction speaker location data and pre-computing gain values for each of the virtual sources according to the reproduction speaker location data and each virtual source location. The gain values may be stored and used during “run time,” during which audio reproduction data are rendered for the speakers of the reproduction environment. During run time, for each audio object, contributions from virtual source locations within an area or volume defined by the audio object position data and the audio object size data may be computed. A set of gain values for each output channel of the reproduction environment may be computed based, at least in part, on the computed contributions. Each output channel may correspond to at least one reproduction speaker of the reproduction environment.

    Systems and methods for providing augmented audio

    公开(公告)号:US11700497B2

    公开(公告)日:2023-07-11

    申请号:US17085574

    申请日:2020-10-30

    CPC classification number: H04S5/005 H04R3/12

    Abstract: A system for providing augmented spatialized audio in a vehicle, including a plurality of speakers disposed in a perimeter of a cabin of the vehicle; and a controller configured to receive a position signal indicative of the position of a first user's head in the vehicle and to output to a first binaural device, according to the first position signal, a first spatial audio signal, such that the first binaural device produces a first spatial acoustic signal perceived by the first user as originating from a first virtual source location within the vehicle cabin, wherein the first spatial audio signal comprises at least an upper range of a first content signal, wherein the controller is further configured to drive the plurality of speakers with a driving signal such that a first bass content of the first content signal is produced in the vehicle cabin.

    APPARATUS AND METHOD FOR LOW DELAY OBJECT METADATA CODING

    公开(公告)号:US20190222949A1

    公开(公告)日:2019-07-18

    申请号:US16360776

    申请日:2019-03-21

    Abstract: An apparatus for generating one or more audio channels is provided. The apparatus comprises a metadata decoder for generating one or more reconstructed metadata signals from one or more processed metadata signals depending on a control signal, wherein each of the one or more reconstructed metadata signals indicates information associated with an audio object signal of one or more audio object signals, wherein the metadata decoder is configured to generate the one or more reconstructed metadata signals by determining a plurality of reconstructed metadata samples for each of the one or more reconstructed metadata signals. The apparatus comprises an audio channel generator for generating the one or more audio channels depending on the one or more audio object signals and depending on the one or more reconstructed metadata signals. The metadata decoder is configured to receive a plurality of processed metadata samples of each of the one or more processed metadata signals. The metadata decoder is configured to receive the control signal.

    SOUND PROCESSING APPARATUS AND METHOD, AND PROGRAM

    公开(公告)号:US20190149935A1

    公开(公告)日:2019-05-16

    申请号:US16248739

    申请日:2019-01-15

    CPC classification number: H04S5/005 H03G3/301 H04S7/30 H04S7/302 H04S2400/11

    Abstract: The present technology relates to a sound processing apparatus and method, and a program for enabling more stable localization of a sound image.A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker. The values obtained by multiplying these gains by the gain of the virtual speaker are set as the gains of the lower right and lower left speakers for fixing a sound image at the target sound image position. The present technology can be applied to sound processing apparatuses.

Patent Agency Ranking