APPARATUS AND METHOD FOR IMPLEMENTING VERSATILE AUDIO OBJECT RENDERING

    公开(公告)号:US20240365077A1

    公开(公告)日:2024-10-31

    申请号:US18764318

    申请日:2024-07-04

    CPC classification number: H04S7/00 H04S2400/11

    Abstract: An apparatus for rendering according to an embodiment is provided. The apparatus is configured to generate an audio output signal for a loudspeaker of a loudspeaker setup from one or more audio objects. Each of the one or more audio objects comprises an audio object signal and exhibits a position. The apparatus comprises an interface configured to receive information on the position of each of the one or more audio objects. Moreover, the apparatus comprises a gain determiner configured to determine gain information for each audio object of the one or more audio objects for the loudspeaker depending on a distance between the position of said audio object and a position of the loudspeaker and depending on distance attenuation information and/or loudspeaker emphasis information. Furthermore, the apparatus comprises a signal processor configured to generate an audio output signal for the loudspeaker depending on the audio object signal of each of the one or more audio objects and depending on the gain information for each of the one or more audio objects for the loudspeaker.

    METHOD AND SYSTEM FOR DETERMINING INDIVIDUALIZED HEAD RELATED TRANSFER FUNCTIONS

    公开(公告)号:US20240349001A1

    公开(公告)日:2024-10-17

    申请号:US18580344

    申请日:2022-07-18

    Abstract: There is provided a system and method for determining individualized head related transfer functions (HRTF) for a user. The method including: receiving measurement data from the user, the measurement data generated by repeatedly emitting an audible reference sound at positions in space around the user and, during each emission, recording sounds received near each ear of the user, the measurement data including, for each emission, the recorded sounds and positional information of the emission; determining the individualized HRTF by updating a decoder of a trained generative artificial neural network model, the decoder receives the measurement data as input, the trained generative artificial neural network model including an encoder and the decoder, the generative artificial neural network model is trained using data gathered from a plurality of test subjects with known spectral representations and directions for associated HRTFs at different positions in space; and outputting the individualized HRTF.

    Minimizing Echo Caused by Stereo Audio Via Position-Sensitive Acoustic Echo Cancellation

    公开(公告)号:US20240340608A1

    公开(公告)日:2024-10-10

    申请号:US18297299

    申请日:2023-04-07

    Applicant: Google LLC

    Abstract: A stereo audio output signal is obtained based on position information indicative of a position of a participant of a teleconference relative to a plurality of audio output devices, wherein both the participant and the plurality of audio devices are located within a teleconferencing space. Playback of the stereo audio output signal is caused at the plurality of audio output devices located within the teleconferencing space. An audio input signal captured at an audio capture device located within the teleconferencing space is received, wherein at least a portion of the audio input signal comprises audio caused by playback of the stereo audio output signal by the plurality of audio output devices. The position information is used to perform an Acoustic Echo Cancellation (AEC) process to the at least the portion of the audio input signal.

    HEADPHONE RENDERING METADATA-PRESERVING SPATIAL CODING

    公开(公告)号:US20240334146A1

    公开(公告)日:2024-10-03

    申请号:US18690133

    申请日:2022-09-08

    CPC classification number: H04S7/302 H04S2400/11 H04S2420/01

    Abstract: Systems and methods for preserving headphone rendering mode (HRM) in object clustering are described. In an embodiment, an object-based audio data processing system includes a processor configured to receive a plurality of audio objects, wherein an audio object of the plurality of audio objects is associated with respective object metadata that indicates respective spatial position information and an HRM; determine a plurality of cluster positions by applying an extended hybrid distance metric to a spatial coding algorithm to calculate a partial loudness for each of the audio objects; render the audio objects to the cluster positions to form a plurality of clusters by applying the extended hybrid distance metric to the spatial coding algorithm to calculate object-to-cluster gains; and transmit the clusters to a spatial reproduction system.

    SOUND PROCESSING APPARATUS AND SOUND PROCESSING SYSTEM

    公开(公告)号:US20240323628A1

    公开(公告)日:2024-09-26

    申请号:US18668392

    申请日:2024-05-20

    Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image.
    A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker. The values obtained by multiplying these gains by the gain of the virtual speaker are set as the gains of the lower right and lower left speakers for fixing a sound image at the target sound image position. The present technology can be applied to sound processing apparatuses.

    Audio processing device and method therefor

    公开(公告)号:US12096201B2

    公开(公告)日:2024-09-17

    申请号:US18302120

    申请日:2023-04-18

    Abstract: An input unit receives input of an assumed listening position of sound of an object, which is a sound source, and outputs assumed listening position information indicating the assumed listening position. A position information correction unit corrects position information of each object on the basis of the assumed listening position information to obtain corrected position information. A gain/frequency characteristic correction unit performs gain correction and frequency characteristic correction on a waveform signal of an object on the basis of the position information and the corrected position information. A spatial acoustic characteristic addition unit further adds a spatial acoustic characteristic to the waveform signal resulting from the gain correction and the frequency characteristic correction on the basis of the position information of the object and the assumed listening position information. The present technology is applicable to an audio processing device.

Patent Agency Ranking