Ambisonics sound field navigation using directional decomposition and path distance estimation

    公开(公告)号:US10715944B2

    公开(公告)日:2020-07-14

    申请号:US16189400

    申请日:2018-11-13

    Applicant: Google LLC

    Inventor: Andrew Allen

    Abstract: Techniques of rendering sound for a listener involve determining, at a first position, a set of directions from which some specified fraction of the sound field energy originates and reconstruct the sound field at a second position based on the set of directions. Along these lines, a sound rendering computer may receive sound data representing sound emanating from positions on a sphere centered on a microphone. Nevertheless, only a small fraction of these points on the sphere contribute much of the sound field energy directed at the listener. The sound rendering computer determines such directions using a directional decomposition in which those directions having the highest fraction of the sound energy are identified. When the listener moves away from the microphone's position, the sound rendering computer, after identifying such directions, applies an amplitude and phase factor to the sound fields corresponding to those directions.

    Fast and memory efficient encoding of sound objects using spherical harmonic symmetries

    公开(公告)号:US10674301B2

    公开(公告)日:2020-06-02

    申请号:US16108385

    申请日:2018-08-22

    Applicant: GOOGLE LLC

    Abstract: A method of encoding sound objects includes receiving a set of monophonic sound inputs. Each of the set of monophonic sound inputs includes position and orientation information of a sound object relative to a source position. The set of monophonic sound inputs are encoded into a higher order ambisonic (HOA) sound field in a spherical harmonics domain based on a spherical harmonics dataset including a subset of all spherical harmonic coefficients for a given subset of azimuth and elevation angles. Some embodiments include decoding the HOA sound field to generate a set of loudspeaker signals.

    AMBISONICS SOUND FIELD NAVIGATION USING DIRECTIONAL DECOMPOSITION AND PATH DISTANCE ESTIMATION

    公开(公告)号:US20210160643A1

    公开(公告)日:2021-05-27

    申请号:US16946148

    申请日:2020-06-08

    Applicant: Google LLC

    Inventor: Andrew Allen

    Abstract: Techniques of rendering sound for a listener involve determining, at a first position, a set of directions from which some specified fraction of the sound field energy originates and reconstruct the sound field at a second position based on the set of directions. Along these lines, a sound rendering computer may receive sound data representing sound emanating from positions on a sphere centered on a microphone. Nevertheless, only a small fraction of these points on the sphere contribute much of the sound field energy directed at the listener. The sound rendering computer determines such directions using a directional decomposition in which those directions having the highest fraction of the sound energy are identified. When the listener moves away from the microphone's position, the sound rendering computer, after identifying such directions, applies an amplitude and phase factor to the sound fields corresponding to those directions.

    AMBISONICS SOUND FIELD NAVIGATION USING DIRECTIONAL DECOMPOSITION AND PATH DISTANCE ESTIMATION

    公开(公告)号:US20190373392A1

    公开(公告)日:2019-12-05

    申请号:US16189400

    申请日:2018-11-13

    Applicant: Google LLC

    Inventor: Andrew Allen

    Abstract: Techniques of rendering sound for a listener involve determining, at a first position, a set of directions from which some specified fraction of the sound field energy originates and reconstruct the sound field at a second position based on the set of directions. Along these lines, a sound rendering computer may receive sound data representing sound emanating from positions on a sphere centered on a microphone. Nevertheless, only a small fraction of these points on the sphere contribute much of the sound field energy directed at the listener. The sound rendering computer determines such directions using a directional decomposition in which those directions having the highest fraction of the sound energy are identified. When the listener moves away from the microphone's position, the sound rendering computer, after identifying such directions, applies an amplitude and phase factor to the sound fields corresponding to those directions.

    OBJECTIVE QUALITY METRICS FOR AMBISONIC SPATIAL AUDIO

    公开(公告)号:US20190341060A1

    公开(公告)日:2019-11-07

    申请号:US15973287

    申请日:2018-05-07

    Applicant: GOOGLE LLC

    Abstract: A computing device includes a processor and a memory. The processor is configured to generate spectrograms, for example, using short-time Fourier transform, for a plurality of channels of reference and test ambisonic signals. In some implementations, the test ambisonic signal may be generated by decoding an encoded version of the reference ambisonic signal. The processor is further configured to compare, for each of the plurality of channels of a reference ambisonic signal, at least a patch associated with a channel of the reference ambisonic signal with at least a corresponding patch of a corresponding channel of the test ambisonic signal and determine a localization accuracy of the test ambisonic signal based on the comparison. In some implementations, the comparing may be based on phaseograms of the reference and test ambisonic signals.

    MUTUAL INFORMATION BASED INTELLIGIBILITY ENHANCEMENT

    公开(公告)号:US20180212690A1

    公开(公告)日:2018-07-26

    申请号:US15926808

    申请日:2018-03-20

    Applicant: GOOGLE LLC

    CPC classification number: H04B15/00 G10L21/0364

    Abstract: Provided are methods and systems for improving the intelligibility of speech in a noisy environment. A communication model is developed that includes noise inherent in the message production and message interpretation processes, and considers that these noises have fixed signal-to-noise ratios. The communication model forms the basis of an algorithm designed to optimize the intelligibility of speech in a noisy environment. The intelligibility optimization algorithm only does something (e.g., manipulates the audio signal) when needed, and thus if no noise is present the algorithm does not alter or otherwise interfere with the audio signals, thereby preventing any speech distortion. The algorithm is also very fast and efficient in comparison to most existing approaches for speech intelligibility enhancement, and therefore the algorithm lends itself to easy implementation in an appropriate device (e.g., cellular phone or smartphone).

    Objective quality metrics for ambisonic spatial audio

    公开(公告)号:US10672405B2

    公开(公告)日:2020-06-02

    申请号:US15973287

    申请日:2018-05-07

    Applicant: GOOGLE LLC

    Abstract: A computing device includes a processor and a memory. The processor is configured to generate spectrograms, for example, using short-time Fourier transform, for a plurality of channels of reference and test ambisonic signals. In some implementations, the test ambisonic signal may be generated by decoding an encoded version of the reference ambisonic signal. The processor is further configured to compare, for each of the plurality of channels of a reference ambisonic signal, at least a patch associated with a channel of the reference ambisonic signal with at least a corresponding patch of a corresponding channel of the test ambisonic signal and determine a localization accuracy of the test ambisonic signal based on the comparison. In some implementations, the comparing may be based on phaseograms of the reference and test ambisonic signals.

Patent Agency Ranking