Distributed speech enhancement using generalized eigenvalue decomposition

    公开(公告)号:US12039991B1

    公开(公告)日:2024-07-16

    申请号:US17532720

    申请日:2021-11-22

    摘要: An artificial reality headset enhances audio signals from a target sound source using information from other devices in the local area. A primary headset broadcasts a location of a target sound source to secondary headsets in a local area. The secondary headsets transmit audio signals to the primary headset to enhance the audio content presented by the primary headset to a user. The secondary headset may select an array transfer function for the location of the target sound source. The secondary headsets correlate known transfer functions in the target direction with estimated transfer functions. The secondary headset may perform beamforming on the target sound source and transmit the output audio signal to the primary headset. In some embodiments, the secondary headset may transmit the array transfer function and a raw audio signal to the primary headset. The primary headset generates audio content based on the received audio signal.

    Audio source localization
    5.
    发明授权

    公开(公告)号:US11671756B2

    公开(公告)日:2023-06-06

    申请号:US16997778

    申请日:2020-08-19

    摘要: An electronic device localizes an audio source by normalizing an amplitude of an audio signal over a time period. The electronic device receives, from one or more microphones of the electronic device, signal(s) representative of audio emitted by an audio source over a time period. The electronic device estimates amplitudes of the signal(s) at a first time within the time period and at a second time within the time period, where the second time is different from the first time. The electronic device normalizes the amplitudes associated with the first and second times to generate normalized amplitudes. The electronic device determines a combined amplitude representative of the audio emitted by the audio source by combining the normalized amplitudes. The electronic device determines, based at least in part on the combined amplitude and motion of the electronic device, an estimated position of the audio source relative to the electronic device.

    Customized sound field for increased privacy

    公开(公告)号:US11611826B1

    公开(公告)日:2023-03-21

    申请号:US17584326

    申请日:2022-01-25

    摘要: An audio system for customizing sound fields for increased user privacy. A microphone array of a headset detects sounds from one or more sound sources in a local area of the headset. The audio system estimates array transfer functions (ATFs) associated with the sounds, and determines determining sound field reproduction filters for a loudspeaker array of the headset using the ATFs. The audio system presents audio content, via the loudspeaker array, based in part on the sound field reproduction filters. The presented audio content has a sound field that has a reduced amplitude in a first damped region of the local area that includes a first sound source of the one or more sound sources.

    Memory recall of historical data samples bucketed in discrete poses for audio beamforming

    公开(公告)号:US11659324B1

    公开(公告)日:2023-05-23

    申请号:US17503565

    申请日:2021-10-18

    IPC分类号: H04R1/40

    CPC分类号: H04R1/406

    摘要: A system and method for storing data samples in discrete poses and recalling the stored data samples for updating a sound filter. The system determines that a microphone array at a first time period is in a first discrete pose of a plurality of discrete poses, wherein the plurality of discrete poses discretizes a pose space. The pose space includes at least an orientation component and may further include a translation component. The system retrieves one or more historical data samples associated with the first discrete pose, generated from sound captured by the microphone array before the first time period, and stored in a memory cache (e.g., for memorization). The system updates a sound filter for the first discrete pose using the retrieved one or more historical data samples. The system generates and presents audio content using the updated sound filter.

    Systems and methods for classifying beamformed signals for binaural audio playback

    公开(公告)号:US11638111B2

    公开(公告)日:2023-04-25

    申请号:US16905411

    申请日:2020-06-18

    摘要: The disclosed computer-implemented method may include receiving a signal for each channel of an audio transducer array on a wearable device. The method may also include calculating a beamformed signal for each beam direction of a set of beamforming filters for the wearable device. Additionally, the method may include classifying a first beamformed signal from the calculated beamformed signals into a first class of sound and a second beamformed signal from the calculated beamformed signals into a second class of sound. The method may also include adjusting, based on the classifying, a gain of the first beamformed signal relative to the second beamformed signal. Furthermore, the method may include converting the beamformed signals into spatialized binaural audio based on a position of a user. Finally, the method may include transmitting the spatialized binaural audio to a playback device. Various other methods, systems, and computer-readable media are also disclosed.

    Determination of composite acoustic parameter value for presentation of audio content

    公开(公告)号:US11638110B1

    公开(公告)日:2023-04-25

    申请号:US17557425

    申请日:2021-12-21

    IPC分类号: H04S7/00 H04R1/40

    摘要: Determination of a composite acoustic parameter value for a headset is presented herein. A directionally enhanced audio signal is generated based on audio signals from an acoustic sensor array and a spatial signal enhancement filter that is directed for enhancement of a sound source. A SNR improvement value is determined based on a SNR value of the directionally enhanced audio signal and a SNR value of an audio signal from an acoustic sensor of the acoustic sensor array. The SNR improvement value is input into a model that maps SNR improvement values to spatial acoustic parameters to determine a spatial acoustic parameter. A temporal acoustic parameter is determined based on the audio signals. The composite acoustic parameter value is determined based on the spatial acoustic parameter and a temporal acoustic parameter value. Audio content presented to a user is adjusted based in part on the composite acoustic parameter value.

    Wearer identification based on personalized acoustic transfer functions

    公开(公告)号:US11526589B2

    公开(公告)日:2022-12-13

    申请号:US16526498

    申请日:2019-07-30

    摘要: A wearable device includes an audio system. In one embodiment, the audio system includes a sensor array that includes a plurality of acoustic sensors. When a user wears the wearable device, the audio system determines an acoustic transfer function for the user based upon detected sounds within a local area surrounding the sensor array. Because the acoustic transfer function is based upon the size, shape, and density of the user's body (e.g., the user's head), different acoustic transfer functions will be determined for different users. The determined acoustic transfer functions are compared with stored acoustic transfer functions of known users in order to authenticate the user of the wearable device.