Microphone control based on speech direction

    公开(公告)号:US11601750B2

    公开(公告)日:2023-03-07

    申请号:US17252314

    申请日:2018-12-17

    Abstract: According to examples, an apparatus may include a processor and a non-transitory computer readable medium on which is stored instructions that the processor may execute to access an audio signal captured by a microphone of a user's speech while the microphone is in a muted state. The processor may also execute the instructions to analyze a spectral or frequency content of the accessed audio signal to determine whether the user was facing the microphone while the user spoke. In addition, based on a determination that the user was facing the microphone while the user spoke, the processor may execute the instructions to unmute the microphone.

    Loudness enhancement based on multiband range compression

    公开(公告)号:US11176958B2

    公开(公告)日:2021-11-16

    申请号:US16487126

    申请日:2017-04-28

    Inventor: Sunil Bharitkar

    Abstract: In some examples, loudness enhancement based on multiband range compression may include determining, based on variations in compression parameters that include compression thresholds and compression ratios, corresponding variations in loudness levels for a specified loudness standard. A learning model may be trained based on the variations in the compression parameters and the corresponding variations in the loudness levels. A specified loudness level for a device may be ascertained, for example, from a user of the device. The compression parameters for the specified loudness level may be determined based on the trained learning model. Further, sub-band compression of an input audio signal may be performed, based on the determined compression parameters, by processing the input audio signal using a perfect reconstruction filterbank.

    FLUID CLASSIFICATION
    5.
    发明申请

    公开(公告)号:US20210199643A1

    公开(公告)日:2021-07-01

    申请号:US16761829

    申请日:2018-01-16

    Abstract: Fluid classification may include: receiving sensed data for the fluid; modeling the sensed data in a frequency domain; synthesizing a model of the sensed data from the frequency domain to a time domain response and converting the time domain response to a time frequency graphical representation in the form of a color map. Predetermined characteristics of the time frequency graphical representation are identified through computer vision and compared to at least one corresponding signature characteristic of a predetermined fluid type to identify the fluid as a fluid type.

    SPATIAL CHARACTERISTICS OF MULTI-CHANNEL SOURCE AUDIO

    公开(公告)号:US20210191685A1

    公开(公告)日:2021-06-24

    申请号:US17047333

    申请日:2018-08-30

    Inventor: Sunil Bharitkar

    Abstract: In some examples, an audio control system can include a first set of resources, a second set of resources and a controller. The first set of resources can generate a frequency energy band representation of a multi-channel source audio input. Additionally, the second set of resources can determine at least a value representing a strength of correlation between multiple channels of the multi-channel source audio input. Moreover, the audio output controller can determine a set of control parameters for tuning sound creation from an audio signal generator to reflect a set of spatial characteristics of the source audio input, based on the frequency energy band representation and the first value.

    Combined audio signal output
    7.
    发明授权

    公开(公告)号:US10659877B2

    公开(公告)日:2020-05-19

    申请号:US16473345

    申请日:2017-03-08

    Abstract: According to examples, an apparatus may include a processor and a memory on which is stored machine readable instructions that are to cause the processor to determine a reference frame from a plurality of frames received at multiple different times, in which each of the plurality of frames includes audio signal data, and in which the reference frame includes audio signal data that identifies a highest audio signal level among audio signals identified in the plurality of frames. The reference frame may be time-aligned with each of the plurality of frames other than the reference frame to obtain respective time-aligned frames. The audio signals identified in each of the respective time-aligned frames may be added together to generate respective added audio signals. The respective added audio signals may be combined together to obtain a combined audio signal and the combined audio signal may be outputted.

    CROSSTALK CANCELLATION FOR SPEAKER-BASED SPATIAL RENDERING

    公开(公告)号:US20200029155A1

    公开(公告)日:2020-01-23

    申请号:US16471893

    申请日:2017-04-14

    Inventor: Sunil Bharitkar

    Abstract: In some examples, crosstalk cancellation for speaker-based spatial rendering may include perceptually smoothing head-related transfer functions (HRTFs) corresponding to ipsilateral and contralateral transfer paths of sound emitted from first and second speakers to corresponding first and second destinations. The crosstalk cancellation may further include inserting an inter-aural time difference in the perceptually smoothed HRTFs corresponding to the contralateral transfer paths. A crosstalk canceller may be generated by inverting the perceptually smoothed HRTFs corresponding to the ipsilateral transfer paths and the perceptually smoothed HRTFs corresponding to the contralateral transfer paths including the inserted inter-aural time difference.

    COMBINED AUDIO SIGNAL OUTPUT
    9.
    发明申请

    公开(公告)号:US20190387313A1

    公开(公告)日:2019-12-19

    申请号:US16473345

    申请日:2017-03-08

    Abstract: According to examples, an apparatus may include a processor and a memory on which is stored machine readable instructions that are to cause the processor to determine a reference frame from a plurality of frames received at multiple different times, in which each of the plurality of frames includes audio signal data, and in which the reference frame includes audio signal data that identifies a highest audio signal level among audio signals identified in the plurality of frames. The reference frame may be time-aligned with each of the plurality of frames other than the reference frame to obtain respective time-aligned frames. The audio signals identified in each of the respective time-aligned frames may be added together to generate respective added audio signals. The respective added audio signals may be combined together to obtain a combined audio signal and the combined audio signal may be outputted.

    MULTI-CHANNEL DECOMPOSITION AND HARMONIC SYNTHESIS

    公开(公告)号:US20230085013A1

    公开(公告)日:2023-03-16

    申请号:US17795193

    申请日:2020-01-28

    Inventor: Sunil Bharitkar

    Abstract: In one example in accordance with the present disclosure, a system is described. The system includes a decompose device to decompose a multi-channel audio stream into at least a first portion and a second portion. A synthesis device of the system independently synthesizes harmonics in each of the first portion and the second portion using different harmonic models. An audio generator of the system combines synthesized harmonics from the first portion and the second portion with the multi-channel audio stream to generate a synthesized audio output.

Patent Agency Ranking