Dominant frequency processing of audio signals

    公开(公告)号:US10390137B2

    公开(公告)日:2019-08-20

    申请号:US16075642

    申请日:2016-11-04

    Inventor: Sunil Bharitkar

    Abstract: An example non-transitory computer-readable medium includes instructions. When executed by a processor, the instructions cause the processor to remove nondominant frequencies from a low frequency portion of an audio signal. The instructions also cause the processor to apply non-linear processing to a remainder of the low frequency portion to generate a plurality of harmonics. The instructions cause the processor to insert the plurality of harmonics into an audio output corresponding to a high frequency portion of the audio signal. The audio output is to be provided to an audio output device.

    DISABLING SPATIAL AUDIO PROCESSING
    22.
    发明申请

    公开(公告)号:US20230130930A1

    公开(公告)日:2023-04-27

    申请号:US17798104

    申请日:2020-03-13

    Abstract: In one example in accordance with the present disclosure, a system is described. The system includes a processor to perform spatial audio processing on a received audio signal and an audio interface to connect an audio output device to a computing device. The system also includes a controller. The controller determines a spatial audio processing capability of the audio output device and disables spatial audio processing on one of the audio output device and the processor based on a determination of the spatial audio processing capability of the audio output device.

    ELIMINATING SPATIAL COLLISIONS DUE TO ESTIMATED DIRECTIONS OF ARRIVAL OF SPEECH

    公开(公告)号:US20220201417A1

    公开(公告)日:2022-06-23

    申请号:US17691394

    申请日:2022-03-10

    Abstract: A communication system may include, in an example, a first computing device communicatively coupled, via a network, to at least a second computing device maintained at a geographically distinct location than the first computing device; the first computing device including: an array of audio output devices and a processor to receive transmitted speech data and metadata describing an estimated direction of arrival (DOA) of speech from a plurality of speakers at an array of microphones at the second computing device and render audio at the array of audio output devices associated with the first computing device by eliminating spatial collision during rendering; said spatial collision arising due to the low angular separation of the estimated DOA of a plurality of speakers.

    IMAGE-BASED SOUNDFIELD RENDERING
    24.
    发明申请

    公开(公告)号:US20220159401A1

    公开(公告)日:2022-05-19

    申请号:US17433017

    申请日:2019-06-21

    Abstract: An audio control system may include an imaging sensor to capture an image of an environment containing loudspeakers connected to the audio control system. A listening position subsystem may process the captured image to identify a listening position within the environment. A speaker position subsystem may process the captured image to determine a physical location of each loudspeaker relative to the identified user listening position. A signal processing subsystem may modify an output signal driving the loudspeakers to steer a soundfield generated by the loudspeakers. The audio control system may include a processor, memory, and/or hardware components to implement the various subsystems such that, at the identified user listening position, a perceived location of one of the loudspeakers is mapped to a location that is different than its physical location.

    AUDIO CLASSIFCATION WITH MACHINE LEARNING MODEL USING AUDIO DURATION

    公开(公告)号:US20210294845A1

    公开(公告)日:2021-09-23

    申请号:US16473284

    申请日:2017-04-28

    Abstract: An audio signal classifier including a feature extractor to extract metadata from an audio signal, the metadata defining a plurality of features of the audio signal, the feature extractor to generate a feature vector including selected features of the audio signal, the selected features including a duration of the audio signal, and each selected feature having a feature value. A machine learning model trained to classify the audio signal as one of a plurality of audio signal classes based on the feature vector. The machine learning model to provide a plurality of class values based on the feature values, each class value corresponding to one of the plurality of audio signal classes, the plurality of class values together indicating the class of the audio signal.

    MATRIX DECOMPOSITION OF AUDIO SIGNAL PROCESSING FILTERS FOR SPATIAL RENDERING

    公开(公告)号:US20200045493A1

    公开(公告)日:2020-02-06

    申请号:US16471124

    申请日:2017-04-26

    Inventor: Sunil Bharitkar

    Abstract: In some examples, matrix decomposition of audio signal processing filters for spatial rendering may include determining first and second spatial synthesis filters respectively as a sum and a difference of ipsilateral and contralateral spatial synthesis filters, and determining first and second crosstalk cancellation filters respectively as a sum and a difference of ipsilateral and contralateral crosstalk cancellation filters. A combined spatial synthesizer and crosstalk canceller that includes a first combined filter and a second combined filter may be determined based on application of matrix decomposition to the first and second spatial synthesis filters and the first and second crosstalk cancellation filters. Further, spatial synthesis and crosstalk cancellation on first and second input audio signals may be performed based on application of the combined spatial synthesizer and crosstalk canceller.

    Dominant sub-band determination
    28.
    发明授权

    公开(公告)号:US10524052B2

    公开(公告)日:2019-12-31

    申请号:US15972069

    申请日:2018-05-04

    Inventor: Sunil Bharitkar

    Abstract: An example system includes a filter bank of sub-octave filters to separate a lower frequency portion of an audio input stream into a number of sub-bands. A detector bank of detectors coupled with the filter bank determines an audio power level in each of the sub-bands. A sub-band selection engine coupled with the detector bank determines a dominant sub-band. A first filter engine isolates the dominant sub-band from the audio input stream and a harmonic engine coupled with the first filter generates harmonics of the dominant sub-band. A second filter engine coupled with the harmonic engine selects a sub-set of the harmonics to combine with a higher frequency portion of the audio input stream.

    Applying directionality to audio
    29.
    发明授权

    公开(公告)号:US10397725B1

    公开(公告)日:2019-08-27

    申请号:US16037127

    申请日:2018-07-17

    Inventor: Sunil Bharitkar

    Abstract: A system for creating a perception of directionality to an audio signal, the system including: a processor with an associated memory, the associated memory containing instructions, which when executed cause the processor to: identify an audio signal and an orientation to be applied to the audio signal; calculate intermediate values to reduce the dimensions of the audio signal and orientation; provide the intermediate values into a neural network, to produce a first and second orienting audio outputs; and provide the first orienting audio output to a first speaker and the second orienting audio output to a second speaker.

    Eliminating spatial collisions due to estimated directions of arrival of speech

    公开(公告)号:US11317232B2

    公开(公告)日:2022-04-26

    申请号:US16605195

    申请日:2017-10-17

    Abstract: A communication system may include, in an example, a first computing device communicatively coupled, via a network, to at least a second computing device maintained at a geographically distinct location than the first computing device; the first computing device including: an array of audio output devices and a processor to receive transmitted speech data and metadata describing an estimated direction of arrival (DOA) of speech from a plurality of speakers at an array of microphones at the second computing device and render audio at the array of audio output devices associated with the first computing device by eliminating spatial collision during rendering; said spatial collision arising due to the low angular separation of the estimated DOA of a plurality of speakers.

Patent Agency Ranking