-
公开(公告)号:US11586411B2
公开(公告)日:2023-02-21
申请号:US17047333
申请日:2018-08-30
Applicant: Hewlett-Packard Development Company, L.P.
Inventor: Sunil Bharitkar
IPC: G06F3/16 , G10L19/008 , H04S3/00
Abstract: In some examples, an audio control system can include a first set of resources, a second set of resources and a controller. The first set of resources can generate a frequency energy band representation of a multi-channel source audio input. Additionally, the second set of resources can determine at least a value representing a strength of correlation between multiple channels of the multi-channel source audio input. Moreover, the audio output controller can determine a set of control parameters for tuning sound creation from an audio signal generator to reflect a set of spatial characteristics of the source audio input, based on the frequency energy band representation and the first value.
-
公开(公告)号:US11457329B2
公开(公告)日:2022-09-27
申请号:US17084319
申请日:2020-10-29
Applicant: Hewlett-Packard Development Company, L.P.
Inventor: Sunil Bharitkar
Abstract: In some examples, immersive audio rendering may include determining whether an audio signal includes a first content format including stereo content, or a second content format including multichannel or object-based content. In response to a determination that the audio signal includes the first content format, the audio signal may be routed to a first block that includes a low-frequency extension and a stereo to multichannel upmix to generate a resulting audio signal. Alternatively, the audio signal may be routed to another low-frequency extension to generate the resulting audio signal. The audio signal may be further processed by performing spatial synthesis on the resulting audio signal, and crosstalk cancellation on the spatial synthesized audio signal. Further, multiband-range compression may be performed on the crosstalk cancelled audio signal, and an output stereo signal may be generated based on the multiband-range compressed audio signal.
-
公开(公告)号:US20220114995A1
公开(公告)日:2022-04-14
申请号:US17419057
申请日:2019-07-03
Applicant: Hewlett-Packard Development Company, L.P.
Inventor: Srikanth Kuthuru , Sunil Bharitkar , Madhu Sudan Athreya
IPC: G10K11/16 , G06V20/50 , G06V10/764 , G01H7/00
Abstract: Audio signal dereverberation can be carried out in accordance instructions on a machine readable storage medium, using a processor. In an example, a location of a person in a room can be determined. An audio signal received from the location of the person can be captured using beamforming. Room properties can be determined based in part on a signal sweep of the room. A dereverberation parameter can be determined based in part on the location of the person and the room properties. The dereverberation parameter can be applied to the audio signal.
-
公开(公告)号:US20210166715A1
公开(公告)日:2021-06-03
申请号:US16770724
申请日:2018-02-16
Applicant: Hewlett-Packard Development Company, L.P.
Inventor: Sunil Bharitkar
IPC: G10L21/043 , G10L17/04 , G10L17/24 , G10L17/18
Abstract: In some examples, with respect to encoded features and rate-based augmentation based speech authentication, a plurality of features of a registration speech signal for a user that is to be registered may be extracted. A speech rate of the registration speech signal may be modified to generate a rate-adjusted speech signal, and a plurality of features of the rate-adjusted speech signal may be extracted. The user may be registered by training, based on the plurality of extracted features of the registration speech signal and the plurality of extracted features of the rate-adjusted speech signal, a machine learning model. Further, based on the trained machine learning model, a determination may be made as to whether an authentication speech signal is authentic to authenticate the registered user.
-
公开(公告)号:US10771896B2
公开(公告)日:2020-09-08
申请号:US16471893
申请日:2017-04-14
Applicant: Hewlett-Packard Development Company, L.P.
Inventor: Sunil Bharitkar
Abstract: In some examples, crosstalk cancellation for speaker-based spatial rendering may include perceptually smoothing head-related transfer functions (HRTFs) corresponding to ipsilateral and contralateral transfer paths of sound emitted from first and second speakers to corresponding first and second destinations. The crosstalk cancellation may further include inserting an inter-aural time difference in the perceptually smoothed HRTFs corresponding to the contralateral transfer paths. A crosstalk canceller may be generated by inverting the perceptually smoothed HRTFs corresponding to the ipsilateral transfer paths and the perceptually smoothed HRTFs corresponding to the contralateral transfer paths including the inserted inter-aural time difference.
-
公开(公告)号:US10750307B2
公开(公告)日:2020-08-18
申请号:US16603260
申请日:2017-04-14
Applicant: Hewlett-Packard Development Company, L.P.
Inventor: Wensen Liu , Sunil Bharitkar
Abstract: Described herein is a technology related to a mobile device with a pair of stereo speakers. The mobile device has an orientation detection system that detects the orientation of the mobile device and a crosstalk cancellation system that performs crosstalk cancellation with complementary cardioid beams in response to the detected orientation of the mobile device. The mobile device also has an audio system that emits complementary cardioid sound beams from the speakers.
-
公开(公告)号:US20200236488A1
公开(公告)日:2020-07-23
申请号:US16487882
申请日:2017-04-28
Applicant: Hewlett-Packard Development Company, L.P.
Inventor: Sunil Bharitkar
Abstract: In some examples, immersive audio rendering may include determining whether an audio signal includes a first content format including stereo content, or a second content format including multichannel or object-based content. In response to a determination that the audio signal includes the first content format, the audio signal may be routed to a first block that includes a low-frequency extension and a stereo to multichannel upmix to generate a resulting audio signal. Alternatively, the audio signal may be routed to another low-frequency extension to generate the resulting audio signal. The audio signal may be further processed by performing spatial synthesis on the resulting audio signal, and crosstalk cancellation on the spatial synthesized audio signal. Further, multiband-range compression may be performed on the crosstalk cancelled audio signal, and an output stereo signal may be generated based on the multiband-range compressed audio signal.
-
公开(公告)号:US20230171346A1
公开(公告)日:2023-06-01
申请号:US17919059
申请日:2020-04-15
Applicant: Hewlett-Packard Development Company, L.P.
Inventor: Srikanth Kuthuru , Sunil Bharitkar
Abstract: In example implementations, an apparatus is provided. The apparatus includes an adaptive filter and a double talk detector in communication with the adaptive filter. The adaptive filter is to calculate a transfer function with coefficients for a particular time that is applied to an output signal of a microphone to cancel echoes caused by a reference signal in the output signal of the microphone. The double talk detector is to determine a peak of the coefficients, detect double talk based on a location of the peak of the coefficients, and transmit a pause signal to the adaptive filter in response to detection of the double talk, wherein the pause signal is to pause a calculation of updates to the coefficients by the adaptive filter.
-
公开(公告)号:US11380347B2
公开(公告)日:2022-07-05
申请号:US16076272
申请日:2017-02-01
Applicant: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P.
Inventor: Sunil Bharitkar , Wensen Liu , Madhu Sudan Athreya , Richard Sweet
IPC: G10L21/0364 , G10L21/02 , G10L21/0316 , G06T7/70 , G10L21/034 , G10L25/60 , G10L25/84 , H04R1/32 , H04R17/00
Abstract: In some examples, adaptive speech intelligibility control for speech privacy may include determining, based on background noise at a near-end of a speaker, a noise estimate associated with speech emitted from the speaker, and comparing, by using a specified factor, the noise estimate to a speech level estimate for the speech emitted from the speaker. Adaptive speech intelligibility control for speech privacy may further include determining, based on the comparison, a gain value to be applied to the speaker to produce the speech at a specified level to maintain on-axis intelligibility with respect to the speaker, and applying the gain value to the speaker.
-
公开(公告)号:US20220101126A1
公开(公告)日:2022-03-31
申请号:US17426678
申请日:2019-02-14
Applicant: Hewlett-Packard Development Company, L.P.
Inventor: Sunil Bharitkar
IPC: G06N3/08 , G06N3/04 , H04S7/00 , G10L19/008 , A63F13/54
Abstract: The present disclosure describes techniques for adding a perception of directionality to audio. The method includes receiving a set of head related transfer functions (HRTFs). The method also includes training an artificial neural network based on the HRTFs to generate a trained artificial neural network, wherein the trained artificial neural network represents a subspace reconstruction model for generating interpolated HRTFs. The trained artificial neural network is generated using Bayesian optimization to determine a number of layers and a number of neurons per layer of the trained artificial neural network. The method also includes storing the trained artificial neural network, wherein the trained artificial neural network is used to reconstruct a new head related transfer function for a specified direction. The new head related transfer function is used to process an audio signal to produce a perception of directionality.
-
-
-
-
-
-
-
-
-