摘要:
An improved methodology for binaural rendering of audio signals that are perceived by a user to originate from a real-world spatial location is disclosed. Embodiments enable personalized HRTF selection from among a data store containing a plurality of candidate HRTFs using an evaluation-based personalization strategy. One or more relational models personalize the selection. These relational models can relate candidate HRTFs to each other and a particular user to other users so that only a subset of the candidate HRTFs require evaluation. Candidate HRTFs can be evaluated according to one or more selection policies, and relational models can be updated based on actual responses from a user to virtual audio signals that are rendered by a candidate HRTF.
摘要:
A method generating a virtual audio signal for a listener. The method includes estimating spherical harmonic coefficients based on an individual character of the listener. The estimated spherical harmonic coefficients are compared to a distribution of known spherical harmonic coefficients. The estimated spherical harmonic coefficients are iteratively updated and compared to the distribution of known spherical harmonic coefficients until convergence. The individual character and the converged spherical harmonic coefficients are then applied to a mono-channel sound.
摘要:
A Head-Related Transfer Function. The Head-Related Transfer Function includes listener-specific and general components. The listener-specific component includes listener-specific, vertical variations in the Head-Related Transfer Function. The general component includes non-listener-specific, lateral variations in the Head-Related Transfer Function.
摘要:
A method of enhancing vertical polar localization of a head related transfer (HRTF). The method includes splitting an audio signal and generating left and right output signals by determining a log lateral component of the respective frequency-dependent audio gain that is equal to a median log frequency-dependent audio gain for all audio signals of that channel having a desired perceived source location. A vertical magnitude of the respective audio signal is enhanced by determining a log vertical component of the respective frequency-dependent audio gain that is equal to a product of a first enhancement factor and a different between the respective frequency-dependent audio gain at the desired perceived source location and the lateral magnitude of respective audio signal. The output signals are time delayed according to an interaural time.