Abstract:
A method and apparatus for outputting audio based on an orientation of an electronic device, or video shown by the electronic device. The audio may be mapped to a set of speakers using either or both of the device and video orientation to determine which speakers receive certain audio channels.
Abstract:
Image analysis of a video signal is performed to produce first metadata, and audio analysis of a multi-channel sound track associated with the video signal is performed to produce second metadata. A number of time segments of the sound track are processed, wherein each time segment is processed by either (i) spatial filtering of the audio signals or (ii) spatial rendering of the audio signals, not both, wherein for each time segment a decision was made to select between the spatial filtering or the spatial rendering, in accordance with the first and second metadata. A mix of the processed sound track and the video signal is generated. Other embodiments are also described and claimed.
Abstract:
An audio system is provided that efficiently detects speaker arrays and configures the speaker arrays to output sound. In this system, a computing device may record the addresses and/or types of speaker arrays on a shared network while a camera captures video of a listening area, including the speaker arrays. The captured video may be analyzed to determine the location of the speaker arrays, one or more users, and/or the audio source in the listening area. While capturing the video, the speaker arrays may be driven to sequentially emit a series of test sounds into the listening area and a user may be prompted to select which speaker arrays in the captured video emitted each of the test sounds. Based on these inputs from the user, the computing device may determine an association between the speaker arrays on the shared network and the speaker arrays in the captured video.
Abstract:
A system and method is described for generating a confidence level for data generated by a beamforming acoustic beacon system. The system may include an audio emission device to emit a set of sounds corresponding to a set of predefined modal patterns into a listening area. The sounds may be detected by an audio capture device to produce a set of impulse responses corresponding to the modal patterns. The impulse responses may be processed to produce a set of window synthesized impulse responses for various angles. These window synthesized impulse responses may (1) be formed based on a weighted set of the modal patterns that were originally used to emanate sound and (2) seek to emulate a target beam, which is also composed of the same weighted modal patterns. A confidence level may be computed based on the difference between the window synthesized impulse responses and the target beam pattern.
Abstract:
Loudspeakers are described that may reduce comb filtering effects perceived by a listener by either 1) moving transducers closer to a sound reflective surface (e.g., a baseplate, a tabletop or a floor) through vertical (height) or rotational adjustments of the transducers or 2) guiding sound produced by the transducers to be released into the listening area proximate to the reflective surface through the use of horns and openings that are at a prescribed distance from the reflective surface. The reduction of this distance between the reflective surface and the point at which sound emitted by the transducers is released into the listening area may lead to a shorter reflected path that reduces comb filtering effects caused by reflected sounds that are delayed relative to the direct sound. Accordingly, the loudspeakers shown and describe may be placed on reflective surfaces without sever audio coloration caused by reflected sounds.
Abstract:
A system and method for measuring the performance of a plurality of transducers integrated in one or more loudspeakers is described. The method simultaneously drives each transducer to emit sounds corresponding to distinct orthogonal test signals. A listening device senses sounds produced by the orthogonal test signals and analyzes the sensed audio signal to determine the performance of each transducer. By using orthogonal test signals, the multiple transducers may be measured and/or characterized simultaneously and with limited affect from extraneous noises.
Abstract:
An audio receiver that performs crosstalk cancellation using a speaker array is described. The audio receiver detects the location of a listener in a room and processes a piece of sound program content to be output through the speaker array using one or more beam pattern matrices. The beam pattern matrices are generated according to one or more constraints. The constraints may include increasing a right channel and decreasing a left channel at the right ear of the listener, increasing a left channel and decreasing a right channel at the left ear of the listener, and decreasing sound in all other areas of the room. These constraints cause the audio receiver to beam sound primarily towards the listener and not in other areas of the room such that crosstalk cancellation is achieved with minimal effects due to changes to the frequency response of the room. Other embodiments are also described.
Abstract:
A system for enhancing audio including a plurality of sensors, an output device, and a processor in communication with the plurality of sensors and the output device. The processor is configured to process data captured by the plurality of sensors, and based on that, modify an output of the output device. The processor also is configured to determine whether there are a plurality of users associated with a video conferencing session, determine which user of the plurality of users is speaking, and enhance the audio or video output of the speaking user on the output device.
Abstract:
Loudspeakers are described that may reduce comb filtering effects perceived by a listener by either 1) moving transducers closer to a sound reflective surface (e.g., a baseplate, a tabletop or a floor) through vertical (height) or rotational adjustments of the transducers or 2) guiding sound produced by the transducers to be released into the listening area proximate to the reflective surface through the use of horns and openings that are at a prescribed distance from the reflective surface. The reduction of this distance between the reflective surface and the point at which sound emitted by the transducers is released into the listening area may lead to shorter reflected path that reduces comb filtering effects caused by reflected sounds that are delayed relative to the direct sound. Accordingly, the loudspeakers shown and described may be placed on reflective surfaces without severe audio coloration caused by reflected sounds.
Abstract:
Virtualizing speakers for a headphone set can include determining a location of a display. Locations of one or more virtual speakers can be assigned based on the location of the display. A first virtual speaker can be located at the display. A position of a head of a user can be tracked. Audio content can be spatialized with a spatial renderer to generate spatialized audio signals, based on the tracked position of the head and the locations of the virtual speakers. Other aspects are also described and claimed.