Intelligent audio rendering for video recording

    公开(公告)号:US10178490B1

    公开(公告)日:2019-01-08

    申请号:US15639191

    申请日:2017-06-30

    Applicant: Apple Inc.

    Abstract: Image analysis of a video signal is performed to produce first metadata, and audio analysis of a multi-channel sound track associated with the video signal is performed to produce second metadata. A number of time segments of the sound track are processed, wherein each time segment is processed by either (i) spatial filtering of the audio signals or (ii) spatial rendering of the audio signals, not both, wherein for each time segment a decision was made to select between the spatial filtering or the spatial rendering, in accordance with the first and second metadata. A mix of the processed sound track and the video signal is generated. Other embodiments are also described and claimed.

    Optimizing the performance of an audio playback system with a linked audio/video feed

    公开(公告)号:US10104490B2

    公开(公告)日:2018-10-16

    申请号:US15499829

    申请日:2017-04-27

    Applicant: Apple Inc.

    Abstract: An audio system is provided that efficiently detects speaker arrays and configures the speaker arrays to output sound. In this system, a computing device may record the addresses and/or types of speaker arrays on a shared network while a camera captures video of a listening area, including the speaker arrays. The captured video may be analyzed to determine the location of the speaker arrays, one or more users, and/or the audio source in the listening area. While capturing the video, the speaker arrays may be driven to sequentially emit a series of test sounds into the listening area and a user may be prompted to select which speaker arrays in the captured video emitted each of the test sounds. Based on these inputs from the user, the computing device may determine an association between the speaker arrays on the shared network and the speaker arrays in the captured video.

    Robust confidence measure for beamformed acoustic beacon for device tracking and localization

    公开(公告)号:US10061009B1

    公开(公告)日:2018-08-28

    申请号:US14867998

    申请日:2015-09-28

    Applicant: Apple Inc.

    CPC classification number: G01S3/80 G01S1/72 G01S1/74 G01S3/801 G10K11/34

    Abstract: A system and method is described for generating a confidence level for data generated by a beamforming acoustic beacon system. The system may include an audio emission device to emit a set of sounds corresponding to a set of predefined modal patterns into a listening area. The sounds may be detected by an audio capture device to produce a set of impulse responses corresponding to the modal patterns. The impulse responses may be processed to produce a set of window synthesized impulse responses for various angles. These window synthesized impulse responses may (1) be formed based on a weighted set of the modal patterns that were originally used to emanate sound and (2) seek to emulate a target beam, which is also composed of the same weighted modal patterns. A confidence level may be computed based on the difference between the window synthesized impulse responses and the target beam pattern.

    ROBUST CROSSTALK CANCELLATION USING A SPEAKER ARRAY
    107.
    发明申请
    ROBUST CROSSTALK CANCELLATION USING A SPEAKER ARRAY 有权
    使用扬声器阵列的稳定的CROSSTALK CANCELLATION

    公开(公告)号:US20160021480A1

    公开(公告)日:2016-01-21

    申请号:US14773280

    申请日:2014-03-13

    Applicant: APPLE INC.

    Abstract: An audio receiver that performs crosstalk cancellation using a speaker array is described. The audio receiver detects the location of a listener in a room and processes a piece of sound program content to be output through the speaker array using one or more beam pattern matrices. The beam pattern matrices are generated according to one or more constraints. The constraints may include increasing a right channel and decreasing a left channel at the right ear of the listener, increasing a left channel and decreasing a right channel at the left ear of the listener, and decreasing sound in all other areas of the room. These constraints cause the audio receiver to beam sound primarily towards the listener and not in other areas of the room such that crosstalk cancellation is achieved with minimal effects due to changes to the frequency response of the room. Other embodiments are also described.

    Abstract translation: 描述使用扬声器阵列执行串扰消除的音频接收器。 音频接收器检测房间中的收听者的位置,并使用一个或多个波束图案矩阵处理要通过扬声器阵列输出的一段声音节目内容。 根据一个或多个约束生成波束图案矩阵。 约束可以包括增加右声道并减少收听者右耳的左声道,增加左声道并减少收听者左耳的右声道,并减少房间所有其他区域的声音。 这些约束使得音频接收器主要向收听者发出声音,而不是在房间的其他区域中发出声音,使得由于对房间的频率响应的改变而以最小的效果来实现串扰消除。 还描述了其它实施例。

    Loudspeaker with reduced audio coloration caused by reflections from a surface

    公开(公告)号:US12192698B2

    公开(公告)日:2025-01-07

    申请号:US18377261

    申请日:2023-10-05

    Applicant: APPLE INC.

    Abstract: Loudspeakers are described that may reduce comb filtering effects perceived by a listener by either 1) moving transducers closer to a sound reflective surface (e.g., a baseplate, a tabletop or a floor) through vertical (height) or rotational adjustments of the transducers or 2) guiding sound produced by the transducers to be released into the listening area proximate to the reflective surface through the use of horns and openings that are at a prescribed distance from the reflective surface. The reduction of this distance between the reflective surface and the point at which sound emitted by the transducers is released into the listening area may lead to shorter reflected path that reduces comb filtering effects caused by reflected sounds that are delayed relative to the direct sound. Accordingly, the loudspeakers shown and described may be placed on reflective surfaces without severe audio coloration caused by reflected sounds.

Patent Agency Ranking