Variable audio for audio-visual content

    公开(公告)号:US11729363B2

    公开(公告)日:2023-08-15

    申请号:US17826546

    申请日:2022-05-27

    Applicant: Apple Inc.

    CPC classification number: H04N9/802 G10L25/51 G11B27/10 H04S7/302

    Abstract: Various implementations disclosed herein include devices, systems, and methods that that modify audio of played back AV content based on context in accordance with some implementations. In some implementations audio-visual content of a physical environment is obtained, and the audio-visual content includes visual content and audio content that includes a plurality of audio portions corresponding to the visual content. In some implementations, a context for presenting the audio-visual content is determined, and a temporal relationship between one or more audio portions of the plurality of audio portions and the visual content is determined based on the context. Then, synthesized audio-visual content is presented based on the temporal relationship.

    Head-tracked spatial audio
    33.
    发明授权

    公开(公告)号:US11546687B1

    公开(公告)日:2023-01-03

    申请号:US17392069

    申请日:2021-08-02

    Applicant: Apple Inc.

    Abstract: Spatial filters are generated that map response of an audio capture device to head related transfer functions (HRTFs) for different positions of the audio capture device relative to the HRTFs. A current set of spatial filters are determined based on the plurality of spatial filters and a head position of a user. The microphone signals are convolved with the current set of spatial filters, resulting in a left audio channel and right audio channel that form output binaural audio channels. The binaural audio channels can be used to drive speakers of a headphone set to generate sound that is perceived to have a spatial quality. Other aspects are described and claimed.

    VARIABLE AUDIO FOR AUDIO-VISUAL CONTENT

    公开(公告)号:US20220368875A1

    公开(公告)日:2022-11-17

    申请号:US17826546

    申请日:2022-05-27

    Applicant: Apple Inc.

    Abstract: Various implementations disclosed herein include devices, systems, and methods that that modify audio of played back AV content based on context in accordance with some implementations. In some implementations audio-visual content of a physical environment is obtained, and the audio-visual content includes visual content and audio content that includes a plurality of audio portions corresponding to the visual content. In some implementations, a context for presenting the audio-visual content is determined, and a temporal relationship between one or more audio portions of the plurality of audio portions and the visual content is determined based on the context. Then, synthesized audio-visual content is presented based on the temporal relationship.

    Time domain neural networks for spatial audio reproduction

    公开(公告)号:US11490218B1

    公开(公告)日:2022-11-01

    申请号:US17134097

    申请日:2020-12-24

    Applicant: Apple Inc.

    Abstract: A device for reproducing spatial audio using a machine learning model may include at least one processor configured to receive multiple audio signals corresponding to a sound scene captured by respective microphones of a device. The at least one processor may be further configured to provide the multiple audio signals to a machine learning model, the machine learning model having been trained based at least in part on a target rendering configuration. The at least one processor may be further configured to provide, responsive to providing the multiple audio signals to the machine learning model, multichannel audio signals that comprise a spatial reproduction of the sound scene in accordance with the target rendering configuration.

    Variable audio for audio-visual content

    公开(公告)号:US11381797B2

    公开(公告)日:2022-07-05

    申请号:US17358263

    申请日:2021-06-25

    Applicant: Apple Inc.

    Abstract: Various implementations disclosed herein include devices, systems, and methods that that modify audio of played back AV content based on context in accordance with some implementations. In some implementations audio-visual content of a physical environment is obtained, and the audio-visual content includes visual content and audio content that includes a plurality of audio portions corresponding to the visual content. In some implementations, a context for presenting the audio-visual content is determined, and a temporal relationship between one or more audio portions of the plurality of audio portions and the visual content is determined based on the context. Then, synthesized audio-visual content is presented based on the temporal relationship.

    VARIABLE AUDIO FOR AUDIO-VISUAL CONTENT

    公开(公告)号:US20210321070A1

    公开(公告)日:2021-10-14

    申请号:US17358263

    申请日:2021-06-25

    Applicant: Apple Inc.

    Abstract: Various implementations disclosed herein include devices, systems, and methods that that modify audio of played back AV content based on context in accordance with some implementations. In some implementations audio-visual content of a physical environment is obtained, and the audio-visual content includes visual content and audio content that includes a plurality of audio portions corresponding to the visual content. In some implementations, a context for presenting the audio-visual content is determined, and a temporal relationship between one or more audio portions of the plurality of audio portions and the visual content is determined based on the context. Then, synthesized audio-visual content is presented based on the temporal relationship.

    System and method for performing panning for an arbitrary loudspeaker setup

    公开(公告)号:US10609485B2

    公开(公告)日:2020-03-31

    申请号:US16000281

    申请日:2018-06-05

    Applicant: Apple Inc.

    Abstract: Placement of one or two placed virtual loudspeakers within a loudspeaker setup that includes a real loudspeakers is determined and vector base amplitude panning (VBAP) gains including the gains of the real loudspeakers and placed one or two virtual loudspeakers are also then determined. Gains of one or two placed virtual loudspeakers are redistributed to the real loudspeakers to ensure preservation of total energy. Real loudspeakers in the loudspeaker setup have redistributed gains of one or two placed virtual loudspeakers. Loudspeaker outputs are generated and transmitted to the real loudspeakers to be played back. When received audio content is ambisonics content, a predetermined grid is generated and HOA content is projected to the grid. Other aspects are also described.

Patent Agency Ranking