Machine learning and user driven selective hearing

    公开(公告)号:US12141347B1

    公开(公告)日:2024-11-12

    申请号:US18055600

    申请日:2022-11-15

    Applicant: Apple Inc.

    Abstract: An audio processing device may generate a plurality of microphone signals from a plurality of microphones of the audio processing device. The audio processing device may determine a gaze of a user who is wearing a playback device that is separate from the audio processing device, the gaze of the user being determined relative to the audio processing device. The audio processing device may extract speech that correlates to the gaze of the user, from the plurality of microphone signals of the audio processing device by applying the plurality of microphone signals of the audio processing device and the gaze of the user to a machine learning model. The extracted speech may be played to the user through the playback device.

    Audio Capture with Multiple Devices
    25.
    发明公开

    公开(公告)号:US20240007816A1

    公开(公告)日:2024-01-04

    申请号:US18212488

    申请日:2023-06-21

    Applicant: Apple Inc.

    CPC classification number: H04S7/302

    Abstract: In one implementation, a method of visualizing a combined audio pick-up pattern is performed at a first device in a physical environment, the first device including a display, one or more processors, and non-transitory memory. The method includes determining a first audio pick-up pattern of the first device. The method includes determining one or more second audio pick-up patterns of a respective one or more second devices. The method includes determining a combined audio pick-up pattern of the first device and the one or more second devices based on the first audio pick-up pattern and the one or more second audio pick-up patterns. The method includes displaying, on the display, a representation of the combined audio pick-up pattern.
    In one implementation, a method of determining an audio emission pattern is performed at a first device at a first location, the first device having a microphone, one or more processors, and non-transitory memory. The method includes obtaining, via the microphone, first audio of a sound source. The method includes receiving, from one or more second devices, one or more second audio of the sound source. The method includes determining one or more second locations of the one or more second devices. The method includes determining an audio emission pattern of the sound source based on the first audio data, the one or more second audio data, and the one or more second locations, wherein the audio emission pattern of the sound source indicates a sound level at various locations relative to the sound source.

    VARIABLE AUDIO FOR AUDIO-VISUAL CONTENT
    28.
    发明公开

    公开(公告)号:US20230344973A1

    公开(公告)日:2023-10-26

    申请号:US18215371

    申请日:2023-06-28

    Applicant: Apple Inc.

    CPC classification number: H04N9/802 G10L25/51 G11B27/10 H04S7/302

    Abstract: Various implementations disclosed herein include devices, systems, and methods that that modify audio of played back AV content based on context in accordance with some implementations. In some implementations audio-visual content of a physical environment is obtained, and the audio-visual content includes visual content and audio content that includes a plurality of audio portions corresponding to the visual content. In some implementations, a context for presenting the audio-visual content is determined, and a temporal relationship between one or more audio portions of the plurality of audio portions and the visual content is determined based on the context. Then, synthesized audio-visual content is presented based on the temporal relationship.

    Method and System For Spatial Audio Processing Using Multiple Orders Of Ambisonics

    公开(公告)号:US20250106579A1

    公开(公告)日:2025-03-27

    申请号:US18476280

    申请日:2023-09-27

    Applicant: Apple Inc.

    Abstract: A method that includes receiving a higher-order ambisonics (HOA) representation of a sound field that includes a first plurality of audio signals, separating a second plurality of audio signals from the first plurality of audio signals that are associated with a first-order ambisonics (FOA) representation of the sound field, determining a plurality of adaptive filters based on at least some of the second plurality of audio signals, producing a plurality of output audio signals based on the first plurality of audio signals and the plurality of adaptive filters, each output audio signal having at least a portion of the sound field, and driving a plurality of speakers using the plurality of output audio signals.

Patent Agency Ranking