-
公开(公告)号:US11508388B1
公开(公告)日:2022-11-22
申请号:US17100802
申请日:2020-11-20
Applicant: Apple Inc.
Inventor: Mehrez Souden , Symeon Delikaris Manias , Joshua D. Atkins , Ante Jukic , Ramin Pishehvar
IPC: G10L21/0232 , H04R1/40 , G10L25/30 , G06N3/08 , H04R3/00 , G10L21/0216
Abstract: A device for processing audio signals in a time-domain includes a processor configured to receive multiple audio signals corresponding to respective microphones of at least two or more microphones of the device, at least one of the multiple audio signals comprising speech of a user of the device. The processor is configured to provide the multiple audio signals to a machine learning model, the machine learning model having been trained based at least in part on an expected position of the user of the device and expected positions of the respective microphones on the device. The processor is configured to provide an audio signal that is enhanced with respect to the speech of the user relative to the multiple audio signals, wherein the audio signal is a waveform output from the machine learning model.
-
公开(公告)号:US20220059123A1
公开(公告)日:2022-02-24
申请号:US17514694
申请日:2021-10-29
Applicant: Apple Inc.
Inventor: Jonathan D. Sheaffer , Joshua D. Atkins , Mehrez Souden , Symeon Delikaris Manias , Sean A. Ramprashad
IPC: G10L25/78 , G10L21/0272 , G06T7/246 , H04R3/00
Abstract: Processing of ambience and speech can include extracting from audio signals, ambience and speech signals. One or more spatial parameters can be generated that define spatial characteristics of ambience sound in the one or more ambience audio signals. The primary speech signal, the one or more ambience audio signals, and the spatial parameters can be encoded into one or more encoded data streams. Other aspects are described and claimed.
-
公开(公告)号:US20210211821A1
公开(公告)日:2021-07-08
申请号:US17128910
申请日:2020-12-21
Applicant: Apple Inc.
Inventor: Gaetan R. Lorho , Jonathan D. Sheaffer , Symeon Delikaris Manias , Frank Baumgarte
Abstract: Transfer functions can describe responses of microphones or ears to sounds at different locations on a sphere. The transfer functions can be compressed by determining, based on transfer functions, a) one or more basis transfer functions, and b) spherical harmonics coefficients that describe variations of the transfer functions with respect to spherical coordinates. Other aspects are described and claimed.
-
公开(公告)号:US12284508B2
公开(公告)日:2025-04-22
申请号:US18079669
申请日:2022-12-12
Applicant: APPLE INC.
Inventor: Shai Messingher Lang , Alexandre Da Veiga , Spencer H. Ray , Symeon Delikaris Manias
Abstract: Various implementations disclosed herein include devices, systems, and methods that display visual content as part of a 3D environment and add audio corresponding to the visual content. The audio may be spatialized to be from one or more audio source locations within the 3D environment. For example, a video may be presented on a virtual surface within an extended reality (XR) environment while audio associated with the video is spatialized to sound as if it is produced from an audio source location corresponding to that virtual surface. How the audio is provided may be determined based on the position of the viewer (e.g., the user or his/her device) relative to the presented visual content.
-
公开(公告)号:US20250080911A1
公开(公告)日:2025-03-06
申请号:US18803081
申请日:2024-08-13
Applicant: Apple Inc.
Inventor: Symeon Delikaris Manias , Afrooz Family , Shai Messingher Lang , Ronald J. Guglielmone, JR.
Abstract: A method that includes driving a first speaker of a first electronic device using a mix of a first audio signal and a second audio signal. The method determines that the first electronic device is within a threshold distance of a second electronic device within an environment in which the first electronic device is located, where the second electronic device includes a second speaker. Responsive to determining that the first electronic device is within the threshold distance, causing the second electronic device to playback the second audio signal through the speaker and driving the first speaker using the first audio signal instead of the mix.
-
公开(公告)号:US20240098442A1
公开(公告)日:2024-03-21
申请号:US18458077
申请日:2023-08-29
Applicant: Apple Inc.
Inventor: Shai Messingher Lang , Joshua D. Atkins , Scott A. Wardle , Symeon Delikaris Manias
IPC: H04S7/00
CPC classification number: H04S7/302 , H04S2400/11
Abstract: An audio processing system may obtain a size of a visual object to present to a display. The audio processing system may determine a virtual placement for each of a plurality of virtual speakers at least based on the size of the visual object. Each of the plurality of virtual speakers may be spatially rendered at each virtual placement through binaural audio, for playback through head-worn speakers. Other aspects are also described and claimed.
-
公开(公告)号:US20240096335A1
公开(公告)日:2024-03-21
申请号:US18454409
申请日:2023-08-23
Applicant: Apple Inc.
Inventor: Sina Zamani , Moo Young Kim , Dipanjan Sen , Sang Uk Ryu , Juha O. Merimaa , Symeon Delikaris Manias
IPC: G10L19/008 , G10L25/03
CPC classification number: G10L19/008 , G10L25/03
Abstract: In one aspect, a computer-implemented method, includes obtaining object audio and metadata that spatially describes the object audio, converting the object audio to time-frequency domain Ambisonics audio based on the metadata, and encoding the time-frequency domain Ambisonics audio and a subset of the metadata as one or more bitstreams to be stored in computer-readable memory or transmitted to a remote device.
-
公开(公告)号:US11841899B2
公开(公告)日:2023-12-12
申请号:US16899019
申请日:2020-06-11
Applicant: Apple Inc.
Inventor: Jonathan D. Sheaffer , Symeon Delikaris Manias , Gaetan R. Lorho , Peter A. Raffensperger , Eric A. Allamanche , Frank Baumgarte , Dipanjan Sen , Joshua D. Atkins , Juha O. Merimaa
IPC: G06F16/683 , G06F16/174 , H04R1/40 , H04R3/00
CPC classification number: G06F16/683 , G06F16/1744 , H04R1/406 , H04R3/005 , H04R2410/00
Abstract: A device with microphones can generate microphone signals during an audio recording. The device can store, in an electronic audio data file, the microphone signals, and metadata that includes impulse responses of the microphones. Other aspects are described and claimed.
-
公开(公告)号:US11758348B1
公开(公告)日:2023-09-12
申请号:US17570251
申请日:2022-01-06
Applicant: Apple Inc.
Inventor: Jared King , Shai Messingher Lang , Symeon Delikaris Manias
CPC classification number: H04S7/303 , H04S5/00 , H04S2420/11
Abstract: Each of a plurality of virtual loudspeaker arrays and their channels are produced, based on a corresponding microphone array and microphone signals thereof. Channels of a hallucinated loudspeaker array are determined based on the channels of the plurality of virtual loudspeaker arrays. The plurality of virtual loudspeaker arrays and the hallucinated loudspeaker array share a common geometry and orientation. Spatial audio is rendered based on the channels of the hallucinated loudspeaker array.
-
公开(公告)号:US20230262406A1
公开(公告)日:2023-08-17
申请号:US18079669
申请日:2022-12-12
Applicant: APPLE INC.
Inventor: Shai Messingher Lang , Alexandre Da Veiga , Spencer H. Ray , Symeon Delikaris Manias
CPC classification number: H04S7/303 , H04S3/008 , G06F3/011 , H04S2400/11 , H04S2400/01
Abstract: Various implementations disclosed herein include devices, systems, and methods that display visual content as part of a 3D environment and add audio corresponding to the visual content. The audio may be spatialized to be from one or more audio source locations within the 3D environment. For example, a video may be presented on a virtual surface within an extended reality (XR) environment while audio associated with the video is spatialized to sound as if it is produced from an audio source location corresponding to that virtual surface. How the audio is provided may be determined based on the position of the viewer (e.g., the user or his/her device) relative to the presented visual content.
-
-
-
-
-
-
-
-
-