Patent search ap:("APPLE INC.") AND inv:"Symeon Delikaris Manias" Page 2

11.

发明授权
Microphone array based deep learning for time-domain speech signal extraction 有权

公开(公告)号：US11508388B1

公开(公告)日：2022-11-22

申请号：US17100802

申请日：2020-11-20

Applicant: Apple Inc.

Inventor： Mehrez Souden , Symeon Delikaris Manias , Joshua D. Atkins , Ante Jukic , Ramin Pishehvar

IPC: G10L21/0232 , H04R1/40 , G10L25/30 , G06N3/08 , H04R3/00 , G10L21/0216

Abstract: A device for processing audio signals in a time-domain includes a processor configured to receive multiple audio signals corresponding to respective microphones of at least two or more microphones of the device, at least one of the multiple audio signals comprising speech of a user of the device. The processor is configured to provide the multiple audio signals to a machine learning model, the machine learning model having been trained based at least in part on an expected position of the user of the device and expected positions of the respective microphones on the device. The processor is configured to provide an audio signal that is enhanced with respect to the speech of the user relative to the multiple audio signals, wherein the audio signal is a waveform output from the machine learning model.

12.

发明申请
SEPARATING AND RENDERING VOICE AND AMBIENCE SIGNALS 有权

公开(公告)号：US20220059123A1

公开(公告)日：2022-02-24

申请号：US17514694

申请日：2021-10-29

Applicant: Apple Inc.

Inventor： Jonathan D. Sheaffer , Joshua D. Atkins , Mehrez Souden , Symeon Delikaris Manias , Sean A. Ramprashad

IPC: G10L25/78 , G10L21/0272 , G06T7/246 , H04R3/00

Abstract: Processing of ambience and speech can include extracting from audio signals, ambience and speech signals. One or more spatial parameters can be generated that define spatial characteristics of ambience sound in the one or more ambience audio signals. The primary speech signal, the one or more ambience audio signals, and the spatial parameters can be encoded into one or more encoded data streams. Other aspects are described and claimed.

13.

发明申请
COMPRESSING SPATIAL ACOUSTIC TRANSFER FUNCTIONS 有权

公开(公告)号：US20210211821A1

公开(公告)日：2021-07-08

申请号：US17128910

申请日：2020-12-21

Applicant: Apple Inc.

Inventor： Gaetan R. Lorho , Jonathan D. Sheaffer , Symeon Delikaris Manias , Frank Baumgarte

IPC: H04S7/00 , H04R1/40 , H04R3/00 , H04R5/027 , H04R3/04 , H04R29/00

Abstract: Transfer functions can describe responses of microphones or ears to sounds at different locations on a sphere. The transfer functions can be compressed by determining, based on transfer functions, a) one or more basis transfer functions, and b) spherical harmonics coefficients that describe variations of the transfer functions with respect to spherical coordinates. Other aspects are described and claimed.

14.

发明授权
Visual content presentation with viewer position-based audio 有权

公开(公告)号：US12284508B2

公开(公告)日：2025-04-22

申请号：US18079669

申请日：2022-12-12

Applicant: APPLE INC.

Inventor： Shai Messingher Lang , Alexandre Da Veiga , Spencer H. Ray , Symeon Delikaris Manias

IPC: H04S7/00 , G06F3/01 , H04S3/00

Abstract: Various implementations disclosed herein include devices, systems, and methods that display visual content as part of a 3D environment and add audio corresponding to the visual content. The audio may be spatialized to be from one or more audio source locations within the 3D environment. For example, a video may be presented on a virtual surface within an extended reality (XR) environment while audio associated with the video is spatialized to sound as if it is produced from an audio source location corresponding to that virtual surface. How the audio is provided may be determined based on the position of the viewer (e.g., the user or his/her device) relative to the presented visual content.

15.

发明申请
Method and System for Selective Audio Playback on A Loudspeaker and A Headset 有权

公开(公告)号：US20250080911A1

公开(公告)日：2025-03-06

申请号：US18803081

申请日：2024-08-13

Applicant: Apple Inc.

Inventor： Symeon Delikaris Manias , Afrooz Family , Shai Messingher Lang , Ronald J. Guglielmone, JR.

IPC: H04R5/033 , H04R3/12

Abstract: A method that includes driving a first speaker of a first electronic device using a mix of a first audio signal and a second audio signal. The method determines that the first electronic device is within a threshold distance of a second electronic device within an environment in which the first electronic device is located, where the second electronic device includes a second speaker. Responsive to determining that the first electronic device is within the threshold distance, causing the second electronic device to playback the second audio signal through the speaker and driving the first speaker using the first audio signal instead of the mix.

16.

发明公开
Spatial Blending of Audio 审中-公开

公开(公告)号：US20240098442A1

公开(公告)日：2024-03-21

申请号：US18458077

申请日：2023-08-29

Applicant: Apple Inc.

Inventor： Shai Messingher Lang , Joshua D. Atkins , Scott A. Wardle , Symeon Delikaris Manias

IPC: H04S7/00

CPC classification number: H04S7/302 , H04S2400/11

Abstract: An audio processing system may obtain a size of a visual object to present to a display. The audio processing system may determine a virtual placement for each of a plurality of virtual speakers at least based on the size of the visual object. Each of the plurality of virtual speakers may be spatially rendered at each virtual placement through binaural audio, for playback through head-worn speakers. Other aspects are also described and claimed.

17.

发明公开
Object Audio Coding 审中-公开

公开(公告)号：US20240096335A1

公开(公告)日：2024-03-21

申请号：US18454409

申请日：2023-08-23

Applicant: Apple Inc.

Inventor： Sina Zamani , Moo Young Kim , Dipanjan Sen , Sang Uk Ryu , Juha O. Merimaa , Symeon Delikaris Manias

IPC: G10L19/008 , G10L25/03

CPC classification number: G10L19/008 , G10L25/03

Abstract: In one aspect, a computer-implemented method, includes obtaining object audio and metadata that spatially describes the object audio, converting the object audio to time-frequency domain Ambisonics audio based on the metadata, and encoding the time-frequency domain Ambisonics audio and a subset of the metadata as one or more bitstreams to be stored in computer-readable memory or transmitted to a remote device.

18.

发明授权
Spatial audio file format for storing capture metadata 有权

公开(公告)号：US11841899B2

公开(公告)日：2023-12-12

申请号：US16899019

申请日：2020-06-11

Applicant: Apple Inc.

Inventor： Jonathan D. Sheaffer , Symeon Delikaris Manias , Gaetan R. Lorho , Peter A. Raffensperger , Eric A. Allamanche , Frank Baumgarte , Dipanjan Sen , Joshua D. Atkins , Juha O. Merimaa

IPC: G06F16/683 , G06F16/174 , H04R1/40 , H04R3/00

CPC classification number: G06F16/683 , G06F16/1744 , H04R1/406 , H04R3/005 , H04R2410/00

Abstract: A device with microphones can generate microphone signals during an audio recording. The device can store, in an electronic audio data file, the microphone signals, and metadata that includes impulse responses of the microphones. Other aspects are described and claimed.

19.

发明授权
Auditory origin synthesis 有权

公开(公告)号：US11758348B1

公开(公告)日：2023-09-12

申请号：US17570251

申请日：2022-01-06

Applicant: Apple Inc.

Inventor： Jared King , Shai Messingher Lang , Symeon Delikaris Manias

IPC: H04S7/00 , H04S5/00

CPC classification number: H04S7/303 , H04S5/00 , H04S2420/11

Abstract: Each of a plurality of virtual loudspeaker arrays and their channels are produced, based on a corresponding microphone array and microphone signals thereof. Channels of a hallucinated loudspeaker array are determined based on the channels of the plurality of virtual loudspeaker arrays. The plurality of virtual loudspeaker arrays and the hallucinated loudspeaker array share a common geometry and orientation. Spatial audio is rendered based on the channels of the hallucinated loudspeaker array.

20.

发明公开
VISUAL CONTENT PRESENTATION WITH VIEWER POSITION-BASED AUDIO 审中-公开

公开(公告)号：US20230262406A1

公开(公告)日：2023-08-17

申请号：US18079669

申请日：2022-12-12

Applicant: APPLE INC.

Inventor： Shai Messingher Lang , Alexandre Da Veiga , Spencer H. Ray , Symeon Delikaris Manias

IPC: H04S7/00 , H04S3/00 , G06F3/01

CPC classification number: H04S7/303 , H04S3/008 , G06F3/011 , H04S2400/11 , H04S2400/01

Abstract: Various implementations disclosed herein include devices, systems, and methods that display visual content as part of a 3D environment and add audio corresponding to the visual content. The audio may be spatialized to be from one or more audio source locations within the 3D environment. For example, a video may be presented on a virtual surface within an extended reality (XR) environment while audio associated with the video is spatialized to sound as if it is produced from an audio source location corresponding to that virtual surface. How the audio is provided may be determined based on the position of the viewer (e.g., the user or his/her device) relative to the presented visual content.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification