-
公开(公告)号:US20250104719A1
公开(公告)日:2025-03-27
申请号:US18476267
申请日:2023-09-27
Applicant: Apple Inc.
Inventor: Symeon Delikaris Manias
IPC: G10L19/008 , G10L25/03
Abstract: A method that includes receiving audio content in a first-order ambisonics (FOA) format that includes a first plurality of audio signals, producing a plurality of spatially rendered audio signals by spatially rendering the first plurality of audio signals according to a layout of a virtual loudspeaker array, determining one or more filters by performing a parametric analysis upon at least one of the first plurality of audio signals, filtering at least one of the plurality of spatially rendered audio signals using the one or more filters; and producing a second plurality of audio signals in a higher-order ambisonics (HOA) format based on the plurality of spatially rendered audio signals.
-
公开(公告)号:US20240312468A1
公开(公告)日:2024-09-19
申请号:US18605688
申请日:2024-03-14
Applicant: Apple Inc.
Inventor: Ismael H. Nawfal , Symeon Delikaris Manias , Mehrez Souden , Joshua D. Atkins
IPC: G10L19/008 , H04S7/00
CPC classification number: G10L19/008 , H04S7/30 , H04S2420/11
Abstract: A sound scene is represented as first order Ambisonics (FOA) audio. A processor formats each signal of the FOA audio to a stream of audio frames, provides the formatted FOA audio to a machine learning model that reformats the formatted FOA audio in a target or desired higher order Ambisonics (HOA) format, and obtains output audio of the sound scene in the desired HOA format from the machine learning model. The output audio in the desired HOA format may then be rendered according to a playback audio format of choice. Other aspects are also described and claimed.
-
公开(公告)号:US20240284137A1
公开(公告)日:2024-08-22
申请号:US18438312
申请日:2024-02-09
Applicant: Apple Inc.
Inventor: Shai Messingher Lang , Symeon Delikaris Manias
IPC: H04S7/00
Abstract: A method may include determining a location of a user, determining a virtual playback format based on the location of the user, where the virtual playback format includes a position of a virtual speaker that is fixed in an environment of the user, and determining an acoustic model based on the location of the user. The method also includes rendering audio at the playback device based on the acoustic model and the virtual playback format.
-
公开(公告)号:US11546692B1
公开(公告)日:2023-01-03
申请号:US17370679
申请日:2021-07-08
Applicant: Apple Inc.
Inventor: Symeon Delikaris Manias , Mehrez Souden , Ante Jukic , Matthew S. Connolly , Sabine Webel , Ronald J. Guglielmone, Jr.
Abstract: An audio renderer can have a machine learning model that jointly processes audio and visual information of an audiovisual recording. The audio renderer can generate output audio channels. Sounds captured in the audiovisual recording and present in the output audio channels are spatially mapped based on the joint processing of the audio and visual information by the machine learning model. Other aspects are described.
-
公开(公告)号:US11252525B2
公开(公告)日:2022-02-15
申请号:US17128910
申请日:2020-12-21
Applicant: Apple Inc.
Inventor: Gaetan R. Lorho , Jonathan D. Sheaffer , Symeon Delikaris Manias , Frank Baumgarte
Abstract: Transfer functions can describe responses of microphones or ears to sounds at different locations on a sphere. The transfer functions can be compressed by determining, based on transfer functions, a) one or more basis transfer functions, and b) spherical harmonics coefficients that describe variations of the transfer functions with respect to spherical coordinates. Other aspects are described and claimed.
-
公开(公告)号:US10798511B1
公开(公告)日:2020-10-06
申请号:US16378438
申请日:2019-04-08
Applicant: Apple Inc.
Inventor: Jonathan D. Sheaffer , Juha O. Merimaa , Jason Wung , Martin E. Johnson , Peter A. Raffensperger , Joshua D. Atkins , Symeon Delikaris Manias , Mehrez Souden
IPC: H04S5/00 , G10K11/178 , H04R1/40
Abstract: Processing input audio channels for generating spatial audio can include receiving a plurality of microphone signals that capture a sound field. Each microphone signal can be transformed into a frequency domain signal. From each frequency domain signal, a direct component and a diffuse component can be extracted. The direct component can be processed with a parametric renderer. The diffuse component can be processed with a linear renderer. The components can be combined, resulting in a spatial audio output. The levels of the components can be adjusted to match a direct to diffuse ratio (DDR) of the output with the DDR of the captured sound field. Other aspects are also described and claimed.
-
7.
公开(公告)号:US12283289B2
公开(公告)日:2025-04-22
申请号:US17514694
申请日:2021-10-29
Applicant: Apple Inc.
Inventor: Jonathan D. Sheaffer , Joshua D. Atkins , Mehrez Souden , Symeon Delikaris Manias , Sean A. Ramprashad
Abstract: Processing of ambience and speech can include extracting from audio signals, ambience and speech signals. One or more spatial parameters can be generated that define spatial characteristics of ambience sound in the one or more ambience audio signals. The primary speech signal, the one or more ambience audio signals, and the spatial parameters can be encoded into one or more encoded data streams. Other aspects are described and claimed.
-
公开(公告)号:US20250113158A1
公开(公告)日:2025-04-03
申请号:US18477491
申请日:2023-09-28
Applicant: Apple Inc.
Inventor: Symeon Delikaris Manias , Juha O. Merimaa
IPC: H04S7/00
Abstract: A method performed by at least one programmed processor of an electronic device, the method includes receiving audio content and, for a first group of points associated with a three-dimensional (3D) sound field of the audio content, estimating a first group of spatial parameters associated with the audio content. The method generates a second group of points associated with a region of the 3D sound field of the audio content based on a comparison of the estimated spatial parameters of the first group of points, where the first group of points includes less points than the second group of points. The method, for the second group of points, estimating a second group of spatial parameters associated with the audio content and storing the second group of spatial parameters associated with the audio content.
-
公开(公告)号:US20240388860A1
公开(公告)日:2024-11-21
申请号:US18664057
申请日:2024-05-14
Applicant: Apple Inc.
Inventor: Symeon Delikaris Manias , Shai Messingher Lang
Abstract: A method includes obtaining an acoustic model of an environment. The acoustic model indicates a set of one or more acoustical properties of the environment. The method includes determining a placement location for a microphone within the environment based on the set of one or more acoustical properties of the environment and a pickup pattern of the microphone. The method includes displaying, on the display, a representation of the environment and a visual indicator that is overlaid onto the representation of the environment in order to indicate the placement location for the microphone.
-
公开(公告)号:US20240107259A1
公开(公告)日:2024-03-28
申请号:US18458965
申请日:2023-08-30
Applicant: Apple Inc.
Inventor: Yoo Mi Hur , Ashrith Deshpande , Prateek Murgai , Joshua D. Atkins , Symeon Delikaris Manias
CPC classification number: H04S7/307 , H04R3/005 , H04R5/027 , H04S3/008 , H04S2400/01 , H04S2400/11 , H04S2400/15 , H04S2420/11
Abstract: A device may include microphones worn on a head of a user. The device may include a processor, configured to obtain microphone signals from the plurality of microphones. The processor may attenuate breathing sound from the user by processing the microphone signals, resulting in attenuated microphone signals. The processor may render one or more output audio channels based on the plurality of attenuated microphone signals.
-
-
-
-
-
-
-
-
-