-
公开(公告)号:US10798511B1
公开(公告)日:2020-10-06
申请号:US16378438
申请日:2019-04-08
Applicant: Apple Inc.
Inventor: Jonathan D. Sheaffer , Juha O. Merimaa , Jason Wung , Martin E. Johnson , Peter A. Raffensperger , Joshua D. Atkins , Symeon Delikaris Manias , Mehrez Souden
IPC: H04S5/00 , G10K11/178 , H04R1/40
Abstract: Processing input audio channels for generating spatial audio can include receiving a plurality of microphone signals that capture a sound field. Each microphone signal can be transformed into a frequency domain signal. From each frequency domain signal, a direct component and a diffuse component can be extracted. The direct component can be processed with a parametric renderer. The diffuse component can be processed with a linear renderer. The components can be combined, resulting in a spatial audio output. The levels of the components can be adjusted to match a direct to diffuse ratio (DDR) of the output with the DDR of the captured sound field. Other aspects are also described and claimed.
-
公开(公告)号:US10764684B1
公开(公告)日:2020-09-01
申请号:US16147140
申请日:2018-09-28
Applicant: Apple Inc.
Inventor: Jonathan D. Sheaffer , Ashrith Deshpande , Joshua D. Atkins
Abstract: Systems, methods, and computer readable media to improve the operation of an electronic device having multiple microphones organized in an arbitrary, but known, arrangement in the device are described (i.e., having a specific form-factor). In general, techniques are disclosed for using a priori knowledge of an electronic device's spatial acoustic transfer functions to recreate or reconstitute a prior recorded three-dimensional (3D) audio field or environment. More particularly, techniques disclosed herein enable the efficient recording of a 3D audio field. That audio field may later be reconstituted using an acoustic characterization based on the device's form-factor. In addition, sensor data may be used to rotate the audio field so as to enable generating an output audio field that takes into account the listener's head position.
-
公开(公告)号:US20200105291A1
公开(公告)日:2020-04-02
申请号:US16147138
申请日:2018-09-28
Applicant: Apple Inc
Inventor: Jonathan D. Sheaffer , Peter A. Raffensperger , Ashrith Deshpande
IPC: G10L21/10 , G06F3/0481 , G06F9/54 , G06F3/16 , H04R3/10
Abstract: An audio appliance can include a microphone transducer configured to receive sound from an environment and to convert the received sound into an audio signal and a display. The audio appliance can include an audio analytics module configured to detect an audio-input impairment by analyzing the audio signal and output a detection signal identifying the audio-input impairment in real-time. The audio-input impairment can include, for example, a poor-intelligibility impairment, a microphone-occlusion impairment, a handling-noise impairment, a wind-noise impairment, or a distortion impairment. The audio appliance can also include an impairment module configured to identify and emit a user-perceptible alert corresponding to the identified audio-input impairment in real-time; and an interactive guidance module configured to present a suggested action to address the audio-input impairment in real-time. Related aspects also are described.
-
公开(公告)号:US10178490B1
公开(公告)日:2019-01-08
申请号:US15639191
申请日:2017-06-30
Applicant: Apple Inc.
Inventor: Jonathan D. Sheaffer , Joshua D. Atkins , Martin E. Johnson , Stuart J. Wood
IPC: H04R5/00 , H04S7/00 , G10L19/008 , G06T7/20
Abstract: Image analysis of a video signal is performed to produce first metadata, and audio analysis of a multi-channel sound track associated with the video signal is performed to produce second metadata. A number of time segments of the sound track are processed, wherein each time segment is processed by either (i) spatial filtering of the audio signals or (ii) spatial rendering of the audio signals, not both, wherein for each time segment a decision was made to select between the spatial filtering or the spatial rendering, in accordance with the first and second metadata. A mix of the processed sound track and the video signal is generated. Other embodiments are also described and claimed.
-
公开(公告)号:US20180359294A1
公开(公告)日:2018-12-13
申请号:US15621890
申请日:2017-06-13
Applicant: Apple Inc.
Inventor: Suzanne C. Brown , Gaetan R. Lorho , Jonathan D. Sheaffer
CPC classification number: H04L65/403 , G10L17/005 , G10L19/008 , H04R1/1008 , H04R3/005 , H04S3/008 , H04S7/305 , H04S2400/01
Abstract: In one aspect herein, a pre-processor receives audio signals for a conference call from individual callers, each of the audio signals associated with corresponding metadata, analyzes the metadata, and associates each of the audio signals with a spatial position in a virtual representation of the conference call based on the analyzation of the metadata. A spatial arrangement processor generates a binaural room impulse response associated with the spatial position of each of the audio signals to filter the received audio signals to account for the spatial position associated with each of the audio signals and to account for the effect of the virtual representation of the conference call. A head-tracking controller tracks an orientation of a listener's head using a headset. A binaural renderer produces multi-channel audio data for playback on the headset according to the orientation of the listener's head and the binaural room impulse response associated with the spatial position of each of the audio signals.
-
-
-
-