-
公开(公告)号:US20220360899A1
公开(公告)日:2022-11-10
申请号:US17630897
申请日:2020-07-27
Inventor: Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga
Abstract: Individual loudspeaker dynamics processing configuration data, for each of a plurality of loudspeakers of a listening environment, may be obtained. Listening environment dynamics processing configuration data may be determined, based on the individual loudspeaker dynamics processing configuration data. Dynamics processing may be performed on received audio data based on the listening environment dynamics processing configuration data, to generate processed audio data. The processed audio data may be rendered for reproduction via a set of loudspeakers that includes at least some of the plurality of loudspeakers, to produce rendered audio signals. The rendered audio signals may be provided to, and reproduced by, the set of loudspeakers.
-
公开(公告)号:US11195511B2
公开(公告)日:2021-12-07
申请号:US17260569
申请日:2019-07-17
Inventor: Toni Hirvonen , Daniel Arteaga , Eduard Aylon Pla , Alex Cabrer Manning , Lie Lu , Karl Jonas Roeden
Abstract: Described herein is a method for creating object-based audio content from a text input for use in audio books and/or audio play, the method including the steps of: a) receiving the text input; b) performing a semantic analysis of the received text input; c) synthesizing speech and effects based on one or more results of the semantic analysis to generate one or more audio objects; d) generating metadata for the one or more audio objects; and e) creating the object-based audio content including the one or more audio objects and the metadata. Described herein are further a computer-based system including one or more processors configured to perform said method and a computer program product comprising a computer-readable storage medium with instructions adapted to carry out said method when executed by a device having processing capability.
-
公开(公告)号:US10405120B2
公开(公告)日:2019-09-03
申请号:US15647121
申请日:2017-07-11
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US09949052B2
公开(公告)日:2018-04-17
申请号:US15451241
申请日:2017-03-06
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
CPC classification number: H04S3/002 , H04R5/02 , H04R5/04 , H04S7/30 , H04S7/308 , H04S2400/11 , H04S2400/13 , H04S2420/03
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US20240284136A1
公开(公告)日:2024-08-22
申请号:US18637073
申请日:2024-04-16
Inventor: Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga , Glenn N. Dickins , Mark Richard Paul Thomas
Abstract: A rendering mode may be determined for received audio data, including audio signals and associated spatial data. The audio data may be rendered for reproduction via a set of loudspeakers of an environment according to the rendering mode, to produce rendered audio signals. Rendering the audio data may involve determining relative activation of a set of loudspeakers in an environment. The rendering mode may be variable between a reference spatial mode and one or more distributed spatial modes. The reference spatial mode may have an assumed listening position and orientation. In the distributed spatial mode(s), one or more elements of the audio data may each be rendered in a more spatially distributed manner than in the reference spatial mode and spatial locations of remaining elements of the audio data may be warped such that they span a rendering space of the environment more completely than in the reference spatial mode.
-
公开(公告)号:US20240267679A1
公开(公告)日:2024-08-08
申请号:US18637051
申请日:2024-04-16
Inventor: Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga
CPC classification number: H04R5/04 , H04R29/001 , H04R2400/01 , H04R2430/20 , H04S2400/15
Abstract: Methods for rendering audio for playback by two or more speakers are disclosed. The audio includes one or more audio signals, each with an associated intended perceived spatial position. Relative activation of the speakers may be a cost function of a model of perceived spatial position of the audio signals when played back over the speakers, a measure of proximity of the intended perceived spatial position of the audio signals to positions of the speakers, and one or more additional dynamically configurable functions. The dynamically configurable functions may be based on at least one or more properties of the audio signals, one or more properties of the set of speakers and/or one or more external inputs.
-
公开(公告)号:US12003946B2
公开(公告)日:2024-06-04
申请号:US17630098
申请日:2020-07-16
Inventor: Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga , Glenn N. Dickins , Mark Richard Paul Thomas
Abstract: A rendering mode may be determined for received audio data, including audio signals and associated spatial data. The audio data may be rendered for reproduction via a set of loudspeakers of an environment according to the rendering mode, to produce rendered audio signals. Rendering the audio data may involve determining relative activation of a set of loudspeakers in an environment. The rendering mode may be variable between a reference spatial mode and one or more distributed spatial modes. The reference spatial mode may have an assumed listening position and orientation. In the distributed spatial mode(s), one or more elements of the audio data may each be rendered in a more spatially distributed manner than in the reference spatial mode and spatial locations of remaining elements of the audio data may be warped such that they span a rendering space of the environment more completely than in the reference spatial mode.
-
公开(公告)号:US12003673B2
公开(公告)日:2024-06-04
申请号:US17628732
申请日:2020-07-29
Inventor: Glenn N. Dickins , Christopher Graham Hines , David Gunawan , Richard J. Cartwright , Alan J. Seefeldt , Daniel Arteaga , Mark R. P. Thomas , Joshua B. Lando
CPC classification number: H04M9/082 , G10L15/22 , G10L2015/223
Abstract: An audio processing method may involve receiving output signals from each microphone of a plurality of microphones in an audio environment, the output signals corresponding to a current utterance of a person and determining, based on the output signals, one or more aspects of context information relating to the person, including an estimated current proximity of the person to one or more microphone locations. The method may involve selecting two or more loudspeaker-equipped audio devices based, at least in part, on the one or more aspects of the context information, determining one or more types of audio processing changes to apply to audio data being rendered to loudspeaker feed signals for the audio devices and causing one or more types of audio processing changes to be applied. In some examples, the audio processing changes have the effect of increasing a speech to echo ratio at one or more microphones.
-
公开(公告)号:US11968268B2
公开(公告)日:2024-04-23
申请号:US17630779
申请日:2020-07-28
Inventor: Glenn N. Dickins , Mark R. P. Thomas , Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga , Carlos Medaglia Dyonisio , David Gunawan , Richard J. Cartwright , Christopher Graham Hines
CPC classification number: H04L67/141 , H04R1/326 , H04R1/403 , H04R1/406 , H04R3/005 , H04R3/12 , H04S7/303
Abstract: An audio session management method may involve: determining, by an audio session manager, one or more first media engine capabilities of a first media engine of a first smart audio device, the first media engine being configured for managing one or more audio media streams received by the first smart audio device and for performing first smart audio device signal processing for the one or more audio media streams according to a first media engine sample clock; receiving, by the audio session manager and via a first application communication link, first application control signals from the first application; and controlling the first smart audio device according to the first media engine capabilities, by the audio session manager, via first audio session management control signals transmitted to the first smart audio device via a first smart audio device communication link and without reference to the first media engine sample clock.
-
公开(公告)号:US20220322010A1
公开(公告)日:2022-10-06
申请号:US17630910
申请日:2020-07-25
Inventor: Alan J. Seefedlt , Joshua B. Lando , Daniel Arteaga
Abstract: Methods for rendering audio for playback by two or more speakers are disclosed. The audio includes one or more audio signals, each with an associated intended perceived spatial position. Relative activation of the speakers may be a cost function of a model of perceived spatial position of the audio signals when played back over the speakers, a measure of proximity of the intended perceived spatial position of the audio signals to positions of the speakers, and one or more additional dynamically configurable functions. The dynamically configurable functions may be based on at least one or more properties of the audio signals, one or more properties of the set of speakers and/or one or more external inputs.
-
-
-
-
-
-
-
-
-