-
公开(公告)号:US11937074B2
公开(公告)日:2024-03-19
申请号:US17161569
申请日:2021-01-28
Inventor: Michael William Mason , Juan Felix Torres , Antonio Mateos Sole , Daniel Arteaga , Adam J. Mills , Mark David de Burgh , Andrew Robert Owen
CPC classification number: H04S7/307 , H04R3/04 , H04R3/12 , H04R5/04 , H04S7/303 , H04S2400/07 , H04S2420/11
Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors. The present document further relates to methods and apparatus for rendering audio input including extent metadata and/or diffuseness metadata for playback in a playback environment.
-
公开(公告)号:US11843930B2
公开(公告)日:2023-12-12
申请号:US17833761
申请日:2022-06-06
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
CPC classification number: H04S3/002 , H04S7/30 , H04S7/308 , H04R5/02 , H04R5/04 , H04S2400/11 , H04S2400/13 , H04S2420/03
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US20220386053A1
公开(公告)日:2022-12-01
申请号:US17833761
申请日:2022-06-06
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US10897682B2
公开(公告)日:2021-01-19
申请号:US16555126
申请日:2019-08-29
Inventor: Jun Wang , Giulio Cengarle , Juan Felix Torres , Daniel Arteaga
Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
-
公开(公告)号:US10779084B2
公开(公告)日:2020-09-15
申请号:US16336861
申请日:2017-09-28
Inventor: Daniel Arteaga , Giulio Cengarle , David Matthew Fischer , Antonio Mateos Sole , Davide Scaini , Alan J. Seefeldt
Abstract: Embodiments are described for a method for localizing a set of speakers (106) and microphones (108), having only the times of arrival between each of the speakers and microphones. An autodiscovery process (107) uses an external input to set: a global translation (3 continuous parameters), a global rotation (3 continuous parameters), and discrete symmetries, i.e., an exchange of any axis pairs and/or reversal of any axis. Different time of arrival acquisition techniques may be used, such as ultrasonic sweeps or generic multitrack audio content. The autodiscovery algorithm is based in minimizing a certain cost function, and the process allows for latencies in the recordings, possibly linked to the latencies in the emission.
-
公开(公告)号:US20240323608A1
公开(公告)日:2024-09-26
申请号:US18732550
申请日:2024-06-03
Inventor: Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga
Abstract: Individual loudspeaker dynamics processing configuration data, for each of a plurality of loudspeakers of a listening environment, may be obtained. Listening environment dynamics processing configuration data may be determined, based on the individual loudspeaker dynamics processing configuration data. Dynamics processing may be performed on received audio data based on the listening environment dynamics processing configuration data, to generate processed audio data. The processed audio data may be rendered for reproduction via a set of loudspeakers that includes at least some of the plurality of loudspeakers, to produce rendered audio signals. The rendered audio signals may be provided to, and reproduced by, the set of loudspeakers.
-
公开(公告)号:US20240305952A1
公开(公告)日:2024-09-12
申请号:US18606301
申请日:2024-03-15
Inventor: Michael William Mason , Juan Felix Torres , Antonio Mateos Sole , Daniel Arteaga , Adam J. Mills , Mark David de Burgh , Andrew Robert Owen
CPC classification number: H04S7/307 , H04R3/04 , H04R3/12 , H04R5/04 , H04S7/303 , H04S2400/07 , H04S2420/11
Abstract: The present document relates to methods and apparatus for rendering input audio for playback in a playback environment. The input audio includes at least one audio object and associated metadata, and the associated metadata indicates at least a location of the audio object. A method for rendering input audio including divergence metadata for playback in a playback environment comprises creating two additional audio objects associated with the audio object such that respective locations of the two additional audio objects are evenly spaced from the location of the audio object, on opposite sides of the location of the audio object when seen from an intended listener's position in the playback environment, determining respective weight factors for application to the audio object and the two additional audio objects, and rendering the audio object and the two additional audio objects to one or more speaker feeds in accordance with the determined weight factors.
-
公开(公告)号:US20240163340A1
公开(公告)日:2024-05-16
申请号:US18415544
申请日:2024-01-17
Inventor: Glenn N. Dickins , Mark R.P. Thomas , Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga , Carlos Medaglia Dyonisio , David Gunawan , Richard J. Cartwright , Christopher Graham Hines
CPC classification number: H04L67/141 , H04R1/326 , H04R1/403 , H04R1/406 , H04R3/005 , H04R3/12 , H04S7/303
Abstract: An audio session management method may involve: determining, by an audio session manager, one or more first media engine capabilities of a first media engine of a first smart audio device, the first media engine being configured for managing one or more audio media streams received by the first smart audio device and for performing first smart audio device signal processing for the one or more audio media streams according to a first media engine sample clock; receiving, by the audio session manager and via a first application communication link, first application control signals from the first application; and controlling the first smart audio device according to the first media engine capabilities, by the audio session manager, via first audio session management control signals transmitted to the first smart audio device via a first smart audio device communication link and without reference to the first media engine sample clock.
-
公开(公告)号:US20230208921A1
公开(公告)日:2023-06-29
申请号:US17630779
申请日:2020-07-28
Inventor: Glenn N. Dickins , Mark Richard Paul Thomas , Alan J. Seefeldt , Joshua B. Lando , Daniel Arteaga , Carlos Medaglia Dyonisio , David Gunawan , Richard J. Cartwright , Christopher Graham Hines
CPC classification number: H04L67/141 , H04S7/303 , H04R1/326 , H04R1/403 , H04R1/406 , H04R3/005 , H04R3/12
Abstract: An audio session management method may involve: determining, by an audio session manager, one or more first media engine capabilities of a first media engine of a first smart audio device, the first media engine being configured for managing one or more audio media streams received by the first smart audio device and for performing first smart audio device signal processing for the one or more audio media streams according to a first media engine sample clock; receiving, by the audio session manager and via a first application communication link, first application control signals from the first application; and controlling the first smart audio device according to the first media engine capabilities, by the audio session manager, via first audio session management control signals transmitted to the first smart audio device via a first smart audio device communication link and without reference to the first media engine sample clock.
-
公开(公告)号:US20210295820A1
公开(公告)日:2021-09-23
申请号:US17260569
申请日:2019-07-17
Inventor: Toni Hirvonen , Daniel Arteaga , Eduard Aylon Pla , Alex Cabrer Manning , Lie Lu , Karl Jonas Roeden
Abstract: Described herein is a method for creating object-based audio content from a text input for use in audio books and/or audio play, the method including the steps of: a) receiving the text input; b) performing a semantic analysis of the received text input; c) synthesizing speech and effects based on one or more results of the semantic analysis to generate one or more audio objects; d) generating metadata for the one or more audio objects; and e) creating the object-based audio content including the one or more audio objects and the metadata. Described herein are further a computer-based system including one or more processors configured to perform said method and a computer program product comprising a computer-readable storage medium with instructions adapted to carry out said method when executed by a device having processing capability.
-
-
-
-
-
-
-
-
-