-
公开(公告)号:US11979733B2
公开(公告)日:2024-05-07
申请号:US18099658
申请日:2023-01-20
Inventor: Antonio Mateos Sole , Nicolas R. Tsingos
CPC classification number: H04S7/30 , H04S3/008 , H04S5/005 , H04S2400/01 , H04S2400/11 , H04S2400/13 , H04S2400/15
Abstract: Multiple virtual source locations may be defined for a volume within which audio objects can move. A set-up process for rendering audio data may involve receiving reproduction speaker location data and pre-computing gain values for each of the virtual sources according to the reproduction speaker location data and each virtual source location. The gain values may be stored and used during “run time,” during which audio reproduction data are rendered for the speakers of the reproduction environment. During run time, for each audio object, contributions from virtual source locations within an area or volume defined by the audio object position data and the audio object size data may be computed. A set of gain values for each output channel of the reproduction environment may be computed based, at least in part, on the computed contributions. Each output channel may correspond to at least one reproduction speaker of the reproduction environment.
-
公开(公告)号:US11937064B2
公开(公告)日:2024-03-19
申请号:US17737184
申请日:2022-05-05
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Lianwu Chen , Lie Lu , Nicolas R. Tsingos
IPC: H04S3/00 , G06F18/2321 , H04S7/00
CPC classification number: H04S3/008 , H04S7/30 , G06F18/2321 , H04S2400/01 , H04S2400/09 , H04S2400/11 , H04S2420/03
Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20230388738A1
公开(公告)日:2023-11-30
申请号:US18141538
申请日:2023-05-01
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Nicolas R. Tsingos , Charles Q. Robinson , Jurgen W. Scharpf
CPC classification number: H04S7/307 , H04S3/008 , H04S7/308 , H04R5/02 , H04S3/00 , H04S5/00 , H04S7/40 , H04S2400/11 , H04S2400/01
Abstract: Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular reproduction environment.
-
公开(公告)号:US20220272472A1
公开(公告)日:2022-08-25
申请号:US17742400
申请日:2022-05-12
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Christophe Chabanne , Nicolas R. Tsingos , Charles Q. Robinson
Abstract: Audio perception in local proximity to visual cues is provided. A device includes a video display, first row of audio transducers, and second row of audio transducers. The first and second rows can be vertically disposed above and below the video display. An audio transducer of the first row and an audio transducer of the second row form a column to produce, in concert, an audible signal. The perceived emanation of the audible signal is from a plane of the video display (e.g., a location of a visual cue) by weighing outputs of the audio transducers of the column. In certain embodiments, the audio transducers are spaced farther apart at a periphery for increased fidelity in a center portion of the plane and less fidelity at the periphery.
-
公开(公告)号:US11350231B2
公开(公告)日:2022-05-31
申请号:US17183360
申请日:2021-02-24
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Christophe Chabanne , Nicolas R. Tsingos , Charles Q. Robinson
Abstract: Audio perception in local proximity to visual cues is provided. A device includes a video display, first row of audio transducers, and second row of audio transducers. The first and second rows can be vertically disposed above and below the video display. An audio transducer of the first row and an audio transducer of the second row form a column to produce, in concert, an audible signal. The perceived emanation of the audible signal is from a plane of the video display (e.g., a location of a visual cue) by weighing outputs of the audio transducers of the column. In certain embodiments, the audio transducers are spaced farther apart at a periphery for increased fidelity in a center portion of the plane and less fidelity at the periphery.
-
公开(公告)号:US11277707B2
公开(公告)日:2022-03-15
申请号:US16938561
申请日:2020-07-24
Inventor: Dirk Jeroen Breebaart , Antonio Mateos Sole , Heiko Purnhagen , Nicolas R. Tsingos
Abstract: Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).
-
公开(公告)号:US10909998B2
公开(公告)日:2021-02-02
申请号:US16551785
申请日:2019-08-27
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Nicolas R. Tsingos
IPC: G10K11/16 , G10L21/0216 , G10L21/0208 , G10L21/0232 , B64C39/02
Abstract: In some embodiments, a method, apparatus and computer program for reducing noise from an audio signal captured by a drone (e.g., canceling the noise signature of a drone from the audio signal) using a model of noise emitted by the drone's propulsion system set, where the propulsion system set includes one or more propulsion systems, each of the propulsion systems including an electric motor, and wherein the noise reduction is performed in response to voltage data indicative of instantaneous voltage supplied to each electric motor of the propulsion system set. In some other embodiments, a method, apparatus and computer program for generating a noise model by determining the noise signature of at least one drone based upon a database of noise signals corresponding to at least one propulsion system and canceling the noise signature of the drone in an audio signal based upon the noise model.
-
28.
公开(公告)号:US10820097B2
公开(公告)日:2020-10-27
申请号:US16337923
申请日:2017-09-28
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Nicolas R. Tsingos , Pradeep Kumar Govindaraju
Abstract: The present document describes a method (700) for determining the position of at least one audio source (200). The method (700) includes capturing (701) first and second microphone signals at two or more microphone arrays (210, 220, 230), wherein the two or more microphone arrays (210, 220, 230) are placed at different positions. The two or more microphone arrays (210, 220, 230) each comprise at least a first microphone capsule to capture a first microphone signal and a second microphone capsule to capture a second microphone signal, wherein the first and second microphone capsules have differently oriented spatial directivities. Furthermore, the method (700) comprises determining (702), for each microphone array (210, 220, 230) and based on the respective first and second microphone signals, an incident direction (211, 221, 231) of at least one audio source (200) at the respective microphone array (210, 220, 230). In addition, the method (700) comprises determining (703) the position of the audio source (200) based on the incident directions (211, 221, 231) at the two or more microphone arrays (210, 220, 230).
-
公开(公告)号:US20200013424A1
公开(公告)日:2020-01-09
申请号:US16551785
申请日:2019-08-27
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Nicolas R. Tsingos
IPC: G10L21/0216 , G10L21/0232
Abstract: In some embodiments, a method, apparatus and computer program for reducing noise from an audio signal captured by a drone (e.g., canceling the noise signature of a drone from the audio signal) using a model of noise emitted by the drone's propulsion system set, where the propulsion system set includes one or more propulsion systems, each of the propulsion systems including an electric motor, and wherein the noise reduction is performed in response to voltage data indicative of instantaneous voltage supplied to each electric motor of the propulsion system set. In some other embodiments, a method, apparatus and computer program for generating a noise model by determining the noise signature of at least one drone based upon a database of noise signals corresponding to at least one propulsion system and canceling the noise signature of the drone in an audio signal based upon the noise model.
-
公开(公告)号:US10327092B2
公开(公告)日:2019-06-18
申请号:US16207006
申请日:2018-11-30
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Charles Q. Robinson , Nicolas R. Tsingos , Christophe Chabanne
Abstract: Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent. The object position metadata contains the appropriate allocentric frame of reference information required to play the sound correctly using the available speaker positions in a room that is set up to play the adaptive audio content.
-
-
-
-
-
-
-
-
-