-
公开(公告)号:US20240406660A1
公开(公告)日:2024-12-05
申请号:US18671597
申请日:2024-05-22
Applicant: Apple Inc.
Inventor: Dipanjan Sen , Francois Becker , Moo Young Kim , Sang Uk Ryu
IPC: H04S7/00
Abstract: The various aspects of the disclosure here enable a content creation side to control how discrete audio objects that make up a sound program are rendered by a decoding side to achieve high spatial resolution, while giving the content creator the flexibility to decide how complex the spatial audio rendering should be in the decoding side. Metadata associated with the sound program will instruct a spatial audio renderer on how complex its listener motion compensation should be. Other aspects are also described and claimed.
-
公开(公告)号:US20240406656A1
公开(公告)日:2024-12-05
申请号:US18671861
申请日:2024-05-22
Applicant: Apple Inc.
Inventor: Dipanjan Sen , Moo Young Kim , Francois Becker , Sang Uk Ryu
IPC: H04S7/00 , G10L19/008
Abstract: The various aspects of the disclosure here enable a content creation side to control how a sound program is spatial audio rendered by a decoding side, so that an audio scene component in a metadata-specified three dimensional acoustic masking zone is not heard while another audio scene component in an un-masked zone of the sound program is heard by a listener of the playback. Other aspects are also described and claimed.
-
公开(公告)号:US20240114310A1
公开(公告)日:2024-04-04
申请号:US18471796
申请日:2023-09-21
Applicant: Apple Inc.
Inventor: Frank Baumgarte , Dipanjan Sen
IPC: H04S7/00
CPC classification number: H04S7/303
Abstract: A method that includes receiving a bitstream that comprises: an encoded version of an audio signal that is associated with a sound source that is within a first 3D scene, a scene tree structure that includes an origin of the first scene relative to an origin of a second scene, and a position of the sound source within the first scene relative to the origin of the first scene, wherein the position references the origin of the first scene using an identifier, wherein the scene tree structure defines an initial configuration of the sound source with respect to the first and second scenes; determining a position of a listener; producing a set of spatially rendered audio signals by spatially rendering the audio signal according to the position of the sound source with respect to the position of the listener; and using the spatially rendered audio signals to drive speakers.
-
公开(公告)号:US20240098444A1
公开(公告)日:2024-03-21
申请号:US18454508
申请日:2023-08-23
Applicant: Apple Inc.
Inventor: Sina Zamani , Moo Young Kim , Dipanjan Sen , Sang Uk Ryu , Juha O. Merimaa , Symeon Delikaris Manias
IPC: H04S7/00
CPC classification number: H04S7/303 , H04S2420/11
Abstract: In one aspect, a computer-implemented method, includes obtaining object audio and metadata that spatially describes the object audio, converting the object audio to Ambisonics audio based on the metadata, encoding, in a first bit stream, the Ambisonics audio, and encoding, in a second bit stream, at least a subset of the metadata.
-
公开(公告)号:US20230396921A1
公开(公告)日:2023-12-07
申请号:US18200262
申请日:2023-05-22
Applicant: APPLE INC.
Inventor: Abhaya Parthy , Dipanjan Sen , Bonnie W. Tom , Jonathan D. Sheaffer , Justin D. Crosby , Symeon Delikaris Manias , Emily A. Wigley
IPC: H04R3/00
Abstract: A multi-radius spherical microphone that includes an inner body defining an inner sphere having an inner radius from a center; a plurality of inner microphones coupled to the inner spherical body and defining an array of inner microphones; an outer body defining an dodecahedron, wherein the inner body and the outer body are concentric about the center; and a plurality of outer microphones coupled to the outer body at respective vertices of the dodecahedron and defining an array of outer microphones, wherein each of the plurality of outer microphones is positioned radially equidistant from the center.
-
公开(公告)号:US11430451B2
公开(公告)日:2022-08-30
申请号:US16584706
申请日:2019-09-26
Applicant: Apple Inc.
Inventor: Dipanjan Sen , Frank Baumgarte , Juha O. Merimaa
IPC: G10L19/008 , H04S7/00 , H04S3/00 , H04N21/2343 , H04N21/2368 , G11B27/22 , H04N21/439 , H04N21/81 , H04N21/6587
Abstract: A first layer of data having a first set of Ambisonic audio components can be decoded where the first set of Ambisonic audio components is generated based on ambience and one or more object-based audio signals. A second layer of data is decoded having at least one of the one or more object-based audio signals. One of the object-based audio signals is subtracted from the first set of Ambisonic audio components. The resulting Ambisonic audio components are rendered to generate a first set of audio channels. The one or more object-based audio signals are spatially rendered to generate a second set of audio channels. Other aspects are described and claimed.
-
公开(公告)号:US20220262373A1
公开(公告)日:2022-08-18
申请号:US17739901
申请日:2022-05-09
Applicant: Apple Inc.
Inventor: Dipanjan Sen , Frank Baumgarte , Juha O. Merimaa
IPC: G10L19/008 , H04S7/00 , H04S3/00
Abstract: A first layer of data having a first set of Ambisonic audio components can be decoded where the first set of Ambisonic audio components is generated based on ambience and one or more object-based audio signals. A second layer of data is decoded having at least one of the one or more object-based audio signals. One of the object-based audio signals is subtracted from the first set of Ambisonic audio components. The resulting Ambisonic audio components are rendered to generate a first set of audio channels. The one or more object-based audio signals are spatially rendered to generate a second set of audio channels. Other aspects are described and claimed.
-
公开(公告)号:US20250087225A1
公开(公告)日:2025-03-13
申请号:US18466663
申请日:2023-09-13
Applicant: Apple Inc.
Inventor: Ravi Kiran Chivukula , Dipanjan Sen , Tejaswi Nanjundaswamy
IPC: G10L19/16
Abstract: A target encoder may receive, from a source decoder, a source bitstream including an audio frame and a metadata frame associated with the audio frame. The target encoder may transcode the audio frame to a new audio frame in a target format associated with the target encoder. The target encoder may convert the metadata frame into a new metadata frame associated with the new audio frame. The target encoder may then generate a target bitstream including the new audio frame and the new metadata frame. Other aspects are also described and claimed.
-
公开(公告)号:US20240406669A1
公开(公告)日:2024-12-05
申请号:US18665488
申请日:2024-05-15
Applicant: Apple Inc.
Inventor: Dipanjan Sen , Moo Young Kim , Francois Becker , Sang Uk Ryu
IPC: H04S7/00 , G10L19/008 , H04S3/00
Abstract: The various aspects of the disclosure here enable a content creation side to control how discrete audio objects that make up a sound program are rendered by a decoding side to achieve greater realism, while enabling the decoder side to also control the rendering process to consider the positions and orientations of the objects as virtual sound sources relative to the listener. The same sound program can thus be optimally rendered by a variety of decoder side formats, such as binaural on headphone, cross-talked cancelled binaural on a stereo pair of speakers embedded in a device, or multichannel on an immersive loudspeaker layout, e.g., planar such as 5.1 and 7.1 surround sound layouts, 3D such as 7.1.4 or 22.2, etc. Other aspects are also described and claimed.
-
-
-
-
-
-
-
-