Apparatus, Methods and Computer Programs for Encoding Spatial Metadata

    公开(公告)号:US20240312469A1

    公开(公告)日:2024-09-19

    申请号:US18678716

    申请日:2024-05-30

    IPC分类号: G10L19/008 G10L19/00

    CPC分类号: G10L19/008 G10L2019/0001

    摘要: An apparatus configured to: obtain spatial audio content; decode encoded spatial metadata associated with the spatial audio content based, at least partially, on a configuration parameter indicative of a codec configuration used to encode, at least in part, spatial metadata; determine at least one prototype audio signal based, at least partially, on the spatial audio content and a configuration of at least one output device; and determine one or more spatial audio signals based, at least partially, on the at least one prototype audio signal and the decoded spatial metadata; and provide, to the at least one output device, the one or more spatial audio signals.

    Audio processing
    44.
    发明授权

    公开(公告)号:US11887616B2

    公开(公告)日:2024-01-30

    申请号:US17418652

    申请日:2020-01-07

    摘要: An apparatus, method and computer program is disclosed. The apparatus may comprise a means comprising at least one processor and at least one memory including computer program code, the at least one memory and computer program code configured to, with the at least one processor, to receive multimedia data representing a scene, the multimedia data comprising at least audio data representing an audio component of the scene. Another operation may comprise determining a location of unwanted sound in the scene. Another operation may comprise performing first audio processing to remove at least part of the unwanted sound from the determined location. Another operation may comprise performing second audio processing to add artificial sound associated to the unwanted sound at the determined location.

    Audio processing
    47.
    发明授权

    公开(公告)号:US11647350B2

    公开(公告)日:2023-05-09

    申请号:US17410320

    申请日:2021-08-24

    IPC分类号: H04S7/00 G06T19/00

    摘要: An apparatus, method and computer program is disclosed. The apparatus may comprise means for providing first virtual content for presentation, which first virtual content is associated with a first space and comprises one or more first virtual objects. One of the first virtual objects may comprise one or more virtual objects having respective audio signals and another may represent a miniature representation of a second space into which the user can transition. The miniature representation may have a second scale, smaller or larger than the first scale, and may comprise one or more second virtual objects having respective audio signals audible from outside of the second space. The apparatus may further comprise means for determining whether at least part of the user is within the first space or the second space, and dependent on the determination, modifying the audio signal of one or more of the first or second virtual objects.

    Sound source distance estimation
    48.
    发明授权

    公开(公告)号:US11644528B2

    公开(公告)日:2023-05-09

    申请号:US16626242

    申请日:2018-06-13

    摘要: An apparatus for generating at least one distance estimate to at least one sound source within a sound scene comprising the least one sound source, the apparatus configured to: receive at least two audio signals from a microphone array located within the sound scene; receive at least one further audio signal associated with the at least one sound source; determine at least one portion of the at least two audio signals from a microphone array corresponding to the at least one further audio signal associated with the at least one sound source; determine a distance estimate to the at least one sound source based on the at least one portion of the at least two audio signals from a microphone array corresponding to the at least one further audio signal associated with the at least one sound source.

    Apparatus and method for processing volumetric audio

    公开(公告)号:US11521591B2

    公开(公告)日:2022-12-06

    申请号:US16768968

    申请日:2018-11-29

    摘要: A method including receiving an audio scene including at least one source captured using at least one near field microphone and at least one far field microphone. The method includes determining at least one room-impulse-response associated with the audio scene based on the at least one near field microphone and the at least one far field microphone, accessing a predetermined scene geometry corresponding to the audio scene, and identifying best match to the predetermined scene geometry in a scene geometry database. The method also includes performing RIR comparison based on the at least one RIR and at least one geometric RIR associated with the best matching geometry and rendering a volumetric audio scene based on a result of the RIR comparison.

    Monitoring
    50.
    发明授权

    公开(公告)号:US11429189B2

    公开(公告)日:2022-08-30

    申请号:US15538739

    申请日:2015-12-21

    摘要: A method including recognizing at least one gesture to define at least one computer-implemented virtual boundary in a monitoring space, wherein the gesture includes a motion along a path and at a location in a monitored scene space wherein there is a correspondence between the monitoring space and the monitored scene space; causing implementation of the at least one virtual boundary in the monitoring space corresponding to the scene space, wherein at least part of the virtual boundary is determined by the path in the monitored scene space and the at least part of the virtual boundary is located in the monitoring space at a corresponding location equivalent to the path location; and processing received data to generate a response event when there is, relative to the at least one virtual boundary, a change in a portion of the monitored scene space.