-
公开(公告)号:US20220199100A1
公开(公告)日:2022-06-23
申请号:US17128544
申请日:2020-12-21
发明人: S M Akramus SALEHIN , Lae-Hoon KIM , Hannes PESSENTHEINER , Shuhua ZHANG , Sanghyun CHI , Erik VISSER , Shankar THAGADUR SHIVAPPA
IPC分类号: G10L21/0232 , H04R1/40 , H04R3/00 , H04S7/00 , H04S3/00 , G10L25/51 , G10L21/0324
摘要: A device includes one or more processors configured to obtain audio signals representing sound captured by at least three microphones and determine spatial audio data based on the audio signals. The one or more processors are further configured to determine a metric indicative of wind noise in the audio signals. The metric is based on a comparison of a first value and a second value. The first value corresponds to an aggregate signal based on the spatial audio data, and the second value corresponds to a differential signal based on the spatial audio data.
-
公开(公告)号:US20220201395A1
公开(公告)日:2022-06-23
申请号:US17127421
申请日:2020-12-18
发明人: S M Akramus SALEHIN , Lae-Hoon KIM , Vasudev NAYAK , Shankar THAGADUR SHIVAPPA , Isaac Garcia MUNOZ , Sanghyun CHI , Erik VISSER
摘要: In an aspect, a lens is zoomed in to create a zoomed lens. Lens data associated with the lens includes a direction of the lens relative to an object in a field-of-view of the zoomed lens and a magnification of the object resulting from the zoomed lens. An array of microphones capture audio signals including audio produced by the object and interference produced by other objects. The audio signals are processed to identify a directional component associated with the audio produced by the object and three orthogonal components associated with the interference produced by the other objects. Stereo beamforming is used to increase a magnitude of the directional component (relative to the interference) while retaining a binaural nature of the audio signals. The increase in magnitude of the directional component is based on an amount of the magnification provided by the zoomed lens to the object.
-
公开(公告)号:US20230060774A1
公开(公告)日:2023-03-02
申请号:US17446498
申请日:2021-08-31
发明人: S M Akramus SALEHIN , Lae-Hoon KIM , Xiaoxin ZHANG , Erik VISSER
摘要: A device includes one or more processors configured to determine, based on data descriptive of two or more audio environments, a geometry of a mutual audio environment. The one or more processors are also configured to process audio data, based on the geometry of the mutual audio environment, for output at an audio device disposed in a first audio environment of the two or more audio environments.
-
公开(公告)号:US20210304777A1
公开(公告)日:2021-09-30
申请号:US17210357
申请日:2021-03-23
IPC分类号: G10L19/038 , G10L19/002 , H04R5/00
摘要: A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device also includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are also configured to apply one adaptive network, based on a constraint, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint.
-
公开(公告)号:US20220386060A1
公开(公告)日:2022-12-01
申请号:US17755578
申请日:2020-10-29
发明人: Nils Gunther PETERS , Shankar THAGADUR SHIVAPPA , S M Akramus SALEHIN , Jason FILOS , Siddhartha Goutham SWAMINATHAN , Ferdinando OLIVIERI
IPC分类号: H04S7/00
摘要: Methods, systems, computer-readable media, and apparatuses for manipulating a soundfield are presented. Some configurations include receiving a bitstream that comprises metadata and a soundfield description; parsing the metadata to obtain an effect identifier and at least one effect parameter value; and applying, to the soundfield description, an effect identified by the effect identifier. The applying may include using the at least one effect parameter value to apply the identified effect to the soundfield description.
-
公开(公告)号:US20200053505A1
公开(公告)日:2020-02-13
申请号:US16058760
申请日:2018-08-08
发明人: Nils Gunther PETERS , S M Akramus SALEHIN , Shankar THAGADUR SHIVAPPA , Moo Young KIM , Dipanjan SEN
摘要: One or more processors may obtain a first distance between a first audio zone of the two or more audio zones associated with the one or more interest points within the first audio zone, and a first device position of a device, obtain a second distance between a second audio zone of the two or more audio zones associated with the one or more interest points within the second audio zone, and the first device position of the device, and obtain an updated first distance and updated second distance after movement of the device has changed from the first device position to a second device position. The one or more processor(s) may independently control the first audio zone and the second audio zone, such that the audio data within the first audio zone and the second audio zone are adjusted based on the updated first distance and updated second distance.
-
7.
公开(公告)号:US20230260525A1
公开(公告)日:2023-08-17
申请号:US18138684
申请日:2023-04-24
IPC分类号: G10L19/038 , H04R5/00 , G10L19/002
CPC分类号: G10L19/038 , H04R5/00 , G10L19/002 , H04S2420/11 , H04R2430/21 , G10L19/008
摘要: A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are configured to apply one adaptive network, based on a constraint that includes preservation of a spatial direction of one or more audio sources in the soundfield at the different time segments, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint. The one or more processors are also configured to apply an additional adaptive network.
-
公开(公告)号:US20210281967A1
公开(公告)日:2021-09-09
申请号:US17329120
申请日:2021-05-24
发明人: Moo Young KIM , Nils Günther PETERS , S M Akramus SALEHIN , Siddhartha Goutham SWAMINATHAN , Dipanjan SEN
摘要: A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.
-
公开(公告)号:US20210157543A1
公开(公告)日:2021-05-27
申请号:US16696798
申请日:2019-11-26
IPC分类号: G06F3/16 , G10L21/0388 , G10L19/16 , H04L29/06
摘要: Methods, systems, and devices for processing of multiple audio streams based on available bandwidth are described. Described techniques provide for receiving, at a device, one or more audio streams, identifying an available bandwidth for processing the one or more audio streams, locating (based on the available bandwidth) a first set of one or more objects contributing to the one or more audio streams that are located within a threshold radius from the device, and generating an object-based audio stream. The described techniques further provide for extracting a contribution of the first number of objects from the one or more audio streams, generating an HOA audio stream, and outputting an audio feed that includes the HOA audio stream and the object-based audio stream.
-
公开(公告)号:US20210092543A1
公开(公告)日:2021-03-25
申请号:US16743275
申请日:2020-01-15
摘要: An apparatus includes one or more processors configured to receive orientation data and to select, based on the orientation data, a particular filter from among multiple filters. The one or more processors are configured to perform signal processing operations associated with three-dimensional (3D) sound data based on the particular filter.
-
-
-
-
-
-
-
-
-