专利检索 ap:("QUALCOMM Incorporated") AND inv:"S M Akramus SALEHIN" 第 1 页

1.

发明申请
SPATIAL AUDIO WIND NOISE DETECTION 有权

公开(公告)号：US20220199100A1

公开(公告)日：2022-06-23

申请号：US17128544

申请日：2020-12-21

申请人： QUALCOMM Incorporated

发明人： S M Akramus SALEHIN , Lae-Hoon KIM , Hannes PESSENTHEINER , Shuhua ZHANG , Sanghyun CHI , Erik VISSER , Shankar THAGADUR SHIVAPPA

IPC分类号： G10L21/0232 , H04R1/40 , H04R3/00 , H04S7/00 , H04S3/00 , G10L25/51 , G10L21/0324

摘要： A device includes one or more processors configured to obtain audio signals representing sound captured by at least three microphones and determine spatial audio data based on the audio signals. The one or more processors are further configured to determine a metric indicative of wind noise in the audio signals. The metric is based on a comparison of a first value and a second value. The first value corresponds to an aggregate signal based on the spatial audio data, and the second value corresponds to a differential signal based on the spatial audio data.

2.

发明申请
SPATIAL AUDIO ZOOM 有权

公开(公告)号：US20220201395A1

公开(公告)日：2022-06-23

申请号：US17127421

申请日：2020-12-18

申请人： QUALCOMM Incorporated

发明人： S M Akramus SALEHIN , Lae-Hoon KIM , Vasudev NAYAK , Shankar THAGADUR SHIVAPPA , Isaac Garcia MUNOZ , Sanghyun CHI , Erik VISSER

IPC分类号： H04R3/00 , H03G3/30 , G02B27/00 , G06K9/00

摘要： In an aspect, a lens is zoomed in to create a zoomed lens. Lens data associated with the lens includes a direction of the lens relative to an object in a field-of-view of the zoomed lens and a magnification of the object resulting from the zoomed lens. An array of microphones capture audio signals including audio produced by the object and interference produced by other objects. The audio signals are processed to identify a directional component associated with the audio produced by the object and three orthogonal components associated with the interference produced by the other objects. Stereo beamforming is used to increase a magnitude of the directional component (relative to the interference) while retaining a binaural nature of the audio signals. The increase in magnitude of the directional component is based on an amount of the magnification provided by the zoomed lens to the object.

3.

发明申请
AUGMENTED AUDIO FOR COMMUNICATIONS 有权

公开(公告)号：US20230060774A1

公开(公告)日：2023-03-02

申请号：US17446498

申请日：2021-08-31

申请人： QUALCOMM Incorporated

发明人： S M Akramus SALEHIN , Lae-Hoon KIM , Xiaoxin ZHANG , Erik VISSER

IPC分类号： H04S7/00 , G06F3/01

摘要： A device includes one or more processors configured to determine, based on data descriptive of two or more audio environments, a geometry of a mutual audio environment. The one or more processors are also configured to process audio data, based on the geometry of the mutual audio environment, for output at an audio device disposed in a first audio environment of the two or more audio environments.

4.

发明申请
TRANSFORM AMBISONIC COEFFICIENTS USING AN ADAPTIVE NETWORK 有权

公开(公告)号：US20210304777A1

公开(公告)日：2021-09-30

申请号：US17210357

申请日：2021-03-23

申请人： QUALCOMM Incorporated

发明人： Lae-Hoon KIM , Shankar THAGADUR SHIVAPPA , S M Akramus SALEHIN , Shuhua ZHANG , Erik VISSER

IPC分类号： G10L19/038 , G10L19/002 , H04R5/00

摘要： A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device also includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are also configured to apply one adaptive network, based on a constraint, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint.

5.

发明申请
SIGNALLING OF AUDIO EFFECT METADATA IN A BITSTREAM 有权

公开(公告)号：US20220386060A1

公开(公告)日：2022-12-01

申请号：US17755578

申请日：2020-10-29

申请人： QUALCOMM Incorporated

发明人： Nils Gunther PETERS , Shankar THAGADUR SHIVAPPA , S M Akramus SALEHIN , Jason FILOS , Siddhartha Goutham SWAMINATHAN , Ferdinando OLIVIERI

IPC分类号： H04S7/00

摘要： Methods, systems, computer-readable media, and apparatuses for manipulating a soundfield are presented. Some configurations include receiving a bitstream that comprises metadata and a soundfield description; parsing the metadata to obtain an effect identifier and at least one effect parameter value; and applying, to the soundfield description, an effect identified by the effect identifier. The applying may include using the at least one effect parameter value to apply the identified effect to the soundfield description.

6.

发明申请
RENDERING AUDIO DATA FROM INDEPENDENTLY CONTROLLED AUDIO ZONES 审中-公开

公开(公告)号：US20200053505A1

公开(公告)日：2020-02-13

申请号：US16058760

申请日：2018-08-08

申请人： QUALCOMM Incorporated

发明人： Nils Gunther PETERS , S M Akramus SALEHIN , Shankar THAGADUR SHIVAPPA , Moo Young KIM , Dipanjan SEN

IPC分类号： H04S7/00 , H04R5/04 , G02B27/01

摘要： One or more processors may obtain a first distance between a first audio zone of the two or more audio zones associated with the one or more interest points within the first audio zone, and a first device position of a device, obtain a second distance between a second audio zone of the two or more audio zones associated with the one or more interest points within the second audio zone, and the first device position of the device, and obtain an updated first distance and updated second distance after movement of the device has changed from the first device position to a second device position. The one or more processor(s) may independently control the first audio zone and the second audio zone, such that the audio data within the first audio zone and the second audio zone are adjusted based on the updated first distance and updated second distance.

7.

发明公开
TRANSFORM AMBISONIC COEFFICIENTS USING AN ADAPTIVE NETWORK FOR PRESERVING SPATIAL DIRECTION 审中-公开

公开(公告)号：US20230260525A1

公开(公告)日：2023-08-17

申请号：US18138684

申请日：2023-04-24

申请人： QUALCOMM Incorporated

发明人： Lae-Hoon KIM , Shankar THAGADUR SHIVAPPA , S M Akramus SALEHIN , Shuhua ZHANG , Erik VISSER

IPC分类号： G10L19/038 , H04R5/00 , G10L19/002

CPC分类号： G10L19/038 , H04R5/00 , G10L19/002 , H04S2420/11 , H04R2430/21 , G10L19/008

摘要： A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are configured to apply one adaptive network, based on a constraint that includes preservation of a spatial direction of one or more audio sources in the soundfield at the different time segments, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint. The one or more processors are also configured to apply an additional adaptive network.

8.

发明申请
SIX DEGREES OF FREEDOM AND THREE DEGREES OF FREEDOM BACKWARD COMPATIBILITY 有权

公开(公告)号：US20210281967A1

公开(公告)日：2021-09-09

申请号：US17329120

申请日：2021-05-24

申请人： QUALCOMM Incorporated

发明人： Moo Young KIM , Nils Günther PETERS , S M Akramus SALEHIN , Siddhartha Goutham SWAMINATHAN , Dipanjan SEN

IPC分类号： H04S7/00 , G06K9/00 , G06F3/0346 , G06T19/00 , H04W4/029 , H04W4/02 , G06F3/01

摘要： A device and method for backward compatibility for virtual reality (VR), mixed reality (MR), augmented reality (AR), computer vision, and graphics systems. The device and method enable rendering audio data with more degrees of freedom on devices that support fewer degrees of freedom. The device includes memory configured to store audio data representative of a soundfield captured at a plurality of capture locations, metadata that enables the audio data to be rendered to support N degrees of freedom, and adaptation metadata that enables the audio data to be rendered to support M degrees of freedom. The device also includes one or more processors coupled to the memory, and configured to adapt, based on the adaptation metadata, the audio data to provide the M degrees of freedom, and generate speaker feeds based on the adapted audio data.

9.

发明申请
PROCESSING OF MULTIPLE AUDIO STREAMS BASED ON AVAILABLE BANDWIDTH 有权

公开(公告)号：US20210157543A1

公开(公告)日：2021-05-27

申请号：US16696798

申请日：2019-11-26

申请人： QUALCOMM Incorporated

发明人： S M Akramus SALEHIN , Siddhartha Goutham SWAMINATHAN

IPC分类号： G06F3/16 , G10L21/0388 , G10L19/16 , H04L29/06

摘要： Methods, systems, and devices for processing of multiple audio streams based on available bandwidth are described. Described techniques provide for receiving, at a device, one or more audio streams, identifying an available bandwidth for processing the one or more audio streams, locating (based on the available bandwidth) a first set of one or more objects contributing to the one or more audio streams that are located within a threshold radius from the device, and generating an object-based audio stream. The described techniques further provide for extracting a contribution of the first number of objects from the one or more audio streams, generating an HOA audio stream, and outputting an audio feed that includes the HOA audio stream and the object-based audio stream.

10.

发明申请
3D SOUND ORIENTATION ADAPTABILITY 有权

公开(公告)号：US20210092543A1

公开(公告)日：2021-03-25

申请号：US16743275

申请日：2020-01-15

申请人： QUALCOMM Incorporated

发明人： S M Akramus SALEHIN , Shankar THAGADUR SHIVAPPA , Sanghyun CHI , Nils Gunther PETERS

IPC分类号： H04S3/00 , G11B20/10

摘要： An apparatus includes one or more processors configured to receive orientation data and to select, based on the orientation data, a particular filter from among multiple filters. The one or more processors are configured to perform signal processing operations associated with three-dimensional (3D) sound data based on the particular filter.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类