Patent search ap:("QUALCOMM INCORPORATED") AND inv:"Erik Visser" Page 10

91.

发明申请
Audio User Interaction Recognition and Context Refinement 审中-公开
Title translation: 音频用户交互识别和上下文优化

公开(公告)号：US20130304476A1

公开(公告)日：2013-11-14

申请号：US13674773

申请日：2012-11-12

Applicant: QUALCOMM INCORPORATED

Inventor： Lae-Hoon Kim , Jongwon Shin , Erik Visser

IPC: G10L21/00

CPC classification number: H04R29/005 , G01S3/80 , G10L17/00 , G10L21/00 , G10L25/06 , G10L25/48 , G10L2021/02166 , H04L65/403 , H04N7/15 , H04R1/406 , H04R3/005 , H04R29/008 , H04R2430/20 , H04R2460/01 , H04R2499/11

Abstract: A system which performs social interaction analysis for a plurality of participants includes a processor. The processor is configured to determine a similarity between a first spatially filtered output and each of a plurality of second spatially filtered outputs. The processor is configured to determine the social interaction between the participants based on the similarities between the first spatially filtered output and each of the second spatially filtered outputs and display an output that is representative of the social interaction between the participants. The first spatially filtered output is received from a fixed microphone array, and the second spatially filtered outputs are received from a plurality of steerable microphone arrays each corresponding to a different participant.

Abstract translation: 对多个参与者进行社交交互分析的系统包括处理器。处理器被配置为确定第一空间滤波输出与多个第二空间滤波输出中的每一个之间的相似度。处理器被配置为基于第一空间滤波输出和第二空间滤波输出中的每一个之间的相似度来确定参与者之间的社交交互，并且显示代表参与者之间的社交交互的输出。从固定的麦克风阵列接收第一空间滤波的输出，并且从多个可操纵的麦克风阵列接收第二空间滤波的输出，每个麦克风阵列对应于不同的参与者。

92.

发明申请
SYSTEMS AND METHODS FOR AUDIO SIGNAL PROCESSING 审中-公开
Title translation: 用于音频信号处理的系统和方法

公开(公告)号：US20130282372A1

公开(公告)日：2013-10-24

申请号：US13828158

申请日：2013-03-14

Applicant: QUALCOMM INCORPORATED

Inventor： Erik Visser , Lae-Hoon Kim , Yinyi Guo , Juhan Nam

IPC: G10L15/20

CPC classification number: G10L21/0208 , G10L15/20 , G10L21/0316 , G10L25/93 , G10L2021/02165

Abstract: A method for detecting voice activity by an electronic device is described. The method includes detecting near end speech based on a near end voiced speech detector and at least one single channel voice activity detector. The near end voiced speech detector is associated with a harmonic statistic based on a speech pitch histogram.

Abstract translation: 描述了一种用于由电子设备检测语音活动的方法。该方法包括基于近端浊音语音检测器和至少一个单声道语音活动检测器检测近端语音。近端浊音语音检测器与基于语音音调直方图的谐波统计量相关联。

93.

发明申请
SYSTEMS AND METHODS FOR DISPLAYING A USER INTERFACE 审中-公开
Title translation: 用于显示用户界面的系统和方法

公开(公告)号：US20130275873A1

公开(公告)日：2013-10-17

申请号：US13836543

申请日：2013-03-15

Applicant: QUALCOMM INCORPORATED

Inventor： Jeffrey C. Shaw , Jeremy P. Toman , Erik Visser , Phuong L. Ton , Lae-Hoon Kim

IPC: G06F3/16

CPC classification number: G10L17/005 , G01B21/00 , G01S3/80 , G01S3/8006 , G01S3/8083 , G01S5/18 , G01S5/186 , G01S15/025 , G01S15/876 , G06F1/1633 , G06F3/04817 , G06F3/0484 , G06F3/04883 , G06F3/167 , G06F16/433 , G10L2021/02166 , H04R1/08 , H04R3/00 , H04R3/005 , H04S7/40

Abstract: A method for displaying a user interface on an electronic device is described. The method includes presenting a user interface. The user interface includes a coordinate system. The coordinate system corresponds to physical coordinates based on sensor data. The method also includes displaying at least a target audio signal and an interfering audio signal on the user interface.

Abstract translation: 描述了在电子设备上显示用户界面的方法。该方法包括呈现用户界面。用户界面包括坐标系。坐标系对应于基于传感器数据的物理坐标。该方法还包括在用户界面上至少显示目标音频信号和干扰音频信号。

94.

发明申请
OBJECT RECOGNITION USING MULTI-MODAL MATCHING SCHEME 有权
Title translation: 使用多模式匹配方案的对象识别

公开(公告)号：US20130272548A1

公开(公告)日：2013-10-17

申请号：US13664295

申请日：2012-10-30

Applicant: QUALCOMM INCORPORATED

Inventor： Erik Visser , Haiyin Wang , Hasib A. Siddiqui , Lae-Hoon Kim

IPC: G06K9/00 , H04R3/00

CPC classification number: G06K9/00624 , G06K9/00 , G06K9/0063 , G06K9/3233 , G06K9/4628 , G06K9/4671 , G06K9/6293 , G06T7/20 , H04R3/00 , H04R3/005 , H04S7/30 , H04S2400/11 , H04S2400/15

Abstract: Methods, systems and articles of manufacture for recognizing and locating one or more objects in a scene are disclosed. An image and/or video of the scene are captured. Using audio recorded at the scene, an object search of the captured scene is narrowed down. For example, the direction of arrival (DOA) of a sound can be determined and used to limit the search area in a captured image/video. In another example, keypoint signatures may be selected based on types of sounds identified in the recorded audio. A keypoint signature corresponds to a particular object that the system is configured to recognize. Objects in the scene may then be recognized using a shift invariant feature transform (SIFT) analysis comparing keypoints identified in the captured scene to the selected keypoint signatures.

Abstract translation: 公开了用于识别和定位场景中的一个或多个物体的方法，系统和制品。拍摄场景的图像和/或视频。使用在场景录制的音频，捕获的场景的对象搜索变窄。例如，可以确定声音的到达方向（DOA）并用于限制捕获的图像/视频中的搜索区域。在另一示例中，可以基于记录的音频中识别的声音的类型来选择关键点签名。关键点签名对应于系统配置为识别的特定对象。然后可以使用移位不变特征变换（SIFT）分析来比较场景中的对象，比较在捕获的场景中识别的关键点与所选择的关键点签名。

95.

发明申请
SYSTEMS, METHODS, AND APPARATUS FOR SPATIALLY DIRECTIVE FILTERING 有权
Title translation: 用于空间指导性过滤的系统，方法和装置

公开(公告)号：US20130272539A1

公开(公告)日：2013-10-17

申请号：US13835139

申请日：2013-03-15

Applicant: QUALCOMM INCORPORATED

Inventor： Lae-Hoon Kim , Erik Visser

IPC: H04R3/00

CPC classification number: G01S3/8006 , G01B21/00 , G01S3/80 , G01S3/8083 , G01S5/18 , G01S5/186 , G01S15/025 , G01S15/876 , G06F1/1633 , G06F3/0484 , G06F3/167 , G10L2021/02166 , H04R1/08 , H04R3/00 , H04R3/005

Abstract: Systems, methods, and apparatus are described for applying, based on angles of arrival of source components relative to the axes of different microphone pairs, a spatially directive filter to a multichannel audio signal to produce an output signal.

Abstract translation: 描述了系统，方法和装置，用于基于源组件相对于不同麦克风对的轴的到达角度将空间指向滤波器应用于多声道音频信号以产生输出信号。

96.

发明授权
Processing of audio signals from multiple microphones 有权

公开(公告)号：US12244994B2

公开(公告)日：2025-03-04

申请号：US17814660

申请日：2022-07-25

Applicant: QUALCOMM Incorporated

Inventor： Erik Visser , Fatemeh Saki , Yinyi Guo , Lae-Hoon Kim , Rogerio Guedes Alves , Hannes Pessentheiner

IPC: H04R3/00 , H04R1/10 , H04R1/40 , H04R5/027 , H04S3/00 , H04S7/00

Abstract: A first device includes a memory configured to store instructions and one or more processors configured to receive audio signals from multiple microphones. The one or more processors are configured to process the audio signals to generate direction-of-arrival information corresponding to one or more sources of sound represented in one or more of the audio signals. The one or more processors are also configured to and send, to a second device, data based on the direction-of-arrival information and a class or embedding associated with the direction-of-arrival information.

97.

发明授权
Shared speech processing network for multiple speech applications 有权

公开(公告)号：US12200450B2

公开(公告)日：2025-01-14

申请号：US18324622

申请日：2023-05-26

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Sunkuk Moon , Erik Visser , Prajakt Kulkarni

IPC: H04R3/00 , G06F18/21 , G06N20/00 , G06V10/82 , G06V20/20 , G10L21/02 , H04L65/60 , H04L65/80 , H04R5/04

Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules.

98.

发明授权
Active self-voice naturalization using a bone conduction sensor 有权

公开(公告)号：US12063490B2

公开(公告)日：2024-08-13

申请号：US18167823

申请日：2023-02-10

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Rogerio Guedes Alves , Jacob Jon Bean , Erik Visser

IPC: H04R3/04

CPC classification number: H04R3/04 , H04R2460/13

Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device to receive an input audio signal from one or more outer microphones, an input audio signal from one or more inner microphones, and a bone conduction signal from a bone conduction sensor based on the input audio signals. The wearable device may filter the bone conduction signal based on a set of frequencies of the input audio signals, such as a low frequency portion of the input audio signals. For example, the wearable device may apply a filter to the bone conduction signal that accounts for an error in the input audio signals. The wearable device may add a gain to the filtered bone conduction signal and may equalize the filtered bone conduction signal based on the gain. The wearable device may output an audio signal to a speaker.

99.

发明公开
SYSTEMS, METHODS, APPARATUS, AND COMPUTER-READABLE MEDIA FOR GESTURAL MANIPULATION OF A SOUND FIELD 审中-公开

公开(公告)号：US20240098420A1

公开(公告)日：2024-03-21

申请号：US18507661

申请日：2023-11-13

Applicant: QUALCOMM Incorporated

Inventor： Pei Xiang , Erik Visser

IPC: H04R5/04 , G06F3/01 , H04R3/00 , H04S7/00

CPC classification number: H04R5/04 , G06F3/017 , H04R3/005 , H04S7/303 , H04R2203/12 , H04R2430/20

Abstract: Gesture-responsive modification of a generated sound field is described.

100.

发明授权
Method and apparatus for target sound detection 有权

公开(公告)号：US11862189B2

公开(公告)日：2024-01-02

申请号：US16837420

申请日：2020-04-01

Applicant: QUALCOMM Incorporated

Inventor： Prajakt Kulkarni , Yinyi Guo , Erik Visser

IPC: G10L25/78 , G10L15/16 , H04W52/02 , G06F18/211 , G06F18/241

CPC classification number: G10L25/78 , G06F18/211 , G06F18/241 , G10L15/16 , H04W52/0229 , H04W52/0261

Abstract: A device to perform target sound detection includes one or more processors. The one or more processors include a buffer configured to store audio data and a target sound detector. The target sound detector includes a first stage and a second stage. The first stage includes a binary target sound classifier configured to process the audio data. The first stage is configured to activate the second stage in response to detection of a target sound. The second stage is configured to receive the audio data from the buffer in response to the detection of the target sound.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification