Patent search ap:("QUALCOMM incorporated") AND inv:"Erik Visser" Page 12

111.

发明申请
IN-VEHICLE VOICE COMMAND CONTROL 审中-公开

公开(公告)号：US20180190282A1

公开(公告)日：2018-07-05

申请号：US15639826

申请日：2017-06-30

Applicant: QUALCOMM Incorporated

Inventor： Asif Iqbal Mohammad , Sreekanth Narayanaswamy , Rishabh Tyagi , Erik Visser

IPC: G10L15/22 , G10L25/84 , G10K11/178 , G06F3/16 , H04R1/40 , H04R3/00

CPC classification number: G10L15/22 , B60R16/0373 , G06F3/167 , G10K11/178 , G10K11/1785 , G10K11/17881 , G10K2210/111 , G10K2210/1282 , G10K2210/3011 , G10K2210/3044 , G10L21/0208 , G10L25/84 , G10L2015/223 , H04R1/406 , H04R3/005 , H04R29/004 , H04R2410/05 , H04R2499/13

Abstract: A vehicle includes an interface device, an in-vehicle control unit, a functional unit, and a processing circuitry. The interface device receives a spoken command to identify an in-cabin vehicle zone of two or more in-cabin vehicle zones of the vehicle, and receives background audio data concurrently with a portion of the spoken command. The in-cabin vehicle control unit separates the background audio data from the spoken command, and selects which in-cabin vehicle zone of the two or more in-cabin vehicle zones is identified by the spoken command. The functional unit controls a function within the vehicle. The processing circuitry stores, to a command buffer, data processed from the received spoken command, and controls, based on the data processed from the received spoken command, the functional unit using audio input received from the selected in-cabin vehicle zone.

112.

发明授权
Systems and methods for speaker dictionary based speech modeling 有权

公开(公告)号：US10013975B2

公开(公告)日：2018-07-03

申请号：US14629109

申请日：2015-02-23

Applicant: QUALCOMM Incorporated

Inventor： Yinyi Guo , Juhan Nam , Erik Visser , Shuhua Zhang , Lae-Hoon Kim

IPC: G10L21/02 , G10L15/20 , G10L15/06 , G10L21/0208 , G10L21/028

CPC classification number: G10L15/20 , G10L15/06 , G10L21/0208 , G10L21/028

Abstract: A method for speech modeling by an electronic device is described. The method includes obtaining a real-time noise reference based on a noisy speech signal. The method also includes obtaining a real-time noise dictionary based on the real-time noise reference. The method further includes obtaining a first speech dictionary and a second speech dictionary. The method additionally includes reducing residual noise based on the real-time noise dictionary and the first speech dictionary to produce a residual noise-suppressed speech signal at a first modeling stage. The method also includes generating a reconstructed speech signal based on the residual noise-suppressed speech signal and the second speech dictionary at a second modeling stage.

113.

发明授权
Enhanced conversational communications in shared acoustic space 有权

公开(公告)号：US09947334B2

公开(公告)日：2018-04-17

申请号：US14808870

申请日：2015-07-24

Applicant: QUALCOMM Incorporated

Inventor： Samir Kumar Gupta , Asif Iqbal Mohammad , Erik Visser , Lae-Hoon Kim , Shaun William Van Dyken

IPC: G10L21/0208 , G10K11/175 , H04M9/08 , H04R27/00 , G10L21/0216

CPC classification number: G10L21/0208 , G10K11/175 , G10L2021/02082 , G10L2021/02166 , H04M9/082 , H04R27/00 , H04R2499/13

Abstract: A multichannel acoustic system (MAS) comprises an arrangement of microphones and loudspeakers and a multichannel acoustic processor (MAP) to together enhance conversational speech between two or more persons in a shared acoustic space such as an automobile. The enhancements are achieved by receiving sound signals substantially originating from relatively near sound sources; filtering the sound signals to cancel at least one echo signal detected for at least one microphone from among the plurality of microphones; filtering the sound signals received by the plurality of microphones to cancel at least one feedback signal detected for at least one microphone from among the plurality of microphones; and reproducing the filtered sound signals for each microphone from among the plurality of microphones on a subset of loudspeakers corresponding that are relatively far from the source microphone.

114.

发明授权
Multi-channel echo cancellation and noise suppression 有权

公开(公告)号：US09936290B2

公开(公告)日：2018-04-03

申请号：US14156292

申请日：2014-01-15

Applicant: QUALCOMM Incorporated

Inventor： Asif Iqbal Mohammad , Lae-Hoon Kim , Ian Ernan Liu , Erik Visser

IPC: H04B3/20 , H04M9/08 , H04R3/00 , G10K11/16

CPC classification number: H04R3/002 , H04M9/082

Abstract: A method for multi-channel echo cancellation and noise suppression is described. One of multiple echo estimates is selected for non-linear echo cancellation. Echo notch masking is performed on a noise-suppressed signal based on an echo direction of arrival (DOA) to produce an echo-suppressed signal. Non-linear echo cancellation is performed on the echo-suppressed signal based, at least in part, on the selected echo estimate.

115.

发明申请
FAR-FIELD AUDIO PROCESSING 审中-公开

公开(公告)号：US20180033428A1

公开(公告)日：2018-02-01

申请号：US15387411

申请日：2016-12-21

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Erik Visser , Asif Mohammad , Ian Ernan Liu , Ye Jiang

IPC: G10L15/20 , G10L21/028 , H04R1/40 , G10L21/0232 , G10L15/08 , G10L15/22

CPC classification number: G10L15/20 , G10L15/08 , G10L15/22 , G10L21/0208 , G10L21/0232 , G10L21/028 , G10L2015/088 , G10L2015/223 , G10L2021/02166 , H04R1/406 , H04R3/005

Abstract: An apparatus includes multiple microphones to generate audio signals based on sound of a far-field acoustic environment. The apparatus also includes a signal processing system to process the audio signals to generate at least one processed audio signal. The signal processing system is configured to update one or more processing parameters while operating in a first operational mode and is configured to use a static version of the one or more processing parameters while operating in the second operational mode. The apparatus further includes a keyword detection system to perform keyword detection based on the at least one processed audio signal to determine whether the sound includes an utterance corresponding to a keyword and, based on a result of the keyword detection, to send a control signal to the signal processing system to change an operational mode of the signal processing system.

116.

发明申请
DEVICE FOR GENERATING AUDIO OUTPUT 审中-公开

公开(公告)号：US20170339491A1

公开(公告)日：2017-11-23

申请号：US15158505

申请日：2016-05-18

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Hyun Jin Park , Erik Visser , Raghuveer Peri

IPC: H04R5/04 , H04R5/033 , G06F3/16 , H04R29/00

Abstract: A headset device includes a first earpiece configured to receive a reference sound and to generate a first reference audio signal based on the reference sound. The headset device further includes a second earpiece configured to receive the reference sound and to generate a second reference audio signal based on the reference sound. The headset device further includes a controller coupled to the first earpiece and to the second earpiece. The controller is configured to generate a first signal and a second signal based on a phase relationship between the first reference audio signal and the second reference audio signal. The controller is further configured to output the first signal to the first earpiece and output the second signal to the second earpiece.

117.

发明授权
Audio user interaction recognition and application interface 有权

公开(公告)号：US09746916B2

公开(公告)日：2017-08-29

申请号：US13674789

申请日：2012-11-12

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Jongwon Shin , Erik Visser

IPC: G06F3/01 , H04M3/56 , G10L25/48 , G04G21/00 , H04N7/15 , G01S3/808 , G10L17/00 , G10L21/0216 , H04R29/00 , H04S7/00 , G06F1/16 , H04R1/40 , H04R3/00

CPC classification number: G06F3/013 , G01S3/8083 , G04G21/00 , G06F1/1613 , G06F3/011 , G10L17/00 , G10L25/48 , G10L2021/02166 , H04M3/568 , H04N7/15 , H04R1/406 , H04R3/005 , H04R29/005 , H04S7/304

Abstract: Disclosed is an application interface that takes into account the user's gaze direction relative to who is speaking in an interactive multi-participant environment where audio-based contextual information and/or visual-based semantic information is being presented. Among these various implementations, two different types of microphone array devices (MADs) may be used. The first type of MAD is a steerable microphone array (a.k.a. a steerable array) which is worn by a user in a known orientation with regard to the user's eyes, and wherein multiple users may each wear a steerable array. The second type of MAD is a fixed-location microphone array (a.k.a. a fixed array) which is placed in the same acoustic space as the users (one or more of which are using steerable arrays).

118.

发明授权
Audio user interaction recognition and context refinement 有权

公开(公告)号：US09736604B2

公开(公告)日：2017-08-15

申请号：US13674690

申请日：2012-11-12

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Jongwon Shin , Erik Visser

IPC: H04R29/00 , G10L21/00 , G10L25/48 , H04N7/15 , H04L29/06 , H04R3/00 , G01S3/80 , G10L17/00 , G10L21/0216 , G10L25/06 , H04R1/40

CPC classification number: H04R29/005 , G01S3/80 , G10L17/00 , G10L21/00 , G10L25/06 , G10L25/48 , G10L2021/02166 , H04L65/403 , H04N7/15 , H04R1/406 , H04R3/005 , H04R29/008 , H04R2430/20 , H04R2460/01 , H04R2499/11

Abstract: A system which tracks a social interaction between a plurality of participants, includes a fixed beamformer that is adapted to output a first spatially filtered output and configured to receive a plurality of second spatially filtered outputs from a plurality of steerable beamformers. Each steerable beamformer outputs a respective one of the second spatially filtered outputs associated with a different one of the participants. The system also includes a processor capable of determining a similarity between the first spatially filtered output and each of the second spatially filtered outputs. The processor determines the social interaction between the participants based on the similarity between the first spatially filtered output and each of the second spatially filtered outputs.

119.

发明授权
Feedback cancelation for enhanced conversational communications in shared acoustic space 有权

公开(公告)号：US09672805B2

公开(公告)日：2017-06-06

申请号：US14808746

申请日：2015-07-24

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Asif Iqbal Mohammad , Erik Visser

IPC: G10K11/16 , A61F11/06 , G10K11/178 , G10L21/02 , H04R3/02 , H04R5/02 , H04S7/00 , H04R27/00 , H04M9/08 , G10L21/0208

CPC classification number: G10K11/178 , G10L21/02 , G10L2021/02082 , H04M9/082 , H04R3/02 , H04R5/02 , H04R27/00 , H04R2227/009 , H04R2499/13 , H04S7/00 , H04S7/302

Abstract: A crosstalk cancelation technique reduces feedback in a shared acoustic space by canceling out some or all parts of sound signals that would otherwise be produced by a loudspeaker to only be captured by a microphone that, recursively, would cause these sounds signals to be reproduced again on the loudspeaker as feedback. Crosstalk cancelation can be used in a multichannel acoustic system (MAS) comprising an arrangement of microphones, loudspeakers, and a processor to together enhance conversational speech between in a shared acoustic space. To achieve crosstalk cancelation, a processor analyzes the inputs of each microphone, compares it to the output of far loudspeaker(s) relative to each such microphone, and cancels out any portion of a sound signal received by the microphone that matches signals that were just produced by the far loudspeaker(s) and sending only the remaining sound signal (if any) to such far loudspeakers.

120.

发明授权
Method, system and article of manufacture for processing spatial audio 有权
Title translation: 处理空间音频的方法，系统和制造

公开(公告)号：US09578439B2

公开(公告)日：2017-02-21

申请号：US14807760

申请日：2015-07-23

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Raghuveer Peri , Erik Visser

IPC: H04S7/00 , G06F3/16 , H04S3/00

CPC classification number: H04S7/30 , G06F3/165 , H04S3/002 , H04S7/301 , H04S2400/01 , H04S2400/11 , H04S2400/15

Abstract: Techniques for processing directionally-encoded audio to account for spatial characteristics of a listener playback environment are disclosed. The directionally-encoded audio data includes spatial information indicative of one or more directions of sound sources in an audio scene. The audio data is modified based on input data identifying the spatial characteristics of the playback environment. The spatial characteristics may correspond to actual loudspeaker locations in the playback environment. The directionally-encoded audio may also be processed to permit focusing/defocusing on sound sources or particular directions in an audio scene. The disclosed techniques may allow a recorded audio scene to be more accurately reproduced at playback time, regardless of the output loudspeaker setup. Another advantage is that a user may dynamically configure audio data so that it better conforms to the user's particular loudspeaker layouts and/or the user's desired focus on particular subjects or areas in an audio scene.

Abstract translation: 公开了用于处理定向编码的音频以考虑收听者回放环境的空间特征的技术。定向编码的音频数据包括指示音频场景中的声源的一个或多个方向的空间信息。基于识别回放环境的空间特性的输入数据来修改音频数据。空间特征可以对应于播放环境中的实际扬声器位置。定向编码的音频也可以被处理以允许对音频场景中的声源或特定方向进行聚焦/散焦。所公开的技术可以允许在播放时间更准确地再现记录的音频场景，而与输出的扬声器设置无关。另一个优点是用户可以动态地配置音频数据，使得其更好地符合用户的特定扬声器布局和/或用户对音频场景中的特定主体或区域的期望焦点。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification