Patent search ap:("QUALCOMM Incorporated") AND inv:"Lae-Hoon KIM" Page 3

21.

发明申请
SEAMLESS LISTEN-THROUGH FOR A WEARABLE DEVICE 审中-公开

公开(公告)号：US20200304903A1

公开(公告)日：2020-09-24

申请号：US16896010

申请日：2020-06-08

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon KIM , Dongmei WANG , Fatemeh SAKI , Taher SHAHBAZI MIRZAHASANLOO , Erik VISSER , Rogerio Guedes ALVES

IPC: H04R1/10

Abstract: Methods, systems, and devices for signal processing are described. Generally, in one example as provided for by the described techniques, a wearable device includes a processor configured to retrieve a plurality of external microphone signals that includes audio sound from outside of the device from a memory; to separate, based on at least information from an internal microphone signal, a self-voice component from a background component; to perform a first listen-through operation on the separated self-voice component to produce a first listen-through signal; and to produce an output audio signal that is based on at least the first listen-through signal, wherein the output audio signal includes an audio zoom signal that includes audio sound of the plurality of external microphone signals.

22.

发明公开
ACOUSTIC CONFIGURATION BASED ON RADIO FREQUENCY SENSING 审中-公开

公开(公告)号：US20240155303A1

公开(公告)日：2024-05-09

申请号：US18549508

申请日：2022-05-02

Applicant: QUALCOMM Incorporated

Inventor： Erik VISSER , Lae-Hoon KIM , Jason FILOS , Xiaoxin ZHANG

IPC: H04S7/00 , G01S7/00 , G01S7/41 , G01S13/72 , H04R3/00 , H04R5/027

CPC classification number: H04S7/303 , G01S7/006 , G01S7/411 , G01S13/726 , H04R3/005 , H04R5/027 , H04S2400/11 , H04S2400/15

Abstract: Disclosed are systems and techniques for detecting audio sources and configuring acoustic device settings. For instance, a wireless device can obtain a first set of radio frequency (RF) sensing data associated with a first plurality of received waveforms corresponding to a first transmitted waveform reflected off of a plurality of reflectors. Based on the first set of RF sensing data, the wireless device can determine a classification of a first reflector from the plurality of reflectors. The wireless device can determine at least one acoustic setting based on the classification of the at least one reflector.

23.

发明公开
SEPARATION OF SELF-VOICE SIGNAL FROM A BACKGROUND SIGNAL USING A SPEECH GENERATIVE NETWORK ON A WEARABLE DEVICE 审中-公开

公开(公告)号：US20230353929A1

公开(公告)日：2023-11-02

申请号：US18349920

申请日：2023-07-10

Applicant: Qualcomm Incorporated

Inventor： Lae-Hoon KIM , Dongmei WANG , Fatemeh SAKI , Taher SHAHBAZI MIRZAHASANLOO , Erik VISSER , Rogerio Guedes ALVES

IPC: H04R1/10

CPC classification number: H04R1/1083 , H04R1/1075 , H04R2420/07 , H04R2460/01 , H04R2460/13

Abstract: A wearable device may include a processor configured to detect a self-voice signal, based on one or more transducers. The processor may be configured to separate the self-voice signal from a background signal in an external audio signal based on using a multi-microphone speech generative network. The processor may also be configured to apply a first filter to an external audio signal, detected by at least one external microphone on the wearable device, during a listen through operation based on an activation of the audio zoom feature to generate a first listen-through signal that includes the external audio signal. The processor may be configured to produce an output audio signal that is based on at least the first listen-through signal that includes the external signal, and is based on the detected self-voice signal.

24.

发明公开
SHARED SPEECH PROCESSING NETWORK FOR MULTIPLE SPEECH APPLICATIONS 审中-公开

公开(公告)号：US20230300527A1

公开(公告)日：2023-09-21

申请号：US18324622

申请日：2023-05-26

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon KIM , Sunkuk MOON , Erik VISSER , Prajakt KULKARNI

IPC: H04R3/00 , G10L21/02 , H04R5/04 , G06N20/00 , H04L65/60 , H04L65/80 , G06F18/21 , G06V10/82 , G06V20/20

CPC classification number: H04R3/005 , G10L21/02 , H04R5/04 , G06N20/00 , H04L65/60 , H04L65/80 , G06F18/217 , G06V10/82 , G06V20/20 , H04R2499/13 , H04R2420/07

Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules.

25.

发明公开
TRANSFORM AMBISONIC COEFFICIENTS USING AN ADAPTIVE NETWORK FOR PRESERVING SPATIAL DIRECTION 审中-公开

公开(公告)号：US20230260525A1

公开(公告)日：2023-08-17

申请号：US18138684

申请日：2023-04-24

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon KIM , Shankar THAGADUR SHIVAPPA , S M Akramus SALEHIN , Shuhua ZHANG , Erik VISSER

IPC: G10L19/038 , H04R5/00 , G10L19/002

CPC classification number: G10L19/038 , H04R5/00 , G10L19/002 , H04S2420/11 , H04R2430/21 , G10L19/008

Abstract: A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are configured to apply one adaptive network, based on a constraint that includes preservation of a spatial direction of one or more audio sources in the soundfield at the different time segments, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint. The one or more processors are also configured to apply an additional adaptive network.

26.

发明申请
PROCESSING OF AUDIO SIGNALS FROM MULTIPLE MICROPHONES 有权

公开(公告)号：US20230036986A1

公开(公告)日：2023-02-02

申请号：US17814660

申请日：2022-07-25

Applicant: QUALCOMM Incorporated

Inventor： Erik VISSER , Fatemeh SAKI , Yinyi GUO , Lae-Hoon KIM , Rogerio Guedes ALVES , Hannes PESSENTHEINER

IPC: H04R1/40 , H04R3/00 , H04R1/10

Abstract: A first device includes a memory configured to store instructions and one or more processors configured to receive audio signals from multiple microphones. The one or more processors are configured to process the audio signals to generate direction-of-arrival information corresponding to one or more sources of sound represented in one or more of the audio signals. The one or more processors are also configured to and send, to a second device, data based on the direction-of-arrival information and a class or embedding associated with the direction-of-arrival information.

27.

发明申请
SEMANTICALLY-AUGMENTED CONTEXT REPRESENTATION GENERATION 有权

公开(公告)号：US20230034450A1

公开(公告)日：2023-02-02

申请号：US17383284

申请日：2021-07-22

Applicant: QUALCOMM Incorporated

Inventor： Arvind Krishna SRIDHAR , Ravi CHOUDHARY , Lae-Hoon KIM , Erik VISSER

IPC: G10L15/18 , G06K9/72 , G10L15/22

Abstract: A device includes a memory configured to store instructions. The device also includes one or more processors configured to execute the instructions to provide context and one or more items of interest corresponding to the context to a dependency network encoder to generate a semantic-based representation of the context. The one or more processors are also configured to provide the context to a data dependent encoder to generate a context-based representation. The one or more processors are further configured to combine the semantic-based representation and the context-based representation to generate a semantically-augmented representation of the context.

28.

发明申请
AUDIO ZOOM 有权

公开(公告)号：US20220360891A1

公开(公告)日：2022-11-10

申请号：US17316529

申请日：2021-05-10

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon KIM , Fatemeh SAKI , Yoon Mo YANG , Erik VISSER

IPC: H04R1/40 , H04R5/04 , H04R5/033 , H04R1/24

Abstract: A device includes one or more processors configured to execute instructions to determine a first phase based on a first audio signal of first audio signals and to determine a second phase based on a second audio signal of second audio signals. The one or more processors are also configured to execute the instructions to apply spatial filtering to selected audio signals of the first audio signals and the second audio signals to generate an enhanced audio signal. The one or more processors are further configured to execute the instructions to generate a first output signal including combining a magnitude of the enhanced audio signal with the first phase and to generate a second output signal including combining the magnitude of the enhanced audio signal with the second phase. The first output signal and the second output signal correspond to an audio zoomed signal.

29.

发明申请
CONTEXT-BASED SPEECH ENHANCEMENT 有权

公开(公告)号：US20220310108A1

公开(公告)日：2022-09-29

申请号：US17209621

申请日：2021-03-23

Applicant: QUALCOMM Incorporated

Inventor： Kyungguen BYUN , Shuhua ZHANG , Lae-Hoon KIM , Erik VISSER , Sunkuk MOON , Vahid MONTAZERI

IPC: G10L21/038

Abstract: A device to perform speech enhancement includes one or more processors configured to obtain input spectral data based on an input signal. The input signal represents sound that includes speech. The one or more processors are also configured to process, using a multi-encoder transformer, the input spectral data and context data to generate output spectral data that represents a speech enhanced version of the input signal.

30.

发明申请
SHARED SPEECH PROCESSING NETWORK FOR MULTIPLE SPEECH APPLICATIONS 有权

公开(公告)号：US20220165285A1

公开(公告)日：2022-05-26

申请号：US17650595

申请日：2022-02-10

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon KIM , Sunkuk MOON , Erik VISSER , Prajakt KULKARNI

IPC: G10L21/02 , H04R5/04 , H04R3/00 , G06N20/00 , H04L65/60 , H04L65/80 , G06K9/62

Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data corresponding to audio captured by one or more microphones. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules. A first speech application module corresponds to a speaker verifier, and a second speech application module corresponds to a speech recognition network.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification