-
公开(公告)号:US20200304903A1
公开(公告)日:2020-09-24
申请号:US16896010
申请日:2020-06-08
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Dongmei WANG , Fatemeh SAKI , Taher SHAHBAZI MIRZAHASANLOO , Erik VISSER , Rogerio Guedes ALVES
IPC: H04R1/10
Abstract: Methods, systems, and devices for signal processing are described. Generally, in one example as provided for by the described techniques, a wearable device includes a processor configured to retrieve a plurality of external microphone signals that includes audio sound from outside of the device from a memory; to separate, based on at least information from an internal microphone signal, a self-voice component from a background component; to perform a first listen-through operation on the separated self-voice component to produce a first listen-through signal; and to produce an output audio signal that is based on at least the first listen-through signal, wherein the output audio signal includes an audio zoom signal that includes audio sound of the plurality of external microphone signals.
-
公开(公告)号:US20240155303A1
公开(公告)日:2024-05-09
申请号:US18549508
申请日:2022-05-02
Applicant: QUALCOMM Incorporated
Inventor: Erik VISSER , Lae-Hoon KIM , Jason FILOS , Xiaoxin ZHANG
CPC classification number: H04S7/303 , G01S7/006 , G01S7/411 , G01S13/726 , H04R3/005 , H04R5/027 , H04S2400/11 , H04S2400/15
Abstract: Disclosed are systems and techniques for detecting audio sources and configuring acoustic device settings. For instance, a wireless device can obtain a first set of radio frequency (RF) sensing data associated with a first plurality of received waveforms corresponding to a first transmitted waveform reflected off of a plurality of reflectors. Based on the first set of RF sensing data, the wireless device can determine a classification of a first reflector from the plurality of reflectors. The wireless device can determine at least one acoustic setting based on the classification of the at least one reflector.
-
23.
公开(公告)号:US20230353929A1
公开(公告)日:2023-11-02
申请号:US18349920
申请日:2023-07-10
Applicant: Qualcomm Incorporated
Inventor: Lae-Hoon KIM , Dongmei WANG , Fatemeh SAKI , Taher SHAHBAZI MIRZAHASANLOO , Erik VISSER , Rogerio Guedes ALVES
IPC: H04R1/10
CPC classification number: H04R1/1083 , H04R1/1075 , H04R2420/07 , H04R2460/01 , H04R2460/13
Abstract: A wearable device may include a processor configured to detect a self-voice signal, based on one or more transducers. The processor may be configured to separate the self-voice signal from a background signal in an external audio signal based on using a multi-microphone speech generative network. The processor may also be configured to apply a first filter to an external audio signal, detected by at least one external microphone on the wearable device, during a listen through operation based on an activation of the audio zoom feature to generate a first listen-through signal that includes the external audio signal. The processor may be configured to produce an output audio signal that is based on at least the first listen-through signal that includes the external signal, and is based on the detected self-voice signal.
-
公开(公告)号:US20230300527A1
公开(公告)日:2023-09-21
申请号:US18324622
申请日:2023-05-26
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Sunkuk MOON , Erik VISSER , Prajakt KULKARNI
IPC: H04R3/00 , G10L21/02 , H04R5/04 , G06N20/00 , H04L65/60 , H04L65/80 , G06F18/21 , G06V10/82 , G06V20/20
CPC classification number: H04R3/005 , G10L21/02 , H04R5/04 , G06N20/00 , H04L65/60 , H04L65/80 , G06F18/217 , G06V10/82 , G06V20/20 , H04R2499/13 , H04R2420/07
Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules.
-
25.
公开(公告)号:US20230260525A1
公开(公告)日:2023-08-17
申请号:US18138684
申请日:2023-04-24
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Shankar THAGADUR SHIVAPPA , S M Akramus SALEHIN , Shuhua ZHANG , Erik VISSER
IPC: G10L19/038 , H04R5/00 , G10L19/002
CPC classification number: G10L19/038 , H04R5/00 , G10L19/002 , H04S2420/11 , H04R2430/21 , G10L19/008
Abstract: A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are configured to apply one adaptive network, based on a constraint that includes preservation of a spatial direction of one or more audio sources in the soundfield at the different time segments, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint. The one or more processors are also configured to apply an additional adaptive network.
-
公开(公告)号:US20230036986A1
公开(公告)日:2023-02-02
申请号:US17814660
申请日:2022-07-25
Applicant: QUALCOMM Incorporated
Inventor: Erik VISSER , Fatemeh SAKI , Yinyi GUO , Lae-Hoon KIM , Rogerio Guedes ALVES , Hannes PESSENTHEINER
Abstract: A first device includes a memory configured to store instructions and one or more processors configured to receive audio signals from multiple microphones. The one or more processors are configured to process the audio signals to generate direction-of-arrival information corresponding to one or more sources of sound represented in one or more of the audio signals. The one or more processors are also configured to and send, to a second device, data based on the direction-of-arrival information and a class or embedding associated with the direction-of-arrival information.
-
公开(公告)号:US20230034450A1
公开(公告)日:2023-02-02
申请号:US17383284
申请日:2021-07-22
Applicant: QUALCOMM Incorporated
Inventor: Arvind Krishna SRIDHAR , Ravi CHOUDHARY , Lae-Hoon KIM , Erik VISSER
Abstract: A device includes a memory configured to store instructions. The device also includes one or more processors configured to execute the instructions to provide context and one or more items of interest corresponding to the context to a dependency network encoder to generate a semantic-based representation of the context. The one or more processors are also configured to provide the context to a data dependent encoder to generate a context-based representation. The one or more processors are further configured to combine the semantic-based representation and the context-based representation to generate a semantically-augmented representation of the context.
-
公开(公告)号:US20220360891A1
公开(公告)日:2022-11-10
申请号:US17316529
申请日:2021-05-10
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Fatemeh SAKI , Yoon Mo YANG , Erik VISSER
Abstract: A device includes one or more processors configured to execute instructions to determine a first phase based on a first audio signal of first audio signals and to determine a second phase based on a second audio signal of second audio signals. The one or more processors are also configured to execute the instructions to apply spatial filtering to selected audio signals of the first audio signals and the second audio signals to generate an enhanced audio signal. The one or more processors are further configured to execute the instructions to generate a first output signal including combining a magnitude of the enhanced audio signal with the first phase and to generate a second output signal including combining the magnitude of the enhanced audio signal with the second phase. The first output signal and the second output signal correspond to an audio zoomed signal.
-
公开(公告)号:US20220310108A1
公开(公告)日:2022-09-29
申请号:US17209621
申请日:2021-03-23
Applicant: QUALCOMM Incorporated
Inventor: Kyungguen BYUN , Shuhua ZHANG , Lae-Hoon KIM , Erik VISSER , Sunkuk MOON , Vahid MONTAZERI
IPC: G10L21/038
Abstract: A device to perform speech enhancement includes one or more processors configured to obtain input spectral data based on an input signal. The input signal represents sound that includes speech. The one or more processors are also configured to process, using a multi-encoder transformer, the input spectral data and context data to generate output spectral data that represents a speech enhanced version of the input signal.
-
公开(公告)号:US20220165285A1
公开(公告)日:2022-05-26
申请号:US17650595
申请日:2022-02-10
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Sunkuk MOON , Erik VISSER , Prajakt KULKARNI
Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data corresponding to audio captured by one or more microphones. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules. A first speech application module corresponds to a speaker verifier, and a second speech application module corresponds to a speech recognition network.
-
-
-
-
-
-
-
-
-