-
公开(公告)号:US20210011887A1
公开(公告)日:2021-01-14
申请号:US16586821
申请日:2019-09-27
Applicant: QUALCOMM Incorporated
Inventor: Erik VISSER , Rehana MAHFUZ , Ravi CHOUDHARY , Lae-Hoon KIM , Sunkuk MOON , Yinyi GUO , Fatemeh SAKI
Abstract: A device for activity tracking includes a memory and one or more processors. The memory is configured to store an activity log. The one or more processors are configured to update the activity log based on activity data. The activity data is received from a second device. The one or more processors are also configured to, responsive to receiving a natural language query, generate a query response based on the activity log.
-
公开(公告)号:US20200312341A1
公开(公告)日:2020-10-01
申请号:US16370812
申请日:2019-03-29
Applicant: QUALCOMM Incorporated
Inventor: Rogerio Guedes ALVES , Taher SHAHBAZI MIRZAHASANLOO , Erik VISSER , Lae-Hoon KIM , Fatemeh SAKI , Dongmei WANG
IPC: G10L21/0208 , G10L21/02
Abstract: A device includes a memory and one or more processors coupled to the memory. The one or more processors are configured to perform an active noise cancellation (ANC) operation on noisy input speech as captured by a first microphone, the noisy input speech as captured by a second microphone, or both, to suppress a noise level associated with the noisy input speech. The one or more processors are configured to match a second frequency spectrum of a second signal with a first frequency spectrum of a first signal. The first signal is representative of the noisy input speech as captured by the first microphone, and the second signal is representative of the noisy input speech as captured by the second microphone. The one or more processors are also configured to generate an output speech signal that is representative of input speech based on the second signal.
-
公开(公告)号:US20190139552A1
公开(公告)日:2019-05-09
申请号:US16140227
申请日:2018-09-24
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Erik VISSER , Phuong Lam TON , Jeremy Patrick TOMAN , Jeffrey Clinton SHAW
IPC: G10L17/00 , G06F3/0481 , G06F3/0488 , H04S7/00 , G06F17/30
Abstract: An electronic device includes a display, wherein the display is configured to present a user interface, wherein the user interface comprises a coordinate system. The coordinate system corresponds to physical coordinates. The display is configured to present a sector selection feature that allows selection of at least one sector of the coordinate system. The at least one sector corresponds to captured audio from multiple microphones. The sector selection may also include an audio signal indicator. The electronic device includes operation circuitry coupled to the display. The operation circuitry is configured to perform an audio operation on the captured audio corresponding to the audio signal indicator based on the sector selection.
-
公开(公告)号:US20230276173A1
公开(公告)日:2023-08-31
申请号:US18167823
申请日:2023-02-10
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Rogerio Guedes ALVES , Jacob Jon BEAN , Erik VISSER
IPC: H04R3/04
CPC classification number: H04R3/04 , H04R2460/13
Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device to receive an input audio signal from one or more outer microphones, an input audio signal from one or more inner microphones, and a bone conduction signal from a bone conduction sensor based on the input audio signals. The wearable device may filter the bone conduction signal based on a set of frequencies of the input audio signals, such as a low frequency portion of the input audio signals. For example, the wearable device may apply a filter to the bone conduction signal that accounts for an error in the input audio signals. The wearable device may add a gain to the filtered bone conduction signal and may equalize the filtered bone conduction signal based on the gain. The wearable device may output an audio signal to a speaker.
-
公开(公告)号:US20220230623A1
公开(公告)日:2022-07-21
申请号:US17154372
申请日:2021-01-21
Applicant: QUALCOMM Incorporated
Inventor: Kyungguen BYUN , Sunkuk MOON , Shuhua ZHANG , Vahid MONTAZERI , Lae-Hoon KIM , Erik VISSER
IPC: G10L13/047 , G06N3/04 , G10L13/033 , G10L25/63 , G10L19/02
Abstract: A device for speech generation includes one or more processors configured to receive one or more control parameters indicating target speech characteristics. The one or more processors are also configured to process, using a multi-encoder, an input representation of speech based on the one or more control parameters to generate encoded data corresponding to an audio signal that represents a version of the speech based on the target speech characteristics.
-
公开(公告)号:US20220199100A1
公开(公告)日:2022-06-23
申请号:US17128544
申请日:2020-12-21
Applicant: QUALCOMM Incorporated
Inventor: S M Akramus SALEHIN , Lae-Hoon KIM , Hannes PESSENTHEINER , Shuhua ZHANG , Sanghyun CHI , Erik VISSER , Shankar THAGADUR SHIVAPPA
IPC: G10L21/0232 , H04R1/40 , H04R3/00 , H04S7/00 , H04S3/00 , G10L25/51 , G10L21/0324
Abstract: A device includes one or more processors configured to obtain audio signals representing sound captured by at least three microphones and determine spatial audio data based on the audio signals. The one or more processors are further configured to determine a metric indicative of wind noise in the audio signals. The metric is based on a comparison of a first value and a second value. The first value corresponds to an aggregate signal based on the spatial audio data, and the second value corresponds to a differential signal based on the spatial audio data.
-
公开(公告)号:US20220115007A1
公开(公告)日:2022-04-14
申请号:US17308593
申请日:2021-05-05
Applicant: QUALCOMM Incorporated
Inventor: Taher SHAHBAZI MIRZAHASANLOO , Rogerio Guedes ALVES , Erik VISSER , Lae-Hoon KIM
Abstract: A device includes a memory configured to store instructions and one or more processors configured execute the instructions. The one or more processors are configured execute the instructions to receive audio data including first audio data corresponding to a first output of a first microphone and second audio data corresponding to a second output of a second microphone. The one or more processors are also configured to execute the instructions to provide the audio data to a dynamic classifier. The dynamic classifier is configured to generate a classification output corresponding to the audio data. The one or more processors are further configured to execute the instructions to determine, at least partially based on the classification output, whether the audio data corresponds to user voice activity.
-
公开(公告)号:US20220201395A1
公开(公告)日:2022-06-23
申请号:US17127421
申请日:2020-12-18
Applicant: QUALCOMM Incorporated
Inventor: S M Akramus SALEHIN , Lae-Hoon KIM , Vasudev NAYAK , Shankar THAGADUR SHIVAPPA , Isaac Garcia MUNOZ , Sanghyun CHI , Erik VISSER
Abstract: In an aspect, a lens is zoomed in to create a zoomed lens. Lens data associated with the lens includes a direction of the lens relative to an object in a field-of-view of the zoomed lens and a magnification of the object resulting from the zoomed lens. An array of microphones capture audio signals including audio produced by the object and interference produced by other objects. The audio signals are processed to identify a directional component associated with the audio produced by the object and three orthogonal components associated with the interference produced by the other objects. Stereo beamforming is used to increase a magnitude of the directional component (relative to the interference) while retaining a binaural nature of the audio signals. The increase in magnitude of the directional component is based on an amount of the magnification provided by the zoomed lens to the object.
-
公开(公告)号:US20210204053A1
公开(公告)日:2021-07-01
申请号:US17201998
申请日:2021-03-15
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Dongmei WANG , Fatemeh SAKI , Taher SHAHBAZI MIRZAHASANLOO , Erik VISSER , Rogerio Guedes ALVES
IPC: H04R1/10
Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device may receive an input audio signal (e.g., including both an external signal and a self-voice signal). The wearable device may detect the self-voice signal in the input audio signal based on a self-voice activity detection (SVAD) procedure, and may implement the described techniques based thereon. The wearable device may perform beamforming operations or other separation procedures to isolate the external signal and the self-voice signal from the input audio signal. The wearable device may apply a first filter to the external signal, and a second filter to the self-voice signal. The wearable device may then mix the filtered signals, and generate an output signal that sounds natural to the user.
-
公开(公告)号:US20210151064A1
公开(公告)日:2021-05-20
申请号:US16685987
申请日:2019-11-15
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Dongmei Wang , Cheng-Yu Hung , Erik Visser
Abstract: A device includes one or more processors configured to perform signal processing including a linear transformation and a non-linear transformation of an input signal to generate a reference target signal. The reference target signal has a linear component associated with the linear transformation and a non-linear component associated with the non-linear transformation. The one or more processors are also configured to perform linear filtering of the input signal by controlling adaptation of the linear filtering to generate an output signal that substantially matches the linear component of the reference target signal.
-
-
-
-
-
-
-
-
-