-
公开(公告)号:US11715480B2
公开(公告)日:2023-08-01
申请号:US17209621
申请日:2021-03-23
Applicant: QUALCOMM Incorporated
Inventor: Kyungguen Byun , Shuhua Zhang , Lae-Hoon Kim , Erik Visser , Sunkuk Moon , Vahid Montazeri
IPC: G10L21/0232 , G10L21/038 , G10L21/02
CPC classification number: G10L21/0232 , G10L21/02 , G10L21/038
Abstract: A device to perform speech enhancement includes one or more processors configured to obtain input spectral data based on an input signal. The input signal represents sound that includes speech. The one or more processors are also configured to process, using a multi-encoder transformer, the input spectral data and context data to generate output spectral data that represents a speech enhanced version of the input signal.
-
公开(公告)号:US20190251971A1
公开(公告)日:2019-08-15
申请号:US16396311
申请日:2019-04-26
Applicant: QUALCOMM Incorporated
Inventor: Erik Visser , Shuhua Zhang , Lae-Hoon Kim , Yinyi Guo , Sunkuk Moon
IPC: G10L15/26 , G10L25/48 , G10L13/047 , G10L21/00
CPC classification number: G10L15/26 , G10L13/047 , G10L21/00 , G10L21/003 , G10L25/48
Abstract: In a particular aspect, a speech generator includes a signal input configured to receive a first audio signal. The speech generator also includes at least one speech signal processor configured to generate a second audio signal based on information associated with the first audio signal and based further on automatic speech recognition (ASR) data associated with the first audio signal.
-
公开(公告)号:US10134422B2
公开(公告)日:2018-11-20
申请号:US14956212
申请日:2015-12-01
Applicant: QUALCOMM Incorporated
Inventor: Kyu Woong Hwang , Yongwoo Cho , Jun-Cheol Cho , Sunkuk Moon
IPC: G10L21/00 , G10L25/51 , G06K9/00 , G08B3/10 , G10L25/72 , G01S5/20 , G01S5/22 , G08B13/16 , G10L17/26 , G10L21/028 , G01S3/803 , G01S3/808 , G01S5/18 , G01S5/28 , G10L25/00 , G08B13/196
Abstract: A method of determining, by an electronic device, an audio event is disclosed. The method may include receiving an input sound from a sound source by a plurality of sound sensors. The method may also extracting, by a processor, at least one sound feature from the received input sound, determining, by the processor, location information of the sound source based on the input sound received by the sound sensors, determining, by the processor, the audio event indicative of the input sound based on the at least one sound feature and the location information, and transmitting, by a communication unit, a notification of the audio event to an external electronic device.
-
公开(公告)号:US20170154638A1
公开(公告)日:2017-06-01
申请号:US14956212
申请日:2015-12-01
Applicant: QUALCOMM Incorporated
Inventor: Kyu Woong Hwang , Yongwoo Cho , Jun-Cheol Cho , Sunkuk Moon
CPC classification number: G10L25/51 , G01S3/803 , G01S3/808 , G01S5/18 , G01S5/20 , G01S5/22 , G01S5/28 , G06K9/00711 , G06K9/00771 , G06K2009/00738 , G08B3/10 , G08B13/1672 , G08B13/19695 , G10L17/26 , G10L21/028 , G10L25/72
Abstract: A method of determining, by an electronic device, an audio event is disclosed. The method may include receiving an input sound from a sound source by a plurality of sound sensors. The method may also extracting, by a processor, at least one sound feature from the received input sound, determining, by the processor, location information of the sound source based on the input sound received by the sound sensors, determining, by the processor, the audio event indicative of the input sound based on the at least one sound feature and the location information, and transmitting, by a communication unit, a notification of the audio event to an external electronic device.
-
公开(公告)号:US12200450B2
公开(公告)日:2025-01-14
申请号:US18324622
申请日:2023-05-26
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Sunkuk Moon , Erik Visser , Prajakt Kulkarni
IPC: H04R3/00 , G06F18/21 , G06N20/00 , G06V10/82 , G06V20/20 , G10L21/02 , H04L65/60 , H04L65/80 , H04R5/04
Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules.
-
公开(公告)号:US11700484B2
公开(公告)日:2023-07-11
申请号:US17650595
申请日:2022-02-10
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Sunkuk Moon , Erik Visser , Prajakt Kulkarni
IPC: H04R3/00 , G10L21/02 , H04R5/04 , G06N20/00 , H04L65/60 , H04L65/80 , G06F18/21 , G06V10/82 , G06V20/20
CPC classification number: H04R3/005 , G06F18/217 , G06N20/00 , G06V10/82 , G06V20/20 , G10L21/02 , H04L65/60 , H04L65/80 , H04R5/04 , H04R2420/07 , H04R2499/13
Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data corresponding to audio captured by one or more microphones. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules. A first speech application module corresponds to a speaker verifier, and a second speech application module corresponds to a speech recognition network.
-
公开(公告)号:US20190355351A1
公开(公告)日:2019-11-21
申请号:US15982851
申请日:2018-05-17
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Yinyi Guo , Ravi Choudhary , Sunkuk Moon , Erik Visser , Fatemeh Saki
IPC: G10L15/22 , G06F3/16 , G10L15/18 , G10L25/63 , G06F3/0484
Abstract: A device includes a memory configured to store a user experience evaluation unit. A processor is configured to receive a first user input corresponding to a user command to initiate a particular task, the first user input received via a first sensor. The processor is configured to, after receiving the first user input, receive one or more subsequent user inputs, the one or subsequent user inputs including a second user input received via a second sensor. The processor is configured to initiate a remedial action in response to determining, based on the user experience evaluation unit, that the one or more subsequent user inputs correspond to a negative user experience.
-
公开(公告)号:US09837068B2
公开(公告)日:2017-12-05
申请号:US14682009
申请日:2015-04-08
Applicant: QUALCOMM Incorporated
Inventor: Sunkuk Moon , Minho Jin , Haiying Xia , Hesu Huang , Warren Frederick Dale
CPC classification number: G10L15/02 , G10L15/063 , G10L15/08 , G10L15/22 , G10L2015/022 , G10L2015/025 , G10L2015/027
Abstract: A method for verifying at least one sound sample to be used in generating a sound detection model in an electronic device includes receiving a first sound sample; extracting a first acoustic feature from the first sound sample; receiving a second sound sample; extracting a second acoustic feature from the second sound sample; and determining whether the second acoustic feature is similar to the first acoustic feature.
-
公开(公告)号:US11676571B2
公开(公告)日:2023-06-13
申请号:US17154372
申请日:2021-01-21
Applicant: QUALCOMM Incorporated
Inventor: Kyungguen Byun , Sunkuk Moon , Shuhua Zhang , Vahid Montazeri , Lae-Hoon Kim , Erik Visser
IPC: G10L13/10 , G10L13/06 , G10L15/22 , G10L13/00 , G10L13/047 , G10L13/033 , G10L19/02 , G10L25/63 , G06N3/045 , G10L21/013
CPC classification number: G10L13/047 , G06N3/045 , G10L13/033 , G10L19/02 , G10L25/63 , G10L2021/0135
Abstract: A device for speech generation includes one or more processors configured to receive one or more control parameters indicating target speech characteristics. The one or more processors are also configured to process, using a multi-encoder, an input representation of speech based on the one or more control parameters to generate encoded data corresponding to an audio signal that represents a version of the speech based on the target speech characteristics.
-
公开(公告)号:US11094316B2
公开(公告)日:2021-08-17
申请号:US15972011
申请日:2018-05-04
Applicant: QUALCOMM Incorporated
Inventor: Erik Visser , Fatemeh Saki , Yinyi Guo , Sunkuk Moon , Lae-Hoon Kim , Ravi Choudhary
Abstract: A device includes a memory configured to store category labels associated with categories of a natural language processing library. A processor is configured to analyze input audio data to generate a text string and to perform natural language processing on at least the text string to generate an output text string including an action associated with a first device, a speaker, a location, or a combination thereof. The processor is configured to compare the input audio data to audio data of the categories to determine whether the input audio data matches any of the categories and, in response to determining that the input audio data does not match any of the categories: create a new category label, associate the new category label with at least a portion of the output text string, update the categories with the new category label, and generate a notification indicating the new category label.
-
-
-
-
-
-
-
-
-