-
公开(公告)号:US20190355351A1
公开(公告)日:2019-11-21
申请号:US15982851
申请日:2018-05-17
发明人: Lae-Hoon Kim , Yinyi Guo , Ravi Choudhary , Sunkuk Moon , Erik Visser , Fatemeh Saki
IPC分类号: G10L15/22 , G06F3/16 , G10L15/18 , G10L25/63 , G06F3/0484
摘要: A device includes a memory configured to store a user experience evaluation unit. A processor is configured to receive a first user input corresponding to a user command to initiate a particular task, the first user input received via a first sensor. The processor is configured to, after receiving the first user input, receive one or more subsequent user inputs, the one or subsequent user inputs including a second user input received via a second sensor. The processor is configured to initiate a remedial action in response to determining, based on the user experience evaluation unit, that the one or more subsequent user inputs correspond to a negative user experience.
-
公开(公告)号:US09837068B2
公开(公告)日:2017-12-05
申请号:US14682009
申请日:2015-04-08
发明人: Sunkuk Moon , Minho Jin , Haiying Xia , Hesu Huang , Warren Frederick Dale
CPC分类号: G10L15/02 , G10L15/063 , G10L15/08 , G10L15/22 , G10L2015/022 , G10L2015/025 , G10L2015/027
摘要: A method for verifying at least one sound sample to be used in generating a sound detection model in an electronic device includes receiving a first sound sample; extracting a first acoustic feature from the first sound sample; receiving a second sound sample; extracting a second acoustic feature from the second sound sample; and determining whether the second acoustic feature is similar to the first acoustic feature.
-
公开(公告)号:US20190251971A1
公开(公告)日:2019-08-15
申请号:US16396311
申请日:2019-04-26
发明人: Erik Visser , Shuhua Zhang , Lae-Hoon Kim , Yinyi Guo , Sunkuk Moon
IPC分类号: G10L15/26 , G10L25/48 , G10L13/047 , G10L21/00
CPC分类号: G10L15/26 , G10L13/047 , G10L21/00 , G10L21/003 , G10L25/48
摘要: In a particular aspect, a speech generator includes a signal input configured to receive a first audio signal. The speech generator also includes at least one speech signal processor configured to generate a second audio signal based on information associated with the first audio signal and based further on automatic speech recognition (ASR) data associated with the first audio signal.
-
公开(公告)号:US10134422B2
公开(公告)日:2018-11-20
申请号:US14956212
申请日:2015-12-01
发明人: Kyu Woong Hwang , Yongwoo Cho , Jun-Cheol Cho , Sunkuk Moon
IPC分类号: G10L21/00 , G10L25/51 , G06K9/00 , G08B3/10 , G10L25/72 , G01S5/20 , G01S5/22 , G08B13/16 , G10L17/26 , G10L21/028 , G01S3/803 , G01S3/808 , G01S5/18 , G01S5/28 , G10L25/00 , G08B13/196
摘要: A method of determining, by an electronic device, an audio event is disclosed. The method may include receiving an input sound from a sound source by a plurality of sound sensors. The method may also extracting, by a processor, at least one sound feature from the received input sound, determining, by the processor, location information of the sound source based on the input sound received by the sound sensors, determining, by the processor, the audio event indicative of the input sound based on the at least one sound feature and the location information, and transmitting, by a communication unit, a notification of the audio event to an external electronic device.
-
公开(公告)号:US20170154638A1
公开(公告)日:2017-06-01
申请号:US14956212
申请日:2015-12-01
发明人: Kyu Woong Hwang , Yongwoo Cho , Jun-Cheol Cho , Sunkuk Moon
CPC分类号: G10L25/51 , G01S3/803 , G01S3/808 , G01S5/18 , G01S5/20 , G01S5/22 , G01S5/28 , G06K9/00711 , G06K9/00771 , G06K2009/00738 , G08B3/10 , G08B13/1672 , G08B13/19695 , G10L17/26 , G10L21/028 , G10L25/72
摘要: A method of determining, by an electronic device, an audio event is disclosed. The method may include receiving an input sound from a sound source by a plurality of sound sensors. The method may also extracting, by a processor, at least one sound feature from the received input sound, determining, by the processor, location information of the sound source based on the input sound received by the sound sensors, determining, by the processor, the audio event indicative of the input sound based on the at least one sound feature and the location information, and transmitting, by a communication unit, a notification of the audio event to an external electronic device.
-
公开(公告)号:US11094316B2
公开(公告)日:2021-08-17
申请号:US15972011
申请日:2018-05-04
发明人: Erik Visser , Fatemeh Saki , Yinyi Guo , Sunkuk Moon , Lae-Hoon Kim , Ravi Choudhary
摘要: A device includes a memory configured to store category labels associated with categories of a natural language processing library. A processor is configured to analyze input audio data to generate a text string and to perform natural language processing on at least the text string to generate an output text string including an action associated with a first device, a speaker, a location, or a combination thereof. The processor is configured to compare the input audio data to audio data of the categories to determine whether the input audio data matches any of the categories and, in response to determining that the input audio data does not match any of the categories: create a new category label, associate the new category label with at least a portion of the output text string, update the categories with the new category label, and generate a notification indicating the new category label.
-
公开(公告)号:US11017783B2
公开(公告)日:2021-05-25
申请号:US16296733
申请日:2019-03-08
发明人: Sunkuk Moon , Bicheng Jiang , Erik Visser
摘要: A device includes a processor configured to determine a feature vector based on an utterance and to determine a first embedding vector by processing the feature vector using a trained embedding network. The processor is configured to determine a first distance metric based on distances between the first embedding vector and each embedding vector of a speaker template. The processor is configured to determine, based on the first distance metric, that the utterance is verified to be from a particular user. The processor is configured to, based on a comparison of a first particular distance metric associated with the first embedding vector to a second distance metric associated with a first test embedding vector of the speaker template, generate an updated speaker template by adding the first embedding vector as a second test embedding vector and removing the first test embedding vector from test embedding vectors of the speaker template.
-
公开(公告)号:US20190341026A1
公开(公告)日:2019-11-07
申请号:US15972011
申请日:2018-05-04
发明人: Erik Visser , Fatemeh Saki , Yinyi Guo , Sunkuk Moon , Lae-Hoon Kim , Ravi Choudhary
摘要: A device includes a memory configured to store category labels associated with categories of a natural language processing library. A processor is configured to analyze input audio data to generate a text string and to perform natural language processing on at least the text string to generate an output text string including an action associated with a first device, a speaker, a location, or a combination thereof. The processor is configured to compare the input audio data to audio data of the categories to determine whether the input audio data matches any of the categories and, in response to determining that the input audio data does not match any of the categories: create a new category label, associate the new category label with at least a portion of the output text string, update the categories with the new category label, and generate a notification indicating the new category label.
-
公开(公告)号:US20180233127A1
公开(公告)日:2018-08-16
申请号:US15430791
申请日:2017-02-13
发明人: ERIK VISSER , Shuhua Zhang , Lae-Hoon Kim , Yinyi Guo , Sunkuk Moon
IPC分类号: G10L13/027 , G10L13/047 , G10L25/78 , G10L25/21 , G10L25/63 , G10L25/90 , G10L15/26
CPC分类号: G10L15/26 , G10L13/047 , G10L21/00 , G10L21/003 , G10L25/48
摘要: In a particular aspect, an apparatus includes an audio sensor configured to receive an input audio signal. The apparatus also includes speech generative circuitry configured to generate a synthesized audio signal based at least partly on automatic speech recognition (ASR) data associated with the input audio signal and based on one or more parameters indicative of state information associated with the input audio signal.
-
公开(公告)号:US11626104B2
公开(公告)日:2023-04-11
申请号:US17115158
申请日:2020-12-08
发明人: Soo Jin Park , Sunkuk Moon , Lae-Hoon Kim , Erik Visser
IPC分类号: G10L17/00 , G10L15/07 , G06F1/3231 , G10L15/04 , G10L15/16
摘要: A device includes processors configured to determine, in a first power mode, whether an audio stream corresponds to speech of at least two talkers. The processors are configured to, based on determining that the audio stream corresponds to speech of at least two talkers, analyze, in a second power mode, audio feature data of the audio stream to generate a segmentation result. The processors are configured to perform a comparison of a plurality of user speech profiles to an audio feature data set of a plurality of audio feature data sets of a talker-homogenous audio segment to determine whether the audio feature data set matches any of the user speech profiles. The processors are configured to, based on determining that the audio feature data set does not match any of the plurality of user speech profiles, generate a user speech profile based on the plurality of audio feature data sets.
-
-
-
-
-
-
-
-
-