-
公开(公告)号:US10657976B2
公开(公告)日:2020-05-19
申请号:US16521104
申请日:2019-07-24
发明人: Ho-sang Sung , Konstantin Osipov , Yi Lu
IPC分类号: G10L19/00 , G10L19/02 , G10L19/032 , G10L19/24 , G10L19/038 , G10L19/12 , G10L19/22 , G10L21/00 , G10L19/002 , G10L19/20
摘要: The present invention relates to a method and an apparatus for encoding and decoding spectrum coefficients in the frequency domain. The spectrum encoding method may comprise the steps of: selecting an encoding type on the basis of bit allocation information of respective bands; performing zero encoding with respect to a zero band; and encoding information of selected significant frequency components with respect to respective non-zero bands. The spectrum encoding method enables encoding and decoding of spectrum coefficients which is adaptive to various bit-rates and various sub-band sizes. In addition, a spectrum can be encoded using a TCQ method at a fixed bit rate using a bit-rate control module in a codec that supports multiple rates. Encoding performance of the codec can be maximised by encoding high performance TCQ at a precise target bit rate.
-
公开(公告)号:US10656909B2
公开(公告)日:2020-05-19
申请号:US16044114
申请日:2018-07-24
IPC分类号: G10L21/00 , G06F3/16 , G10L15/22 , G10L15/18 , G06F3/01 , G06F3/03 , G06N20/00 , G06F3/00 , G10L15/04 , G10L15/26 , G10L15/24
摘要: A method and system are provided. The method includes receiving, by a microphone and camera, user utterances indicative of user commands and associated user gestures for the user utterances. The method further includes parsing, by a hardware-based recognizer, sample utterances and the user utterances into verb parts and noun parts. The method also includes recognizing, by a hardware-based recognizer, the user utterances and the associated user gestures based on the sample utterances and descriptions of associated supporting gestures for the sample utterances. The recognizing step includes comparing the verb parts and the noun parts from the user utterances individually and as pairs to the verb parts and the noun parts of the sample utterances. The method additionally includes selectively performing a given one of the user commands responsive to a recognition result.
-
公开(公告)号:US20200081686A1
公开(公告)日:2020-03-12
申请号:US16688317
申请日:2019-11-19
摘要: A playback sound provision device includes: a surrounding information detection device configured to detect detection information including information on a three-dimensional object or a planar display around the vehicle; and a control device configured to determine a playback method for a playback sound based on a music piece based on the detection information when a predetermined target is included in the detection information, and provide the playback sound based on the playback method.
-
公开(公告)号:US10580406B2
公开(公告)日:2020-03-03
申请号:US15807025
申请日:2017-11-08
申请人: 2236008 Ontario Inc.
IPC分类号: G10L15/00 , G10L15/26 , G10L21/00 , G10L21/06 , G10L25/00 , G10L15/22 , G10L15/18 , G10L15/32 , G10L15/30
摘要: A system and method receives a spoken utterance and converts the spoken utterance into recognized speech results through automatic speech recognition modules. The system and method renders a composite recognition speech result comprising the recognized speech results joined in a return function. The system and method interprets the recognized speech results joined in a return function from each of the automatic speech recognition modules through multiple conversation modules.
-
公开(公告)号:US10579133B2
公开(公告)日:2020-03-03
申请号:US16132673
申请日:2018-09-17
发明人: Herve Dallet , Francis Chauvet
IPC分类号: G10L21/00 , G06F3/01 , G06F3/02 , G06F3/023 , G05B19/05 , G05F1/46 , G06F1/26 , G06F13/10 , G08B5/36 , G10L25/00 , H01H13/83 , H01H13/02
摘要: A control module for a human-machine dialogue system, the system including a plurality of human-machine dialogue members, each human-machine dialogue member including a functional element that includes at least one electrical contact and/or a signalling indicator. The control module is configured to operate in a write mode to attribute a state to each signalling indicator of each human-machine dialogue member and in a read mode to read a state of the electrical contact of each human-machine dialogue member.
-
公开(公告)号:US20200066292A1
公开(公告)日:2020-02-27
申请号:US16666237
申请日:2019-10-28
发明人: Per EKSTRAND
IPC分类号: G10L21/00 , H03H17/02 , H04S7/00 , G10L19/26 , G10L19/02 , G10L19/008 , G06F17/11 , H04R3/04 , G06F17/10
摘要: An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.
-
公开(公告)号:US20200005790A1
公开(公告)日:2020-01-02
申请号:US16569849
申请日:2019-09-13
发明人: DAE GYU BAE , TAE HWAN CHA , HO JEONG YOU
IPC分类号: G10L15/22 , H04N5/44 , H04N21/422 , G10L15/26 , H04N21/431 , H04N21/439 , G10L21/00 , G10L15/08 , G10L17/22 , H03G3/02 , H03G3/30
摘要: Provided are an image display apparatus and a method of controlling the same. The image display apparatus enabling voice recognition includes: a first voice inputter which receives a user-side audio signal; an audio outputter which outputs an audio signal processed by the image display apparatus; a first voice recognizer which recognizes the user-side audio signal received through the first voice inputter; and a controller which decreases a volume of the audio signal output through the audio outputter to a predetermined level if a voice recognition start command is received.
-
68.
公开(公告)号:US10522144B2
公开(公告)日:2019-12-31
申请号:US15911965
申请日:2018-03-05
申请人: ELIZA Corporation
IPC分类号: G10L15/22 , H04M3/46 , H04M3/493 , H04M3/523 , G10L15/06 , G10L21/00 , G10L25/03 , G10L25/00 , H04M3/38 , H04M3/51
摘要: A system for conducting a telephonic speech recognition application includes an automated telephone device for making telephonic contact with a respondent and a speech recognition device which, upon the telephonic contact being made, presents the respondent with at least one introductory prompt for the respondent to reply to; receives a spoken response from the respondent; and performs a speech recognition analysis on the spoken response to determine a capability of the respondent to complete the application. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is capable of competing the application, the speech recognition device presents at least one application prompt to the respondent. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is not capable of completing the application, the speech recognition system presents instructions on completing the application to the respondent.
-
公开(公告)号:US10515640B2
公开(公告)日:2019-12-24
申请号:US15806667
申请日:2017-11-08
申请人: INTEL CORPORATION
发明人: Jonathan Huang , David Pearce , Willem M. Beltman
IPC分类号: G10L21/00 , G10L17/22 , G10L17/06 , G10L25/60 , G10L21/0208 , G10L17/02 , G10L15/08 , G10L15/18 , G10L17/12
摘要: An example apparatus for generating dialogue includes an audio receiver to receive audio data including speech. The apparatus also includes a verification score generator to generate a verification score based on the audio data. The apparatus further includes a user detector to detect that the verification score exceeds a lower threshold but does not exceed a higher threshold. The apparatus includes a dialogue generator to generate dialogue to solicit additional audio data to be used to generate an updated verification score in response to detecting that the verification score exceeds a lower threshold but does not exceed a higher threshold.
-
公开(公告)号:US10514886B2
公开(公告)日:2019-12-24
申请号:US15945127
申请日:2018-04-04
IPC分类号: G06F3/16 , G10L21/00 , G10L21/02 , G10L21/003 , G10L21/013 , G10L21/0316 , G10L21/043 , H04R3/00
摘要: A playback sound provision device includes: a surrounding information detection device configured to detect detection information including information on a three-dimensional object or a planar display around the vehicle; and a control device configured to determine a playback method for a playback sound based on a music piece based on the detection information when a predetermined target is included in the detection information, and provide the playback sound based on the playback method.
-
-
-
-
-
-
-
-
-