Signal encoding method and apparatus, and signal decoding method and apparatus

    公开(公告)号:US10657976B2

    公开(公告)日:2020-05-19

    申请号:US16521104

    申请日:2019-07-24

    摘要: The present invention relates to a method and an apparatus for encoding and decoding spectrum coefficients in the frequency domain. The spectrum encoding method may comprise the steps of: selecting an encoding type on the basis of bit allocation information of respective bands; performing zero encoding with respect to a zero band; and encoding information of selected significant frequency components with respect to respective non-zero bands. The spectrum encoding method enables encoding and decoding of spectrum coefficients which is adaptive to various bit-rates and various sub-band sizes. In addition, a spectrum can be encoded using a TCQ method at a fixed bit rate using a bit-rate control module in a codec that supports multiple rates. Encoding performance of the codec can be maximised by encoding high performance TCQ at a precise target bit rate.

    Learning intended user actions
    62.
    发明授权

    公开(公告)号:US10656909B2

    公开(公告)日:2020-05-19

    申请号:US16044114

    申请日:2018-07-24

    摘要: A method and system are provided. The method includes receiving, by a microphone and camera, user utterances indicative of user commands and associated user gestures for the user utterances. The method further includes parsing, by a hardware-based recognizer, sample utterances and the user utterances into verb parts and noun parts. The method also includes recognizing, by a hardware-based recognizer, the user utterances and the associated user gestures based on the sample utterances and descriptions of associated supporting gestures for the sample utterances. The recognizing step includes comparing the verb parts and the noun parts from the user utterances individually and as pairs to the verb parts and the noun parts of the sample utterances. The method additionally includes selectively performing a given one of the user commands responsive to a recognition result.

    PLAYBACK SOUND PROVISION DEVICE
    63.
    发明申请

    公开(公告)号:US20200081686A1

    公开(公告)日:2020-03-12

    申请号:US16688317

    申请日:2019-11-19

    摘要: A playback sound provision device includes: a surrounding information detection device configured to detect detection information including information on a three-dimensional object or a planar display around the vehicle; and a control device configured to determine a playback method for a playback sound based on a music piece based on the detection information when a predetermined target is included in the detection information, and provide the playback sound based on the playback method.

    Method of and system for providing adaptive respondent training in a speech recognition application

    公开(公告)号:US10522144B2

    公开(公告)日:2019-12-31

    申请号:US15911965

    申请日:2018-03-05

    申请人: ELIZA Corporation

    摘要: A system for conducting a telephonic speech recognition application includes an automated telephone device for making telephonic contact with a respondent and a speech recognition device which, upon the telephonic contact being made, presents the respondent with at least one introductory prompt for the respondent to reply to; receives a spoken response from the respondent; and performs a speech recognition analysis on the spoken response to determine a capability of the respondent to complete the application. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is capable of competing the application, the speech recognition device presents at least one application prompt to the respondent. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is not capable of completing the application, the speech recognition system presents instructions on completing the application to the respondent.