Environmental noise detection for dialog systems

    公开(公告)号:US09818404B2

    公开(公告)日:2017-11-14

    申请号:US14979189

    申请日:2015-12-22

    Abstract: Embodiments are directed to receiving a speech signal representative of audible speech, processing the speech signal to interpret the speech signal by a dialog system implemented at least partially in hardware, determining, by the dialog system, that the speech signal cannot be correctly interpreted, receiving a noise signal representative of audible background noise, identifying a noise level from the noise signal, determining, by the dialog system, that the noise level is too high for the speech signal to be correctly interpreted, and providing, by the dialog system, a message indicating that the noise level is too high for the speech signal to be correctly interpreted.

    SYSTEMS AND METHODS FOR PROVIDING NON-LEXICAL CUES IN SYNTHESIZED SPEECH

    公开(公告)号:US20200243064A1

    公开(公告)日:2020-07-30

    申请号:US16851444

    申请日:2020-04-17

    Abstract: Systems and methods are disclosed for providing non-lexical cues in synthesized speech. An example system includes one or more storage devices including instructions and a processor to execute the instructions. The processor is to execute the instructions to: generate first and second non-lexical cues to enhance speech to be synthesized from text; determine a first insertion point of the first non-lexical cue in the text; determine a second insertion point of the second non-lexical cue in the text; and insert the first non-lexical cue at the first insertion point and the second non-lexical cue at the second insertion point. The example system also includes a transmitter to communicate the text with the inserted first non-lexical cue and the inserted second non-lexical cue over a network.

    Systems and methods for providing non-lexical cues in synthesized speech
    14.
    发明授权
    Systems and methods for providing non-lexical cues in synthesized speech 有权
    在合成语音中提供非词法线索的系统和方法

    公开(公告)号:US09542929B2

    公开(公告)日:2017-01-10

    申请号:US14497994

    申请日:2014-09-26

    Abstract: Systems and methods for providing non-lexical cues in synthesized speech are described herein. Original text is analyzed to determine characteristics of the text and/or to derive or augment an intent (e.g., an intent code). Non-lexical cue insertion points are determined based on the characteristics of the text and/or the intent. One or more nonlexical cues are inserted at insertion points to generate augmented text. The augmented text is synthesized into speech, including converting the non-lexical cues to speech output.

    Abstract translation: 本文描述了在合成语音中提供非词法线索的系统和方法。 分析原始文本以确定文本的特征和/或导出或增加意图(例如,意图代码)。 非词汇提示插入点是基于文本的特征和/或意图来确定的。 在插入点插入一个或多个非弹性提示以生成增强文本。 扩展文本被合成为语音,包括将非词汇提示转换为语音输出。

    USER ADAPTIVE INTERFACES
    15.
    发明申请
    USER ADAPTIVE INTERFACES 审中-公开
    用户自适应接口

    公开(公告)号:US20160092160A1

    公开(公告)日:2016-03-31

    申请号:US14497984

    申请日:2014-09-26

    Abstract: Systems and methods for providing a user adaptive natural language interface are disclosed. The disclosed embodiments may receive and analyze user input to derive current user behavior data, including data indicative of characteristics of the user input. The user input is classified based on prior user behavior data previously logged during one or more previous user-system interactions and the current user behavior data to generate a classification of the user input. Machine learning algorithms can be employed to classify the user input. User adaptive utterances are selected based on the user input and the classification of the user input. The user-system interaction is logged for use as prior user behavior data in future user-system interactions. A response to the user input is generated, including synthesizing output speech from the user adaptive utterances selected. Example applications of the disclosed systems and methods provide user adaptive navigation directions in navigation systems.

    Abstract translation: 公开了用于提供用户自适应自然语言界面的系统和方法。 所公开的实施例可以接收和分析用户输入以导出当前用户行为数据,包括指示用户输入的特征的数据。 用户输入基于先前在一个或多个先前用户 - 系统交互期间记录的先前用户行为数据和当前用户行为数据进行分类,以生成用户输入的分类。 机器学习算法可用于对用户输入进行分类。 基于用户输入和用户输入的分类来选择用户自适应语音。 用户系统交互记录用作未来用户系统交互中的先前用户行为数据。 产生对用户输入的响应,包括从所选择的用户自适应语音合成输出语音。 所公开的系统和方法的示例应用在导航系统中提供用户适应性导航方向。

Patent Agency Ranking