Dialog management using knowledge graph-driven information state in a natural language processing system

    公开(公告)号:US11288457B1

    公开(公告)日:2022-03-29

    申请号:US16265668

    申请日:2019-02-01

    Abstract: Systems and methods are disclosed for determining a move driven by an interaction. In some embodiments, a processor determines an operational state of an interaction with a user based on parameter values of a data structure. The processor identifies a plurality of candidate moves for changing the operational state by determining a domain in which the interaction is occurring, retrieving a set of candidate moves that correspond to the domain from a knowledge graph, and adding the set to the plurality of candidate moves. The processor encodes input of the user received during the interaction into encoded terms, and determines a move for changing the operational state based on a match of the encoded terms to the set of candidate moves. The processor updates the parameter values of the data structure based on the move to reflect a current operational state led to by the move.

    HIERARCHICAL SPEECH RECOGNITION DECODER
    4.
    发明申请

    公开(公告)号:US20190035389A1

    公开(公告)日:2019-01-31

    申请号:US16148884

    申请日:2018-10-01

    CPC classification number: G10L15/197 G10L15/02 G10L15/063 G10L2015/0631

    Abstract: A speech interpretation module interprets the audio of user utterances as sequences of words. To do so, the speech interpretation module parameterizes a literal corpus of expressions by identifying portions of the expressions that correspond to known concepts, and generates a parameterized statistical model from the resulting parameterized corpus. When speech is received the speech interpretation module uses a hierarchical speech recognition decoder that uses both the parameterized statistical model and language sub-models that specify how to recognize a sequence of words. The separation of the language sub-models from the statistical model beneficially reduces the size of the literal corpus needed for training, reduces the size of the resulting model, provides more fine-grained interpretation of concepts, and improves computational efficiency by allowing run-time incorporation of the language sub-models.

    Hierarchical speech recognition decoder

    公开(公告)号:US10096317B2

    公开(公告)日:2018-10-09

    申请号:US15131833

    申请日:2016-04-18

    Abstract: A speech interpretation module interprets the audio of user utterances as sequences of words. To do so, the speech interpretation module parameterizes a literal corpus of expressions by identifying portions of the expressions that correspond to known concepts, and generates a parameterized statistical model from the resulting parameterized corpus. When speech is received the speech interpretation module uses a hierarchical speech recognition decoder that uses both the parameterized statistical model and language sub-models that specify how to recognize a sequence of words. The separation of the language sub-models from the statistical model beneficially reduces the size of the literal corpus needed for training, reduces the size of the resulting model, provides more fine-grained interpretation of concepts, and improves computational efficiency by allowing run-time incorporation of the language sub-models.

    System and method for enhancing voice-enabled search based on automated demographic identification
    6.
    发明授权
    System and method for enhancing voice-enabled search based on automated demographic identification 有权
    基于自动人口统计学识别来增强语音搜索的系统和方法

    公开(公告)号:US09189483B2

    公开(公告)日:2015-11-17

    申请号:US13847173

    申请日:2013-03-19

    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating responses to a user speech query in voice-enabled search based on metadata that include demographic features of the speaker. A system practicing the method recognizes received speech from a speaker to generate recognized speech, identifies metadata about the speaker from the received speech, and feeds the recognized speech and the metadata to a question-answering engine. Identifying the metadata about the speaker is based on voice characteristics of the received speech. The demographic features can include age, gender, socio-economic group, nationality, and/or region. The metadata identified about the speaker from the received speech can be combined with or override self-reported speaker demographic information.

    Abstract translation: 本文公开的是基于包括说话者的人口统计特征的元数据的用于在基于语音的搜索中近似对用户语音查询的响应的系统,方法和非暂时计算机可读存储介质。 实施该方法的系统识别来自扬声器的接收到的语音以产生识别的语音,从接收到的语音识别关于说话者的元数据,并将识别的语音和元数据馈送到问答引擎。 识别关于扬声器的元数据是基于所接收语音的语音特征。 人口特征可以包括年龄,性别,社会经济群体,国籍和/或地区。 从接收到的语音中识别的关于说话者的元数据可以与自报告的说话者人口统计信息进行组合或覆盖。

    Accelerating agent performance in a natural language processing system

    公开(公告)号:US11314942B1

    公开(公告)日:2022-04-26

    申请号:US16825856

    申请日:2020-03-20

    Abstract: A computer-implemented method for providing agent assisted transcriptions of user utterances. A user utterance is received in response to a prompt provided to the user at a remote client device. An automatic transcription is generated from the utterance using a language model based upon an application or context, and presented to a human agent. The agent reviews the transcription and may replace at least a portion of the transcription with a corrected transcription. As the agent inputs the corrected transcription, accelerants are presented to the user comprising suggested texted to be inputted. The accelerants may be determined based upon an agent input, an application or context of the transcription, the portion of the transcription being replaced, or any combination thereof. In some cases, the user provides textual input, to which the agent transcribes an intent associated with the input with the aid of one or more accelerants.

Patent Agency Ranking