LIBRARY OF EXISTING SPOKEN DIALOG DATA FOR USE IN GENERATING NEW NATURAL LANGUAGE SPOKEN DIALOG SYSTEMS
    1.
    发明申请
    LIBRARY OF EXISTING SPOKEN DIALOG DATA FOR USE IN GENERATING NEW NATURAL LANGUAGE SPOKEN DIALOG SYSTEMS 审中-公开
    现有的语音对话数据库用于生成新的自然语言语音对话系统

    公开(公告)号:US20160093300A1

    公开(公告)日:2016-03-31

    申请号:US14963408

    申请日:2015-12-09

    CPC classification number: G10L15/1815 G10L15/063 G10L15/22 G10L15/28 G10L25/48

    Abstract: A machine-readable medium may include a group of reusable components for building a spoken dialog system. The reusable components may include a group of previously collected audible utterances. A machine-implemented method to build a library of reusable components for use in building a natural language spoken dialog system may include storing a dataset in a database. The dataset may include a group of reusable components for building a spoken dialog system. The reusable components may further include a group of previously collected audible utterances. A second method may include storing at least one set of data. Each one of the at least one set of data may include ones of the reusable components associated with audible data collected during a different collection phase.

    Abstract translation: 机器可读介质可以包括用于构建口语对话系统的一组可重复使用的组件。 可重复使用的组件可以包括一组先前收集的可听话语。 用于构建用于构建自然语言对话系统的可重用组件库的机器实现的方法可以包括将数据集存储在数据库中。 数据集可以包括一组用于构建口语对话系统的可重复使用的组件。 可重复使用的组件还可以包括一组先前收集的可听话语。 第二种方法可以包括存储至少一组数据。 所述至少一组数据中的每一个可以包括与在不同收集阶段期间收集的可听数据相关联的可重用组件中的一个。

    Method and apparatus for responding to an inquiry
    2.
    发明授权
    Method and apparatus for responding to an inquiry 有权
    响应查询的方法和装置

    公开(公告)号:US08719010B2

    公开(公告)日:2014-05-06

    申请号:US13782616

    申请日:2013-03-01

    Abstract: Disclosed is a method and apparatus for responding to an inquiry from a client via a network. The method and apparatus receive the inquiry from a client via a network. Based on the inquiry, question-answer pairs retrieved from the network are analyzed to determine a response to the inquiry. The QA pairs are not predefined. As a result, the QA pairs have to be analyzed in order to determine whether they are responsive to a particular inquiry. Questions of the QA pairs may be repetitive and, without more, will not be useful in determining whether their corresponding answer responds to an inquiry.

    Abstract translation: 公开了一种用于经由网络从客户机响应询问的方法和装置。 该方法和装置经由网络从客户端接收询问。 基于查询,分析从网络检索的问答对以确定对查询的响应。 QA对未预先定义。 因此,必须分析QA对以确定它们是否响应于特定查询。 质量保证对的问题可能是重复的,而在更多的情况下,确定他们对应的答案是否对询问作出回应将不会有用。

    Method and Apparatus for Responding to an Inquiry
    3.
    发明申请
    Method and Apparatus for Responding to an Inquiry 有权
    响应查询的方法和装置

    公开(公告)号:US20130177893A1

    公开(公告)日:2013-07-11

    申请号:US13782616

    申请日:2013-03-01

    Abstract: Disclosed is a method and apparatus for responding to an inquiry from a client via a network. The method and apparatus receive the inquiry from a client via a network. Based on the inquiry, question-answer pairs retrieved from the network are analyzed to determine a response to the inquiry. The QA pairs are not predefined. As a result, the QA pairs have to be analyzed in order to determine whether they are responsive to a particular inquiry. Questions of the QA pairs may be repetitive and, without more, will not be useful in determining whether their corresponding answer responds to an inquiry.

    Abstract translation: 公开了一种用于经由网络从客户机响应询问的方法和装置。 该方法和装置经由网络从客户端接收询问。 基于查询,分析从网络检索的问答对以确定对查询的响应。 QA对未预先定义。 因此,必须分析QA对以确定它们是否响应于特定查询。 质量保证对的问题可能是重复的,而在更多的情况下,确定他们对应的答案是否对询问作出回应将不会有用。

    Unsupervised and active learning in automatic speech recognition for call classification
    4.
    发明授权
    Unsupervised and active learning in automatic speech recognition for call classification 有权
    无监督和主动学习自动语音识别呼叫分类

    公开(公告)号:US09159318B2

    公开(公告)日:2015-10-13

    申请号:US14468375

    申请日:2014-08-26

    Abstract: Utterance data that includes at least a small amount of manually transcribed data is provided. Automatic speech recognition is performed on ones of the utterance data not having a corresponding manual transcription to produce automatically transcribed utterances. A model is trained using all of the manually transcribed data and the automatically transcribed utterances. A predetermined number of utterances not having a corresponding manual transcription are intelligently selected and manually transcribed. Ones of the automatically transcribed data as well as ones having a corresponding manual transcription are labeled. In another aspect of the invention, audio data is mined from at least one source, and a language model is trained for call classification from the mined audio data to produce a language model.

    Abstract translation: 提供了至少包含少量手动转录数据的语音数据。 对没有相应的手动转录的话语数据中的一个进行自动语音识别以产生自动转录的话语。 使用所有手动转录数据和自动转录的话语训练模型。 智能地选择并且手动地转录预定数量的不具有对应的手动转录的话语。 自动转录的数据以及具有相应手动转录的数据的标签。 在本发明的另一方面,音频数据从至少一个源开始,并且语言模型被训练用于从所开采的音频数据进行呼叫分类以产生语言模型。

    Answer Determination for Natural Language Questioning
    7.
    发明申请
    Answer Determination for Natural Language Questioning 审中-公开
    自然语言提问的答案决定

    公开(公告)号:US20150052113A1

    公开(公告)日:2015-02-19

    申请号:US14479765

    申请日:2014-09-08

    CPC classification number: G06F16/243 G06F16/3331 G06F16/951 G06F17/279

    Abstract: Open-domain question answering is the task of finding a concise answer to a natural language question using a large domain, such as the Internet. The use of a semantic role labeling approach to the extraction of the answers to an open domain factoid (Who/When/What/Where) natural language question that contains a predicate is described. Semantic role labeling identities predicates and semantic argument phrases in the natural language question and the candidate sentences. When searching for an answer to a natural language question, the missing argument in the question is matched using semantic parses of the candidate answers. Such a technique may improve the accuracy of a question answering system and may decrease the length of answers for enabling voice interface to a question answering system.

    Abstract translation: 开放域问答是使用大型域(如互联网)找到一个简明的自然语言问题答案的任务。 描述了使用语义角色标注方法来提取对包含谓词的开放域factoid(Who / When / What / Where)自然语言问题的答案。 自然语言问题和候选句子中的语义角色标识身份谓词和语义参数短语。 在搜索自然语言问题的答案时,使用候选答案的语义分析来匹配问题中的缺失参数。 这样的技术可以提高问答系统的准确性,并且可以减少用于启用语音接口到问答系统的答案的长度。

    System and method of providing an automated data-collection in spoken dialog systems
    8.
    发明授权
    System and method of providing an automated data-collection in spoken dialog systems 有权
    在口头对话系统中提供自动数据收集的系统和方法

    公开(公告)号:US08914294B2

    公开(公告)日:2014-12-16

    申请号:US14246216

    申请日:2014-04-07

    CPC classification number: G10L15/063 G10L15/183 G10L15/22

    Abstract: The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.

    Abstract translation: 本发明涉及一种用于收集在口头对话系统中使用的数据的系统和方法。 本发明的一个方面通常被称为在与对话系统中的用户的对话开始时自动执行数据收集的自动隐藏人。 该方法包括向用户呈现初始提示,使用自动语音识别引擎识别接收到的用户话语,并使用口语理解模块对所识别的用户话语进行分类。 如果识别的用户话语不能被理解或可被分类到预定的接受阈值,则该方法重新提示用户。 如果识别的用户话语不能被分类为预定的拒绝阈值,则该方法将用户转移给人,因为这可能意味着任务特定的话语。 然后,接收和分类的用户话语用于训练口语对话系统。

    Apparatus and Method for Model Adaptation for Spoken Language Understanding
    9.
    发明申请
    Apparatus and Method for Model Adaptation for Spoken Language Understanding 有权
    用于语言理解的模型适应的装置和方法

    公开(公告)号:US20140330565A1

    公开(公告)日:2014-11-06

    申请号:US14282054

    申请日:2014-05-20

    Inventor: Gokhan Tur

    CPC classification number: G10L15/065

    Abstract: An apparatus and a method are provided for building a spoken language understanding model. Labeled data may be obtained for a target application. A new classification model may be formed for use with the target application by using the labeled data for adaptation of an existing classification model. In some implementations, the existing classification model may be used to determine the most informative examples to label.

    Abstract translation: 提供了一种用于构建口语理解模型的装置和方法。 可以为目标应用获得标签数据。 可以通过使用用于适应现有分类模型的标记数据来形成用于目标应用的新分类模型。 在一些实施方式中,现有的分类模型可用于确定最具信息性的标签示例。

    System and Method of Providing an Automated Data-Collection in Spoken Dialog Systems
    10.
    发明申请
    System and Method of Providing an Automated Data-Collection in Spoken Dialog Systems 有权
    在口语对话系统中提供自动数据收集的系统和方法

    公开(公告)号:US20140222426A1

    公开(公告)日:2014-08-07

    申请号:US14246216

    申请日:2014-04-07

    CPC classification number: G10L15/063 G10L15/183 G10L15/22

    Abstract: The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.

    Abstract translation: 本发明涉及一种用于收集在口头对话系统中使用的数据的系统和方法。 本发明的一个方面通常被称为在与对话系统中的用户的对话开始时自动执行数据收集的自动隐藏人。 该方法包括向用户呈现初始提示,使用自动语音识别引擎识别接收到的用户话语,并使用口语理解模块对所识别的用户话语进行分类。 如果识别的用户话语不能被理解或可被分类到预定的接受阈值,则该方法重新提示用户。 如果识别的用户话语不能被分类为预定的拒绝阈值,则该方法将用户转移给人,因为这可能意味着任务特定的话语。 然后,接收和分类的用户话语用于训练口语对话系统。

Patent Agency Ranking