MACHINE COMPREHENSION OF UNSTRUCTURED TEXT
    2.
    发明公开

    公开(公告)号:EP3443467A1

    公开(公告)日:2019-02-20

    申请号:EP17727049.3

    申请日:2017-05-17

    申请人: Maluuba Inc.

    IPC分类号: G06F17/27

    摘要: Described herein are systems and methods for providing a natural language comprehension system that employs a two-stage process for machine comprehension of text. The first stage indicates words in one or more text passages that potentially answer a question. The first stage outputs a set of candidate answers for the question, along with a first probability of correctness for each candidate answer. The second stage forms one or more hypotheses by inserting each candidate answer into the question and determines whether a sematic relationship exists between each hypothesis and each sentence in the text. The second processing circuitry generates a second probability of correctness for each candidate answer and combines the first probability with the second probability to produce a score that is used to rank the candidate answers. The candidate answer with the highest score is selected as a predicted answer.

    SPEECH RECOGNITION USING MODELS ASSOCIATED WITH A GEOGRAPHIC LOCATION
    5.
    发明公开
    SPEECH RECOGNITION USING MODELS ASSOCIATED WITH A GEOGRAPHIC LOCATION 审中-公开
    使用与地理位置相关的模型进行语音识别

    公开(公告)号:EP3198593A1

    公开(公告)日:2017-08-02

    申请号:EP15827991.9

    申请日:2015-07-31

    申请人: Maluuba Inc.

    CPC分类号: G10L15/18

    摘要: A natural language system for recognizing geographic specific language embodied within a query received at a computing device is disclosed. A given territory such as a country may be divided into sub-territories. The data source content may be limited to a predetermined number 5 of each type of entity determined by establishing a radius for each type of entity from the center of the particular sub-territory, and only including each entity with the distance of the radius. One or more sentence templates may be gathered from common queries, and training sentences may be created by substituting entities into the sentence patterns. When the natural language system receives a query, the system may apply a speech recognition module associated with 10 the geographic location of the computing device so that geographic specific language such as businesses, street and cities may be recognized by the particular speech recognition model.

    摘要翻译: 公开了一种用于识别在计算设备处接收的查询内包含的地理特定语言的自然语言系统。 一个特定的领土,如一个国家,可能会被划分为分区域。 数据源内容可以被限制为通过为来自特定子领土的中心的每种类型的实体建立半径而确定的每种类型的实体的预定数量5,并且仅包括具有半径距离的每个实体。 可以从普通查询中收集一个或多个句子模板,并且可以通过将实体替换为句子模式来创建训练句子。 当自然语言系统接收查询时,系统可以应用与计算设备的地理位置相关联的语音识别模块,使得诸如商业,街道和城市的地理特定语言可以被特定语音识别模型识别。

    METHOD AND SERVER FOR CLASSIFYING QUERIES
    6.
    发明公开
    METHOD AND SERVER FOR CLASSIFYING QUERIES 审中-公开
    分类查询的方法和服务器

    公开(公告)号:EP3201803A1

    公开(公告)日:2017-08-09

    申请号:EP15822796.7

    申请日:2015-07-17

    申请人: Maluuba Inc.

    IPC分类号: G06F17/30 G06F17/16

    摘要: A server, method, and non-transitory computer readable medium for classifying queries based on contextual information are provided. The server includes a network interface, a memory storage unit and a processor. The method involves receiving a plurality of queries, analyzing the queries and determining a likelihood divergence and selecting a domain. The non-transitory computer readable medium is encoded with codes to direct a processor to carry out the method.

    摘要翻译: 提供了用于基于上下文信息来分类查询的服务器,方法和非暂时性计算机可读介质。 该服务器包括网络接口,存储器存储单元和处理器。 该方法涉及接收多个查询,分析查询并确定可能性发散并选择域。 非暂时性计算机可读介质被编码以指导处理器执行该方法。

    CONVERSATIONAL AGENT
    7.
    发明公开
    CONVERSATIONAL AGENT 审中-公开
    交谈AGENT

    公开(公告)号:EP2839391A4

    公开(公告)日:2016-01-27

    申请号:EP13777880

    申请日:2013-04-22

    申请人: MALUUBA INC

    IPC分类号: G06F17/27 G06F17/30 G10L15/22

    摘要: A method, system, and computer program product provide a conversation agent to process natural language queries expressed by a user and perform commands according to the derived intention of the user. A natural language processing (NLP) engine derives intent using conditional random fields to identify a domain and at least one task embodied in the query. The NLP may further identify one or more subdomains, and one or more entities related to the identified command. A template system creates a data structure for information relevant to the derived intent and passes a template to a services manager for interfacing with one or more services capable of accomplishing the task. A dialog manager may elicit more entities from the user if required by the services manager and otherwise engage in conversation with the user. In one embodiment, the conversational agent allows a user to engage in multiple conversations simultaneously.

    Method and system for linking data sources for processing composite concepts
    8.
    发明公开
    Method and system for linking data sources for processing composite concepts 审中-公开
    一种用于连接源用于处理复合概念的方法和系统

    公开(公告)号:EP2757510A1

    公开(公告)日:2014-07-23

    申请号:EP14152193.0

    申请日:2014-01-22

    申请人: Maluuba Inc.

    IPC分类号: G06Q10/10 G06F17/30

    摘要: A computer-implemented method and system and computer-readable medium are disclosed for linking an ontology provided by a content service (i.e. category ontology) with a word expansion ontology (i.e. lexical ontology). A user may provide an input such as a voice command to an application. The voice command is processed by a natural language processing (NLP) engine to derive the user's intent and to extract relevant entities embodied in the command. The NLP engine may create a composite concept set containing multiple permutations of the concepts (entities extracted) and provide the composite concept set to a concept mapper. The concept mapper searches a mapping file and applies one or more scoring operations to determine a best match between the composite concept set and at least one category provided by the category ontology. The content service is searched using the category and the results are displayed to the user.

    摘要翻译: 一种计算机实现的方法和系统以及计算机可读介质盘为游离缺失链接到与字扩张本体由内容服务(即类别本体)提供本体(即词汇本体)。 用户可以提供输入以应用程序作为一个语音命令来搜索到。 该语音命令是由自然语言处理(NLP)引擎处理以导出用户的意图,并提取包含在该命令相关实体。 NLP引擎可以创建复合概念集含的(提取的实体)的概念的多个排列,并提供复合的概念设置为一个概念映射器。 概念映射器搜索映射文件和应用一个或更多的得分操作以确定性矿复合概念集和由类别本体设置有至少一个类别之间的最佳匹配。 内容服务使用类别搜索,并将结果显示给用户。

    Speech recognition using phoneme matching
    10.
    发明公开
    Speech recognition using phoneme matching 审中-公开
    on。。。。。。。。。

    公开(公告)号:EP2851896A1

    公开(公告)日:2015-03-25

    申请号:EP14185452.1

    申请日:2014-09-18

    申请人: Maluuba Inc.

    IPC分类号: G10L15/26

    CPC分类号: G10L15/26 G10L2015/088

    摘要: A system, method and computer program is provided for generating customized text representations of audio commands. A first speech recognition module may be used for generating a first text representation of an audio command based on a general language grammar. A second speech recognition module may be used for generating a second text representation of the audio command, the second module including a custom language grammar that may include contacts for a particular user. Entity extraction is applied to the second text representation and the entities are checked against a file containing personal language. If the entities are found in the user-specific language, the two text representations may be fused into a combined text representation and named entity recognition may be performed again to extract further entities.

    摘要翻译: 提供了一种用于生成音频命令的定制文本表示的系统,方法和计算机程序。 第一语音识别模块可以用于基于通用语言语法来生成音频命令的第一文本表示。 第二语音识别模块可以用于生成音频命令的第二文本表示,第二模块包括定制语言语法,其可以包括特定用户的联系人。 实体提取被应用于第二文本表示,并且针对包含个人语言的文件检查实体。 如果以用户特定语言找到实体,则两个文本表示可以被融合成组合的文本表示,并且可以再次执行命名实体识别以提取其他实体。