Systems and methods for extracting meaning from multimodal inputs using finite-state devices
    51.
    发明授权
    Systems and methods for extracting meaning from multimodal inputs using finite-state devices 有权
    使用有限状态设备从多模态输入中提取意义的系统和方法

    公开(公告)号:US07069215B1

    公开(公告)日:2006-06-27

    申请号:US09904253

    申请日:2001-07-12

    IPC分类号: G10L15/28

    摘要: Finite-state systems and methods allow multiple input streams to be parsed and integrated by a single finite-state device. These systems and methods not only address multimodal recognition, but are also able to encode semantics and syntax into a single finite-state device. The finite-state device provides models for recognizing multimodal inputs, such as speech and gesture, and composes the meaning content from the various input streams into a single semantic representation. Compared to conventional multimodal recognition systems, finite-state systems and methods allow for compensation among the various input streams. Finite-state systems and methods allow one input stream to dynamically alter a recognition model used for another input stream, and can reduce the computational complexity of multidimensional multimodal parsing. Finite-state devices provide a well-understood probabilistic framework for combining the probability distributions associated with the various input streams and for selecting among competing multimodal interpretations.

    摘要翻译: 有限状态系统和方法允许通过单个有限状态设备解析和集成多个输入流。 这些系统和方法不仅解决了多模态识别,而且还能够将语义和语法编码成单个有限状态的设备。 有限状态设备提供用于识别多模态输入(如语音和手势)的模型,并将来自各种输入流的含义构成单个语义表示。 与传统的多模式识别系统相比,有限状态系统和方法允许在各种输入流之间进行补偿。 有限状态系统和方法允许一个输入流动态地改变用于另一个输入流的识别模型,并且可以降低多维多模式解析的计算复杂度。 有限状态设备提供了一个很好理解的概率框架,用于组合与各种输入流相关联的概率分布,并用于在竞争的多模式解释之间进行选择。

    System and method for natural language generation
    52.
    发明申请
    System and method for natural language generation 有权
    自然语言生成的系统和方法

    公开(公告)号:US20050267751A1

    公开(公告)日:2005-12-01

    申请号:US11195973

    申请日:2005-08-03

    IPC分类号: G06F17/27 G06F17/28 G10L15/08

    CPC分类号: G06F17/271 G06F17/2881

    摘要: A system, method and computer-readable medium for generating natural language utilizes a stochastic process to choose a derivation tree according to a predetermined grammar, such as tree-adjoined grammar (TAG). A word lattice is created from a single semi-specified derivation tree and the proper path (i.e., desired output string) is selected from the lattice using a least cost, or other appropriate algorithms.

    摘要翻译: 用于生成自然语言的系统,方法和计算机可读介质利用随机过程根据诸如树相邻语法(TAG)的预定语法来选择导出树。 从单个半指定的导出树创建一个字格,并且使用最小成本或其他适当的算法从网格中选择适当的路径(即期望的输出串)。

    Copying human interactions through learning and discovery
    53.
    发明授权
    Copying human interactions through learning and discovery 有权
    通过学习和发现来复制人际交往

    公开(公告)号:US08990126B1

    公开(公告)日:2015-03-24

    申请号:US11462068

    申请日:2006-08-03

    IPC分类号: G06F15/18 G06F17/21

    摘要: A method, system and computer readable medium that generates a dialog model for use in automated dialog is disclosed. The method may include collecting a plurality of task-oriented dialog interactions between users and human agents for a given domain, identifying one or more task in each dialog interaction, identifying one or more subtasks in each identified task and associating relations between the subtasks, identifying a dialog act and a set of predicate-argument relations for each subtask, generating one or more clauses from the set of predicate-argument relations, storing the tasks, subtasks, dialog acts predicate-argument relations, and clauses from each dialog interaction as a dialog interaction set, generating a dialog management model using the stored dialog interaction sets.

    摘要翻译: 公开了一种生成用于自动对话中的对话模型的方法,系统和计算机可读介质。 该方法可以包括针对给定域收集用户和人类代理之间的多个面向任务的对话交互,识别每个对话交互中的一个或多个任务,识别每个被识别的任务中的一个或多个子任务并且关联子任务之间的关系,识别 每个子任务的对话行为和一组谓词 - 参数关系,从一组谓词参数关系生成一个或多个子句,将任务,子任务,对话动作谓词参数关系和每个对话框交互的子句存储为 对话交互集,使用存储的对话交互集生成对话管理模型。

    On-Demand language translation for television programs
    54.
    发明授权
    On-Demand language translation for television programs 有权
    电视节目的按需语言翻译

    公开(公告)号:US08589146B2

    公开(公告)日:2013-11-19

    申请号:US12772580

    申请日:2010-05-03

    IPC分类号: G06F17/28

    摘要: A method, a system and a machine-readable medium are provided for an on demand translation service. A translation module including at least one language pair module for translating a source language to a target language may be made available for use by a subscriber. The subscriber may be charged a fee for use of the requested on demand translation service or may be provided use of the on demand translation service for free in exchange for displaying commercial messages to the subscriber. A video signal may be received including information in the source language, which may be obtained as text from the video signal and may be translated from the source language to the target language by use of the translation module. Translated information, based on the translated text, may be added into the received video signal. The video signal including the translated information in the target language may be sent to a display device.

    摘要翻译: 为按需翻译服务提供方法,系统和机器可读介质。 包括用于将源语言翻译成目标语言的至少一个语言对模块的翻译模块可以被用户使用。 用户可能会收取使用所请求的按需翻译服务的费用,或者可以免费使用按需翻译服务,以便向用户显示商业消息。 可以接收包括源语言的信息的视频信号,其可以从视频信号获取为文本,并且可以通过使用翻译模块从源语言翻译成目标语言。 基于翻译文本的翻译信息可以被添加到接收的视频信号中。 可以将包括目标语言的翻译信息的视频信号发送到显示装置。

    System and method for enriching spoken language translation with prosodic information
    55.
    发明授权
    System and method for enriching spoken language translation with prosodic information 有权
    用韵律信息丰富口语翻译的系统和方法

    公开(公告)号:US08571849B2

    公开(公告)日:2013-10-29

    申请号:US12241660

    申请日:2008-09-30

    IPC分类号: G06F17/28

    CPC分类号: G06F17/289 G10L13/10

    摘要: Disclosed herein are systems, methods, and computer readable-media for enriching spoken language translation with prosodic information in a statistical speech translation framework. The method includes receiving speech for translation to a target language, generating pitch accent labels representing segments of the received speech which are prosodically prominent, and injecting pitch accent labels with word tokens within the translation engine to create enriched target language output text. A further step may be added of synthesizing speech in the target language based on the prosody enriched target language output text. An automatic prosody labeler can generate pitch accent labels. An automatic prosody labeler can exploit lexical, syntactic, and prosodic information of the speech. A maximum entropy model may be used to determine which segments of the speech are prosodically prominent. A pitch accent label can include an indication of certainty that a respective segment of the speech is prosodically prominent and/or an indication of prosodic prominence of a respective segment of speech.

    摘要翻译: 本文公开了用于在统计语音翻译框架中丰富口头语言翻译与韵律信息的系统,方法和计算机可读介质。 该方法包括接收用于翻译到目标语言的语音,产生表示接收到的语音段的韵律突出的音调重音标签,以及在翻译引擎内用词令牌注入音调重音标签,以创建丰富的目标语言输出文本。 可以基于韵律丰富的目标语言输出文本,添加目标语言中的合成语音的另一步骤。 自动韵律贴标机可以产生音调重音标签。 自动韵律标签者可以利用言语的词汇,句法和韵律信息。 可以使用最大熵模型来确定语音的哪些段韵律突出。 音调重音标签可以包括确定语音的相应片段韵律突出的指示和/或相应语音段的韵律突出的指示。

    System and method for referring to entities in a discourse domain
    56.
    发明授权
    System and method for referring to entities in a discourse domain 有权
    用于引用话语域中的实体的系统和方法

    公开(公告)号:US08566090B2

    公开(公告)日:2013-10-22

    申请号:US13465685

    申请日:2012-05-07

    IPC分类号: G10L13/027

    摘要: Systems, methods, and non-transitory computer-readable media for referring to entities. The method includes receiving domain-specific training data of sentences describing a target entity in a context, extracting a speaker history and a visual context from the training data, selecting attributes of the target entity based on at least one of the speaker history, the visual context, and speaker preferences, generating a text expression referring to the target entity based on at least one of the selected attributes, the speaker history, and the context, and outputting the generated text expression. The weighted finite-state automaton can represent partial orderings of word pairs in the domain-specific training data. The weighted finite-state automaton can be speaker specific or speaker independent. The weighted finite-state automaton can include a set of weighted partial orderings of the training data for each possible realization.

    摘要翻译: 用于引用实体的系统,方法和非暂时计算机可读介质。 该方法包括接收在上下文中描述目标实体的句子的特定领域的训练数据,从训练数据中提取讲者历史和视觉上下文,基于说话者的历史,视觉上的至少一个来选择目标实体的属性 上下文和说话人首选项,基于所选择的属性,说话者历史和上下文中的至少一个生成参考目标实体的文本表达,并输出所生成的文本表达。 加权有限状态自动机可以表示域特定训练数据中单词对的部分排序。 加权有限状态自动机可以是扬声器专用或扬声器独立的。 加权有限状态自动机可以包括用于每个可能实现的训练数据的一组加权部分排序。

    Method and apparatus for building sales tools by mining data from websites
    57.
    发明授权
    Method and apparatus for building sales tools by mining data from websites 失效
    通过网站挖掘数据建立销售工具的方法和设备

    公开(公告)号:US08359307B2

    公开(公告)日:2013-01-22

    申请号:US13088935

    申请日:2011-04-18

    IPC分类号: G06F17/30

    摘要: A website mining tool is disclosed that extracts information from, for example, a company's website and presents the extracted information in a graphical user interface (GUI). In one embodiment, web pages from a website are stored in, for example, computer memory and a structure of the web pages is identified. A plurality of blocks of information is then extracted as a function of this structure and a category is assigned to each block of information. The elements in the blocks of information are then displayed, for example to a salesperson, as a function of these categories. In another embodiment, Document Object Modeling parsing is used to identify the structure of the web pages. In yet another embodiment, a support vector machine is used to categorize each block of information.

    摘要翻译: 公开了一种网站挖掘工具,其提取例如公司网站的信息,并将所提取的信息呈现在图形用户界面(GUI)中。 在一个实施例中,来自网站的网页被存储在例如计算机存储器中,并且识别网页的结构。 然后根据该结构提取多个信息块,并将类别分配给每个信息块。 然后,作为这些类别的函数,将信息块中的元素显示为例如销售人员。 在另一个实施例中,文档对象建模解析用于识别网页的结构。 在另一个实施例中,支持向量机用于对每个信息块进行分类。

    SYSTEM AND METHOD FOR OPTIMIZING RESPONSE HANDLING TIME AND CUSTOMER SATISFACTION SCORES
    58.
    发明申请
    SYSTEM AND METHOD FOR OPTIMIZING RESPONSE HANDLING TIME AND CUSTOMER SATISFACTION SCORES 有权
    优化响应处理时间和客户满意度的系统和方法

    公开(公告)号:US20120271898A1

    公开(公告)日:2012-10-25

    申请号:US13539896

    申请日:2012-07-02

    IPC分类号: G06F15/16

    摘要: A system and method disclosed for using and updating a database of template responses for a live agent in response to user communications. The method includes computing an average string distance between each response from a live agent and a template, use to generate the response, modifying the computed average string distance based on a customer satisfaction score associated with each response and selecting a response that minimizes the computed average string distance and maximizes customer satisfaction. Upon receiving a further communication on a certain issue, the system presents a prototype response that has been added to the template database to the live agent for use in generating a response to the further communication that reduces handling time and increases customer satisfaction.

    摘要翻译: 公开了用于响应于用户通信使用和更新活动代理的模板响应的数据库的系统和方法。 该方法包括计算来自活动代理和模板的每个响应之间的平均字符串距离,用于生成响应,基于与每个响应相关联的客户满意度得分修改所计算的平均字符串距离,并且选择使计算出的平均值最小化的响应 弦距,并最大化客户满意度。 在一个特定的问题上接收到进一步的通信之后,系统呈现已经被添加到模板数据库中的实时代理的原型响应,用于产生对进一步通信的响应,这减少了处理时间并提高了客户满意度。

    System and method of spoken language understanding in human computer dialogs
    59.
    发明授权
    System and method of spoken language understanding in human computer dialogs 有权
    在人机对话中口语理解的系统和方法

    公开(公告)号:US08190436B2

    公开(公告)日:2012-05-29

    申请号:US10310596

    申请日:2002-12-05

    IPC分类号: G10L21/00 G06F17/20 G06F17/27

    摘要: A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.

    摘要翻译: 公开了一种提高口语对话系统中的自动语音识别的系统和方法。 该方法包括将语音识别器输出划分为独立子句,识别每个自包含子句中的对话行为,通过识别当前域对象和/或当前域动作进行限定对话行为,以及确定是否可进一步限定 对于当前域对象和/或当前域操作。 如果可以进一步鉴定,则该方法包括识别与当前域对象和/或当前域操作相关联的另一域操作和/或另一域对象,将另一域操作和/或另一域对象重新分配为当前域操作,以及 /或当前域对象,然后递归地限定新的当前域操作和/或当前对象。 这个过程一直持续到没有什么是剩下的资格。

    Text edit tracker that categorizes communications, determines distances between templates, codes templates in color, and uses a morphing score based on edits
    60.
    发明授权
    Text edit tracker that categorizes communications, determines distances between templates, codes templates in color, and uses a morphing score based on edits 有权
    对通信进行分类的文本编辑跟踪器,确定模板之间的距离,颜色的代码模板,并使用基于编辑的变形分数

    公开(公告)号:US08170961B2

    公开(公告)日:2012-05-01

    申请号:US12252732

    申请日:2008-10-16

    IPC分类号: G06F11/00

    CPC分类号: G06F17/248

    摘要: A method for monitoring edits to a template for responding to an incoming communication includes categorizing the incoming communication into a category associated with the template for a response to the incoming communication. The method also includes determining distances between the template and each of a set of responses based on the template, at a predetermined level of granularity. The method also includes coding the template in accordance with the determined distances and displaying the coded template. A method for extracting a new template based on responses to an existing template includes selecting factors that affect quantitative measures for preparing a response to the incoming communication. The method includes using a mathematical model of the factors to cluster a set of responses created based on the existing template into two clusters. The method further includes restricting a first cluster centroid to be the existing template and searching for a second cluster centroid for a second cluster.

    摘要翻译: 用于监视对模板的编辑以响应传入通信的方法包括将传入通信分类为与模板相关联的类别以对于进入通信的响应。 该方法还包括以预定的粒度级确定基于模板的模板与一组响应中的每一个之间的距离。 该方法还包括根据确定的距离对模板进行编码并显示编码的模板。 基于对现有模板的响应来提取新模板的方法包括选择影响对进入通信的响应的定量测量的因素。 该方法包括使用这些因素的数学模型将基于现有模板创建的一组响应聚类成两个聚类。 该方法还包括将第一聚类中心限制为现有模板并搜索第二聚簇的第二聚类质心。