Machine Translation with Side Information
    1.
    发明申请
    Machine Translation with Side Information 有权
    机器翻译与侧面信息

    公开(公告)号:US20110282648A1

    公开(公告)日:2011-11-17

    申请号:US12779751

    申请日:2010-05-13

    IPC分类号: G06F17/28 G06F17/30 G06F7/00

    CPC分类号: G06F17/2818

    摘要: A method of identifying and using side information available to statistical machine translation systems within an enterprise setting, the method including extracting user-specific interaction and non-interaction-based information from at least one corresponding database within the enterprise for each of a plurality of users, aggregating the user-specific interaction and non-interaction based information from a plurality of users, by using a processor on a computer, to tune and adapt background translation and language models, and updating all relevant models within the enterprise after user activity based on the tuned and adapted translation and language models.

    摘要翻译: 一种识别和使用可用于企业设置内的统计机器翻译系统的侧面信息的方法,所述方法包括从多个用户中的每一个的企业内的至少一个对应的数据库中提取用户特定交互和非基于交互的信息 ,通过使用计算机上的处理器来聚合来自多个用户的用户特定交互和非基于交互的信息,以调整和适应背景翻译和语言模型,以及在基于用户活动的用户活动之后更新企业内的所有相关模型 调整和适应的翻译和语言模型。

    Machine translation with side information
    2.
    发明授权
    Machine translation with side information 有权
    机器翻译与侧面信息

    公开(公告)号:US08768686B2

    公开(公告)日:2014-07-01

    申请号:US12779751

    申请日:2010-05-13

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2818

    摘要: A method of identifying and using side information available to statistical machine translation systems within an enterprise setting, the method including extracting user-specific interaction and non-interaction-based information from at least one corresponding database within the enterprise for each of a plurality of users, aggregating the user-specific interaction and non-interaction based information from a plurality of users, by using a processor on a computer, to tune and adapt background translation and language models, and updating all relevant models within the enterprise after user activity based on the tuned and adapted translation and language models.

    摘要翻译: 一种识别和使用可用于企业设置内的统计机器翻译系统的侧面信息的方法,所述方法包括从多个用户中的每一个的企业内的至少一个对应的数据库中提取用户特定交互和非基于交互的信息 ,通过使用计算机上的处理器来聚合来自多个用户的用户特定交互和非基于交互的信息,以调整和适应背景翻译和语言模型,以及在基于用户活动的用户活动之后更新企业内的所有相关模型 调整和适应的翻译和语言模型。

    Word classing for language modeling
    3.
    发明授权
    Word classing for language modeling 有权
    用于语言建模的词分类

    公开(公告)号:US09367526B1

    公开(公告)日:2016-06-14

    申请号:US13190891

    申请日:2011-07-26

    摘要: A language processing application employs a classing function optimized for the underlying production application context for which it is expected to process speech. A combination of class based and word based features generates a classing function optimized for a particular production application, meaning that a language model employing the classing function uses word classes having a high likelihood of accurately predicting word sequences encountered by a language model invoked by the production application. The classing function optimizes word classes by aligning the objective of word classing with the underlying language processing task to be performed by the production application. The classing function is optimized to correspond to usage in the production application context using class-based and word-based features by computing a likelihood of a word in an n-gram and a frequency of a word within a class of the n-gram.

    摘要翻译: 语言处理应用程序使用针对其预期处理语音的底层生产应用程序环境进行优化的分类功能。 基于类和基于字的特征的组合产生针对特定生产应用优化的分类功能,这意味着采用分类函数的语言模型使用具有准确预测由生产调用的语言模型遇到的单词序列的高似然性的单词类 应用。 分类函数通过将单词分类的目标与生产应用程序执行的底层语言处理任务进行对齐来优化单词类。 通过计算n-gram中的单词和n-gram类中的单词的可能性,使用基于类和基于单词的特征来优化分类功能以对应于生产应用上下文中的使用。

    MT Based Spoken Dialog Systems Customer/Machine Dialog
    4.
    发明申请
    MT Based Spoken Dialog Systems Customer/Machine Dialog 有权
    基于MT的口语对话系统客户/机器对话框

    公开(公告)号:US20130073276A1

    公开(公告)日:2013-03-21

    申请号:US13236016

    申请日:2011-09-19

    IPC分类号: G06F17/28

    摘要: Operation of an automated dialog system is described using a source language to conduct a real time human machine dialog process with a human user using a target language. A user query in the target language is received and automatically machine translated into the source language. An automated reply of the dialog process is then delivered to the user in the target language. If the dialog process reaches an initial assistance state, a first human agent using the source language is provided to interact in real time with the user in the target language by machine translation to continue the dialog process. Then if the dialog process reaches a further assistance state, a second human agent using the target language is provided to interact in real time with the user in the target language to continue the dialog process.

    摘要翻译: 使用源语言来描述自动对话系统的操作,以使用目标语言与人类用户进行实时的人机对话过程。 接收目标语言的用户查询并自动机器翻译成源语言。 然后将对话过程的自动回复以目标语言传递给用户。 如果对话过程达到初始辅助状态,则使用源语言的第一人机代理被提供以通过机器翻译以目标语言与用户实时交互以继续对话过程。 然后,如果对话过程达到进一步的辅助状态,则使用目标语言的第二人机代理被提供以与目标语言的用户实时交互以继续对话过程。

    Game based method for translation data acquisition and evaluation
    6.
    发明授权
    Game based method for translation data acquisition and evaluation 有权
    基于游戏的翻译数据采集和评估方法

    公开(公告)号:US08566078B2

    公开(公告)日:2013-10-22

    申请号:US12697047

    申请日:2010-01-29

    CPC分类号: A63F9/24 G06F17/28

    摘要: A method of generating a statistical machine translation database through a game in which a monolingual structure is provided to a plurality of players. A first translation attempt is received from each of the plurality of players. The first translation attempt from each of the plurality of players is compared. Feedback is provided to each of the plurality of players and the attempts are received and compared to provide feedback to iteratively converge subsequent translations from each of the plurality of players into a final translated structure.

    摘要翻译: 一种通过游戏产生统计机器翻译数据库的方法,其中向多个玩家提供单语构造。 从多个玩家中的每一个接收第一翻译尝试。 比较来自多个玩家中的每一个的第一翻译尝试。 向多个玩家中的每一个提供反馈,并且尝试被接收并且进行比较以提供反馈以迭代地收敛从多个玩家中的每一个到随后的翻译结构的后续翻译。

    GAME BASED METHOD FOR TRANSLATION DATA ACQUISITION AND EVALUATION
    7.
    发明申请
    GAME BASED METHOD FOR TRANSLATION DATA ACQUISITION AND EVALUATION 有权
    基于游戏的翻译数据获取和评估方法

    公开(公告)号:US20110191096A1

    公开(公告)日:2011-08-04

    申请号:US12697047

    申请日:2010-01-29

    IPC分类号: G06F17/28 A63F9/24

    CPC分类号: A63F9/24 G06F17/28

    摘要: A method of generating a statistical machine translation database through a game in which a monolingual structure is provided to a plurality of players. A first translation attempt is received from each of the plurality of players. The first translation attempt from each of the plurality of players is compared. Feedback is provided to each of the plurality of players and the attempts are received and compared to provide feedback to iteratively converge subsequent translations from each of the plurality of players into a final translated structure.

    摘要翻译: 一种通过游戏产生统计机器翻译数据库的方法,其中向多个玩家提供单语构造。 从多个玩家中的每一个接收第一翻译尝试。 比较来自多个玩家中的每一个的第一翻译尝试。 向多个玩家中的每一个提供反馈,并且尝试被接收并且进行比较以提供反馈以迭代地收敛从多个玩家中的每一个到随后的翻译结构的后续翻译。

    Machine translation in continuous space
    8.
    发明授权
    Machine translation in continuous space 失效
    机器翻译在连续空间

    公开(公告)号:US08229729B2

    公开(公告)日:2012-07-24

    申请号:US12054636

    申请日:2008-03-25

    CPC分类号: G06F17/2818

    摘要: A system and method for training a statistical machine translation model and decoding or translating using the same is disclosed. A source word versus target word co-occurrence matrix is created to define word pairs. Dimensionality of the matrix may be reduced. Word pairs are mapped as vectors into continuous space where the word pairs are vectors of continuous real numbers and not discrete entities in the continuous space. A machine translation parametric model is trained using an acoustic model training method based on word pair vectors in the continuous space.

    摘要翻译: 公开了一种用于训练统计机器翻译模型和使用其的解码或翻译的系统和方法。 创建源词与目标词同现矩阵以定义单词对。 可以减小矩阵的尺寸。 字对被映射为连续空间中的向量,其中单词对是连续实数的向量,而不是连续空间中的离散实体。 使用基于连续空间中的字对矢量的声学模型训练方法训练机器翻译参数模型。

    Semantic language modeling and confidence measurement
    9.
    发明申请
    Semantic language modeling and confidence measurement 有权
    语义语言建模和置信度测量

    公开(公告)号:US20050055209A1

    公开(公告)日:2005-03-10

    申请号:US10655838

    申请日:2003-09-05

    IPC分类号: G10L15/18 G10L15/28 G10L15/00

    CPC分类号: G10L15/1815

    摘要: A system and method for speech recognition includes generating a set of likely hypotheses in recognizing speech, rescoring the likely hypotheses by using semantic content by employing semantic structured language models, and scoring parse trees to identify a best sentence according to the sentence's parse tree by employing the semantic structured language models to clarify the recognized speech.

    摘要翻译: 一种用于语音识别的系统和方法包括在识别语音中产生一组可能的假设,通过使用语义结构化语言模型通过使用语义内容来重新计算可能的假设,并且通过采用语法结构语言模型对解析树进行评分以识别根据句子的解析树的最佳句子 语义结构语言模型来澄清公认的言语。