Method for fast semi-automatic semantic annotation
    1.
    发明授权
    Method for fast semi-automatic semantic annotation 有权
    快速半自动语义注释方法

    公开(公告)号:US07610191B2

    公开(公告)日:2009-10-27

    申请号:US10959523

    申请日:2004-10-06

    IPC分类号: G06F17/27

    CPC分类号: G06F17/271 G06F17/2755

    摘要: A method, apparatus and computer instructions is provided for fast semi-automatic semantic annotation. Given a limited annotated corpus, the present invention assigns a tag and a label to each word of the next limited annotated corpus using a parser engine, a similarity engine, and a SVM engine. A rover then combines the parse trees from the three engines and annotates the next chunk of limited annotated corpus with confidence, such that the efforts required for human annotation is reduced.

    摘要翻译: 提供了一种用于快速半自动语义注释的方法,装置和计算机指令。 给定有限的注释语料库,本发明使用解析器引擎,相似性引擎和SVM引擎为下一个有限注释语料库的每个单词分配标签和标签。 然后,流动站组合来自三个引擎的解析树,并自信地注释下一批有限注释语料库,从而减少人体注释所需的努力。

    Method and apparatus for fast semi-automatic semantic annotation
    2.
    发明授权
    Method and apparatus for fast semi-automatic semantic annotation 有权
    快速半自动语义注释的方法和装置

    公开(公告)号:US07996211B2

    公开(公告)日:2011-08-09

    申请号:US12123778

    申请日:2008-05-20

    IPC分类号: G06F17/27

    CPC分类号: G06F17/271 G06F17/2755

    摘要: A method, apparatus and computer instructions is provided for fast semi-automatic semantic annotation. Given a limited annotated corpus, the present invention assigns a tag and a label to each word of the next limited annotated corpus using a parser engine, a similarity engine, and a SVM engine. A rover then combines the parse trees from the three engines and annotates the next chunk of limited annotated corpus with confidence, such that the efforts required for human annotation is reduced.

    摘要翻译: 提供了一种用于快速半自动语义注释的方法,装置和计算机指令。 给定有限的注释语料库,本发明使用解析器引擎,相似性引擎和SVM引擎向下一个有限注释语料库的每个单词分配标签和标签。 然后,流动站组合来自三个引擎的解析树,并自信地注释下一批有限注释语料库,从而减少人体注释所需的努力。

    Method and Apparatus for Fast Semi-Automatic Semantic Annotation
    3.
    发明申请
    Method and Apparatus for Fast Semi-Automatic Semantic Annotation 有权
    快速半自动语义注释的方法与装置

    公开(公告)号:US20080221874A1

    公开(公告)日:2008-09-11

    申请号:US12123778

    申请日:2008-05-20

    IPC分类号: G06F17/27

    CPC分类号: G06F17/271 G06F17/2755

    摘要: A method, apparatus and computer instructions is provided for fast semi-automatic semantic annotation. Given a limited annotated corpus, the present invention assigns a tag and a label to each word of the next limited annotated corpus using a parser engine, a similarity engine, and a SVM engine. A rover then combines the parse trees from the three engines and annotates the next chunk of limited annotated corpus with confidence, such that the efforts required for human annotation is reduced.

    摘要翻译: 提供了一种用于快速半自动语义注释的方法,装置和计算机指令。 给定有限的注释语料库,本发明使用解析器引擎,相似性引擎和SVM引擎向下一个有限注释语料库的每个单词分配标签和标签。 然后,流动站组合来自三个引擎的解析树,并自信地注释下一批有限注释语料库,从而减少人体注释所需的努力。

    Speaker adaptation system and method based on class-specific
pre-clustering training speakers
    4.
    发明授权
    Speaker adaptation system and method based on class-specific pre-clustering training speakers 失效
    基于类特定的前聚类训练讲话者的演讲人适应系统和方法

    公开(公告)号:US06073096A

    公开(公告)日:2000-06-06

    申请号:US18350

    申请日:1998-02-04

    IPC分类号: G10L15/07 G10L15/06

    CPC分类号: G10L15/07

    摘要: A method of speech recognition, in accordance with the present invention includes the steps of grouping acoustics to form classes based on acoustic features, clustering training speakers by the classes to provide class-specific cluster systems, selecting from the cluster systems, a subset of cluster systems closest to adaptation data from a test speaker, transforming the subset of cluster systems to bring the subset of cluster systems closer to the test speaker based on the adaptation data to form adapted cluster systems and combining the adapted cluster systems to create a speaker adapted system for decoding speech from the test speaker. System and methods for building speech recognition systems as well as adapting speaker systems for class-specific speaker clusters are included.

    摘要翻译: 根据本发明的语音识别方法包括以下步骤:基于声学特征对声学进行分组以形成类别,由类别聚类训练讲话者以提供特定类别的集群系统,从集群系统中选择集群的子集 最接近来自测试说话者的自适应数据的系统,基于适配数据来改变集群系统的子集以使集群系统的子集更靠近测试说话者,以形成适应的集群系统,并组合适应的集群系统以创建一个说话者适配系统 用于解码来自测试扬声器的语音。 包括构建语音识别系统的系统和方法以及适用于类特定扬声器群的扬声器系统。

    Methods and apparatus for training a pattern recognition system using maximal rank likelihood as an optimization function
    5.
    发明授权
    Methods and apparatus for training a pattern recognition system using maximal rank likelihood as an optimization function 失效
    使用最大秩可能性作为优化函数训练模式识别系统的方法和装置

    公开(公告)号:US06850888B1

    公开(公告)日:2005-02-01

    申请号:US09680706

    申请日:2000-10-06

    IPC分类号: G10L15/14 G10L15/00

    CPC分类号: G10L15/144

    摘要: A method and apparatus are disclosed for training a pattern recognition system, such as a speech recognition system, using an improved objective function. The concept of rank likelihood, previously applied only to the decding process, is applied in a novel manner to the parameter estimation of the training phase of a pattern recognition system. The disclosed objective function is based on a pseudo-rank likelihood that not only maximizes the likelihood of an observation for the correct class, but also minimizes the likelihoods of the observation for all other classes, such that the discrimination between classes is maximized. A training process is disclosed that utilizes the pseudo-rank likelihood objective function to identify model parameters that will result in a pattern recognizer with the lowest possible recognition error rate. The discrete nature of the rank-based rank likelihood objective function is transformed to allow the parameter estimations to be optimized during the training phase.

    摘要翻译: 公开了一种使用改进的目标函数来训练诸如语音识别系统的模式识别系统的方法和装置。 先前仅适用于解码处理的等级似然性的概念以新颖的方式应用于模式识别系统的训练阶段的参数估计。 所公开的目标函数基于伪秩可能性,其不仅使对正确类的观察的可能性最大化,而且使对所有其他类的观察的可能性最小化,使得类之间的区分最大化。 公开了一种训练过程,其利用伪秩似然度目标函数来识别将产生具有最低可能识别错误率的模式识别器的模型参数。 转换基于秩的等级似然目标函数的离散性质,以便在训练阶段优化参数估计。

    Method and apparatus for estimating phone class probabilities
a-posteriori using a decision tree
    6.
    发明授权
    Method and apparatus for estimating phone class probabilities a-posteriori using a decision tree 失效
    用于使用决策树估计电话类概率的方法和装置

    公开(公告)号:US5680509A

    公开(公告)日:1997-10-21

    申请号:US312584

    申请日:1994-09-27

    IPC分类号: G10L15/06 G10L15/08 G10L5/06

    CPC分类号: G10L15/063 G10L15/08

    摘要: A method and apparatus for estimating the probability of phones, a-posteriori, in the context of not only the acoustic feature at that time, but also the acoustic features in the vicinity of the current time, and its use in cutting down the search-space in a speech recognition system. The method constructs and uses a decision tree, with the predictors of the decision tree being the vector-quantized acoustic feature vectors at the current time, and in the vicinity of the current time. The process starts with an enumeration of all (predictor, class) events in the training data at the root node, and successively partitions the data at a node according to the most informative split at that node. An iterative algorithm is used to design the binary partitioning. After the construction of the tree is completed, the probability distribution of the predicted class is stored at all of its terminal leaves. The decision tree is used during the decoding process by tracing a path down to one of its leaves, based on the answers to binary questions about the vector-quantized acoustic feature vector at the current time and its vicinity.

    摘要翻译: 在不仅在当时的声学特征以及当前时间附近的声学特征的上下文中估计电话的概率的方法和装置,以及其用于减少搜索 - 语音识别系统中的空间。 该方法构造并使用决策树,其中决策树的预测变量是当前时间和当前时间附近的矢量量化的声学特征向量。 该过程从在根节点的训练数据中的所有(预测器,类)事件的枚举开始,并且根据该节点处的最多信息拆分在节点处依次划分数据。 迭代算法用于设计二进制分区。 树完成后,预测类的概率分布存储在其所有终端叶上。 基于对当前时间及其附近的向量量化声学特征向量的二进制问题的答案,在解码过程中使用决策树通过跟踪到其叶子之一的路径。

    Techniques for enhancing the performance of concatenative speech synthesis
    7.
    发明授权
    Techniques for enhancing the performance of concatenative speech synthesis 有权
    提高连接语音合成性能的技术

    公开(公告)号:US08145491B2

    公开(公告)日:2012-03-27

    申请号:US10208453

    申请日:2002-07-30

    IPC分类号: G10L13/06

    CPC分类号: G10L13/07

    摘要: When pitch of a speech segment is being modified from a current pitch to a requested pitch, and the difference between these is relatively large, a pitch modification algorithm is used to modify the pitch of the speech segment. When the difference between current and requested pitches is relatively small, the pitch of the speech segment is not modified. After one or the other speech modification techniques are used, then the resultant modified speech segment is overlapped and added to previously modified speech segments. A modification ratio is determined in order to quantify the difference between the current and requested pitches for a speech segment. The modification ratio is a ratio between the requested and current pitches. Low and high ratio thresholds are used to determine when pitch is being modified to a predetermined high degree, and whether pitch of the speech segment will or will not be modified.

    摘要翻译: 当语音片段的节距从当前音调修改为所请求的节距,并且它们之间的差异相对较大时,使用音调修改算法来修改语音片段的音高。 当电流和请求间距之差相对较小时,语音段的音调不被修改。 在使用一种或另一种语音修改技术之后,将所得到的修改语音段重叠并添加到先前修改的语音段。 确定修正率以量化语音段的当前和所请求的间距之间的差异。 修正比是要求和当前间距之间的比率。 使用低和高比率阈值来确定音调何时被修改到预定的高度,以及语音片段的音调是否将被修改。

    Automatic segmentation of continuous text using statistical approaches
    9.
    发明授权
    Automatic segmentation of continuous text using statistical approaches 失效
    使用统计方法自动分割连续文本

    公开(公告)号:US5806021A

    公开(公告)日:1998-09-08

    申请号:US700823

    申请日:1996-09-04

    IPC分类号: G06F17/27 G06F17/20

    CPC分类号: G06F17/277

    摘要: An automatic segmenter for continuous text segments such text in a rapid, consistent and semantically accurate manner. Two statistical methods for segmentation of continuous text are used. The first method, called "forward-backward matching", is easy and fast but can produce occasional errors in long phrases. The second method, called "statistical stack search segmenter", utilizes statistical language models to generate more accurate segmentation output at an expense of two times more execution time than the "forward-backward matching" method. In some applications where speed is a major concern, "forward-backward matching" can be used, while in other applications where highly accurate output is desired, "statistical stack search segmenter" is ideal.

    摘要翻译: 用于以快速,一致和语义准确的方式连续文本段的自动分段器。 使用两种连续文本分割的统计方法。 第一种称为“前向 - 后向匹配”的方法是简单快捷的,但可能会产生长时间的误差。 称为“统计堆栈搜索分段器”的第二种方法利用统计语言模型以比“前向 - 后向匹配”方法多两倍的执行时间来生成更精确的分段输出。 在速度是主要关注的一些应用中,可以使用“前向后匹配”,而在需要高精度输出的其他应用中,“统计栈搜索分段器”是理想的。

    Methods and apparatus for adapting output speech in accordance with context of communication
    10.
    发明授权
    Methods and apparatus for adapting output speech in accordance with context of communication 有权
    根据通信背景调整输出语音的方法和装置

    公开(公告)号:US07490042B2

    公开(公告)日:2009-02-10

    申请号:US11092057

    申请日:2005-03-29

    IPC分类号: G10L15/00

    CPC分类号: G10L13/027 G10L15/22

    摘要: A technique for producing speech output in an automatic dialog system in accordance with a detected context is provided. Communication is received from a user at the automatic dialog system. A context of the communication from the user is detected in a context detector of the automatic dialog system. A message is created in a natural language generator of the automatic dialog system in communication with the context detector. The message is conveyed to the user through a speech synthesis system of the automatic dialog system, in communication with the natural language generator and the context detector. Responsive to a detected level of ambient noise, the context detector provides at least one command in a markup language to cause the natural language generator to create the message using maximally intelligible words and to cause the speech synthesis system to convey the message with increased volume and decreased speed.

    摘要翻译: 提供了一种根据检测到的上下文在自动对话系统中产生语音输出的技术。 在自动对话系统中从用户接收通信。 在自动对话系统的上下文检测器中检测来自用户的通信的上下文。 在与上下文检测器通信的自动对话系统的自然语言生成器中创建消息。 该消息通过与自然语言生成器和上下文检测器通信的自动对话系统的语音合成系统传送给用户。 响应于检测到的环境噪声水平,上下文检测器以标记语言提供至少一个命令,以使自然语言生成器使用最大可理解的单词来创建消息,并且使得语音合成系统以增加的音量传达消息,并且 降低速度