Web-based voice dialog interface
    1.
    发明授权
    Web-based voice dialog interface 有权
    基于Web的语音对话界面

    公开(公告)号:US06604075B1

    公开(公告)日:2003-08-05

    申请号:US09524964

    申请日:2000-03-14

    IPC分类号: G10L1500

    摘要: A web-based voice dialog interface for use in communicating dialog information between a user at a client machine and one or more servers coupled to the client machine via the Internet or other computer network. The interface in an illustrative embodiment includes a web page interpreter for receiving information relating to one or more web pages. The web page interpreter generates a rendering of at least a portion of the information for presentation to a user in an audibly-perceptible format. A grammar processing device utilizes interpreted web page information received from the web page interpreter to generate syntax information and semantic information. A speech recognizer processes received user speech in accordance with the syntax information, and a natural language interpreter processes the resulting recognized speech in accordance with the semantics information to generate output for delivery to a web server in conjunction with a voice dialog which includes the user speech and the rendering of the web page(s). The output may be processed by a common gateway interface (CGI) formatter prior to delivery to a CGI associated with the web server.

    摘要翻译: 一种基于网络的语音对话界面,用于在客户端机器上的用户和经由因特网或其他计算机网络耦合到客户机器的一个或多个服务器之间传送对话信息。 说明性实施例中的界面包括用于接收与一个或多个网页有关的信息的网页解释器。 网页解释器产生用于以可听见的格式呈现给用户的信息的至少一部分的呈现。 语法处理装置利用从网页解释器接收到的解释性网页信息来生成语法信息和语义信息。 语音识别器根据语法信息处理接收到的用户语音,并且自然语言解释器根据语义信息处理所得到的识别语音,以便结合包括用户语音的语音对话来产生用于递送到web服务器的输出 以及网页的呈现。 输出可以在传送到与web服务器相关联的CGI之前由公共网关接口(CGI)格式器处理。

    Large vocabulary connected speech recognition system and method of
language representation using evolutional grammer to represent context
free grammars
    2.
    发明授权
    Large vocabulary connected speech recognition system and method of language representation using evolutional grammer to represent context free grammars 失效
    大词汇连接的语音识别系统和使用演化语法的语言表示方法来表示上下文无关语法

    公开(公告)号:US5719997A

    公开(公告)日:1998-02-17

    申请号:US697152

    申请日:1996-08-20

    摘要: A method of recognizing speech input selectively creates and maintains grammar representations of the speech input in essentially real time. Speech input frames are received by a speech recognition system. Grammar representations are created for each speech frame and a probability score is derived for the representations indicating the probability of the accuracy of the representations to the speech input. Representations having a probability score below a predetermined threshold are not maintained. Those grammar representations having probability scores above the predetermined threshold are maintained. As more speech frames are received by the system, additional grammar representations are created and the probability scores are updated. When the entire speech input has been received, the chain of grammar representations having the highest probability score is identified as the speech input.

    摘要翻译: 识别语音输入的方法选择性地以基本上实时的方式创建和维护语音输入的语法表示。 语音输入帧由语音识别系统接收。 为每个语音帧创建语法表示,并且为表示语音输入的表示的精度的概率的表示导出概率分数。 不保持概率分数低于预定阈值的表示。 保持具有高于预定阈值的概率分数的那些语法表示。 随着系统接收到更多的语音帧,创建了附加的语法表示,并且更新了概率得分。 当接收到整个语音输入时,将具有最高概率得分的语法表示链识别为语音输入。

    Acoustic speech recognizer system and method
    3.
    发明授权
    Acoustic speech recognizer system and method 有权
    声学语音识别系统及方法

    公开(公告)号:US06574601B1

    公开(公告)日:2003-06-03

    申请号:US09229809

    申请日:1999-01-13

    IPC分类号: G10L1520

    CPC分类号: G10L15/22

    摘要: An adaptive endpointer system and method are used in speech recognition applications, such as telephone-based Internet browsers, to determine barge-in events during the processing of speech. The endpointer system includes a signal energy level estimator for estimating signal levels in speech data; a noise energy level estimator for estimating noise levels in the speech data; and a barge-in detector for increasing a threshold used in comparing the signal levels and the noise levels to detect the barge-in event in the speech data corresponding to a speech prompt during speech recognition.

    摘要翻译: 在诸如基于电话的因特网浏览器的语音识别应用中使用自适应终端系统和方法来确定在语音处理期间的插入事件。 终端系统包括用于估计语音数据中的信号电平的信号能级估计器; 用于估计语音数据中的噪声电平的噪声能级估计器; 以及用于在语音识别期间增加用于比较信号电平和噪声电平以用于检测对应于语音提示的语音数据中的插入事件的阈值的插入检测器。

    Large vocabulary connected speech recognition system and method of
language representation using evolutional grammar to represent context
free grammars
    4.
    发明授权
    Large vocabulary connected speech recognition system and method of language representation using evolutional grammar to represent context free grammars 失效
    大词汇连接的语音识别系统和使用演化语法的语言表示方法来表示上下文无关语法

    公开(公告)号:US5907634A

    公开(公告)日:1999-05-25

    申请号:US846014

    申请日:1997-04-25

    摘要: A method of recognizing speech input selectively creates and maintains grammar representations of the speech input in essentially real time. Speech input frames are received by a speech recognition system. Grammar representations are created for each speech frame and a probability score is derived for the representations indicating the probability of the accuracy of the representations to the speech input. Representations having a probability score below a predetermined threshold are not maintained. Those grammar representations having probability scores above the predetermined threshold are maintained. As more speech frames are received by the system, additional grammar representations are created and the probability scores are updated. When the entire speech input has been received, the chain of grammar representations having the highest probability score is identified as the speech input.

    摘要翻译: 识别语音输入的方法选择性地以基本上实时的方式创建和维护语音输入的语法表示。 语音输入帧由语音识别系统接收。 为每个语音帧创建语法表示,并且为表示语音输入的表示的精度的概率的表示导出概率分数。 不保持概率分数低于预定阈值的表示。 保持具有高于预定阈值的概率分数的那些语法表示。 随着系统接收到更多的语音帧,创建了附加的语法表示,并且更新了概率得分。 当接收到整个语音输入时,将具有最高概率得分的语法表示链识别为语音输入。

    Methods and systems for performing handwriting recognition from raw
graphical image data
    5.
    发明授权
    Methods and systems for performing handwriting recognition from raw graphical image data 失效
    从原始图形图像数据执行手写识别的方法和系统

    公开(公告)号:US5875256A

    公开(公告)日:1999-02-23

    申请号:US691995

    申请日:1996-08-02

    摘要: Methods and systems for performing handwriting recognition which include, in part, application of stochastic modeling techniques in conjunction with language modeling. Handwriting recognition is performed on a received data set, which is representative of a handwriting sample comprised of one or more symbols. Recognition is performed by selectively segmenting the data set into one or more strokes utilizing an evolution grammar for identifying each one of the strokes among one or more alternatives. Each one of the strokes represents a segment of the handwriting sample. The identified strokes are evaluated as a stroke sequence, representative of one or more of the handwriting sample's symbols, to identify the handwriting sample.

    摘要翻译: 用于执行手写识别的方法和系统,其部分地包括结合语言建模的随机建模技术的应用。 对接收到的数据集执行手写识别,该数据集代表由一个或多个符号组成的手写样本。 通过使用用于识别一个或多个替代方案中的每个笔划的演进语法来选择性地将数据集分成一个或多个笔划来执行识别。 每个笔画都代表手写样本的一段。 所识别的笔画被评估为表示手写样本的一个或多个符号的笔画序列,以识别手写样本。

    Large vocabulary connected speech recognition system and method of
language representation using evolutional grammar to represent context
free grammars
    6.
    发明授权
    Large vocabulary connected speech recognition system and method of language representation using evolutional grammar to represent context free grammars 失效
    大词汇连接的语音识别系统和使用演化语法的语言表示方法来表示上下文无关语法

    公开(公告)号:US5699456A

    公开(公告)日:1997-12-16

    申请号:US184811

    申请日:1994-01-21

    摘要: A method of recognizing speech input selectively creates and maintains grammar representations of the speech input in essentially real time. Speech input frames received by a speech recognition system. Grammar representations are created for each speech frame and a probability score is derived for the representations indicating the probability of the accuracy of the representations to the speech input. Representations having a probability score below a predetermined threshold are not maintained. Those grammar representations having probability scores above predetermined threshold are maintained. As more speech frames are received by the system, additional grammar representations are created and the probability scores are updated. When the entire speech input has been received, the chain of grammar representations having the highest probability score is identified as the speech input.

    摘要翻译: 识别语音输入的方法选择性地以基本上实时的方式创建和维护语音输入的语法表示。 由语音识别系统接收的语音输入帧。 为每个语音帧创建语法表示,并且为表示语音输入的表示的精度的概率的表示导出概率分数。 不保持概率分数低于预定阈值的表示。 保持具有高于预定阈值的概率分数的那些语法表示。 随着系统接收到更多的语音帧,创建了附加的语法表示,并且更新了概率得分。 当接收到整个语音输入时,将具有最高概率得分的语法表示链识别为语音输入。