Speech recognition interface system suitable for window systems and
speech mail systems
    2.
    发明授权
    Speech recognition interface system suitable for window systems and speech mail systems 失效
    语音识别接口系统适用于窗口系统和语音邮件系统

    公开(公告)号:US5632002A

    公开(公告)日:1997-05-20

    申请号:US178731

    申请日:1993-12-28

    IPC分类号: G06F3/16 H04M3/533 G10L5/06

    摘要: A speech recognition interface system capable of handling a plurality of application programs simultaneously, and realizing convenient speech input and output modes which are suitable for the applications in the window systems and the speech mail systems. The system includes a speech recognition unit for carrying out a speech recognition processing for a speech input made by a user to obtain a recognition result; a program management table for managing program management data indicating a speech recognition interface function required by each application program; and a message processing unit for exchanging messages with the plurality of application programs in order to specify an appropriate recognition vocabulary to be used in the speech recognition processing of the speech input to the speech recognition unit, and to transmit the recognition result for the speech input obtained by the speech recognition unit by using the appropriate recognition vocabulary to appropriate ones of the plurality of application programs, according to the program management data managed by the program management table.

    摘要翻译: 一种能够同时处理多个应用程序的语音识别接口系统,并且实现了适合于窗口系统和语音邮件系统中的应用的方便的语音输入和输出模式。 该系统包括语音识别单元,用于对由用户进行的语音输入执行语音识别处理以获得识别结果; 用于管理指示每个应用程序所需的语音识别接口功能的程序管理数据的程序管理表; 以及消息处理单元,用于与多个应用程序交换消息,以便指定要在语音识别单元输入的语音的语音识别处理中使用的适当的识别词汇,并且发送用于语音输入的识别结果 根据由程序管理表管理的程序管理数据,由语音识别单元通过使用适当的识别词汇对多个应用程序中的适当的应用程序进行获取。

    Speech dialogue system for facilitating improved human-computer
interaction
    3.
    发明授权
    Speech dialogue system for facilitating improved human-computer interaction 失效
    语音对话系统,促进改善人机交互

    公开(公告)号:US5577165A

    公开(公告)日:1996-11-19

    申请号:US312541

    申请日:1994-09-26

    CPC分类号: G06F3/16 G10L15/26

    摘要: A speech dialogue system capable of realizing natural and smooth dialogue between the system and a human user, and easy maneuverability of the system. In the system, a semantic content of input speech from a user is understood and a semantic content determination of a response output is made according to the understood semantic content of the input speech. Then, a speech response and a visual response according to the determined response output are generated and outputted to the user. The dialogue between the system and the user is managed by controlling transitions between user states during which the input speech is to be entered and system states during which the system response is to be outputted. The understanding of a semantic content of input speech from a user is made by detecting keywords in the input speech, with the keywords to be detected in the input speech limited in advance, according to a state of a dialogue between the user and the system.

    摘要翻译: 一种语音对话系统,能够实现系统和人类用户之间的自然而平稳的对话,并且系统的易操作性。 在系统中,理解来自用户的输入语音的语义内容,并且根据输入语音的理解语义内容进行响应输出的语义内容确定。 然后,产生根据所确定的响应输出的语音响应和视觉响应并向用户输出。 通过控制要输入输入语音的用户状态之间的转换以及要输出系统响应的系统状态来管理系统和用户之间的对话。 根据用户和系统之间的对话状态,通过检测输入语音中的关键字,使输入语音中要检测的关键字被预先限制,从而进行对用户输入语音的语义内容的理解。

    Speech dialogue system for facilitating improved human-computer
interaction
    4.
    发明授权
    Speech dialogue system for facilitating improved human-computer interaction 失效
    语音对话系统,促进改善人机交互

    公开(公告)号:US5357596A

    公开(公告)日:1994-10-18

    申请号:US978521

    申请日:1992-11-18

    CPC分类号: G06F3/16 G10L15/26

    摘要: A speech dialogue system capable of realizing natural and smooth dialogue between the system and a human user, and easy maneuverability of the system. In the system, a semantic content of input speech from a user is understood and a semantic content determination of a response output is made according to the understood semantic content of the input speech. Then, a speech response and a visual response according to the determined response output are generated and outputted to the user. The dialogue between the system and the user is managed by controlling transitions between user states during which the input speech is to be entered and system states during which the system response is to be outputted. The understanding of a semantic content of input speech from a user is made by detecting keywords in the input speech, with the keywords to be detected in the input speech limited in advance, according to a state of a dialogue between the user and the system.

    摘要翻译: 一种语音对话系统,能够实现系统和人类用户之间的自然而平稳的对话,并且系统的易操作性。 在系统中,理解来自用户的输入语音的语义内容,并且根据输入语音的理解语义内容进行响应输出的语义内容确定。 然后,产生根据所确定的响应输出的语音响应和视觉响应并向用户输出。 通过控制要输入输入语音的用户状态之间的转换以及要输出系统响应的系统状态来管理系统和用户之间的对话。 根据用户和系统之间的对话状态,通过检测输入语音中的关键字,使输入语音中要检测的关键字被预先限制,从而进行对用户输入语音的语义内容的理解。

    Language processing system
    5.
    发明申请
    Language processing system 失效
    语言处理系统

    公开(公告)号:US20070055496A1

    公开(公告)日:2007-03-08

    申请号:US11508841

    申请日:2006-08-24

    申请人: Shigenobu Seto

    发明人: Shigenobu Seto

    IPC分类号: G06F17/27

    摘要: A language processing system including: a forbidden word memory part that stores a forbidden word; a sequence candidate generator that generates a plurality of word sequence candidates where each words are described separately from plain text; and a word sequence estimator that reads the forbidden word from the forbidden word memory part, excludes the word sequence candidate containing the forbidden word from the plurality of word sequence candidates, and selects an estimated word sequence with the highest concatenation possibility of the words from among the plurality of word sequence candidates.

    摘要翻译: 一种语言处理系统,包括:禁止字存储部分,存储禁止字; 序列候选生成器,其生成与纯文本分开地描述每个单词的多个单词序列候选; 以及从禁止字存储器部分读取禁止字的字序列估计器,从多个字序列候选中排除包含禁止字的字序列候选,并选择具有最高级联可能性的估计字序列, 多个单词序列候选。

    Clustered patterns for text-to-speech synthesis
    6.
    发明授权
    Clustered patterns for text-to-speech synthesis 有权
    文本到语音合成的聚类模式

    公开(公告)号:US06529874B2

    公开(公告)日:2003-03-04

    申请号:US09149036

    申请日:1998-09-08

    IPC分类号: G10L1308

    CPC分类号: G10L13/10

    摘要: A representative pattern memory stores a plurality of initial representative patterns as a noise pattern. Different attribute is affixed to each initial representative pattern. A pitch pattern memory stores a large number of natural pitch patterns as an accent phrase. A clustering unit classifies each natural pitch pattern to the initial representative pattern based on the attribute of the accent phrase. A transformation parameter generation unit calculates an error between a transformed representative pattern and each natural pitch pattern classified to the initial representative pattern. A representative pattern generation unit calculates an evaluation function of the sum of the error between the transformed-representative pattern and each natural pitch pattern classified to the initial representative pattern, and updates each initial representative pattern. The representative pattern memory stores each updated representative pattern as a clustered pattern of the attribute affixed to the corresponding initial representative pattern.

    摘要翻译: 代表性图案存储器将多个初始代表图案存储为噪声图案。 每个初始代表模式附加不同的属性。 音调模式存储器存储大量自然音高模式作为重音短语。 聚类单元基于重音短语的属性将每个自然音调模式分类为初始代表模式。 变换参数生成单元计算变换后的代表性图案与分类为初始代表图案的每个自然间距图案之间的误差。 代表图案生成单元计算变换代表图案与分类为初始代表图案的每个自然间距图案之间的误差之和的评估函数,并且更新每个初始代表图案。 代表性图案存储器将每个更新的代表图案存储为附加到对应的初始代表图案的属性的聚类图案。

    Language processing system
    7.
    发明授权
    Language processing system 失效
    语言处理系统

    公开(公告)号:US07917352B2

    公开(公告)日:2011-03-29

    申请号:US11508841

    申请日:2006-08-24

    申请人: Shigenobu Seto

    发明人: Shigenobu Seto

    IPC分类号: G06F17/20

    摘要: A language processing system including: a forbidden word memory part that stores a forbidden word; a sequence candidate generator that generates a plurality of word sequence candidates where each words are described separately from plain text; and a word sequence estimator that reads the forbidden word from the forbidden word memory part, excludes the word sequence candidate containing the forbidden word from the plurality of word sequence candidates, and selects an estimated word sequence with the highest concatenation possibility of the words from among the plurality of word sequence candidates.

    摘要翻译: 一种语言处理系统,包括:禁止字存储部分,存储禁止字; 序列候选生成器,其生成与纯文本分开地描述每个单词的多个单词序列候选; 以及从禁止字存储器部分读取禁止字的字序列估计器,从多个字序列候选中排除包含禁止字的字序列候选,并选择具有最高级联可能性的估计字序列, 多个单词序列候选。