Speech dialogue system for realizing improved communication between user
and system
    1.
    发明授权
    Speech dialogue system for realizing improved communication between user and system 失效
    语音对话系统,用于实现用户与系统之间的通信

    公开(公告)号:US5548681A

    公开(公告)日:1996-08-20

    申请号:US929106

    申请日:1992-08-13

    CPC分类号: G10L15/22 G10L2021/02087

    摘要: In the system, a speech input uttered by a human is received by a microphone which outputs microphone output signals. The speech input received by the microphone is then recognized by a speech recognition unit, and a synthetic speech response appropriate for the speech input recognized by the speech recognition unit is generated and outputted from a loudspeaker to the human. In recognizing the speech input, the speech recognition unit receives input signals in which the synthetic speech response, outputted from the loudspeaker and then received by the microphone, is cancelled from the microphone output signals.

    摘要翻译: 在该系统中,由输出麦克风输出信号的麦克风接收由人发出的语音输入。 由麦克风接收到的语音输入然后由语音识别单元识别,并且生成适合于由语音识别单元识别的语音输入的合成语音响应并从扬声器输出到人。 在识别语音输入时,语音识别单元接收输入信号,其中从扬声器输出然后由麦克风接收的合成语音响应从麦克风输出信号中消除。

    Method and system for microphone array input type speech recognition
using band-pass power distribution for sound source position/direction
estimation
    2.
    发明授权
    Method and system for microphone array input type speech recognition using band-pass power distribution for sound source position/direction estimation 失效
    用于声源位置/方向估计的带通功率分配的麦克风阵列输入型语音识别的方法和系统

    公开(公告)号:US06009396A

    公开(公告)日:1999-12-28

    申请号:US818672

    申请日:1997-03-14

    申请人: Yoshifumi Nagata

    发明人: Yoshifumi Nagata

    CPC分类号: G10L15/26 G10L2021/02166

    摘要: A microphone array input type speech recognition scheme capable of realizing a high precision sound source position or direction estimation by a small amount of calculations, and thereby realizing a high precision speech recognition. A band-pass waveform, which is a waveform for each frequency bandwidth, is obtained from input signals of the microphone array, and a band-pass power of the sound source is directly obtained from the band-pass waveform. Then, the obtained band-pass power is used as the speech parameter. It is also possible to realize the sound source estimation and the band-pass power estimation at high precision while further reducing an amount of calculations, by utilizing a sound source position search processing in which a low resolution position estimation and a high resolution position estimation are combined.

    摘要翻译: 一种能够通过少量计算实现高精度声源位置或方向估计的麦克风阵列输入型语音识别方案,从而实现高精度语音识别。 从麦克风阵列的输入信号获得作为每个频率带宽的波形的带通波形,并且从带通波形直接获得声源的带通功率。 然后,将获得的带通功率用作语音参数。 还可以通过利用低分辨率位置估计和高分辨率位置估计的声源位置搜索处理,以高精度实现声源估计和带通功率估计,同时进一步减少计算量 结合在一起

    Speech dialogue system for facilitating improved human-computer
interaction
    3.
    发明授权
    Speech dialogue system for facilitating improved human-computer interaction 失效
    语音对话系统,促进改善人机交互

    公开(公告)号:US5577165A

    公开(公告)日:1996-11-19

    申请号:US312541

    申请日:1994-09-26

    CPC分类号: G06F3/16 G10L15/26

    摘要: A speech dialogue system capable of realizing natural and smooth dialogue between the system and a human user, and easy maneuverability of the system. In the system, a semantic content of input speech from a user is understood and a semantic content determination of a response output is made according to the understood semantic content of the input speech. Then, a speech response and a visual response according to the determined response output are generated and outputted to the user. The dialogue between the system and the user is managed by controlling transitions between user states during which the input speech is to be entered and system states during which the system response is to be outputted. The understanding of a semantic content of input speech from a user is made by detecting keywords in the input speech, with the keywords to be detected in the input speech limited in advance, according to a state of a dialogue between the user and the system.

    摘要翻译: 一种语音对话系统,能够实现系统和人类用户之间的自然而平稳的对话,并且系统的易操作性。 在系统中,理解来自用户的输入语音的语义内容,并且根据输入语音的理解语义内容进行响应输出的语义内容确定。 然后,产生根据所确定的响应输出的语音响应和视觉响应并向用户输出。 通过控制要输入输入语音的用户状态之间的转换以及要输出系统响应的系统状态来管理系统和用户之间的对话。 根据用户和系统之间的对话状态,通过检测输入语音中的关键字,使输入语音中要检测的关键字被预先限制,从而进行对用户输入语音的语义内容的理解。

    Speech dialogue system for facilitating improved human-computer
interaction
    4.
    发明授权
    Speech dialogue system for facilitating improved human-computer interaction 失效
    语音对话系统,促进改善人机交互

    公开(公告)号:US5357596A

    公开(公告)日:1994-10-18

    申请号:US978521

    申请日:1992-11-18

    CPC分类号: G06F3/16 G10L15/26

    摘要: A speech dialogue system capable of realizing natural and smooth dialogue between the system and a human user, and easy maneuverability of the system. In the system, a semantic content of input speech from a user is understood and a semantic content determination of a response output is made according to the understood semantic content of the input speech. Then, a speech response and a visual response according to the determined response output are generated and outputted to the user. The dialogue between the system and the user is managed by controlling transitions between user states during which the input speech is to be entered and system states during which the system response is to be outputted. The understanding of a semantic content of input speech from a user is made by detecting keywords in the input speech, with the keywords to be detected in the input speech limited in advance, according to a state of a dialogue between the user and the system.

    摘要翻译: 一种语音对话系统,能够实现系统和人类用户之间的自然而平稳的对话,并且系统的易操作性。 在系统中,理解来自用户的输入语音的语义内容,并且根据输入语音的理解语义内容进行响应输出的语义内容确定。 然后,产生根据所确定的响应输出的语音响应和视觉响应并向用户输出。 通过控制要输入输入语音的用户状态之间的转换以及要输出系统响应的系统状态来管理系统和用户之间的对话。 根据用户和系统之间的对话状态,通过检测输入语音中的关键字,使输入语音中要检测的关键字被预先限制,从而进行对用户输入语音的语义内容的理解。

    Apparatus and method for correcting the difference in frequency
characteristics between microphones for analyzing speech and for
creating a recognition dictionary
    5.
    发明授权
    Apparatus and method for correcting the difference in frequency characteristics between microphones for analyzing speech and for creating a recognition dictionary 失效
    用于校正用于分析语音和用于创建识别词典的麦克风之间的频率特性差异的装置和方法

    公开(公告)号:US6032115A

    公开(公告)日:2000-02-29

    申请号:US935082

    申请日:1997-09-26

    CPC分类号: G10L15/065 G10L15/20

    摘要: In sound recognition apparatus of the present invention, user's utterance or a sound provided by an output section using previously stored sound waveforms is simultaneously inputted through a basic microphone of known frequency characteristics and an input microphone of unknown frequency characteristics. An analysis section respectively analyzes the frequency of the input speech through the basic microphone and the input microphone. A frequency characteristics calculation section calculates first difference data between the frequencies of the input speech of the basic microphone and the input microphone, and calculates frequency characteristics of the input microphone according to the first difference data and the frequency characteristics of the basic microphone. A frequency characteristics correction section calculates second difference data between the frequency characteristics of the input microphone and known frequency characteristics of a dictionary data microphone, and corrects input speech to be recognized through the input microphone as speech data of the frequency characteristics of the dictionary data microphone according to the second difference data. A recognition section recognizes corrected speech data by referring to a recognition dictionary storing data previously created through the dictionary data microphone.

    摘要翻译: 在本发明的声音识别装置中,通过已知频率特性的基本麦克风和未知频率特性的输入麦克风同时输入使用先前存储的声音波形的输出部分提供的用户发声或声音。 分析部分分别通过基本麦克风和输入麦克风分析输入语音的频率。 频率特性计算部分计算基本麦克风和输入麦克风的输入语音的频率之间的第一差分数据,并根据第一差分数据和基本麦克风的频率特性来计算输入麦克风的频率特性。 频率特性校正部分计算输入麦克风的频率特性与字典数据麦克风的已知频率特性之间的第二差分数据,并通过输入麦克风校正要识别的输入语音作为词典数据麦克风的频率特性的语音数据 根据第二个差异数据。 识别部分通过参考存储先前通过字典数据麦克风创建的数据的识别字典识别校正的语音数据。

    Apparatus for detecting position of object capable of simultaneously
detecting plural objects and detection method therefor
    6.
    发明授权
    Apparatus for detecting position of object capable of simultaneously detecting plural objects and detection method therefor 失效
    用于检测能够同时检测多个物体的物体的位置的装置及其检测方法

    公开(公告)号:US6157403A

    公开(公告)日:2000-12-05

    申请号:US905387

    申请日:1997-08-04

    申请人: Yoshifumi Nagata

    发明人: Yoshifumi Nagata

    IPC分类号: G01S15/46 G10L21/02 H04N5/225

    摘要: An apparatus for detecting a position of an object, including a signal output portion for generating a predetermined signal to radiate the signal into a space toward an arbitrary object, a signal input portion having a plurality of sensors for individually receiving signals reflected from the object, an impulse response calculating portion for obtaining an impulse response for each sensor in accordance with the signal radiated from the signal output portion and the signals received by the plural sensors, and an object position estimating portion for calculating the weight of a virtual position determined at an arbitrary point on the assumption that the signal radiated to the space by the signal output portion is reflected by the virtual position in such a manner that transmission time required for the signal to reach the signal input portion is measured and the components of each impulse response calculated in accordance with the transmission time are used to calculate the weight and calculating the weight while shifting the virtual position to estimate a virtual position, at which the weight exceeds a predetermined threshold value, to be the position of the object.

    摘要翻译: 一种用于检测物体的位置的装置,包括用于产生预定信号的信号输出部分,以将信号辐射到朝向任意物体的空间中;具有多个传感器的信号输入部分,用于分别接收从物体反射的信号, 脉冲响应计算部分,用于根据从信号输出部分辐射的信号和由多个传感器接收的信号获得每个传感器的脉冲响应;以及对象位置估计部分,用于计算在一个 假设通过信号输出部分辐射到空间的信号被虚拟位置反射,使得测量信号到达信号输入部分所需的传输时间并且计算每个脉冲响应的分量的任意点 按照传输时间用于计算重量和c 在移动虚拟位置时计算重量,以估计重量超过预定阈值的虚拟位置作为对象的位置。

    Speech recognition interface system suitable for window systems and
speech mail systems
    7.
    发明授权
    Speech recognition interface system suitable for window systems and speech mail systems 失效
    语音识别接口系统适用于窗口系统和语音邮件系统

    公开(公告)号:US5632002A

    公开(公告)日:1997-05-20

    申请号:US178731

    申请日:1993-12-28

    IPC分类号: G06F3/16 H04M3/533 G10L5/06

    摘要: A speech recognition interface system capable of handling a plurality of application programs simultaneously, and realizing convenient speech input and output modes which are suitable for the applications in the window systems and the speech mail systems. The system includes a speech recognition unit for carrying out a speech recognition processing for a speech input made by a user to obtain a recognition result; a program management table for managing program management data indicating a speech recognition interface function required by each application program; and a message processing unit for exchanging messages with the plurality of application programs in order to specify an appropriate recognition vocabulary to be used in the speech recognition processing of the speech input to the speech recognition unit, and to transmit the recognition result for the speech input obtained by the speech recognition unit by using the appropriate recognition vocabulary to appropriate ones of the plurality of application programs, according to the program management data managed by the program management table.

    摘要翻译: 一种能够同时处理多个应用程序的语音识别接口系统,并且实现了适合于窗口系统和语音邮件系统中的应用的方便的语音输入和输出模式。 该系统包括语音识别单元,用于对由用户进行的语音输入执行语音识别处理以获得识别结果; 用于管理指示每个应用程序所需的语音识别接口功能的程序管理数据的程序管理表; 以及消息处理单元,用于与多个应用程序交换消息,以便指定要在语音识别单元输入的语音的语音识别处理中使用的适当的识别词汇,并且发送用于语音输入的识别结果 根据由程序管理表管理的程序管理数据,由语音识别单元通过使用适当的识别词汇对多个应用程序中的适当的应用程序进行获取。