Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue
    1.
    发明授权
    Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue 有权
    使用意义提取和对话的手持设备中面向目标的语音翻译方法

    公开(公告)号:US06233561B1

    公开(公告)日:2001-05-15

    申请号:US09290628

    申请日:1999-04-12

    IPC分类号: G10L1522

    CPC分类号: G10L15/1822 G10L15/1815

    摘要: A computer-implemented method and apparatus is provided for processing a spoken request from a user. A speech recognizer converts the spoken request into a digital format. A frame data structure associates semantic components of the digitized spoken request with predetermined slots. The slots are indicative of data which are used to achieve a predetermined goal. A speech understanding module which is connected to the speech recognizer and to the frame data structure determines semantic components of the spoken request. The slots are populated based upon the determined semantic components. A dialog manager which is connected to the speech understanding module may determine at least one slot which is unpopulated based upon the determined semantic components and in a preferred embodiment may provide confirmation of the populated slots. A computer generated-request is formulated in order for the user to provide data related to the unpopulated slot. The method and apparatus are well-suited (but not limited) to use in a hand-held speech translation device.

    摘要翻译: 提供了一种用于处理来自用户的口头请求的计算机实现的方法和装置。 语音识别器将口头请求转换为数字格式。 帧数据结构将数字化语音请求的语义分量与预定时隙相关联。 这些时隙指示用于实现预定目标的数据。 连接到语音识别器和帧数据结构的语音理解模块确定语音请求的语义分量。 基于确定的语义分量来填充时隙。 连接到语音理解模块的对话管理器可以基于所确定的语义组件来确定未填充的至少一个时隙,并且在优选实施例中可以提供填充时隙的确认。 制定计算机生成请求以便用户提供与未填充槽相关的数据。 该方法和装置非常适合(但不限于)在手持语音翻译装置中使用。

    Speech recognition system employing multiple grammar networks
    2.
    发明授权
    Speech recognition system employing multiple grammar networks 失效
    语音识别系统采用多种语法网络

    公开(公告)号:US5991720A

    公开(公告)日:1999-11-23

    申请号:US834358

    申请日:1997-04-16

    摘要: The input speech is segmented using plural grammar networks, including a network that includes a filler model designed to represent noise or extraneous speech. Recognition processing results in plural lists of candidates, each list containing the N-best candidates generated. The lists are then separately aligned with the dictionary of valid names to generate two lists of valid names. The final recognition pass combines these two lists of names into a dynamic grammar and this dynamic grammar may be used to find the best candidate name using Viterbi recognition. A telephone call routing application based on the recognition system selects the best candidate name corresponding to the name spelled by the user, whether the user pronounces the name prior to spelling, or not.

    摘要翻译: 使用多个语法网络对输入语音进行分段,包括一个网络,其中包括一个设计用于表示噪声或无关语音的填充模型。 识别处理产生多个候选列表,每个列表包含生成的N个最佳候选。 然后将列表与有效名称的字典分开对齐,以生成两个有效名称列表。 最终识别通过将这两个名称列表组合成动态语法,并且可以使用该动态语法来使用维特比识别来找到最佳候选名。 基于识别系统的电话呼叫路由应用选择与用户拼写的名称相对应的最佳候选名称,用户是否在拼写之前发音名称。

    Call routing device employing continuous speech
    3.
    发明授权
    Call routing device employing continuous speech 失效
    呼叫路由设备采用连续语音

    公开(公告)号:US5799065A

    公开(公告)日:1998-08-25

    申请号:US642766

    申请日:1996-05-06

    摘要: The call routing device plugs into existing extensions of the office telephone network or PBX system and acts as a "virtual" operator, prompting incoming callers to spell the name of the desired recipient. The speech recognizer uses a multipass procedure employing Hidden Markov Models and dynamic programming. The N-best hypotheses are propagated between passes, allowing the more computationally costly routines to be reserved until the final pass, when the size of the search space is significantly reduced. The routing device prompts the user to confirm that the selected name is correct, whereupon the device signals the telephone network to automatically switch the incoming call to the telephone extension of the selected recipient.

    摘要翻译: 呼叫路由设备插入办公室电话网络或PBX系统的现有扩展,并充当“虚拟”运营商,提示来电者拼写所需接收者的名称。 语音识别器使用采用隐马尔可夫模型和动态规划的多通道程序。 N个最佳假设在传递之间传播,允许在搜索空间的大小显着减小时,保留更多的计算上昂贵的例程,直到最终通过。 路由设备提示用户确认所选择的名称是否正确,从而该设备发信号通知电话网络,以便将来电自动切换到所选收件人的电话分机。

    Method and apparatus using probabilistic language model based on confusable sets for speech recognition
    4.
    发明授权
    Method and apparatus using probabilistic language model based on confusable sets for speech recognition 失效
    基于混合语言识别的概率语言模型的方法和装置

    公开(公告)号:US06182039B2

    公开(公告)日:2001-01-30

    申请号:US09047274

    申请日:1998-03-24

    IPC分类号: G10L1504

    CPC分类号: G10L15/193 G10L15/197

    摘要: The speech recognizer incorporates a language model that reduces the number of acoustic pattern matching sequences that must be performed by the recognizer. The language model is based on knowledge of a pre-defined set of syntactically defined content and includes a data structure that organizes the content according to acoustic confusability. A spelled name recognition system based on the recognizer employs a language model based on classes of letters that the recognizer frequently confuses for one another. The language model data structure is optionally an N-gram data structure, a tree data structure, or an incrementally configured network that is built during a training sequence. The incrementally configured network has nodes that are selected based on acoustic distance from a predetermined lexicon.

    摘要翻译: 语音识别器结合了一种语言模型,其减少必须由识别器执行的声学模式匹配序列的数量。 语言模型基于预定义的一组语法定义的内容的知识,并且包括根据声学可混淆性来组织内容的数据结构。 基于识别器的拼写名识别系统采用基于识别器经常彼此混淆的字母类的语言模型。 语言模型数据结构可选地是在训练序列期间构建的N-gram数据结构,树数据结构或递增配置的网络。 增量配置的网络具有基于来自预定词典的声学距离来选择的节点。

    METHOD AND SYSTEM OF IDENTIFYING A USER OF A HANDHELD DEVICE
    5.
    发明申请
    METHOD AND SYSTEM OF IDENTIFYING A USER OF A HANDHELD DEVICE 审中-公开
    识别手持设备用户的方法和系统

    公开(公告)号:US20110043475A1

    公开(公告)日:2011-02-24

    申请号:US12988745

    申请日:2009-04-21

    IPC分类号: G06F3/041

    摘要: A system and method for identifying a user of a handheld device is herein disclosed. The device implementing the method and system may attempt to identify a user based on signals that are incidental to a user's handling of the device. The signals are generated by a variety of sensors dispersed along the periphery or within the housing. The sensors range may include touch sensors, inertial sensors, acoustic sensors, pulse oximiters, and a touchpad. Based on the sensors and corresponding signals, identification information is generated. The identification information is used to identify the user of the handheld device. The handheld device may implement various statistical learning and data mining techniques to increase the robustness of the system. The device may also authenticate the user based on the user drawing a circle, or other shape.

    摘要翻译: 本文公开了一种用于识别手持设备的用户的系统和方法。 实现该方法和系统的设备可以基于用户对设备的处理附带的信号来尝试识别用户。 信号由沿着周边或壳体内分散的各种传感器产生。 传感器范围可以包括触摸传感器,惯性传感器,声学传感器,脉冲嗅觉器和触摸板。 基于传感器和相应的信号,生成识别信息。 识别信息用于识别手持设备的用户。 手持设备可以实现各种统计学习和数据挖掘技术以增加系统的鲁棒性。 设备还可以基于用户绘制圆形或其他形状来认证用户。

    Factorization for generating a library of mouth shapes

    公开(公告)号:US07069214B2

    公开(公告)日:2006-06-27

    申请号:US10095813

    申请日:2002-03-12

    IPC分类号: G10L13/00 G10L21/06

    CPC分类号: G10L13/04 G10L2021/0135

    摘要: A library of mouth shapes is created by separating speaker-dependent and speaker independent variability. Preferably, speaker dependent variability is modeled by a speaker space while the speaker independent variability (i.e. context dependency), is modeled by a set of normalized mouth shapes that need be built only once. Given a small amount of data from a new speaker, it is possible to construct a corresponding mouth shape library by estimating a point in speaker space that maximizes the likelihood of adaptation data and by combining speaker dependent and speaker independent variability. Creation of talking heads is simplified because creation of a library of mouth shapes is enabled with only a few mouth shape instances. To build the speaker space, a context independent mouth shape parametric representation is obtained. Then a supervector containing the set of context-independent mouth shapes is formed for each speaker included in the speaker space. Dimensionality reduction is used to find the areas of the speaker space.

    Intelligent nurse robot
    7.
    发明申请
    Intelligent nurse robot 审中-公开
    智能护士机器人

    公开(公告)号:US20050154265A1

    公开(公告)日:2005-07-14

    申请号:US10755862

    申请日:2004-01-12

    摘要: A robotic nursing system for use with a patient comprises a nursing robot having at least one patient condition sensor, a transmitter, and a receiver mounted therein. A display device for displays data sensed by the patient condition sensor. The display device includes a receiver in communication with the nursing robot. The nursing robot senses patient physiological conditions using the patient condition sensor and transmits the physiological conditions to the display device using the transmitter. The display device then displays the physiological conditions for review by a user. One or another or both. The nursing robot also transmits the physiological conditions to a patient database for storage.

    摘要翻译: 一种与患者一起使用的机器人护理系统包括具有安装在其中的至少一个患者状况传感器,发射器和接收器的护理机器人。 一种用于由病人状况传感器感测的显示数据的显示装置。 显示装置包括与护理机器人通信的接收器。 护理机器人使用患者状况传感器来感测患者的生理状态,并使用发射器将生理条件发送到显示装置。 显示装置然后显示用于用户审查的生理条件。 一个或另一个或两者。 护理机器人还将生理条件发送到患者数据库以进行存储。

    Focused language models for improved speech input of structured documents
    8.
    发明授权
    Focused language models for improved speech input of structured documents 有权
    用于改进结构化文档语音输入的专注语言模型

    公开(公告)号:US06901364B2

    公开(公告)日:2005-05-31

    申请号:US09951093

    申请日:2001-09-13

    CPC分类号: G10L15/1815 G10L15/30

    摘要: An e-mail message process is provided for use with a personal digital assistant which allows for the use of input speech messaging which is converted to text using a focused language model which is downloaded by a cellular phone connection to an Internet server which provides the focused language model based upon a topic for the intended e-mail message. The text that is generated from the input speech method can be summarized by the e-mail message processor and can be edited by the user. The generated e-mail message can then be transmitted again via cellular connection to an Internet e-mail server for transmitting the e-mail message to a recipient.

    摘要翻译: 提供电子邮件消息处理以与个人数字助理一起使用,该个人数字助理允许使用输入语音消息传送,其使用由通过蜂窝电话连接下载的聚焦语言模型转换为文本,该互联网服务器提供聚焦 基于预期电子邮件的主题的语言模型。 从输入语音方法生成的文本可以由电子邮件消息处理器来总结,并且可以由用户编辑。 然后可以通过蜂窝连接再次将生成的电子邮件消息发送到Internet电子邮件服务器,以将电子邮件消息发送给接收者。

    Eigenvoice re-estimation technique of acoustic models for speech recognition, speaker identification and speaker verification
    9.
    发明授权
    Eigenvoice re-estimation technique of acoustic models for speech recognition, speaker identification and speaker verification 有权
    用于语音识别,扬声器识别和说话人验证的声学模型的本征语重新估计技术

    公开(公告)号:US06895376B2

    公开(公告)日:2005-05-17

    申请号:US09849174

    申请日:2001-05-04

    IPC分类号: G10L15/06 G10L17/00

    CPC分类号: G10L15/07 G10L17/02

    摘要: A reduced dimensionality eigenvoice analytical technique is used during training to develop context-dependent acoustic models for allophones. Re-estimation processes are performed to more strongly separate speaker-dependent and speaker-independent components of the speech model. The eigenvoice technique is also used during run time upon the speech of a new speaker. The technique removes individual speaker idiosyncrasies, to produce more universally applicable and robust allophone models. In one embodiment the eigenvoice technique is used to identify the centroid of each speaker, which may then be “subtracted out” of the recognition equation.

    摘要翻译: 在训练期间使用减小的维度本征语音分析技术来开发用于异音素的上下文相关的声学模型。 执行重新估计过程以更强烈地分离语音模型的与扬声器相关的和与扬声器无关的组件。 特定语音技术在运行时也用于新演讲者的演讲。 该技术可以消除单个扬声器的特性,从而产生更普遍适用和强大的异音模型。 在一个实施例中,本征语音技术用于识别每个说话者的质心,然后可以将其“减去”识别方程。

    Apparatus for efficient dispatch and selection of information in law enforcement applications
    10.
    发明授权
    Apparatus for efficient dispatch and selection of information in law enforcement applications 有权
    用于在执法应用程序中高效地发送和选择信息的装置

    公开(公告)号:US06571174B2

    公开(公告)日:2003-05-27

    申请号:US09929634

    申请日:2001-08-14

    IPC分类号: G01C2134

    摘要: A navigation apparatus is disclosed which may be used by law enforcement personnel for rapid intervention to a location while adding safety and reliability to the process. The apparatus includes a computer system, having an operating system, memory and a user interface. The system further includes a positioning system, such as a GPS system for determining the position of a vehicle. The positioning system communicates with the operating system. An information database, communicating with the operating system, contains data related to routing information concerning routes for travel by the vehicle. The routing information includes safety information concerning route safety in the traveling region accessible by the vehicle. The apparatus further includes a routing system in communication with the operating system that determines a route based at least in part on the routing information. Driving directions and call information are provided multi-modally to provide the officer with critical information in an efficient and timely fashion.

    摘要翻译: 公开了一种导航装置,其可以被执法人员用于对位置的快速干预,同时为该过程增加安全性和可靠性。 该装置包括具有操作系统,存储器和用户界面的计算机系统。 该系统还包括诸如用于确定车辆位置的GPS系统的定位系统。 定位系统与操作系统通信。 与操作系统通信的信息数据库包含与车辆行驶路线有关的路线信息的数据。 路线信息包括关于车辆可接近的行驶区域中的路线安全的安全信息。 该装置还包括与操作系统通信的路由系统,其至少部分地基于路由信息来确定路由。 驾驶方向和通话信息以多方式提供,以有效和及时的方式向官员提供关键信息。