Portable acoustic interface for remote access to automatic
speech/speaker recognition server
    11.
    发明授权
    Portable acoustic interface for remote access to automatic speech/speaker recognition server 失效
    便携式声学接口,用于远程访问自动语音/扬声器识别服务器

    公开(公告)号:US5953700A

    公开(公告)日:1999-09-14

    申请号:US873079

    申请日:1997-06-11

    CPC分类号: G10L15/30

    摘要: A portable acoustic signal (speech signal) preprocessing (SSP) device for accessing an automatic speech/speaker recognition (ASSR) server comprises a microphone for converting sound including speech, silence and background noise signals to analog signals; an analog signals to digital converter for converting the analog signals to digital signals; a digital signal processor (DSP) for generating feature vector data representing the digitized speech and silence/background noise, and for generating channel characterization signals; and an acoustic coupler for converting the feature vector data and the characterization signals to acoustic signals and coupling the acoustic signals to a communication channel to access the ASSR server to perform speech and speaker recognition at a remote location. The SSP device may also be configured to compress and encrypt data transmitted to the ASSR server via the DSP and encryption keys stored in a memory. The ASSR server receives the preprocessed acoustic signals to perform speech/speaker recognition by setting references, selecting appropriate decoding models and algorithms to decode the acoustic signals by modeling the channel transfer function from the channel characterization signals and processing the silence/background noise data to reduce word error rate for speech recognition and to perform accurate speaker recognition. A client/server system having the portable SSP device and the ASSR server can be used to remotely activate, reset, or change personal identification numbers (PINs) or user passwords for smartcards, magnetic cards, or electronic money cards.

    摘要翻译: 用于访问自动语音/说话人识别(ASSR)服务器的便携式声音信号(语音信号)预处理(SSP)设备包括用于将包括语音,静音和背景噪声信号的声音转换为模拟信号的麦克风; 模拟信号到数字转换器,用于将模拟信号转换成数字信号; 数字信号处理器(DSP),用于产生表示数字化语音和静音/背景噪声的特征向量数据,并用于产生信道表征信号; 以及用于将特征矢量数据和表征信号转换为声信号并将声信号耦合到通信信道以访问ASSR服务器以在远程位置执行语音和说话者识别的声耦合器。 SSP设备还可以被配置为经由DSP和存储在存储器中的加密密钥来压缩和加密发送到ASSR服务器的数据。 ASSR服务器通过设置参考来接收预处理的声信号以执行语音/说话者识别,通过从信道表征信号建模信道传递函数并处理静音/背景噪声数据来减少声音信号,选择适当的解码模型和算法来解码声信号 用于语音识别的字错误率并执行准确的说话者识别。 具有便携式SSP设备和ASSR服务器的客户端/服务器系统可用于远程激活,重置或更改智能卡,磁卡或电子货币卡的个人识别号码(PIN)或用户密码。

    Methods and Apparatus for Receiving Data in a Packet Network
    12.
    发明申请
    Methods and Apparatus for Receiving Data in a Packet Network 审中-公开
    用于在分组网络中接收数据的方法和装置

    公开(公告)号:US20080225845A1

    公开(公告)日:2008-09-18

    申请号:US12127943

    申请日:2008-05-28

    IPC分类号: H04L12/56

    CPC分类号: H04L45/00 H04L63/0861

    摘要: Methods and apparatus are disclosed for transmitting data, such as biometric data or Internet telephone data, in a packet network. Packets are split and interchanged prior to transmission across a packet network, such that packets that teach their destination may be processed, even in the presence of lost or delayed packets. Packets of biometric data, such as fingerprints, retinal scans or voice characteristics, or sampled voice packets are split, and optionally interchanged prior to transmission. If some packets are lost or delayed, while some of the packets reach their destination and provide sufficient data for user identification, then the user may be authenticated without requesting the retransmission of the lost or delayed data. If some packets are lost or delayed, while some packets teach their destination, then the received speech samples may be reproduced without requesting the retransmission of the lost or delayed data.

    摘要翻译: 公开了用于在分组网络中发送诸如生物特征数据或因特网电话数据的数据的方法和装置。 在分组网络传输之前,分组和交换分组,使得即使在存在丢失或延迟的分组的情况下也可以处理教导其目的地的分组。 诸如指纹,视网膜扫描或语音特征或采样的语音分组之类的生物特征数据包被分割,并且可选地在传输之前互换。 如果一些数据包丢失或延迟,而一些数据包到达其目的地并提供足够的数据用于用户标识,则可以对用户进行认证,而不需要重传丢失或延迟的数据。 如果一些分组丢失或延迟,而一些分组教导其目的地,则可以再现所接收的语音样本,而不需要重传丢失或延迟的数据。

    Conversational data mining
    13.
    发明授权
    Conversational data mining 有权
    会话数据挖掘

    公开(公告)号:US06665644B1

    公开(公告)日:2003-12-16

    申请号:US09371400

    申请日:1999-08-10

    IPC分类号: G10L1500

    摘要: A method for collecting data associated with the voice of a voice system user includes conducting a plurality of conversations with a plurality of voice system users. For each conversation, a speech waveform is captured and digitized, and at least one acoustic feature is extracted. The features are correlated with at least one attribute such as gender, age, accent, native language, dialect, socioeconomic classification, educational level and emotional state. Attribute data and at least one identifying indicia are stored for each user in a data warehouse, in a form to facilitate subsequent data mining thereon. The resulting collection of stored data is then mined to provide information for modifying underlying business logic of the voice system. An apparatus suitable for carrying out the method includes a dialog management unit, an audio capture module, an acoustic from end, a processing module and a data warehouse. Appropriate method steps can be implemented by a digital computer running a suitable program stored on a program storage device.

    摘要翻译: 用于收集与语音系统用户的语音相关联的数据的方法包括与多个语音系统用户进行多个对话。 对于每个会话,语音波形被捕获并数字化,并且提取至少一个声学特征。 这些特征与至少一个属性(如性别,年龄,口音,母语,方言,社会经济分类,教育水平和情绪状态)相关联。 以数据仓库中的每个用户存储属性数据和至少一个识别标记,以便于其后面的数据挖掘。 然后,所得到的存储数据集合被开采以提供用于修改语音系统的底层业务逻辑的信息。 适用于执行该方法的装置包括对话管理单元,音频捕获模块,来自端部的声音,处理模块和数据仓库。 可以通过运行存储在程序存储设备上的合适程序的数字计算机来实现适当的方法步骤。

    Visor mounting of microphone for vehicle operator
    15.
    发明授权
    Visor mounting of microphone for vehicle operator 失效
    用于车辆操作人员的麦克风安装

    公开(公告)号:US06345103B1

    公开(公告)日:2002-02-05

    申请号:US09239357

    申请日:1999-01-28

    IPC分类号: H04R1104

    CPC分类号: H04R11/04

    摘要: A structural means is provided that positions an operator's voice communication microphone in a vehicle in the vicinity of the visor without interfering with the movement and functions of the visor. The positioning being achieved by attaching a portion of a microphone holder in connection with an escutcheon- type plate that is part of the visor retention and the visor support member and attaching the microphone to another portion of the microphone holder so as to extend the microphone to a position above the visor when the visor is in the stored position.

    摘要翻译: 提供了一种结构装置,其将操作员的语音通信麦克风定位在护目镜附近的车辆中,而不会妨碍遮阳板的运动和功能。 通过将麦克风保持器的一部分连接到作为遮阳板保持件和面罩支撑构件的一部分的孔罩型板并将麦克风附接到麦克风保持器的另一部分以便将麦克风延伸到 当遮阳板处于储存位置时,遮阳板上方的位置。

    Portable information and transaction processing system and method
utilizing biometric authorization and digital certificate security

    公开(公告)号:US06016476A

    公开(公告)日:2000-01-18

    申请号:US8122

    申请日:1998-01-16

    摘要: The present invention is a portable client PDA with a touch screen or other equivalent user interface and having a microphone and local central processing unit (CPU) for processing voice commands and for processing biometric data to provide user verification. The PDA also includes a memory for storing financial and personal information of the user and I/O capability for reading and writing information to various cards such as smartcards, magnetic cards, optical cards or EAROM cards. The PDA includes a Universal Card, which is common generic smartcard with a unique imprint provided by a service provider, on which selected financial or personal information stored in the PDA can be downloaded to perform certain consumer transactions. The PDA includes a modem, a serial port and/or a parallel port so as to provide direct communication capability with peripheral devices (such as POS and ATM terminals) and is capable of transmitting or receiving information through wireless communications such as radio frequency (RF) and infrared (IR) communication. The present invention is preferably operated in two modes, i.e., a client/server mode and a local mode. The client/server mode is periodically performed to download a temporary digital certificate (which is necessary to access selected information stored in the PDA and to write such information to the Universal Card) from a central server of the service provider of the PDA and Universal Card. Next, the local mode of operation is performed by providing the PDA with biometric data and selecting one of the pre-enrolled credit cards that are stored in the PDA. Upon biometric verification, the Universal Card is written with the selected card information, which is then used to initiate a consumer transaction. In the absence of an unexpired digital certificate, however, the selected card information will not be written to the Universal Card, notwithstanding that the user may have passed local biometric verification.

    Corporate voice dialing with shared directories
    17.
    发明授权
    Corporate voice dialing with shared directories 失效
    公司语音拨号与共享目录

    公开(公告)号:US5924070A

    公开(公告)日:1999-07-13

    申请号:US870373

    申请日:1997-06-06

    摘要: Voice-controlled customized commands including customization of the command to be preformed, such as a number to be dialed to make a connection with an address of a corporate voice dialing system, and the speech pattern or utterance which may be enrolled by a user to invoke the command can be used by other users, if authorized by the enrolling user. When a current user wants to use a customized command enrolled by another user, a preferably voice actuated command is invoked to cause the search of a database containing a page of customized commands for each user and the return of commands to which access of a current user is authorized in accordance with aliases established by the enrolling user. The returned commands are preferably presented to the current user as a menu from which the current user can make a selection and obtain execution of the authorized command.

    摘要翻译: 语音控制的定制命令,包括定制要执行的命令,例如要拨打的号码以与公司语音拨号系统的地址进行连接,以及可由用户注册的语音模式或话语 该命令可由其他用户使用,如果由注册用户授权。 当当前用户希望使用由另一用户登记的定制命令时,调用优选语音激活命令,以引起对包含每个用户的定制命令页面的数据库的搜索,以及返回当前用户的访问权限 根据登记用户建立的别名进行授权。 返回的命令优选地作为当前用户可以进行选择并获得授权命令的执行的菜单呈现给当前用户。

    Method and apparatus for processing information signals based on content
    18.
    发明授权
    Method and apparatus for processing information signals based on content 有权
    基于内容处理信息信号的方法和装置

    公开(公告)号:US07092496B1

    公开(公告)日:2006-08-15

    申请号:US09664300

    申请日:2000-09-18

    IPC分类号: H04M1/652

    摘要: Methods and apparatus are provided for processing an information signal containing content presented in accordance with at least one modality. In one aspect of the present invention, a method of processing an information signal containing content presented in accordance with at least one modality, comprises the steps of: (i) obtaining the information signal; (ii) performing content detection on the information signal to detect whether the information signal includes particular content presented in accordance with the at least one modality; and (iii) generating a control signal, when the particular content is detected, for use in controlling a rendering property of the particular content and/or implementation of a specific action relating to the particular content. Various illustrative embodiments in the context of speech signal processing for use in voicemail and/or cellular phone applications are provided, as well as illustrative embodiments associated with the processing of multi-modal or multimedia information signals. Also, the present invention provides for storing selectively marked information, even in the absence of content detection, such that the information may be rendered and/or used at a later time. The invention also extends to processing of text-based and markup language-based signals, e.g., XML documents.

    摘要翻译: 提供了用于处理包含根据至少一种模态呈现的内容的信息信号的方法和装置。 在本发明的一个方面,一种处理包含根据至少一种模态呈现的内容的信息信号的方法包括以下步骤:(i)获得信息信号; (ii)对所述信息信号执行内容检测,以检测所述信息信号是否包括根据所述至少一种模式呈现的特定内容; 以及(iii)当检测到特定内容时,生成控制信号,以用于控制特定内容的呈现属性和/或与特定内容相关的特定动作的实现。 提供了在语音邮件和/或蜂窝电话应用中使用的语音信号处理的上下文中的各种说明性实施例,以及与多模式或多媒体信息信号的处理相关联的说明性实施例。 此外,本发明提供了即使在没有内容检测的情况下存储选择性标记的信息,使得可以在稍后时间呈现和/或使用该信息。 本发明还扩展到处理基于文本和标记语言的信号,例如XML文档。

    Methods and apparatus for correlating biometric attributes and biometric attribute production features
    19.
    发明授权
    Methods and apparatus for correlating biometric attributes and biometric attribute production features 有权
    将生物特征属性和生物特征属性生产特征相关联的方法和装置

    公开(公告)号:US06411933B1

    公开(公告)日:2002-06-25

    申请号:US09444684

    申请日:1999-11-22

    IPC分类号: G10L2100

    摘要: A method of validating production of a biometric attribute allegedly associated with a user comprises the following steps. A first signal is generated representing data associated with the biometric attribute allegedly received in association with the user. A second signal is also generated representing data associated with at least one feature detected in association with the production of the biometric attribute allegedly received from the user. Then, the first signal and the second signal are compared to determine a correlation level between the biometric attribute and the production feature, wherein the validation of the production of the biometric attribute depends on the correlation level. Accordingly, the invention serves to provide substantial assurance that the biometric attribute offered by the user has been physically generated by the user.

    摘要翻译: 据称与用户相关联的验证生物特征属性的生成的方法包括以下步骤。 生成第一信号,其表示与被认为与用户相关联地接收到的生物特征属性相关联的数据。 还生成第二信号,表示与据称从用户接收的生物特征属性的生成相关联地检测到的至少一个特征相关联的数据。 然后,比较第一信号和第二信号以确定生物特征属性和生产特征之间的相关性水平,其中生物特征属性生产的验证取决于相关级别。 因此,本发明用于提供用户提供的生物特征属性已经由用户物理地生成的实质性保证。

    Apparatus and methods for identifying homophones among words in a speech recognition system
    20.
    发明授权
    Apparatus and methods for identifying homophones among words in a speech recognition system 有权
    用于在语音识别系统中识别单词之间的同音词的装置和方法

    公开(公告)号:US06269335B1

    公开(公告)日:2001-07-31

    申请号:US09134261

    申请日:1998-08-14

    IPC分类号: G10L2100

    CPC分类号: G10L15/22

    摘要: A method of identifying homophones of a word uttered by a user from at least a portion of existing words of a vocabulary of a speech recognition engine comprises the steps of: a user uttering the word; decoding the uttered word; computing respective measures between the decoded word and at least a portion of the other existing vocabulary words, the respective measures indicative of acoustic similarity between the word and the at least a portion of other existing words; if at least one measure is within a threshold range, indicating, to the user, results associated with the at least one measure, the results preferably including the decoded word and the other existing vocabulary word associated with the at least one measure; and the user preferably making a selection depending on the word the user intended to utter.

    摘要翻译: 从语音识别引擎的词汇表的现有单词的至少一部分中识别用户发出的单词的同音词的方法包括以下步骤:用户说出该单词; 解码发音字; 计算解码字与至少一部分其他现有词汇词之间的相应度量,所述各个度量指示词与其他现有词的至少一部分之间的声学​​相似性; 如果至少一个度量在阈值范围内,则向用户指示与至少一个度量相关联的结果,结果优选地包括与所述至少一个度量相关联的解码词和其他现有词汇单; 并且用户优选地根据用户想要发出的词进行选择。