Spatial sound conference system and method
    1.
    发明授权
    Spatial sound conference system and method 失效
    空间会议系统和方法

    公开(公告)号:US08170193B2

    公开(公告)日:2012-05-01

    申请号:US11355171

    申请日:2006-02-16

    IPC分类号: H04M3/42

    摘要: The spatial sound conference system enables participants in a teleconference to distinguish between speakers even during periods of interruption and overtalk, identify speakers based on spatial location cues, understand low volume speech, and block out background noise using spatial sound information. Spatial sound information may be captured using microphones positioned at the ear locations of a dummy head at a conference table, or spatial sound information may be added to a participant's monaural audio signal using head-related transfer functions. Head-related transfer functions simulate the frequency response of audio signals across the head from one ear to the other ear to create a spatial location for a sound. Spatial sound is transmitted across a communication channel, such as ISDN, and reproduced using spatially disposed loudspeakers positioned at the ears of a participant. By inserting a spatial sound component in a teleconference, a speaker other than the loudest speaker may be heard during periods of interruption and overtalk. Additionally, speakers may be more readily identified when they have a spatial sound position, and the perception of background noise is reduced.

    摘要翻译: 空间声音会议系统使得电话会议的参与者即使在中断和覆盖期间也能区分扬声器,基于空间位置提示识别扬声器,理解低音量语音并且使用空间声音信息来阻止背景噪声。 空间声音信息可以使用位于会议台的虚拟头部的耳朵位置处的麦克风来捕获,或者可以使用头部相关的传递函数将空间声音信息添加到参与者的单声道音频信号。 与头相关的传输功能模拟从一个耳朵到另一只耳朵的头部的音频信号的频率响应,以创建声音的空间位置。 空中声音通过诸如ISDN的通信信道传输,并且使用位于参与者的耳朵处的空间布置的扬声器进行再现。 通过在电话会议中插入空间声音分量,可以在中断和overtalk期间听到除最大扬声器之外的扬声器。 此外,当扬声器具有空间声音位置时,可以更容易地识别扬声器,并且降低背景噪声的感知。

    Spatial sound conference system and apparatus
    2.
    发明授权
    Spatial sound conference system and apparatus 失效
    空间声音会议系统和设备

    公开(公告)号:US07012630B2

    公开(公告)日:2006-03-14

    申请号:US08598457

    申请日:1996-02-08

    IPC分类号: H04R5/00

    摘要: The spatial sound conference system enables participants in a teleconference to distinguish between speakers even during periods of interruption and overtalk, identify speakers based on spatial location cues, understand low volume speech, and block out background noise using spatial sound information. Spatial sound information may be captured using microphones positioned at the ear locations of a dummy head at a conference table, or spatial sound information may be added to a participant's monaural audio signal using head-related transfer functions. Head-related transfer functions simulate the frequency response of audio signals across the head from one ear to the other ear to create a spatial location for a sound. Spatial sound is transmitted across a communication channel, such as ISDN, and reproduced using spatially disposed loudspeakers positioned at the ears of a participant. By inserting a spatial sound component in a teleconference, a speaker other than the loudest speaker may be heard during periods of interruption and overtalk. Additionally, speakers may be more readily identified when they have a spatial sound position, and the perception of background noise is reduced.

    摘要翻译: 空间声音会议系统使得电话会议的参与者即使在中断和覆盖期间也能区分扬声器,基于空间位置提示识别扬声器,理解低音量语音并且使用空间声音信息来阻止背景噪声。 空间声音信息可以使用位于会议台的虚拟头部的耳朵位置处的麦克风来捕获,或者可以使用头部相关的传递函数将空间声音信息添加到参与者的单声道音频信号。 与头相关的传输功能模拟从一个耳朵到另一只耳朵的头部的音频信号的频率响应,以创建声音的空间位置。 空中声音通过诸如ISDN的通信信道传输,并且使用位于参与者的耳朵处的空间布置的扬声器进行再现。 通过在电话会议中插入空间声音分量,可以在中断和overtalk期间听到除最大扬声器之外的扬声器。 此外,当扬声器具有空间声音位置时,可以更容易地识别扬声器,并且降低背景噪声的感知。

    Personal message service with enhanced text to speech synthesis
    3.
    发明授权
    Personal message service with enhanced text to speech synthesis 失效
    具有增强的文本到语音合成的个人消息服务

    公开(公告)号:US07027568B1

    公开(公告)日:2006-04-11

    申请号:US08948328

    申请日:1997-10-10

    IPC分类号: H04M1/64

    摘要: A server in a network gathers textual information, such as news items, E-mail and the like. From that information, the server develops or identifies messages for use by individual subscribers. The same server that accumulates the text messages or another server in the network converts the textual information in each message to a sequence of speech synthesizer instructions. The converted messages, containing the sequences of speech synthesizer instructions, are transmitted to each identified subscriber's terminal device. A synthesizer in the terminal generates an audio waveform signal, representing the speech information, in response to the instructions. In the preferred embodiment, the terminals utilize concatenative type speech synthesizers, each of which has an associated vocabulary of stored fundamental sound samples. The instructions identify the sound samples, in order. The instructions also provide parameters for controlling characteristics of the signal generated during waveform synthesis for each sound sample in each sequence. For example, the instructions may specify the pitch, duration, amplitude, attack envelope and decay envelope for each sample. The division of the text to speech synthesis processing between the server and the terminals places the cost of the front end processing in the server, which is a shared resource. As a result, the hardware and software of the terminal may be relatively simple and inexpensive. Also, it is possible to upgrade the quality of the synthesis by upgrading the server software, without modifying the terminals.

    摘要翻译: 网络中的服务器收集文本信息,例如新闻项目,电子邮件等。 根据该信息,服务器开发或识别个人订户使用的消息。 在网络中累积文本消息或其他服务器的相同服务器将每个消息中的文本信息转换为语音合成器指令序列。 包含语音合成器指令序列的经转换的消息被发送到每个识别的用户的终端设备。 终端中的合成器响应于指令产生表示语音信息的音频波形信号。 在优选实施例中,终端使用级联型语音合成器,每个语音合成器具有存储的基本声音样本的相关词汇。 指令按顺序识别声音样本。 指令还提供用于控制在每个序列中的每个声音样本的波形合成期间产生的信号的特性的参数。 例如,指令可以指定每个样本的音调,持续时间,幅度,攻击包络和衰减包络。 服务器和终端之间的文本到语音合成处理的划分将前端处理的成本放在作为共享资源的服务器中。 结果,终端的硬件和软件可能相对简单和便宜。 此外,可以通过升级服务器软件来升级合成的质量,而无需修改终端。

    Network accessed personal secretary
    4.
    发明授权
    Network accessed personal secretary 失效
    网络访问个人秘书

    公开(公告)号:US5771273A

    公开(公告)日:1998-06-23

    申请号:US596657

    申请日:1996-02-05

    摘要: A method and system is disclosed for accessing a remote personalized secretarial platform that permits a wide variety of functions with high flexibility, while being easily usable by an individual telephone subscriber. The platform can be accessed whenever the subscriber telephone is off-hook through a voice recognition monitor that monitors the subscriber line and is responsive to a preselected utterance to generate an access signal. Placed at the telephone switch facility, a monitor module is speech responsive individually to a plurality of lines that are off hook to generate signals that effect switch functions including bridging to the platform and modifying a subscriber switch feature profile.

    摘要翻译: 公开了一种用于访问远程个性化秘书平台的方法和系统,其允许具有高灵活性的各种功能,同时易于由个人电话用户使用。 只要用户电话通过监视用户线路的语音识别监视器摘机,并响应于预先选择的话语来生成接入信号,则可以访问该平台。 放置在电话交换机设备处,监视器模块单独地响应于多个线路进行语音响应,这些线路被钩挂以产生影响交换机功能的信号,包括桥接到平台并修改订户交换机特征轮廓。

    Providing automated voice responses with variable user prompting
    5.
    发明授权
    Providing automated voice responses with variable user prompting 有权
    提供可变用户提示的自动语音响应

    公开(公告)号:US06385584B1

    公开(公告)日:2002-05-07

    申请号:US09302432

    申请日:1999-04-30

    IPC分类号: G10L1500

    摘要: A voice response unit (VRU) includes a library of content equivalent messages and prompts which may be substituted for one another to vary the presentation of messages provided to a user and thereby more closely simulate a human operator. Groups of content equivalent messages and prompts include multiple audio files, each with a slightly different wording or phraseology, but conveying substantially the same information. After a particular message content is selected, the corresponding group of messages is identified and a random number is generated and used to select one of the audio files of the group for playback. The VRU may be included as part of an automated dialer or auto attendant. In such a system, a calling party is greeted by the VRU and is prompted by a randomly selected prompt to speak the name of the called party. The system accesses a telephone directory, attempts to identify a name corresponding to the name spoken, and dials the number. The caller may interrupt or request alternative processing during a predetermined time period after the system has selected and read back a closest matching name or its corresponding telephone number. If processing is halted by the caller indicating that the name or telephone number selected by the system is incorrect, the system will attempt to identify a second closest guess, or if none is available, to ask the caller to reinput the name of the called party. Alternative processing includes hearing the telephone number without having it dialed, and diverting a call to voice mail.

    摘要翻译: 语音应答单元(VRU)包括内容等同消息和提示的库,其可以替代彼此以改变提供给用户的消息的呈现,从而更接近地模拟人类操作者。 等效消息和提示的内容组包括多个音频文件,每个音频文件具有略微不同的措辞或措辞,但传达基本相同的信息。 在选择特定消息内容之后,识别相应的消息组,并且生成随机数并用于选择组中的一个音频文件进行回放。 VRU可以作为自动拨号器或自动助理的一部分。 在这样一个系统中,主叫方被VRU打招呼,并随机选择提示来提示被叫方的名字。 系统访问电话簿,尝试识别与所使用的名称相对应的名称,并拨打号码。 在系统选择和读回最接近的匹配名称或其对应的电话号码之后,呼叫者可以在预定时间段内中断或请求替代处理。 如果呼叫者停止处理,指示系统选择的名称或电话号码不正确,系统将尝试识别第二个最接近的猜测,或者如果没有可用的话,请求呼叫方重新输入被叫方的名称 。 替代处理包括在不拨打电话号码的情况下听取电话号码,并将呼叫转移到语音信箱。

    Phonetic voice activated dialing
    6.
    发明授权
    Phonetic voice activated dialing 失效
    语音语音激活拨号

    公开(公告)号:US5991364A

    公开(公告)日:1999-11-23

    申请号:US828781

    申请日:1997-03-27

    IPC分类号: H04M3/42 H04M7/06 H04M1/64

    摘要: A telephone communications system Advanced Intelligent Network (AIN) platform provides a voice activated call dialing functionality through speaker independent phoneme speech recognition having a minimum volume of storage without requiring user template training. Speaker independent phoneme recognition identifies phoneme strings of caller spoken utterances which are then compared to phoneme string representations that previously have been stored in respective caller processing records (CPRs) for those subscribers listed in the ISCP database, or stored in an equivalent peripheral database with which the ISCP can communicate. Each stored phoneme string representation is associated in the CPR with a destination telephone number that may then be extracted to route a call.

    摘要翻译: 电话通信系统高级智能网络(AIN)平台通过具有最小存储容量的扬声器独立音素语音识别提供语音激活的呼叫拨号功能,而不需要用户模板训练。 扬声器独立音素识别识别呼叫者语音话音的音素字符串,然后与先前已经存储在ISCP数据库中列出的那些用户的相应呼叫者处理记录(CPR)中的音素字符串表示进行比较,或存储在等效的外围数据库中, ISCP可以通信。 每个存储的音素串表示在CPR中与目的地电话号码相关联,然后可以提取目的地电话号码以路由呼叫。

    Automated directory assistance system using word recognition and phoneme
processing method
    7.
    发明授权
    Automated directory assistance system using word recognition and phoneme processing method 失效
    使用字识别和音素处理方法的自动目录辅助系统

    公开(公告)号:US5638425A

    公开(公告)日:1997-06-10

    申请号:US333988

    申请日:1994-11-02

    摘要: A mechanized directory assistance system for use in a telecommunications network includes multiple speech recognition devices comprising a word recognition device, a phoneme recognition device, and an alphabet recognition device. Also provided is a voice processing unit and a computer operating under stored program control. A database is utilized which may comprise the same database as used for operator directory assistance. The system operates as follows: A directory assistance caller is prompted to speak the city or location desired. The response is digitized and simultaneously inputted to the word and phoneme recognition devices which each output a translation signal plus a probability level signal. These are compared and the highest probability level translation is selected. The caller is prompted to speak the name of the sought party. The response is processed in the same manner as the location word. In the event that the probability level fails to meet a predetermined standard the caller is prompted to spell all or part of the location and/or name. The resulting signal is inputted to the alphabet device. When translations are obtained having a satisfactory probability level the database is accessed. If plural listings are located these are articulated and the caller is prompted to respond affirmatively or negatively as to each. When a single directory number has been located a signal is transmitted to the caller to articulate this number. The system also includes provision for DTMF keyboard input in aid of the spelling procedure.

    摘要翻译: 在电信网络中使用的机械化目录辅助系统包括包括字识别装置,音素识别装置和字母识别装置的多个语音识别装置。 还提供了语音处理单元和在存储的程序控制下操作的计算机。 使用数据库,其可以包括与用于操作者目录帮助相同的数据库。 系统操作如下:提示目录协助呼叫者说出所需的城市或地点。 该响应被数字化并且同时输入到单词和音素识别装置,其中每个输出翻译信号加上概率级信号。 将这些进行比较,并选择最高概率级别的翻译。 呼叫者被提示说出所寻求的一方的名字。 以与位置字相同的方式处理响应。 在概率级不能满足预定标准的情况下,呼叫者被提示拼写所有或部分位置和/或名称。 所得到的信号被输入到字母设备。 当获得具有令人满意的概率级别的翻译时,访问数据库。 如果找到多个列表,则这些列表被清楚地表达,并且呼叫者被提示作出肯定地或消极地响应于每一个。 当已经找到单个目录号码时,将一个信号发送给呼叫者来表达此号码。 该系统还包括提供DTMF键盘输入以辅助拼写过程。

    Personal telephone service with transportable script control of services
    9.
    发明授权
    Personal telephone service with transportable script control of services 失效
    个人电话服务,可传输脚本控制服务

    公开(公告)号:US06317484B1

    公开(公告)日:2001-11-13

    申请号:US09056844

    申请日:1998-04-08

    IPC分类号: H04M164

    摘要: Personal dial tone service is used to identify the user of a subscriber line to a telephone terminal and, based on that identification, the system and method dynamically configures that line with the personal profile of that user. Such a line is used in a roaming situation to provide voice mail service to the roamer through an emulation of the roamer's home voice mail interface. The emulation is accomplished by storage at the home locale of the roamer of object oriented script associated with both executable and non-executable data duplicating or emulating executable and non-executable data in the roamer's home voice mail system. The script directs the running of the executables using the non-executables at the roaming central office to provide to the roamer at that remote office voice mail service using virtually the same interface as the interface to which the roamer is accustomed at his home locale. The script is stored in an Intelligent Peripheral wherein the executables are run pursuant to the script. Voice mail messages may be stored either in the remote or home locals.

    摘要翻译: 个人拨号音服务用于识别到电话终端的用户线路的用户,并且基于该识别,系统和方法动态地将该线路与该用户的个人简档配置。 这种线路在漫游的情况下被用于通过模拟漫游者的家庭语音邮件接口向漫游者提供语音邮件服务。 通过在漫游者家庭语音邮件系统中复制或模拟可执行和不可执行数据的可执行和不可执行数据相关联的漫游者的家庭区域的存储来实现仿真。 该脚本使用漫游中心局的不可执行文件指导可执行文件的运行,以使用与漫游者在其家庭环境中习惯的界面几乎相同的界面向远程办公室语音邮件服务的漫游者提供。 脚本存储在智能外设中,其中可执行文件根据脚本运行。 语音邮件消息可以存储在远程或本地居民当中。

    Personal area network for personal telephone services
    10.
    发明授权
    Personal area network for personal telephone services 失效
    个人电话服务个人区域网络

    公开(公告)号:US6104913A

    公开(公告)日:2000-08-15

    申请号:US38100

    申请日:1998-03-11

    IPC分类号: H04B5/00

    摘要: A personal area network (PAN) device enables the communication of data using galvanic properties of the skin. A person can wear a processor coupled to a PAN device. When the person touches a sensor capable of communicating with the PAN, the processor sends and receive data through the PAN and the sensor. In accord with the invention, the processor stores personal information related to the wearer's telephone service, such as the person's identification and billing information. The processor also may store information relating to the person's telephone subscriber profile, defining that person's individualized telephone services. When the wearer touches a sensor on a pay telephone, the processor supplies the data through the PAN and the sensor to a processor in the telephone. The telephone communicates the data through the telephone network, to enable the network to provide personalized services. For example, the network uses the billing information to bill any calls that the person makes to the person's normal telephone account, in a manner analogous to a credit card type billing procedure. A feature of the invention is that virtually positive identification of a person is implemented preferably using biometric characteristics of the actual caller.

    摘要翻译: 个人区域网络(PAN)设备能够使用皮肤的电流特性进行数据通信。 一个人可以佩戴耦合到PAN装置的处理器。 当人触摸能够与PAN通信的传感器时,处理器通过PAN和传感器发送和接收数据。 根据本发明,处理器存储与佩戴者的电话服务相关的个人信息,例如该人的识别和记帐信息。 处理器还可以存储与该人的电话用户简档相关的信息,定义该人的个性化电话服务。 当佩戴者接触付费电话上的传感器时,处理器通过PAN和传感器将数据提供给电话中的处理器。 电话通过电话网络传送数据,使网络能够提供个性化的服务。 例如,网络使用计费信息来以类似于信用卡类型计费程序的方式对该人对该人的正常电话帐户的任何呼叫进行计费。 本发明的一个特征是优选地使用实际呼叫者的生物特征来实现人的几乎正面的识别。