Method for exemplary voice morphing
    1.
    发明申请
    Method for exemplary voice morphing 有权
    示范声音变形的方法

    公开(公告)号:US20130311173A1

    公开(公告)日:2013-11-21

    申请号:US13673708

    申请日:2012-11-09

    申请人: Jordan Cohen

    发明人: Jordan Cohen

    IPC分类号: G10L21/013

    CPC分类号: G10L21/013 G10L2021/0135

    摘要: A method of morphing speech from an original speaker into the speech of a second, target speaker with decomposing either speech into source and filter, and without the need to determine the formant positions by warping spectral envelops.

    摘要翻译: 一种将来自原始说话者的语音变形成第二个目标扬声器的语音的方法,其将语音分解成源和滤波器,并且不需要通过扭曲频谱包络来确定共振峰位置。

    Text messaging via phrase recognition
    2.
    发明申请
    Text messaging via phrase recognition 审中-公开
    通过短信识别的短信

    公开(公告)号:US20050149327A1

    公开(公告)日:2005-07-07

    申请号:US10935691

    申请日:2004-09-07

    摘要: A method of constructing a text message on a mobile communications device, the method involving: storing a plurality of text phrases; for each of the text phrases, storing a representation that is derived from that text phrase; receiving a spoken phrase from a user; from the received spoken phrase generating an acoustic representation thereof; based on the acoustic representation, searching among the stored representations to identify a stored text phrase that best matches the spoken phrase; and inserting into an electronic document the text phrase that is identified from searching.

    摘要翻译: 一种在移动通信设备上构建文本消息的方法,所述方法涉及:存储多个文本短语; 对于每个文本短语,存储从该文本短语导出的表示; 从用户接收口头短语; 从所接收的口头短语产生其声表示; 基于声学表示,在所存储的表示之间进行搜索以识别与语音短语最匹配的存储的文本短语; 并从电子文档中插入从搜索中识别的文本短语。

    Installing language modules in a mobile communication device
    3.
    发明申请
    Installing language modules in a mobile communication device 审中-公开
    在移动通信设备中安装语言模块

    公开(公告)号:US20050131685A1

    公开(公告)日:2005-06-16

    申请号:US10988994

    申请日:2004-11-15

    摘要: A method including: providing a mobile device (e.g. cellular phone) with a core engine for performing speech recognition; providing a plurality of sets of language-specific modules, each set of the plurality of sets for enabling the core engine to recognize a different language; selecting one set of language-specific modules among the plurality of sets of language-specific modules; and loading into memory within the mobile communication device the selected set of language-specific modules so as to enable the mobile communication device to recognize speech spoken in the language of the selected set.

    摘要翻译: 一种方法,包括:提供具有用于执行语音识别的核心引擎的移动设备(例如,蜂窝电话); 提供多组语言特定模块,所述多个集合中的每一组用于使得核心引擎能够识别不同的语言; 在所述多组语言特定模块中选择一组语言特定模块; 以及将所选择的语言特定模块集合加载到所述移动通信设备内的存储器中,以使所述移动通信设备能够识别所选择的集合的语言中所说出的语音。

    Phone number and name pronunciation interchange via cell phone
    4.
    发明申请
    Phone number and name pronunciation interchange via cell phone 审中-公开
    电话号码和名称通过手机发音互换

    公开(公告)号:US20050118986A1

    公开(公告)日:2005-06-02

    申请号:US10937890

    申请日:2004-09-09

    IPC分类号: H04M1/2745 H04M1/725 H04M1/00

    CPC分类号: H04M1/274516 H04M1/72552

    摘要: A method of transferring phone book information from one cell phone to another cell phone includes compiling the phone book information relating to one or more cell phone users into a data transmission package, and sending the data transmission package from the first cell phone to the second cell phone, via a communication channel native to the first and second cell phones.

    摘要翻译: 将电话簿信息从一个手机转移到另一个手机的方法包括将与一个或多个蜂窝电话用户有关的电话簿信息编译成数据传输包,并将数据传输包从第一手机发送到第二个小区 通过本地对第一和第二手机的通信信道进行电话。

    Speech recognition using ambiguous or phone key spelling and/or filtering
    5.
    发明申请
    Speech recognition using ambiguous or phone key spelling and/or filtering 有权
    使用模糊或手机键拼写和/或过滤的语音识别

    公开(公告)号:US20050043947A1

    公开(公告)日:2005-02-24

    申请号:US10950090

    申请日:2004-09-24

    CPC分类号: G10L15/22 G10L15/19

    摘要: Alphabetic filtering of the speech recognition of words uses a key press to indicate a desired character in an alphabetic filter string, where each key press represents two or more letters. The key presses can be disambiguated by recognizing a key-disambiguation utterance in association with a given key press. A user can select a desired recognition candidate from a choice list produced by such filtered word recognition. Ambiguous alphabetic filtering can be performed iteratively in response to the addition of successive ambiguous key presses. A user can select to re-recognize the utterance using filtering based on ambiguous key input after seeing the results of recognition without such filtering. Unambiguous alphabetic filtering can be performed by using multiple presses of an ambiguous key to disambiguate which letter is intended. A user can select between entering text by either large vocabulary speech recognition or by spelling text by pressing phone keys.

    摘要翻译: 字母语音识别的字母过滤使用按键来在字母过滤器字符串中指示期望的字符,其中每个按键表示两个或多个字母。 通过识别与给定的重点新闻相关的关键消歧话语,可以消除按键。 用户可以从由这种经过过滤的字识别产生的选择列表中选择所需的识别候选。 响应于添加连续模糊的按键,可以迭代地执行不确定的字母过滤。 用户可以在看到没有这种过滤的识别结果之后,基于模糊键输入使用过滤来重新识别话语。 无歧义的字母过滤可以通过使用多个不明确的键来进行,以消除哪个字母的意图。 用户可以通过大词汇语音识别输入文本或通过按电话键拼写文本来进行选择。

    Extendable voice commands
    6.
    发明申请
    Extendable voice commands 有权
    可扩展语音命令

    公开(公告)号:US20050288005A1

    公开(公告)日:2005-12-29

    申请号:US11158994

    申请日:2005-06-22

    摘要: A mobile device, such as a cellular telephone includes a voice interface that includes one part that may not be specific to a particular carrier, and a second part that provides an interface to services that are specific to a carrier or to service or information providers that are not necessarily available with all carriers. A voice command interface provides easy access to the carrier services. The set of carrier services is optionally extendible by the carrier.

    摘要翻译: 诸如蜂窝电话的移动设备包括语音接口,其包括可能不是特定于特定载波的一个部分,以及第二部分,其提供与运营商或服务或信息提供者特有的服务的接口, 不一定适用于所有运营商。 语音命令界面提供了对运营商服务的轻松访问。 运营商服务的集合可以由运营商可选地扩展。

    Automated testing of voice recognition software
    7.
    发明申请
    Automated testing of voice recognition software 有权
    自动测试语音识别软件

    公开(公告)号:US20050197836A1

    公开(公告)日:2005-09-08

    申请号:US11031955

    申请日:2005-01-07

    IPC分类号: G10L15/00

    CPC分类号: G10L15/01

    摘要: A method and a system for testing a voice enabled application on a target device, the method including conducting one or more interactions with the target device, at least some of the interactions including presenting an acoustic utterance in an acoustic environment to the target device, receiving an output of the target device in response to the acoustic utterance, and comparing the output to an output expected from the acoustic utterance.

    摘要翻译: 一种用于测试目标设备上的支持语音的应用的方法和系统,所述方法包括与目标设备进行一个或多个交互,所述交互中的至少一些包括在声学环境中向目标设备呈现声学发声,接收 响应于声学发声的目标装置的输出,以及将输出与从声学语音预期的输出进行比较。

    Voice enabled phone book interface for speaker dependent name recognition and phone number categorization
    9.
    发明申请
    Voice enabled phone book interface for speaker dependent name recognition and phone number categorization 审中-公开
    支持语音功能的电话簿界面,用于与扬声器相关的名称识别和电话号码分类

    公开(公告)号:US20050154587A1

    公开(公告)日:2005-07-14

    申请号:US10935690

    申请日:2004-09-07

    IPC分类号: H04M1/27 H04M1/725 G10L15/00

    CPC分类号: H04M1/271 H04M1/725

    摘要: A method of operating a mobile communication device that includes a speaker independent recognizer and a memory storing phonebook including a plurality of names, the method involving: generating a first voice signal from a first voice input received from a user, the first voice input specifying a selected one of a plurality of names; comparing the first voice signal to a plurality of voice tags that are stored in the device to identify the selected name in the phonebook; generating a second voice signal from a second speech input received from the user, the second voice input specifying a selected one of a plurality of phone number types; using the speaker independent recognizer to identify the selected phone number type; retrieving a phone number that is stored in association with the identified type for the identified name; and initiating a call to the phone number associated with the identified type for the identified name.

    摘要翻译: 一种操作移动通信设备的方法,该移动通信设备包括独立于扬声器的识别器和包含多个名称的存储电话簿的存储器,所述方法包括:从从用户接收的第一语音输入中产生第一语音信号,所述第一语音输入指定 选择多个名称中的一个; 将所述第一语音信号与存储在所述设备中的多个语音标签进行比较,以识别所述电话簿中所选择的名称; 从从用户接收的第二语音输入产生第二语音信号,所述第二语音输入指定多个电话号码类型中的所选择的一个; 使用扬声器独立识别器来识别所选择的电话号码类型; 检索与识别的名称的识别类型相关联地存储的电话号码; 以及发起与所识别的名称的所识别类型相关联的电话号码的呼叫。

    Speech recognition using selectable recognition modes
    10.
    发明申请
    Speech recognition using selectable recognition modes 有权
    使用可选识别模式的语音识别

    公开(公告)号:US20050049880A1

    公开(公告)日:2005-03-03

    申请号:US10950092

    申请日:2004-09-24

    CPC分类号: G10L15/22 G10L15/19

    摘要: The present invention relates to speech recognition using selectable recognition modes. This includes innovations such as: large vocabulary speech recognition programming that supplies recognized words to external program as they are recognized, and allows a user to select between large vocabulary recognition of an utterance with and without language context from the prior utterance independently of state of the external program; allowing a user to select between continuous and discrete speech recognition that use substantially the same vocabulary; allowing a user to select between continuous and discrete large-vocabulary speech recognition modes; allowing a user to select between at least two different alphabetic entry speech recognition modes; and allowing a user to select from among four or more of the following recognitions modes when creating text: a large-vocabulary mode, an alphabetic entry mode, a number entry mode, and a punctuation entry mode.

    摘要翻译: 本发明涉及使用可选择识别模式的语音识别。 这包括创新,例如:大量词汇语音识别程序,在识别出外部程序时,将识别的词提供给外部程序,并允许用户在与先前的语言无关的语言语境的大量词汇识别与非语言语境之间进行选择 外部程序; 允许用户在使用基本相同词汇的连续和离散语音识别之间进行选择; 允许用户在连续和离散的大词汇语音识别模式之间进行选择; 允许用户在至少两个不同的字母进入语音识别模式之间进行选择; 并且允许用户在创建文本时从四种或更多种以下识别模式中进行选择:大词汇模式,字母输入模式,数字输入模式和标点输入模式。