专利检索 caee:"Voice Signal Technologies, Inc." 第 1 页

1.

发明授权
System and method for conducting a search using a wireless mobile device 有权
标题翻译：使用无线移动设备进行搜索的系统和方法

公开(公告)号：US08285273B2

公开(公告)日：2012-10-09

申请号：US12350848

申请日：2009-01-08

申请人： Daniel L. Roth

发明人： Daniel L. Roth

IPC分类号： H04W4/00

CPC分类号： G06F17/30867 , G06F17/30554 , G06Q30/0247 , G06Q30/0256 , G06Q30/0267 , G06Q30/0277 , G06Q40/08 , G10L13/00 , G10L15/26 , H04L67/02 , H04L67/04 , H04L67/2823 , Y10S707/99933 , Y10S707/99935

摘要： A method and system are provided by which a wireless mobile device takes a vocally entered query and transmits it in a text message format over a wireless network to a search engine; receives search results based on the query from the search engine over the wireless network; and displays the search results.

摘要翻译： 提供了一种方法和系统，通过该方法和系统，无线移动设备通过无线移动设备进行语音输入的查询，并通过无线网络将其以文本消息格式发送到搜索引擎; 基于无线网络上搜索引擎的查询接收搜索结果; 并显示搜索结果。

2.

发明授权
Word recognition using choice lists 有权
标题翻译：使用选择列表的Word识别

公开(公告)号：US07809574B2

公开(公告)日：2010-10-05

申请号：US10950074

申请日：2004-09-24

申请人： Daniel L. Roth , Jordan R. Cohen , David F. Johnston , Edward W. Porter

发明人： Daniel L. Roth , Jordan R. Cohen , David F. Johnston , Edward W. Porter

IPC分类号： G10L21/00 , G06F3/16 , G06F3/14

CPC分类号： G10L15/14

摘要： One aspect of the invention involves word recognition that uses scrollable choice lists in which choices are listed in character-order. Another aspect relates to a scrollable, visually-displayed word recognition choice list, where the recognition candidates on the choice list are each associated with a choice-selecting symbol the user can use to select a desired recognition candidate by pressing an associated button, and where the same choice-selecting symbol is used for different choices displayed on the display at different times as a result of scrolling. Another aspect of the invention relates to providing a choice list of best scoring characters for a particular character position in the spelling of a filter that is used to filter word recognition. Another aspect of the invention relates to a choice list used in word recognition in which the choice list can be scrolled horizontally.

摘要翻译： 本发明的一个方面涉及使用可滚动选择列表的单词识别，其中以字符顺序列出选择。另一方面涉及可滚动的，视觉上显示的词识别选择列表，其中选择列表上的识别候选者各自与选择选择符号相关联，用户可以通过按下相关联的按钮来选择期望的识别候选，并且其中相同的选择选择符号用于在不同时间显示在显示器上作为滚动的结果的不同选择。本发明的另一方面涉及提供用于滤波器的拼写中用于过滤词识别的特定字符位置的最佳评分字符的选择列表。本发明的另一方面涉及用于字识别中的选择列表，其中选择列表可以水平滚动。

3.

发明授权
Multilingual speech recognition 有权
标题翻译：多语言语音识别

公开(公告)号：US07716050B2

公开(公告)日：2010-05-11

申请号：US10716027

申请日：2003-11-17

申请人： Laurence S. Gillick , Thomas E. Lynch , Michael J. Newman , Daniel L. Roth , Steven A. Wegmann , Jonathan P. Yamron

发明人： Laurence S. Gillick , Thomas E. Lynch , Michael J. Newman , Daniel L. Roth , Steven A. Wegmann , Jonathan P. Yamron

IPC分类号： G10L15/00

CPC分类号： G10L15/005

摘要： A method for speech recognition. The method uses a single pronunciation estimator to train acoustic phoneme models and recognize utterances from multiple languages. The method includes accepting text spellings of training words in a plurality of sets of training words, each set corresponding to a different one of a plurality of languages. The method also includes, for each of the sets of training words in the plurality, receiving pronunciations for the training words in the set, the pronunciations being characteristic of native speakers of the language of the set, the pronunciations also being in terms of subword units at least some of which are common to two or more of the languages. The method also includes training a single pronunciation estimator using data comprising the text spellings and the pronunciations of the training words.

摘要翻译： 一种语音识别方法。该方法使用单个发音估计器来训练声音音素模型并识别来自多种语言的语音。该方法包括接受多组训练词中训练词的文本拼写，每组训练单词对应于多种语言中的不同语言。该方法还包括对于多个训练词集合中的每一组，接收组中的训练单词的发音，发音是该组语言的母语者的特征，发音还以子单位其中至少有一些是两种或多种语言的共同之处。该方法还包括使用包括文本拼写和训练词的发音的数据训练单个发音估计器。

4.

发明申请
ON A MOBILE DEVICE TRACKING USE OF SEARCH RESULTS DELIVERED TO THE MOBILE DEVICE 审中-公开
标题翻译：移动设备跟踪使用提供给移动设备的搜索结果

公开(公告)号：US20080154608A1

公开(公告)日：2008-06-26

申请号：US11673992

申请日：2007-02-12

申请人： Gunnar Evermann , Daniel L. ROTH , Laurence S. GILLICK , James Coughlin

发明人： Gunnar Evermann , Daniel L. ROTH , Laurence S. GILLICK , James Coughlin

IPC分类号： G10L11/00

CPC分类号： H04M1/72522 , G06F16/90335 , G06F16/951 , G10L15/26 , H04M2250/74

摘要： A method implemented on a mobile device that includes speech recognition functionality involves: receiving an utterance that includes a search request from a user of the device; recognizing that the utterance includes a search request; sending a representation of the search request to a remote server over a wireless data connection; receiving information over the wireless data connection that is responsive to the search request; presenting the information on the mobile device; receiving an input from the user selecting an item present in the received information, the item identifying a remote resource; using the selected item to connect to the remote resource, the connection to the remote resource not involving the remote server; and sending to the remote server an indication that a connection was made to the resource identified by the selected item. The method further involves storing a log of the user's connection to remote resources and sending the log to the server.

摘要翻译： 在包括语音识别功能的移动设备上实现的方法包括：接收包括来自设备的用户的搜索请求的话语; 认识到话语包括搜索请求; 通过无线数据连接将搜索请求的表示发送到远程服务器; 通过无线数据连接接收响应于该搜索请求的信息; 呈现移动设备上的信息; 从所述用户接收选择存在于接收到的信息中的项目的输入，所述项目标识远程资源; 使用所选项目连接到远程资源，与远程资源的连接不涉及远程服务器; 以及向所述远程服务器发送与由所选项目标识的资源进行连接的指示。该方法还包括将用户连接的日志存储到远程资源并将日志发送到服务器。

5.

发明申请
Methods and apparatus for formant-based voice systems 有权
标题翻译：基于共振峰的语音系统的方法和装置

公开(公告)号：US20070061145A1

公开(公告)日：2007-03-15

申请号：US11225524

申请日：2005-09-13

申请人： Michael Edgington , Laurence Gillick , Jordan Cohen

发明人： Michael Edgington , Laurence Gillick , Jordan Cohen

IPC分类号： G10L13/00

CPC分类号： G10L13/027 , G10L13/033 , G10L25/15

摘要： In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.

摘要翻译： 在一个方面，提供一种处理语音信号以提取信息以便于训练语音合成模型的方法。该方法包括检测语音信号中的多个候选特征的动作，执行多个候选特征的一个或多个组合与语音信号之间的至少一个比较，以及从多个候选特征中选择一组特征，至少部分地在至少一个比较上。在另一方面，通过执行在计算机可读介质上编码的程序来执行该方法。在另一方面，通过至少部分地执行该方法来提供语音合成模型。

6.

发明申请
Codec-dependent unit selection for mobile devices 审中-公开
标题翻译：适用于移动设备的编解码器依赖单元选择

公开(公告)号：US20060161433A1

公开(公告)日：2006-07-20

申请号：US11262482

申请日：2005-10-28

申请人： Michael Edgington , Laurence Gillick , Igor Zlokarnik

发明人： Michael Edgington , Laurence Gillick , Igor Zlokarnik

IPC分类号： G10L15/08

CPC分类号： G10L13/06

摘要： A method of extracting a subset of speech units from a larger set of speech units for use by a speech synthesizer in synthesizing speech, wherein the speech units are stored in a compressed encoded representation that was generated by a codec, the method comprising: selecting members of the subset of speech units based on an overall cost associated with using the speech synthesizer to synthesize a test set of speech, wherein the overall cost includes at least one error introduced by using the codec to decode the stored representations of the speech units; and storing the selected subset of speech units on a speech-enabled device.

摘要翻译： 一种从较大语音单元组中提取语音单元的子集的方法，供语音合成器在合成语音中使用，其中所述语音单元以由编解码器生成的压缩编码表示形式存储，所述方法包括：选择成员基于与使用语音合成器合成语音合成器相关联的总体成本的语音单元子集，其中总体成本包括通过使用编解码器引入的至少一个错误来解码所存储的语音单元的表示; 以及将所选择的语音单元的子集存储在启用语音的设备上。

7.

发明申请
Automatic voice addressing and messaging methods and apparatus 审中-公开
标题翻译：自动语音寻址和消息传递方法和设备

公开(公告)号：US20050137878A1

公开(公告)日：2005-06-23

申请号：US10938419

申请日：2004-09-10

申请人： Daniel Roth , Laurence Gillick , Jordan Cohen , William Barton

发明人： Daniel Roth , Laurence Gillick , Jordan Cohen , William Barton

IPC分类号： G10L15/26 , H04M1/27 , H04M1/725 , G10L21/00

CPC分类号： H04M1/72522 , G10L2015/228 , H04M1/271 , H04M1/72547 , H04M1/72561

摘要： A method of operating a device that includes speech recognition capabilities includes implementing on a device a plurality of user interfaces, wherein at least one said user interfaces is a voice interface. The method also includes launching a first application, and as part of launching the first application, launching a second application, the second application optionally presenting to a user at least one query using the voice interface and populating an address field in the first application in response to the query using the speech recognition capabilities. The second application is launched either simultaneously or subsequent to the launching of the first application. Populating the address field comprises accessing address information from a plurality of databases resident in the device.

摘要翻译： 一种操作包括语音识别功能的设备的方法包括在设备上实现多个用户接口，其中至少一个所述用户接口是语音接口。该方法还包括启动第一应用程序，并且作为启动第一应用程序的一部分，启动第二应用程序，第二应用程序可任选地向用户呈现至少一个使用语音接口的查询并响应于第一应用程序中填充地址字段使用语音识别功能查询。第二个应用程序是在第一个应用程序启动之后同时或之后启动的。填充地址字段包括从驻留在设备中的多个数据库访问地址信息。

8.

发明授权
Speech recognition using automatic recognition turn off 有权
标题翻译：语音识别使用自动识别关闭

公开(公告)号：US07716058B2

公开(公告)日：2010-05-11

申请号：US10949972

申请日：2004-09-24

申请人： Daniel L. Roth , Jordan R. Cohen , David F. Johnston

发明人： Daniel L. Roth , Jordan R. Cohen , David F. Johnston

IPC分类号： G10L15/28

CPC分类号： G10L15/22 , G10L15/19

摘要： Large vocabulary speech recognition can automatically turn recognition off in one or more ways. A user command can turn on recognition that is automatically turned off after the next end of utterance. A plurality of buttons can each be associated with a different speech mode and the touch of a given button can turn on, and then automatically turn off, the given button's associated speech recognition mode. These selectable modes can include large vocabulary and alphabetic entry modes, or continuous and discrete modes. A first user input can start recognition that allows a sequence of vocabulary words to be recognized and a second user input can start recognition that turns off after one word has been recognized. A first user input can start recognition that allows a sequence of utterances to be recognized and a second user input can start recognition that allows only a single utterance to be recognized.

摘要翻译： 大词汇语音识别可以以一种或多种方式自动转移识别。用户命令可以打开在下一个结束语句后自动关闭的识别。多个按钮可以各自与不同的语音模式相关联，并且给定按钮的触摸可以打开，然后自动关闭给定按钮的相关语音识别模式。这些可选择的模式可以包括大词汇和字母输入模式，或连续和离散模式。第一用户输入可以开始识别，其允许识别词汇序列的序列，并且第二用户输入可以开始识别，一个字被识别之后关闭。第一用户输入可以开始识别，其允许识别一系列话语，并且第二用户输入可以开始仅允许单个话语被识别的识别。

9.

发明授权
Method of producing alternate utterance hypotheses using auxiliary information on close competitors 有权
标题翻译：使用辅助信息在密切的竞争对手上产生替代发音假设的方法

公开(公告)号：US07676367B2

公开(公告)日：2010-03-09

申请号：US10783518

申请日：2004-02-20

申请人： Robert Roth , Arkady Khasin , Laurence S. Gillick

发明人： Robert Roth , Arkady Khasin , Laurence S. Gillick

IPC分类号： G10L15/04 , G10L15/00

CPC分类号： G10L15/08 , G10L15/10 , G10L2015/085

摘要： A method of constructing a list of alternate transcripts from a recognized transcript includes generating a list of close call records, matching partial sub-histories from the recognized transcript with one of the history pairs stored in each of the records, and substituting the other of the history pairs for the partial sub-history of the recognized transcript. A close call record is generated each time a pair of partial hypotheses attempt to seed a common word. Each close call record includes history information and scoring information associated with a particular pair of partial hypotheses seeding a common word. Alternate transcripts are constructed by substituting close call histories for partial histories of the recognized transcripts, and also by substituting close call histories for partial histories of other alternate transcript.

摘要翻译： 从识别的记录中构建候选抄本的列表的方法包括生成紧密呼叫记录的列表，将来自所识别抄本的部分子历史与存储在每个记录中的历史对之一进行匹配，历史对对于识别的成绩单的部分子历史记录。每当一对部分假设尝试种植一个共同词时，就会产生一个接近通话记录。每个近距离通话记录包括历史信息和与特定的一对部分假设相关联的评分信息，播种公共字。替代的成绩单是通过将认可的记录的部分历史代替关闭呼叫历史，并通过替代其他替代记录的部分历史的近距离呼叫历史来代替。

10.

发明申请
INTEGRATED VOICE SEARCH COMMANDS FOR MOBILE COMMUNICATION DEVICES 审中-公开
标题翻译：用于移动通信设备的集成语音搜索命令

公开(公告)号：US20080154611A1

公开(公告)日：2008-06-26

申请号：US11673988

申请日：2007-02-12

申请人： Gunnar Evermann , Daniel L. ROTH , Laurence S. GILLICK , James COUGHLIN

发明人： Gunnar Evermann , Daniel L. ROTH , Laurence S. GILLICK , James COUGHLIN

IPC分类号： G10L21/00

CPC分类号： H04M3/4931 , G06F16/957 , G10L15/30 , H04M1/72522 , H04M1/72561 , H04M7/0036 , H04M2250/74

摘要： A method implemented on a mobile device that includes speech recognition functionality involves presenting to a user of the mobile device a voice-control interface that supports two types of commands at a common level of the interface, the two types of commands including a first type and a second type, the first type being command and control commands and the second type being search request commands. The method further involves: receiving an utterance from the user that corresponds to a command of either of the first type or the second type; recognizing the utterance; if the received utterance is a command of the first type, performing a corresponding command and control function; and if the received utterance is a command of the second type, generating a representation of a corresponding search request and then using the representation to request a search that is responsive to the search request.

摘要翻译： 在包括语音识别功能的移动设备上实现的方法包括向移动设备的用户呈现在接口的公共级支持两种类型的命令的语音控制接口，所述两种类型的命令包括第一类型和第二类型，第一类型是命令和控制命令，第二类型是搜索请求命令。该方法还包括：从用户接收对应于第一类型或第二类型中的任一种的命令的话语; 承认话语; 如果所接收的话语是第一类型的命令，则执行相应的命令和控制功能; 并且如果接收的话语是第二类型的命令，则生成对应的搜索请求的表示，然后使用该表示来请求响应于该搜索请求的搜索。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类