Word recognition using choice lists
    2.
    发明授权
    Word recognition using choice lists 有权
    使用选择列表的Word识别

    公开(公告)号:US07809574B2

    公开(公告)日:2010-10-05

    申请号:US10950074

    申请日:2004-09-24

    IPC分类号: G10L21/00 G06F3/16 G06F3/14

    CPC分类号: G10L15/14

    摘要: One aspect of the invention involves word recognition that uses scrollable choice lists in which choices are listed in character-order. Another aspect relates to a scrollable, visually-displayed word recognition choice list, where the recognition candidates on the choice list are each associated with a choice-selecting symbol the user can use to select a desired recognition candidate by pressing an associated button, and where the same choice-selecting symbol is used for different choices displayed on the display at different times as a result of scrolling. Another aspect of the invention relates to providing a choice list of best scoring characters for a particular character position in the spelling of a filter that is used to filter word recognition. Another aspect of the invention relates to a choice list used in word recognition in which the choice list can be scrolled horizontally.

    摘要翻译: 本发明的一个方面涉及使用可滚动选择列表的单词识别,其中以字符顺序列出选择。 另一方面涉及可滚动的,视觉上显示的词识别选择列表,其中选择列表上的识别候选者各自与选择选择符号相关联,用户可以通过按下相关联的按钮来选择期望的识别候选,并且其中 相同的选择选择符号用于在不同时间显示在显示器上作为滚动的结果的不同选择。 本发明的另一方面涉及提供用于滤波器的拼写中用于过滤词识别的特定字符位置的最佳评分字符的选择列表。 本发明的另一方面涉及用于字识别中的选择列表,其中选择列表可以水平滚动。

    Multilingual speech recognition
    3.
    发明授权
    Multilingual speech recognition 有权
    多语言语音识别

    公开(公告)号:US07716050B2

    公开(公告)日:2010-05-11

    申请号:US10716027

    申请日:2003-11-17

    IPC分类号: G10L15/00

    CPC分类号: G10L15/005

    摘要: A method for speech recognition. The method uses a single pronunciation estimator to train acoustic phoneme models and recognize utterances from multiple languages. The method includes accepting text spellings of training words in a plurality of sets of training words, each set corresponding to a different one of a plurality of languages. The method also includes, for each of the sets of training words in the plurality, receiving pronunciations for the training words in the set, the pronunciations being characteristic of native speakers of the language of the set, the pronunciations also being in terms of subword units at least some of which are common to two or more of the languages. The method also includes training a single pronunciation estimator using data comprising the text spellings and the pronunciations of the training words.

    摘要翻译: 一种语音识别方法。 该方法使用单个发音估计器来训练声音音素模型并识别来自多种语言的语音。 该方法包括接受多组训练词中训练词的文本拼写,每组训练单词对应于多种语言中的不同语言。 该方法还包括对于多个训练词集合中的每一组,接收组中的训练单词的发音,发音是该组语言的母语者的特征,发音还以子单位 其中至少有一些是两种或多种语言的共同之处。 该方法还包括使用包括文本拼写和训练词的发音的数据训练单个发音估计器。

    ON A MOBILE DEVICE TRACKING USE OF SEARCH RESULTS DELIVERED TO THE MOBILE DEVICE
    4.
    发明申请
    ON A MOBILE DEVICE TRACKING USE OF SEARCH RESULTS DELIVERED TO THE MOBILE DEVICE 审中-公开
    移动设备跟踪使用提供给移动设备的搜索结果

    公开(公告)号:US20080154608A1

    公开(公告)日:2008-06-26

    申请号:US11673992

    申请日:2007-02-12

    IPC分类号: G10L11/00

    摘要: A method implemented on a mobile device that includes speech recognition functionality involves: receiving an utterance that includes a search request from a user of the device; recognizing that the utterance includes a search request; sending a representation of the search request to a remote server over a wireless data connection; receiving information over the wireless data connection that is responsive to the search request; presenting the information on the mobile device; receiving an input from the user selecting an item present in the received information, the item identifying a remote resource; using the selected item to connect to the remote resource, the connection to the remote resource not involving the remote server; and sending to the remote server an indication that a connection was made to the resource identified by the selected item. The method further involves storing a log of the user's connection to remote resources and sending the log to the server.

    摘要翻译: 在包括语音识别功能的移动设备上实现的方法包括:接收包括来自设备的用户的搜索请求的话语; 认识到话语包括搜索请求; 通过无线数据连接将搜索请求的表示发送到远程服务器; 通过无线数据连接接收响应于该搜索请求的信息; 呈现移动设备上的信息; 从所述用户接收选择存在于接收到的信息中的项目的输入,所述项目标识远程资源; 使用所选项目连接到远程资源,与远程资源的连接不涉及远程服务器; 以及向所述远程服务器发送与由所选项目标识的资源进行连接的指示。 该方法还包括将用户连接的日志存储到远程资源并将日志发送到服务器。

    Methods and apparatus for formant-based voice systems
    5.
    发明申请
    Methods and apparatus for formant-based voice systems 有权
    基于共振峰的语音系统的方法和装置

    公开(公告)号:US20070061145A1

    公开(公告)日:2007-03-15

    申请号:US11225524

    申请日:2005-09-13

    IPC分类号: G10L13/00

    摘要: In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.

    摘要翻译: 在一个方面,提供一种处理语音信号以提取信息以便于训练语音合成模型的方法。 该方法包括检测语音信号中的多个候选特征的动作,执行多个候选特征的一个或多个组合与语音信号之间的至少一个比较,以及从多个候选特征中选择一组特征 ,至少部分地在至少一个比较上。 在另一方面,通过执行在计算机可读介质上编码的程序来执行该方法。 在另一方面,通过至少部分地执行该方法来提供语音合成模型。

    Codec-dependent unit selection for mobile devices
    6.
    发明申请
    Codec-dependent unit selection for mobile devices 审中-公开
    适用于移动设备的编解码器依赖单元选择

    公开(公告)号:US20060161433A1

    公开(公告)日:2006-07-20

    申请号:US11262482

    申请日:2005-10-28

    IPC分类号: G10L15/08

    CPC分类号: G10L13/06

    摘要: A method of extracting a subset of speech units from a larger set of speech units for use by a speech synthesizer in synthesizing speech, wherein the speech units are stored in a compressed encoded representation that was generated by a codec, the method comprising: selecting members of the subset of speech units based on an overall cost associated with using the speech synthesizer to synthesize a test set of speech, wherein the overall cost includes at least one error introduced by using the codec to decode the stored representations of the speech units; and storing the selected subset of speech units on a speech-enabled device.

    摘要翻译: 一种从较大语音单元组中提取语音单元的子集的方法,供语音合成器在合成语音中使用,其中所述语音单元以由编解码器生成的压缩编码表示形式存储,所述方法包括:选择成员 基于与使用语音合成器合成语音合成器相关联的总体成本的语音单元子集,其中总体成本包括通过使用编解码器引入的至少一个错误来解码所存储的语音单元的表示; 以及将所选择的语音单元的子集存储在启用语音的设备上。

    Automatic voice addressing and messaging methods and apparatus
    7.
    发明申请
    Automatic voice addressing and messaging methods and apparatus 审中-公开
    自动语音寻址和消息传递方法和设备

    公开(公告)号:US20050137878A1

    公开(公告)日:2005-06-23

    申请号:US10938419

    申请日:2004-09-10

    摘要: A method of operating a device that includes speech recognition capabilities includes implementing on a device a plurality of user interfaces, wherein at least one said user interfaces is a voice interface. The method also includes launching a first application, and as part of launching the first application, launching a second application, the second application optionally presenting to a user at least one query using the voice interface and populating an address field in the first application in response to the query using the speech recognition capabilities. The second application is launched either simultaneously or subsequent to the launching of the first application. Populating the address field comprises accessing address information from a plurality of databases resident in the device.

    摘要翻译: 一种操作包括语音识别功能的设备的方法包括在设备上实现多个用户接口,其中至少一个所述用户接口是语音接口。 该方法还包括启动第一应用程序,并且作为启动第一应用程序的一部分,启动第二应用程序,第二应用程序可任选地向用户呈现至少一个使用语音接口的查询并响应于第一应用程序中填充地址字段 使用语音识别功能查询。 第二个应用程序是在第一个应用程序启动之后同时或之后启动的。 填充地址字段包括从驻留在设备中的多个数据库访问地址信息。

    Speech recognition using automatic recognition turn off
    8.
    发明授权
    Speech recognition using automatic recognition turn off 有权
    语音识别使用自动识别关闭

    公开(公告)号:US07716058B2

    公开(公告)日:2010-05-11

    申请号:US10949972

    申请日:2004-09-24

    IPC分类号: G10L15/28

    CPC分类号: G10L15/22 G10L15/19

    摘要: Large vocabulary speech recognition can automatically turn recognition off in one or more ways. A user command can turn on recognition that is automatically turned off after the next end of utterance. A plurality of buttons can each be associated with a different speech mode and the touch of a given button can turn on, and then automatically turn off, the given button's associated speech recognition mode. These selectable modes can include large vocabulary and alphabetic entry modes, or continuous and discrete modes. A first user input can start recognition that allows a sequence of vocabulary words to be recognized and a second user input can start recognition that turns off after one word has been recognized. A first user input can start recognition that allows a sequence of utterances to be recognized and a second user input can start recognition that allows only a single utterance to be recognized.

    摘要翻译: 大词汇语音识别可以以一种或多种方式自动转移识别。 用户命令可以打开在下一个结束语句后自动关闭的识别。 多个按钮可以各自与不同的语音模式相关联,并且给定按钮的触摸可以打开,然后自动关闭给定按钮的相关语音识别模式。 这些可选择的模式可以包括大词汇和字母输入模式,或连续和离散模式。 第一用户输入可以开始识别,其允许识别词汇序列的序列,并且第二用户输入可以开始识别,一个字被识别之后关闭。 第一用户输入可以开始识别,其允许识别一系列话语,并且第二用户输入可以开始仅允许单个话语被识别的识别。

    Method of producing alternate utterance hypotheses using auxiliary information on close competitors
    9.
    发明授权
    Method of producing alternate utterance hypotheses using auxiliary information on close competitors 有权
    使用辅助信息在密切的竞争对手上产生替代发音假设的方法

    公开(公告)号:US07676367B2

    公开(公告)日:2010-03-09

    申请号:US10783518

    申请日:2004-02-20

    IPC分类号: G10L15/04 G10L15/00

    摘要: A method of constructing a list of alternate transcripts from a recognized transcript includes generating a list of close call records, matching partial sub-histories from the recognized transcript with one of the history pairs stored in each of the records, and substituting the other of the history pairs for the partial sub-history of the recognized transcript. A close call record is generated each time a pair of partial hypotheses attempt to seed a common word. Each close call record includes history information and scoring information associated with a particular pair of partial hypotheses seeding a common word. Alternate transcripts are constructed by substituting close call histories for partial histories of the recognized transcripts, and also by substituting close call histories for partial histories of other alternate transcript.

    摘要翻译: 从识别的记录中构建候选抄本的列表的方法包括生成紧密呼叫记录的列表,将来自所识别抄本的部分子历史与存储在每个记录中的历史对之一进行匹配, 历史对对于识别的成绩单的部分子历史记录。 每当一对部分假设尝试种植一个共同词时,就会产生一个接近通话记录。 每个近距离通话记录包括历史信息和与特定的一对部分假设相关联的评分信息,播种公共字。 替代的成绩单是通过将认可的记录的部分历史代替关闭呼叫历史,并通过替代其他替代记录的部分历史的近距离呼叫历史来代替。

    INTEGRATED VOICE SEARCH COMMANDS FOR MOBILE COMMUNICATION DEVICES
    10.
    发明申请
    INTEGRATED VOICE SEARCH COMMANDS FOR MOBILE COMMUNICATION DEVICES 审中-公开
    用于移动通信设备的集成语音搜索命令

    公开(公告)号:US20080154611A1

    公开(公告)日:2008-06-26

    申请号:US11673988

    申请日:2007-02-12

    IPC分类号: G10L21/00

    摘要: A method implemented on a mobile device that includes speech recognition functionality involves presenting to a user of the mobile device a voice-control interface that supports two types of commands at a common level of the interface, the two types of commands including a first type and a second type, the first type being command and control commands and the second type being search request commands. The method further involves: receiving an utterance from the user that corresponds to a command of either of the first type or the second type; recognizing the utterance; if the received utterance is a command of the first type, performing a corresponding command and control function; and if the received utterance is a command of the second type, generating a representation of a corresponding search request and then using the representation to request a search that is responsive to the search request.

    摘要翻译: 在包括语音识别功能的移动设备上实现的方法包括向移动设备的用户呈现在接口的公共级支持两种类型的命令的语音控制接口,所述两种类型的命令包括第一类型和 第二类型,第一类型是命令和控制命令,第二类型是搜索请求命令。 该方法还包括:从用户接收对应于第一类型或第二类型中的任一种的命令的话语; 承认话语; 如果所接收的话语是第一类型的命令,则执行相应的命令和控制功能; 并且如果接收的话语是第二类型的命令,则生成对应的搜索请求的表示,然后使用该表示来请求响应于该搜索请求的搜索。