专利检索 cpc:"G10L2015/086" 第 3 页

21.

发明申请
NUMBER-ASSISTANT VOICE INPUT SYSTEM, NUMBER-ASSISTANT VOICE INPUT METHOD FOR VOICE INPUT SYSTEM AND NUMBER-ASSISTANT VOICE CORRECTING METHOD FOR VOICE INPUT SYSTEM 审中-公开
标题翻译：数字辅助语音输入系统，用于语音输入系统的辅助语音输入方法和语音输入系统的数字辅助语音校正方法

公开(公告)号：US20120303368A1

公开(公告)日：2012-11-29

申请号：US13117491

申请日：2011-05-27

申请人： Ting MA

发明人： Ting MA

IPC分类号： G10L15/26

CPC分类号： G10L15/22 , G10L2015/086 , G10L2015/221

摘要： The present invention discloses a number-assistant voice input system, a number-assistant voice input method for a voice input system and a number-assistant voice correcting method for a voice input system, which apply software to drive a voice input system of an electronic device to provide a voice input logic circuit module. The voice input logic circuit module defines the pronunciation of numbers 1 to 26 as the paths to respectively input letters A to Z in the voice input system and allows users to selectively input or correct a letter by reading a number from 1 to 26 instead of a letter from A to Z.

摘要翻译： 本发明公开了一种语音输入系统的辅助语音输入系统，语音输入系统的辅助语音输入方法以及用于语音输入系统的数字助理语音校正方法，该方法应用软件来驱动电子语音输入系统设备提供语音输入逻辑电路模块。语音输入逻辑电路模块将数字1至26的发音定义为在语音输入系统中分别输入字母A至Z的路径，并且允许用户通过从1至26读取数字来选择性地输入或校正字母，而不是从A到Z的信。

22.

发明申请
SYSTEM AND METHOD FOR SPELLING RECOGNITION USING SPEECH AND NON-SPEECH INPUT 有权
标题翻译：使用语音和非语音输入来识别识别的系统和方法

公开(公告)号：US20110218808A1

公开(公告)日：2011-09-08

申请号：US13109293

申请日：2011-05-17

申请人： Sarangarajan PARTHASARATHY

发明人： Sarangarajan PARTHASARATHY

IPC分类号： G10L15/18

CPC分类号： G06F3/038 , G06F3/0237 , G06F2203/0381 , G10L15/187 , G10L15/197 , G10L15/22 , G10L2015/086

摘要： A system and method for non-speech input or keypad-aided word and spelling recognition is disclosed. The method includes generating an unweighted grammar, selecting a database of words, generating a weighted grammar using the unweighted grammar and a statistical letter model trained on the database of words, receiving speech from a user after receiving the non-speech input and after generating the weighted grammar, and performing automatic speech recognition on the speech and non-speech input using the weighted grammar If a confidence is below a predetermined level, then the method includes receiving non-speech input from the user, disambiguating possible spellings by generating a letter lattice based on a user input modality, and constraining the letter lattice and generating a new letter string of possible word spellings until a letter string is correctly recognized.

摘要翻译： 公开了一种用于非语音输入或键盘辅助字和拼写识别的系统和方法。该方法包括生成未加权语法，选择字数据库，使用未加权语法生成加权语法，以及在字数据库上训练的统计字母模型，在接收到非语音输入之后从用户接收语音，并在生成加权语法，并使用加权语法对语音和非语音输入执行自动语音识别如果置信度低于预定水平，则该方法包括从用户接收非语音输入，通过生成字母格来消除可能的拼写基于用户输入模式，并约束字母格并生成可能的字拼写的新字母串，直到正确识别字母串。

23.

发明授权
Phonetic spelling for speech recognition 有权
标题翻译：语音识别语音拼写

公开(公告)号：US06321196B1

公开(公告)日：2001-11-20

申请号：US09346355

申请日：1999-07-02

申请人： Carlos Antonio Franceschi

发明人： Carlos Antonio Franceschi

IPC分类号： G10L1506

CPC分类号： G10L15/22 , G10L2015/0631 , G10L2015/086

摘要： Speech recognition apparatus includes means for determining when a speaker desires to spell a first word. The speaker may then say a sequence of words selected from a large vocabulary without being restricted to a pre-specified phonetic alphabet. The apparatus recognizes the spoken words, associates letters with these words and then arranges the letters to form the first word. The speaker may also indicate a desire to stop phonetic spelling. Apparatus may also be used for selecting items from a list.

摘要翻译： 语音识别装置包括用于确定说话人何时拼写第一个字的装置。然后，扬声器可以说从大词汇表中选择的一系列词，而不限于预先指定的语音字母表。该装置识别口语，与这些单词相关联的字母，然后排列字母形成第一个单词。演讲者也可能表示希望停止拼音。设备也可用于从列表中选择项目。

24.

发明授权
Text editor for speech input 失效
标题翻译：用于语音输入的文本编辑器

公开(公告)号：US4914704A

公开(公告)日：1990-04-03

申请号：US666212

申请日：1984-10-30

申请人： Alan G. Cole , Robert H. Riekert

发明人： Alan G. Cole , Robert H. Riekert

IPC分类号： G06F3/16 , G06F17/21 , G06F17/24 , G06F17/27

CPC分类号： G06F17/273 , G06F17/21 , G06F17/24 , G06F17/277 , G06F17/2775 , G06F3/167 , G10L2015/086

摘要： A text editor is connected to a speech recognizing unit for editing preferably spoken input text using a display speech. For each text word (including digits), and each punctuation mark that can be recognized and is contained in a dictionary, a token is stored for holding information on character count, capitalization, left and right concatenation of the respective item, and for providing fields for context conditions. For each segment or entity recognized spoken text, a respective character string and associated token is transferred to storage in the editor to allow automatic formatting and correct displaying or printing of the text, including spaces and capitalization where required. Tokens are updated during editing to reflect modifications such as in the beginning of a sentence or in concatenation. Switching to spelling mode is provided for entering single spelled characters in case where a word cannot be recognized or where spelling is desired.

摘要翻译： 文本编辑器连接到语音识别单元，用于使用显示语音来编辑优选地说出的输入文本。对于每个文本字（包括数字）以及可以被识别并包含在字典中的每个标点符号，存储用于保存关于字符数，大小写，相应项的左和右连接的信息的令牌，并且用于提供字段对于上下文条件。对于每个段或实体识别的口头文本，相应的字符串和相关联的令牌被传送到编辑器中的存储器，以允许自动格式化并且正确地显示或打印文本，包括在需要时的空格和大小写。令牌在编辑过程中被更新，以反映诸如在句子开头或连接中的修改。提供拼写模式，以便在无法识别单词或需要拼写的情况下输入单个拼写字符。

25.

发明公开
AUTOMATED WORD CORRECTION IN SPEECH RECOGNITION SYSTEMS 审中-公开

公开(公告)号：US20230410792A1

公开(公告)日：2023-12-21

申请号：US18211732

申请日：2023-06-20

申请人： Rovi Guides, Inc.

发明人： Ankur Anil Aher , Jeffry Copps Robert Jose

IPC分类号： G10L15/01 , G10L15/06 , G10L15/08 , G10L15/22

CPC分类号： G10L15/01 , G10L15/063 , G10L15/08 , G10L15/22 , G10L2015/0636 , G10L2015/086

摘要： Systems and methods for correcting recognition errors in speech recognition systems are disclosed herein. Natural conversational variations are identified to determine whether a query intends to correct a speech recognition error or whether the query is a new command. When the query intends to correct a speech recognition error, the system identifies a location of the error and performs the correction. The corrected query can be presented to the user or be acted upon as a command for the system.

26.

发明公开
APPARATUSES AND METHODS FOR SELECTIVELY INSERTING TEXT INTO A VIDEO RESUME 审中-公开

公开(公告)号：US20230298630A1

公开(公告)日：2023-09-21

申请号：US18084742

申请日：2022-12-20

申请人： MY JOB MATCHER, INC. D/B/A JOB.COM

发明人： Arran Stewart

IPC分类号： G11B27/036 , G06F40/279 , G10L15/08 , G06V20/40 , G10L25/57 , G06V30/19 , G06N20/00 , G10L15/22

CPC分类号： G11B27/036 , G06F40/279 , G10L15/08 , G06V20/49 , G10L25/57 , G06V30/19 , G06N20/00 , G10L15/22 , G10L2015/086

摘要： Aspects relate to apparatuses and methods for selectively inserting text into a video resume. An exemplary apparatus includes a processor and a memory communicatively connected to the processor, the memory containing instructions configuring the processor to receive a video resume from a user, divide the video resume is into temporal sections, acquire a plurality of textual inputs from a user, wherein the plurality of textual inputs pertains to the same user of received video resume, classify the plurality of textual inputs to corresponding temporal sections of the received video resume and display, as a function of the classification, the received video resume with a corresponding plurality of textual inputs.

27.

发明申请
MULTI-CHANNEL VOICE RECOGNITION FOR A VEHICLE ENVIRONMENT 审中-公开

公开(公告)号：US20190237067A1

公开(公告)日：2019-08-01

申请号：US15884437

申请日：2018-01-31

申请人： Toyota Motor Engineering & Manufacturing North America, Inc.

发明人： Scott A. Friedman , Prince R. Remegio , Tim Uwe Falkenmayer , Roger Akira Kyle , Ryoma Kakimi , Luke D. Heide , Nishikant Narayan Puranik

IPC分类号： G10L15/22 , H04R1/40 , G10L15/08 , H04R3/00

CPC分类号： G10L15/22 , G10L15/08 , G10L2015/086 , G10L2015/223 , H04R1/406 , H04R3/005 , H04R2499/13

摘要： A method and device for providing voice command operation in a passenger vehicle cabin having multiple occupants are disclosed. The method and device operate to monitor microphone data relating to voice commands within a vehicle cabin and determine whether the microphone data includes wake-up-word data. When the wake-up-word data relates to more than one of a plurality of vehicle cabin zones and more than one wake-up-words are coincident, the method and device operate to monitor respective microphone data for voice command data from each of the more than one of the respective ones of the plurality of vehicle cabin zones. Upon detection, the voice command data may be processed to produce respective vehicle device commands and the vehicle device command(s) can be transmitted to effect the voice command data.

28.

发明申请
METHOD OF CREATING ANIMATED IMAGE BASED ON KEY INPUT, AND USER TERMINAL FOR PERFORMING THE METHOD 审中-公开

公开(公告)号：US20190073817A1

公开(公告)日：2019-03-07

申请号：US16122027

申请日：2018-09-05

申请人： KAKAO CORP.

发明人： Kyung Ho SUNG , Ji Hyung HONG , Ji Soo HWANG , Hye Won SHIN , Hyun A KIM , Yea Joon PARK , Shin Hyang OH

IPC分类号： G06T13/80 , H04M1/725 , G10L15/08 , G10L15/02 , G06T11/00

CPC分类号： G06T13/80 , G06F3/167 , G06T11/001 , G10L15/02 , G10L15/08 , G10L15/26 , G10L2015/027 , G10L2015/086 , H04M1/72555 , H04M2250/22 , H04M2250/52

摘要： A method of creating an animated image based on a key input, and a user terminal for performing the method are provided. The method includes acquiring a snapshot image using a camera installed in a user terminal every time a key is input to the user terminal, and creating an animated image by merging the acquired snapshot image with the input key.

29.

发明授权
Speech recognition apparatus for recognizing user's utterance 有权
标题翻译：用于识别用户话语的语音识别装置

公开(公告)号：US09437190B2

公开(公告)日：2016-09-06

申请号：US14239315

申请日：2012-08-31

申请人： Tomoyuki Kumai , Toshiyuki Miyazaki

发明人： Tomoyuki Kumai , Toshiyuki Miyazaki

IPC分类号： G10L15/00 , G10L15/04 , G10L15/187 , G10L15/08

CPC分类号： G10L15/187 , G10L2015/086

摘要： In accordance with alphabet input method information for each user, a word formed of an alphabet string is registered in a word dictionary, in a state where “dotto” being added before each alphabet and one of a set of alphabets difficult to distinguish from each other like “M and N” and “B and P” is repeated twice. For example, a word “PAM” and a feature of time series corresponding to “dotto P P doddo A dotto M” are registered in association with each other. When a user performs a speech input of “PAM”, in accordance with the user's alphabet input method information, the user utters “dotto P P dotto A dotto M”. A speech recognition is performed on this sound data using the word dictionary corresponding to the user's alphabet input method information.

摘要翻译： 根据每个用户的字母输入法信息，将字母串形成的字记录在单词字典中，在每个字母表之前添加“dotto”的状态，并且一组字母表中的一个难以彼此区分像“M”和“N”，“B”和“P”重复两次。例如，与“dotto P P doddo A dotto M”对应的单词“PAM”和时间序列的特征被相互关联地登记。当用户执行“PAM”的语音输入时，根据用户的字母输入方法信息，用户发出“dotto P P dotto A dotto M”。使用与用户的字母表输入法信息对应的单词字典对该声音数据进行语音识别。

30.

发明申请
METHODS AND SYSTEMS FOR MANAGING DIALOG OF SPEECH SYSTEMS 有权
标题翻译：用于管理语音系统对话的方法和系统

公开(公告)号：US20140316782A1

公开(公告)日：2014-10-23

申请号：US13866829

申请日：2013-04-19

申请人： GM GLOBAL TECHNOLOGY OPERATIONS LLC

发明人： Eli TZIRKEL-HANCOCK , Gaurav TALWAR , Xufang ZHAO , Greg T. Lindemann

IPC分类号： G10L15/06 , G10L15/08

CPC分类号： G10L15/06 , G10L15/08 , G10L15/19 , G10L15/22 , G10L2015/086 , G10L2015/221 , H04M3/493 , H04M2201/40

摘要： Methods and systems are provided for managing speech dialog of a speech system. In one embodiment, a method includes: receiving a first utterance from a user of the speech system; determining a first list of possible results from the first utterance, wherein the first list includes at least two elements that each represent a possible result; analyzing the at least two elements of the first list to determine an ambiguity of the elements; and generating a speech prompt to the user based on partial orthography and the ambiguity.

摘要翻译： 提供了用于管理语音系统的语音对话的方法和系统。在一个实施例中，一种方法包括：从语音系统的用户接收第一话语; 确定来自第一话语的可能结果的第一列表，其中第一列表包括每个表示可能结果的至少两个元素; 分析第一列表的至少两个元素以确定元素的模糊性; 以及基于部分正字法和模糊度向用户生成语音提示。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类