摘要:
The present invention discloses a number-assistant voice input system, a number-assistant voice input method for a voice input system and a number-assistant voice correcting method for a voice input system, which apply software to drive a voice input system of an electronic device to provide a voice input logic circuit module. The voice input logic circuit module defines the pronunciation of numbers 1 to 26 as the paths to respectively input letters A to Z in the voice input system and allows users to selectively input or correct a letter by reading a number from 1 to 26 instead of a letter from A to Z.
摘要:
A system and method for non-speech input or keypad-aided word and spelling recognition is disclosed. The method includes generating an unweighted grammar, selecting a database of words, generating a weighted grammar using the unweighted grammar and a statistical letter model trained on the database of words, receiving speech from a user after receiving the non-speech input and after generating the weighted grammar, and performing automatic speech recognition on the speech and non-speech input using the weighted grammar If a confidence is below a predetermined level, then the method includes receiving non-speech input from the user, disambiguating possible spellings by generating a letter lattice based on a user input modality, and constraining the letter lattice and generating a new letter string of possible word spellings until a letter string is correctly recognized.
摘要:
Speech recognition apparatus includes means for determining when a speaker desires to spell a first word. The speaker may then say a sequence of words selected from a large vocabulary without being restricted to a pre-specified phonetic alphabet. The apparatus recognizes the spoken words, associates letters with these words and then arranges the letters to form the first word. The speaker may also indicate a desire to stop phonetic spelling. Apparatus may also be used for selecting items from a list.
摘要:
A text editor is connected to a speech recognizing unit for editing preferably spoken input text using a display speech. For each text word (including digits), and each punctuation mark that can be recognized and is contained in a dictionary, a token is stored for holding information on character count, capitalization, left and right concatenation of the respective item, and for providing fields for context conditions. For each segment or entity recognized spoken text, a respective character string and associated token is transferred to storage in the editor to allow automatic formatting and correct displaying or printing of the text, including spaces and capitalization where required. Tokens are updated during editing to reflect modifications such as in the beginning of a sentence or in concatenation. Switching to spelling mode is provided for entering single spelled characters in case where a word cannot be recognized or where spelling is desired.
摘要:
Systems and methods for correcting recognition errors in speech recognition systems are disclosed herein. Natural conversational variations are identified to determine whether a query intends to correct a speech recognition error or whether the query is a new command. When the query intends to correct a speech recognition error, the system identifies a location of the error and performs the correction. The corrected query can be presented to the user or be acted upon as a command for the system.
摘要:
Aspects relate to apparatuses and methods for selectively inserting text into a video resume. An exemplary apparatus includes a processor and a memory communicatively connected to the processor, the memory containing instructions configuring the processor to receive a video resume from a user, divide the video resume is into temporal sections, acquire a plurality of textual inputs from a user, wherein the plurality of textual inputs pertains to the same user of received video resume, classify the plurality of textual inputs to corresponding temporal sections of the received video resume and display, as a function of the classification, the received video resume with a corresponding plurality of textual inputs.
摘要:
A method and device for providing voice command operation in a passenger vehicle cabin having multiple occupants are disclosed. The method and device operate to monitor microphone data relating to voice commands within a vehicle cabin and determine whether the microphone data includes wake-up-word data. When the wake-up-word data relates to more than one of a plurality of vehicle cabin zones and more than one wake-up-words are coincident, the method and device operate to monitor respective microphone data for voice command data from each of the more than one of the respective ones of the plurality of vehicle cabin zones. Upon detection, the voice command data may be processed to produce respective vehicle device commands and the vehicle device command(s) can be transmitted to effect the voice command data.
摘要:
A method of creating an animated image based on a key input, and a user terminal for performing the method are provided. The method includes acquiring a snapshot image using a camera installed in a user terminal every time a key is input to the user terminal, and creating an animated image by merging the acquired snapshot image with the input key.
摘要:
In accordance with alphabet input method information for each user, a word formed of an alphabet string is registered in a word dictionary, in a state where “dotto” being added before each alphabet and one of a set of alphabets difficult to distinguish from each other like “M and N” and “B and P” is repeated twice. For example, a word “PAM” and a feature of time series corresponding to “dotto P P doddo A dotto M” are registered in association with each other. When a user performs a speech input of “PAM”, in accordance with the user's alphabet input method information, the user utters “dotto P P dotto A dotto M”. A speech recognition is performed on this sound data using the word dictionary corresponding to the user's alphabet input method information.
摘要翻译:根据每个用户的字母输入法信息,将字母串形成的字记录在单词字典中,在每个字母表之前添加“dotto”的状态,并且一组字母表中的一个难以彼此区分 像“M”和“N”,“B”和“P”重复两次。 例如,与“dotto P P doddo A dotto M”对应的单词“PAM”和时间序列的特征被相互关联地登记。 当用户执行“PAM”的语音输入时,根据用户的字母输入方法信息,用户发出“dotto P P dotto A dotto M”。 使用与用户的字母表输入法信息对应的单词字典对该声音数据进行语音识别。
摘要:
Methods and systems are provided for managing speech dialog of a speech system. In one embodiment, a method includes: receiving a first utterance from a user of the speech system; determining a first list of possible results from the first utterance, wherein the first list includes at least two elements that each represent a possible result; analyzing the at least two elements of the first list to determine an ambiguity of the elements; and generating a speech prompt to the user based on partial orthography and the ambiguity.