摘要:
A likely source language of a media item can be identified by attempting an initial language identification of the media item based on intrinsic or extrinsic factors, such as words in the media item and languages known by the media item author. This initial identification can generate a list of most likely source languages with corresponding likelihood factors. Translations can then be performed presuming each of the most likely source languages. The translations can be performed for multiple output languages. Each resulting translation can receive a corresponding score based on a number of factors. The scores can be combined where they have a common source language. These combined scores can be used to weight the previously identified likelihood factors for the source languages of the media item.
摘要:
When a French character is selected while an English character string candidate is presented, a mobile terminal presents English and French character string candidates by using an English/French mixed candidate dictionary. Furthermore, when a French character is selected while English and French character string candidates are presented by using the English/French mixed candidate dictionary, the mobile terminal presents a French character string candidate by using a French candidate dictionary. Moreover, when a character other than French is selected while a French character string candidate is presented by using the French candidate dictionary, the mobile terminal presents an English character string candidate by using an English candidate dictionary. The technique can also be applied to one and two byte Katakana characters.
摘要:
An automatic speech recognition (ASR) device may be configured to predict pronunciations of textual identifiers (for example, song names, etc.) based on predicting one or more languages of origin of the textual identifier. The one or more languages of origin may be determined based on the textual identifier. The pronunciations may include a hybrid pronunciation including a pronunciation in one language, a pronunciation in a second language and a hybrid pronunciation that combines multiple languages. The pronunciations may be added to a lexicon and matched to the content item (e.g., song) and/or textual identifier. The ASR device may receive a spoken utterance from a user requesting the ASR device to access the content item. The ASR device determines whether the spoken utterance matches one of the pronunciations of the content item in the lexicon. The ASR device then accesses the content when the spoken utterance matches one of the potential textual identifier pronunciations.
摘要:
L'invention se rapporte à un procédé, mis en oeuvre par l'intermédiaire d'un dispositif électronique, de détermination d'une différence entre un contenu à optimiser (CntO) et un ensemble de contenus de référence (Cnt1,..., CntN), lesdits contenus de référence étant réunis au sein d'un corpus (COR). Un tel procédé comprend : la création (20) de champs contextuels (ChTxt0, ChTxt1,...,ChTxtN, ChtTxtX), répartis en un champ contextuel (ChTxt0) du contenu à optimiser (CntO), au moins un champ contextuel (ChTxt1,...,ChTxtN) correspondant à au moins un contenu de référence (Cnt1,..., CntN) et un champ contextuel global (ChtTxtX) obtenu à partir d'au moins un contenu de référence (Cnt1,..., CntN) réuni au sein d'un contenu global, chaque champ contextuel (ChTxt0, ChTxt1,...,ChTxtN, ChtTxtX) comprenant au moins un ensemble de sommets deux-à-deux adjacents dénommé clique, chaque sommet comprenant un mot et chaque relation entre mot étant représentative d'une signification ; le calcul (30) d'un gap sémantique (GapS) sur la base des champs contextuels (ChTxt0, ChTxt1,...,ChTxtN, ChtTxtX) créés.
摘要:
Systems, methods, and products for language learning that may extract text from various resources having text, using various natural-language processing features, which can be combined with custom-designed learning activities to offer a needs-based, adaptive learning methodology. The system may receive a resource, extract keywords pedagogically valuable to non-native language learning and academic exercises. Metadata describing various aspects of resources from which keywords are extracted may be associated with keywords. Metadata describing various aspects of keywords may also be associated with keywords. Extracted keywords may be stored into a keyword store along with any metadata associated with keywords.
摘要:
Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments may enable multi-lingual communications through different modes of communications including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments may implement communications systems and methods that translate text between two or more languages (e.g., spoken), while handling/accommodating for one or more of the following in the text: specialized/domain-related jargon, abbreviations, acronyms, proper nouns, common nouns, diminutives, colloquial words or phrases, and profane words or phrases.
摘要:
A diacritic chording system uses a single standard keyboard layout to generate diacritical characters used, for example, by Latin-based languages or languages based on a Roman character set, providing a universal keyboard. The key combination required to select a diacritic is consistent, independent of the language the user is typing. The diacritic chording system uses chording to obtain characters modified by diacritics, i.e., diacritical characters. The key combinations in the diacritic chord are chosen in such a way to aid memorization by positional association of the keys or by logic association of the keys within the diacritic chord. The use of a combination of positional association and logical association eliminates conflicts in mapping diacritics for different languages. The diacritic chording system also provides a screen overlay as a mnemonic to illustrate which diacritic chords provide the desired diacritical character.