Abstract:
The invention has provided an asynchronous machine translation to standardize and simplify translation process as well as improve translation quality. The invention divides the process of translating from language A to language B into two steps, each of which may be independently developed. Step 1 : translating language A to values stored by multipurpose language data storage method (intermediate data) Step 2: converting intermediate data into any language B, exported data may be text, sound, image, sign... The intermediate data enables to exterminate synonyms. Division of the translation process into two independent steps helps to decrease difficulty of translation between two languages (changing from multiplication to addition) and simultaneously translate multi-languages, independently develop new languages and reduce translation on a device to various devices.
Abstract:
The invention relates to a system and method for integrating domain information into state transitions of a Finite State Transducer ("FST") for natural language processing. A system may integrate semantic parsing and information retrieval from an information domain to generate an FST parser that represents the information domain. The FST parser may include a plurality of FST paths, at least one of which may be used to generate a meaning representation from a natural language input. As such, the system may perform domain-based semantic parsing of a natural language input, generating more robust meaning representations using domain information. The system may be applied to a wide range of natural language applications that use natural language input from a user such as, for example, natural language interfaces to computing systems, communication with robots in natural language, personalized digital assistants, question-answer query systems, and/or other natural language processing applications.
Abstract:
Systems and methods are disclosed for determining whether a short amount of text is irrelevant. A social networking server (204) includes a comment extractor (512) configured to extract one or more comments associated with a selected article. Depending on the number of comments associated with the selected article, the social networking server (204) constructs a native context and/or a transferred context. A native context feature generator (516) is configured to determine a native context feature from the constructed native context and a transferred context feature generator (518) is configured to determine a transferred context feature from the constructed transferred context. The context features (530) may be leveraged by a comment classifier (520) to determine whether a given comment is irrelevant.
Abstract:
This document provides systems and methods for altering text presentation to increase reading accuracy, efficiency, and retention. This can include identification text specific attributes from machine readable text (through parsing of the text), varying the text presentation in accordance with the attributes, and creating an enhanced visual product for enhancing the reading experience. For example, a computer system can extract attributes such as parts of speech from an input sentence and display that sentence in cascading text segments down and across a display screen. The system can further use domain-specific dictionaries derived from domain-specific texts to identify domain-specific compound noun phrases and verb phrases that require specific linguistic tagging to be usable in other linguistic analysis steps.
Abstract:
System, method and apparatuses directed to a paradigm of manuscript generation and manipulation combined with contemporaneous or simultaneous visualization of the text or other media being entered by the creator. Through respective panels or interfaces the creator may manipulate a work or manuscript while visualizing the effects desired.
Abstract:
Technologies for extensible, context-aware natural language interactions include a computing device having a number of context source modules. Context source modules may be developed or installed after deployment of the computing device to a user. Each context source module includes a context capture module, a language model, one or more database query mappings, and may include one or more user interface element mappings. The context capture module interprets, generates, and stores context data. A virtual personal assistant (VPA) of the computing device indexes the language models and generates a semantic representation of a user request that associates each word of the request to a language model. The VPA translates the user request into a database query, and may generate a user interface element for the request. The VPA may execute locally on the computing device or remotely on a cloud server. Other embodiments are described and claimed.
Abstract:
The present disclosure is related to a method and an apparatus of mining synonymous phrases. The method comprises: obtaining, according to a parallel text corpus, a first phrase-alignment relationship from phrases of a current language to phrases of an intermediate language, and a second phrase-alignment relationship from the phrases of the intermediate language to the phrases of the current language; obtaining, for a target phrase of current language, a first set of aligned phrases of the intermediate language that are aligned with the target phrase of the current language based on the first phrase-alignment relationship; obtaining a second set of aligned phrases of the current language that are aligned with selected phrase(s) in the first set of aligned phrases based on the second phrase-alignment relationship; and obtaining synonymous phrases for the target phrase from the second set of aligned phrases.
Abstract:
The present disclosure is related to a method and an apparatus of mining synonymous phrases. The method comprises: obtaining, according to a parallel text corpus, a first phrase-alignment relationship from phrases of a current language to phrases of an intermediate language, and a second phrase-alignment relationship from the phrases of the intermediate language to the phrases of the current language; obtaining, for a target phrase of current language, a first set of aligned phrases of the intermediate language that are aligned with the target phrase of the current language based on the first phrase-alignment relationship; obtaining a second set of aligned phrases of the current language that are aligned with selected phrase(s) in the first set of aligned phrases based on the second phrase-alignment relationship; and obtaining synonymous phrases for the target phrase from the second set of aligned phrases.
Abstract:
An approach is provided to detect and hide spoiler information. In the approach, potential spoiler content included user text entries submitted to a collaborative environment are automatically detected. The system inhibits display of the potential spoiler content from the collaborative environment in response to the detection.
Abstract:
A computer-implemented method is performed at a device having one or more processors and memory storing programs executed by the one or more processors. The method comprises: selecting a target word in a target sentence; from the target sentence, acquiring a first sequence of words that precede the target word and a second sequence of words that succeed the target word; from a sentence database, searching and acquiring a group of words, each of which separates the first sequence of words from the second sequence of words in a sentence; creating a candidate sentence for each of the candidate words by replacing the target word in the target sentence with each of the candidate words; determining the fittest sentence among the candidate sentences according to a linguistic model; and suggesting the candidate word within the fittest sentence as a correction.