摘要:
Some embodiments of the present invention provide a system that infers activity-related context information from a message. Upon receiving the message, the system looks for activity-related keywords in the message, wherein the activity-related keywords are from a content database. If one or more activity-related keywords are found in the message, the system infers message-related context information from the one or more activity-related keywords. Next, the system uses the message-related context information to facilitate recommending an activity to a user.
摘要:
Some embodiments of the present invention provide a system that infers activity-related context information from a message. Upon receiving the message, the system looks for activity-related keywords in the message, wherein the activity-related keywords are from a content database. If one or more activity-related keywords are found in the message, the system infers message-related context information from the one or more activity-related keywords. Next, the system uses the message-related context information to facilitate recommending an activity to a user.
摘要:
One embodiment of the present invention provides a system that detects sensitive content in a document. In doing so, the system receives a document, identifies a set of terms in the document that are candidate sensitive terms, and generates a combination of terms based on the identified terms that is associated with a semantic meaning. Next, the system performs searches through a corpus based on the combination of terms and determines hit counts returned for each term in the combination and for the combination. The system then determines whether the combination of terms is sensitive based on the hit count for the combination and the hit counts for the individual terms in the combination, and generates a result that indicates portions of the document which contain sensitive combinations.
摘要:
One embodiment of the present invention provides a system that detects sensitive content in a document. In doing so, the system receives a document, identifies a set of terms in the document that are candidate sensitive terms, and generates a combination of terms based on the identified terms that is associated with a semantic meaning. Next, the system performs searches through a corpus based on the combination of terms and determines hit counts returned for each term in the combination and for the combination. The system then determines whether the combination of terms is sensitive based on the hit count for the combination and the hit counts for the individual terms in the combination, and generates a result that indicates portions of the document which contain sensitive combinations.
摘要:
One embodiment of the present invention provides a system that recommends activities. During operation, the system receives a piece of content obtained from text or converted to text from speech. The system then analyzes the received content to identify any activity type, indication of willingness to participate in any type of activities, and at least one piece of temporal information, which can be implicitly and/or explicitly stated in the content, and/or one piece of location information associated with the activity type. The system further recommends one or more activities, venues, and/or services that afford or support activities for a user based on the information extracted from the content.
摘要:
Systems and methods are presented for generating snippets from document data within the document and category taxonomies. In some embodiments, the system may receive a document comprising a set of paragraphs and sentences, identify text in the document relating to a set of categories, and score the paragraphs based on a relation between the paragraph and the set of categories to produce a section score. The system determines one or more sentences for inclusion in a snippet based in part on the section score. The system generates a snippet from the sentences determined for inclusion and associates the snippet with the document.
摘要:
A method of automatically generating personalized text for teaching a student to learn to read. Based upon inputs of the students reading ability/level, either from a self assessment or teacher input and input of personal data, the system automatically searches selected libraries and chooses appropriate text and modifies the text for vocabulary and topics of character identification of personal interest to the student. An optional function of previewing by one of the student's teacher, parent or advocate is included. The system generates a local repository of generated text associated with a particular student.
摘要:
A system and method of query expansion are disclosed. A query expansion source, a query expansion candidate, and feature data for the query expansion source and the query expansion candidate are received. The feature data comprises information for a plurality of features. A determination is made as to whether the query expansion candidate qualifies as an expansion of the query expansion source based on an analysis of the information for the plurality of features. The query expansion candidate is assigned as an expanded query of the query expansion source in a query expansion dictionary in response to a determination that the query expansion candidate qualifies as an expansion of the query expansion source.
摘要:
Techniques are provided for detecting entailment and contradiction. Packed knowledge representations for a premise and conclusion text are determined comprising facts about the relationships between concept and/or context denoting terms. Concept and context alignments are performed based on alignments scores. A union is determined. Terms are marked as to their origin and conclusion text terms replaced with by corresponding terms from the premise text. Subsumption and specificity, instantiability, spatio-temporal and relationship based packed rewrite rules are applied in conjunction with the context denoting facts to remove entailed terms and to mark contradictory facts within the union. Entailment is indicated by a lack of any facts from the packed knowledge representation of the conclusion in the union. Entailment and contradiction markers are then displayed.
摘要:
Techniques are provided for detecting entailment and contradiction. Packed knowledge representations for a premise and conclusion text are determined comprising facts about the relationships between concept and/or context denoting terms. Concept and context alignments are performed based on alignments scores. A union is determined. Terms are marked as to their origin and conclusion text terms replaced with by corresponding terms from the premise text. Subsumption and specificity, instantiability, spatio-temporal and relationship based packed rewrite rules are applied in conjunction with the context denoting facts to remove entailed terms and to mark contradictory facts within the union. Entailment is indicated by a lack of any facts from the packed knowledge representation of the conclusion in the union. Entailment and contradiction markers are then displayed.