Abstract:
Methods and apparatus are described herein for identifying tasks in messages. In various implementations, natural language processing may be performed on a received message to generate an annotated message. The annotated message may be analyzed pursuant to a grammar. A portion of the message may be classified as a user task entry based on the analysis of the annotated message.
Abstract:
A computer-implemented technique can include obtaining a training corpus including pairs of (i) documents and (ii) corresponding abstracts. The technique can include identifying a set of entity mentions in each abstract and each corresponding document based on their respective part-of-speech (POS) tags and dependency parses. The technique can include clustering the sets of entity mentions referring to a same underlying entity to obtain clusters for each document and each corresponding abstract. The technique can include aligning specific abstract entity mentions to corresponding document entity mentions to obtain a set of aligned abstract and document entities. The technique can include labeling the set of aligned entities as salient and unaligned entities as non-salient to generate a labeled corpus. The technique can also include training features of a classifier using the labeled corpus to obtain a trained classifier.
Abstract:
Implementations include systems and methods generate data for training or evaluating semantic analysis engines. For example, a method may include receiving documents from a corpus that includes an authoritative set of documents from an authoritative source. Each document in the authoritative set may be associated with an entity. A second set of documents from the corpus that do not overlap with the first set may include at least one link to a document in the authoritative set, the at least one link being associated with anchor text. For each document in the second set, the method may include identifying entity mentions in the document based on the anchor text. The method may include associating the entity mention with the entity in a graph-structured knowledge base or associating entity types with the entity mention. The method may also include training a semantic analysis engine using the identified entity mentions and associations.
Abstract:
Implementations include systems and methods for querying a data graph. An example method includes receiving a machine learning module trained to produce a model with multiple features for a query, each feature representing a path in a data graph. The method also includes receiving a search query that includes a first search term, mapping the search query to the query, and mapping the first search term to a first entity in the data graph. The method may also include identifying a second entity in the data graph using the first entity and at least one of the multiple weighted features, and providing information relating to the second entity in a response to the search query. Some implementations may also include training the machine learning module by, for example, generating positive and negative training examples from an answer to a query.
Abstract:
Methods and apparatus are described herein for identifying tasks in messages. In various implementations, natural language processing may be performed on a received message to generate an annotated message. The annotated message may be analyzed pursuant to a grammar. A portion of the message may be classified as a user task entry based on the analysis of the annotated message.
Abstract:
Methods, systems, and apparatus for obtaining a resource, identifying a first portion of text of the resource that is characterized as a question, and a second part of text of the resource that is characterized as an answer to the question, identifying an entity that is referenced by one or more terms of the text that is characterized as the question, a relationship type that is referenced by one or more other terms of the text that is characterized as the question, and an entity that is referenced by the text that is characterized as the answer to the question, and adjusting a score for a relationship of the relationship type for the entity that is referenced by the one or more terms of the text that is characterized as the question and the entity that is referenced by the text that is characterized as the answer to the question.
Abstract:
Methods and apparatus are described herein for identifying tasks in messages. In various implementations, natural language processing may be performed on a received message to generate an annotated message. The annotated message may be analyzed pursuant to a grammar. A portion of the message may be classified as a user task entry based on the analysis of the annotated message.
Abstract:
Methods and apparatus related to associating a segment of an electronic message with one or more segment addressees. One or more message addressees of an electronic message may be identified, the one or more message addressees identifying at least one recipient of the electronic message. A segment of the electronic message may be identified via one or more processors. One or more segment addressees may be determined from the at least one recipient, the one or more segment addressees identifying an addressee for the identified segment. One or more aspects of the segment may be associated with the one or more segment addressees. An indication pertaining to the one or more aspects of the segment may be provided to the one or more segment addressees.
Abstract:
A computer-implemented technique can receive, at a computing device including one or more processors, a plurality of photos. The technique can extract quality features and similarity features for each of the plurality of photos and can obtain weights for the various quality features and similarity features based on an analysis of a reference photo collection. The technique can generate a quality metric for each of the plurality of photos and can generate a similarity matrix for the plurality of photos by analyzing the various quality features and similarity features and using the obtained weights. The technique can perform joint global maximization of photo quality and photo diversity using the quality metrics and the similarity matrix in order to select a subset of the plurality of photos having a high degree of representativeness. The technique can then store the subset of the plurality of photos in a memory.
Abstract:
A computer-implemented technique can receive a plurality of photos and automatically select a subset of the plurality of photos having a high degree of representativeness by jointly maximizing both photo quality and photo diversity to obtain a photo album. The technique can determine one or more clusters for the photo album using a hierarchical clustering algorithm, and store the photo album according to the one or more clusters. The technique can control the manner in which the photo album is displayed using the one or more clusters. The technique can adjust at least one of the one or more clusters and the automatic photo album generation based on user input. The user input can include at least one of adding, deleting, and moving a photo with respect to the one or more clusters. The technique can then re-cluster, automatically generate a new photo album, and/or adjust the presentation.