Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for classifying documents. One of the methods includes obtaining a collection of training documents, the training documents including positive documents identified as being longform documents and negative documents identified as not being longform documents; extracting one or more features from the training documents, wherein the features represent lexical or textual content of the training documents; and generating a longform document classifier trained using feature instances extracted from the training documents, wherein the generated longform document classifier is trained such that input documents are classified as being longform documents or classified as not being longform documents.
Abstract:
Systems and methods are disclosed for using an additive context model for entity disambiguation. An example method may include receiving a span of text from a document and a phrase vector for the span. The phrase vector may have a quantity of features and represent a context for the span. The method also includes determining a quantity of candidate entities from a knowledge base that have been referred to by the span. For each of the quantity of candidate entities, the method may include determining a support score for the candidate entity for each feature in the phrase vector, combining the support scores additively, and computing a probability that the span resolves to the candidate entity given the context. The method may also include resolving the span to a candidate entity with a highest probability.
Abstract:
Methods and apparatus are described herein for identifying tasks in messages. In various implementations, natural language processing may be performed on a received message to generate an annotated message. The annotated message may be analyzed pursuant to a grammar. A portion of the message may be classified as a user task entry based on the analysis of the annotated message.
Abstract:
A computer-implemented technique can receive a plurality of photos and automatically select a subset of the plurality of photos having a high degree of representativeness by jointly maximizing both photo quality and photo diversity to obtain a photo album. The technique can determine one or more clusters for the photo album using a hierarchical clustering algorithm, and store the photo album according to the one or more clusters. The technique can control the manner in which the photo album is displayed using the one or more clusters. The technique can adjust at least one of the one or more clusters and the automatic photo album generation based on user input. The user input can include at least one of adding, deleting, and moving a photo with respect to the one or more clusters. The technique can then re-cluster, automatically generate a new photo album, and/or adjust the presentation.
Abstract:
The disclosure includes a system and method for generating audio snippets from a subset of audio tracks. In some embodiments an audio snippet is an audio summary of a group or collection of songs.
Abstract:
Methods and apparatus related to associating a segment of an electronic message with one or more segment addressees. One or more message addressees of an electronic message may be identified, the one or more message addressees identifying at least one recipient of the electronic message. A segment of the electronic message may be identified via one or more processors. One or more segment addressees may be determined from the at least one recipient, the one or more segment addressees identifying an addressee for the identified segment. One or more aspects of the segment may be associated with the one or more segment addressees. An indication pertaining to the one or more aspects of the segment may be provided to the one or more segment addressees.
Abstract:
A method for annotating a message executes at a computing device having one or more processors and memory. The memory stores one or more programs configured for execution by the one or more processors. A plurality of messages for a user is analyzed to determine whether a task reminder is derivable for any of the messages. In this way, task reminders are derived for at least a subset of the messages. A user interface for an electronic messaging application is provided. The interface includes a list of objects, one or more of which represents a collapsed state of a message in the message subset and comprises a summary, task reminder, and a dismissal affordance. Responsive dismissal affordance selection, the task reminder and the dismissal affordances are removed from the object.
Abstract:
Methods and apparatus related to associating a segment of an electronic message with one or more segment addressees. One or more message addressees of an electronic message may be identified, the one or more message addressees identifying at least one recipient of the electronic message. A segment of the electronic message may be identified via one or more processors. One or more segment addressees may be determined from the at least one recipient, the one or more segment addressees identifying an addressee for the identified segment. One or more aspects of the segment may be associated with the one or more segment addressees. An indication pertaining to the one or more aspects of the segment may be provided to the one or more segment addressees.
Abstract:
Methods, systems, and apparatus for obtaining a resource, identifying a first portion of text of the resource that is characterized as a question, and a second part of text of the resource that is characterized as an answer to the question, identifying an entity that is referenced by one or more terms of the text that is characterized as the question, a relationship type that is referenced by one or more other terms of the text that is characterized as the question, and an entity that is referenced by the text that is characterized as the answer to the question, and adjusting a score for a relationship of the relationship type for the entity that is referenced by the one or more terms of the text that is characterized as the question and the entity that is referenced by the text that is characterized as the answer to the question.
Abstract:
Methods and apparatus are described herein for identifying tasks in messages. In various implementations, natural language processing may be performed on a received message to generate an annotated message. The annotated message may be analyzed pursuant to a grammar. A portion of the message may be classified as a user task entry based on the analysis of the annotated message.