摘要:
Method, system, and computer program product are provided for scoring of crowd-computing inputs. A group of data is provided to crowd-computing participants and the participants are requested to provide candidate members or the group of data. The computer-implemented method performed includes: receiving an input by a participant, wherein the input is a candidate member; counting multiple inputs of the same candidate member by participants; validating a candidate member; rewarding the participants inputting the candidate member, with a higher reward for participants who input the candidate member earlier than other participants; and supplying the rewards to participants once the candidate member has been validated.
摘要:
Method, system, and computer program product are provided for scoring of crowd-computing inputs. A group of data is provided to crowd-computing participants and the participants are requested to provide candidate members or the group of data. The computer-implemented method performed includes: receiving an input by a participant, wherein the input is a candidate member; counting multiple inputs of the same candidate member by participants; validating a candidate member; rewarding the participants inputting the candidate member, with a higher reward for participants who input the candidate member earlier than other participants; and supplying the rewards to participants once the candidate member has been validated.
摘要:
A method for gathering and recording temporal information for a linked entity, the method including identifying a link related activity within a linked source entity, and recording a time stamp in association with the link related activity.
摘要:
A query difficulty prediction unit includes a query difficulty predictor to determine the extent of overlap between query documents received from a search engine operating on an input query and sub-query documents received from the search engine operating on sub-queries of the input query. The unit generates a query difficulty prediction from the extent of overlap.
摘要:
Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).
摘要:
Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).
摘要:
A method for information retrieval with unified search between heterogeneous objects includes indexing a first object as a document in a search index; referencing a second object related to the first object in a facet of the document; and storing a relationship strength between the first and second objects in the facet of the document in the search index. Multiple heterogeneous objects can be related to the first object and referenced in multiple facets of the document, each with its relationship strength to the first object. Scoring an indirect object by indirect relation to a query object can be carried out by aggregating the relationship strengths between the indirect object and the retrieved objects multiplied by the retrieved objects' direct scores of relationship strength to the query object.
摘要:
A method and system for searching numerical terms are provided. The method includes processing a numerical term to a stem, the stem being in the form of a number, and a distance measurement of the numerical term to the stem, wherein a numerical term is a string of characters identified as a number by a numeric parser, and indexing the numerical term in a search index by the stem for search retrieval. The method includes receiving a query and processing the query to divide it into terms, including a numerical term. The numerical terms of the query are also processed to a stem and a distance measurement and document results are retrieved for the stem from an index.
摘要:
A method for searching a corpus of documents, such as the World Wide Web, includes defining a knowledge domain and identifying a set of reference documents in the corpus pertinent to the domain. Upon inputting a query, the corpus is searched using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the query. The set of reference documents is updated with the found documents that are most relevant to the domain. The updated set is used in searching the corpus for information in the domain relevant to subsequent queries.
摘要:
A method and system are provided for conversation detection in email systems. Multiple email messages are provided and grouped as relating to a conversation. The grouping is carried out by applying a similarity function based on a similarity of the email messages' attributes, the similarity function including a similarity between the email messages' participants and at least one of a similarity between the email messages' subjects or a similarity between the email messages' contents. The similarity function may also include the similarity between the email messages' dates. The similarity function may also include weightings for the contributions of the email messages' attributes. A graphical user interface is provided in an email client which includes means for viewing email messages by conversation.