Method and system for providing relationships in search results
    1.
    发明授权
    Method and system for providing relationships in search results 有权
    在搜索结果中提供关系的方法和系统

    公开(公告)号:US08959079B2

    公开(公告)日:2015-02-17

    申请号:US12568685

    申请日:2009-09-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 Y10S707/961

    摘要: A method and system for providing relationships in search results are provided. The method includes indexing an entity in a search index as an entity index entry, the entity index entry including facets providing information on the entity type and related entities. Search results are obtained by a search engine in the form of ranked result entities, wherein the result entities include multiple types of entities. The method then includes retrieving index entries to determine relationships between search result entities and providing the relationships in search results. The method further includes, for each result entity, retrieving its entity index entry and cross-checking the facets of the retrieved entity index entry for other result entities.

    摘要翻译: 提供了一种在搜索结果中提供关系的方法和系统。 该方法包括将搜索索引中的实体索引为实体索引条目,该实体索引条目包括提供关于实体类型和相关实体的信息的方面。 搜索结果由搜索引擎以排名结果实体的形式获得,其中结果实体包括多种类型的实体。 该方法然后包括检索索引条目以确定搜索结果实体之间的关系并在搜索结果中提供关系。 该方法还包括对于每个结果实体,检索其实体索引条目并交叉检查其他结果实体的检索到的实体索引条目的方面。

    METHOD AND SYSTEM FOR PROVIDING RELATIONSHIPS IN SEARCH RESULTS
    2.
    发明申请
    METHOD AND SYSTEM FOR PROVIDING RELATIONSHIPS IN SEARCH RESULTS 有权
    提供搜索结果关系的方法和系统

    公开(公告)号:US20110078136A1

    公开(公告)日:2011-03-31

    申请号:US12568685

    申请日:2009-09-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 Y10S707/961

    摘要: A method and system for providing relationships in search results are provided. The method includes indexing an entity in a search index as an entity index entry, the entity index entry including facets providing information on the entity type and related entities. Search results are obtained by a search engine in the form of ranked result entities, wherein the result entities include multiple types of entities. The method then includes retrieving index entries to determine relationships between search result entities and providing the relationships in search results. The method further includes, for each result entity, retrieving its entity index entry and cross-checking the facets of the retrieved entity index entry for other result entities.

    摘要翻译: 提供了一种在搜索结果中提供关系的方法和系统。 该方法包括将搜索索引中的实体索引为实体索引条目,该实体索引条目包括提供关于实体类型和相关实体的信息的方面。 搜索结果由搜索引擎以排名结果实体的形式获得,其中结果实体包括多种类型的实体。 该方法然后包括检索索引条目以确定搜索结果实体之间的关系并在搜索结果中提供关系。 该方法还包括对于每个结果实体,检索其实体索引条目并交叉检查其他结果实体的检索到的实体索引条目的方面。

    SCORING RELATIONSHIPS BETWEEN OBJECTS IN INFORMATION RETRIEVAL
    3.
    发明申请
    SCORING RELATIONSHIPS BETWEEN OBJECTS IN INFORMATION RETRIEVAL 审中-公开
    信息检索对象之间的比较关系

    公开(公告)号:US20110282855A1

    公开(公告)日:2011-11-17

    申请号:US12778162

    申请日:2010-05-12

    IPC分类号: G06F17/30 G06F15/16

    CPC分类号: G06F16/332

    摘要: A method, system, and computer program product for scoring relationships between objects in information retrieval are provided. The method includes: receiving a query object as an input in a search, wherein the query object is a query for a searchable entity type; identifying indexed document objects associated with the query object; and identifying facet objects referenced in the indexed document objects, which facet objects share a defined relationship type with the query object. The method calculates for each relationship between a facet object and the query object a weight of relationship. Wherein a query object, document object, and facet object can represent any searchable entity. Calculating a weight of relationship calculates the weight of relationships over all document objects divided by a selected normalization.

    摘要翻译: 提供了一种用于评估信息检索中对象之间的关系的方法,系统和计算机程序产品。 该方法包括:在查询中接收查询对象作为输入,其中查询对象是可搜索实体类型的查询; 识别与查询对象相关联的索引的文档对象; 并标识在索引的文档对象中引用的facet对象,哪个facet对象与查询对象共享定义的关系类型。 该方法针对一个面对象和查询对象之间的关系的权重计算每个关系。 其中查询对象,文档对象和构面对象可以表示任何可搜索的实体。 计算关系的权重计算所有文档对象之间的关系的权重除以选择的归一化。

    Object-Oriented Twig Query Evaluation
    4.
    发明申请
    Object-Oriented Twig Query Evaluation 有权
    面向对象的Twig查询评估

    公开(公告)号:US20090164424A1

    公开(公告)日:2009-06-25

    申请号:US11964017

    申请日:2007-12-25

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30938

    摘要: A computer-implemented method for searching a corpus of documents includes defining a query as a twig including a root annotation operator having an associated tag specifying a span and having an associated expression indicative of one or more terms whose occurrence within the span will satisfy the query. An object is recursively selected from a group of objects that consists of the tag and the expression, and is used in advancing through the corpus until a candidate document is found that contains the tag and satisfies the expression. The candidate document is evaluated to determine whether the one or more terms indicated by the expression occur within the span in the candidate document so as to satisfy the annotation operator.

    摘要翻译: 用于搜索文档语料库的计算机实现的方法包括将查询定义为包括具有指定跨度的相关联的标签的根注解算子的具有树枝的查询,并且具有指示跨越中的发生将满足查询的一个或多个项的相关联表达式 。 从由标签和表达式组成的一组对象中递归地选择一个对象,并且被用于通过语料库前进直到找到包含标签并满足表达式的候选文档。 对候选文件进行评估,以确定表达式中指出的一个或多个项是否在候选文档的跨度内发生,以满足注释算子。

    Faceted search with relationships between categories
    5.
    发明授权
    Faceted search with relationships between categories 有权
    通过类别之间的关系进行面搜索

    公开(公告)号:US08510306B2

    公开(公告)日:2013-08-13

    申请号:US13118477

    申请日:2011-05-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30722

    摘要: Method, system, and computer program product for faceted search with relationships between categories are provided. The method includes: having a document set of multiple documents, each document having associated categories to which it belongs; grouping multiple categories associated with a document into a category set based on a relationship between the multiple categories; associating the category set with the document; and indexing the category set for retrieval of documents from categories sharing a category set. Wherein indexing the category set includes: having an index entry of a textual representations of a category, wherein the index entry includes a single occurrence for each document to which the category is attached; adding a payload to a document occurrence of a serialization of an identifier of the category sets to which the category belongs associated with the document. Indexing the category set further includes: adding an index entry for category set data, wherein the index entry includes a single occurrence for each document, wherein a document occurrence includes a payload of a serialization of an identifier of category sets associated with the document, and an identifier of the categories belonging to the category sets.

    摘要翻译: 提供了方法,系统和计算机程序产品,用于分类搜索与类别之间的关系。 该方法包括:具有多个文档的文档集合,每个文档具有其所属的相关类别; 基于多个类别之间的关系将与文档相关联的多个类别分组成类别集合; 将类别集与文档相关联; 并索引用于从共享类别集的类别中检索文档的类别集。 其中索引所述类别集包括:具有类别的文本表示的索引条目,其中所述索引条目包括所述类别附加到的每个文档的单个事件; 向文档的类别集合的标识符的序列化的文档的发生添加有效载荷。 对类别集的索引进一步包括:为类别集数据添加索引条目,其中索引条目包括每个文档的单个出现,其中文档发生包括与文档相关联的类别集合的标识符的序列化的有效载荷,以及 属于类别集的类别的标识符。

    Method and system for detection of authors
    6.
    发明授权
    Method and system for detection of authors 有权
    作者检测方法和系统

    公开(公告)号:US07752208B2

    公开(公告)日:2010-07-06

    申请号:US11733808

    申请日:2007-04-11

    IPC分类号: G06F7/00

    摘要: A method and system are provided for detection of authors across different types of information sources such as across documents on the Web. The method includes obtaining a compression signature for a document, and determining the similarity between compression signatures of two or more documents. If the similarity is greater than a threshold measure, the two or more documents are considered to be by the same author. Scored pairs of documents are clustered to provide a group of documents by the same author.The group of documents by the same author can be used for user profiling, noise reduction, contribution sizing, detecting fraudulent contributions, obtaining other search results by the same author, or mating a document with undisclosed authorship to a document of known author.

    摘要翻译: 提供了一种方法和系统,用于检测跨不同类型信息源的作者,例如跨Web上的文档。 该方法包括获得文档的压缩签名,以及确定两个或多个文档的压缩签名之间的相似性。 如果相似度大于阈值度量,则两个或多个文档被认为是由同一作者。 得分的文档对被聚集以提供同一作者的一组文档。 同一作者的一组文件可用于用户分析,降噪,贡献大小,检测欺诈性贡献,获取同一作者的其他搜索结果,或将未公开作者的文档与已知作者的文档进行交互。

    Information Retrieval with Unified Search Using Multiple Facets
    7.
    发明申请
    Information Retrieval with Unified Search Using Multiple Facets 有权
    使用多个面进行统一搜索的信息检索

    公开(公告)号:US20090327271A1

    公开(公告)日:2009-12-31

    申请号:US12164139

    申请日:2008-06-30

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/30675

    摘要: Information retrieval with unified search between heterogeneous objects is described. The method includes: indexing a first object as a document in a search index; referencing a second object related to the first object in a facet of the document; and storing a relationship strength between the first and second objects in the facet of the document in the search index. Multiple heterogeneous objects can be related to the first object and referenced in multiple facets of the document, each with its relationship strength to the first object. Scoring an indirect object by indirect relation to a query object can be carried out by aggregating the relationship strengths between the indirect object and the retrieved objects multiplied by the retrieved objects' direct scores of relationship strength to the query object.

    摘要翻译: 描述了异构对象之间统一搜索的信息检索。 该方法包括:将第一对象作为文档索引到搜索索引中; 在所述文档的方面引用与所述第一对象相关的第二对象; 以及在所述搜索索引中存储所述文档的所述面中的所述第一和第二对象之间的关系强度。 多个异构对象可以与第一个对象相关,并在文档的多个方面被引用,每一个都具有与第一个对象的关系强度。 通过与查询对象的间接关系来计算间接对象可以通过将间接对象和检索对象之间的关系强度乘以检索到的对象的关系强度的直接得分与查询对象进行。

    Access control for entity search
    8.
    发明授权
    Access control for entity search 有权
    实体搜索的访问控制

    公开(公告)号:US09177171B2

    公开(公告)日:2015-11-03

    申请号:US13417250

    申请日:2012-03-11

    IPC分类号: G06F17/30 G06F21/62

    CPC分类号: G06F21/6227

    摘要: Method, system, and computer program product for access control for entity search are provided. The method includes: representing entity-relationship data in a conceptual model; representing entities in a search system as documents containing the entity's searchable content and metadata; defining authorization rules for searchers over entities and their relationships; and extending an entity document to include searchable tokens defining the authorization rules. Defining authorization rules may include: identifying query predicate constraints for entity search; and defining searchable tokens as paths for query predicates and permissible searchers; wherein the permissible searchers are permitted access to data based on a query that contains the predicate. Defining authorization rules may further include: defining searchable document files for a free-text predicate with a field name as a token of permissible searchers and the field value as the searchable content.

    摘要翻译: 提供了用于实体搜索的访问控制的方法,系统和计算机程序产品。 该方法包括:在概念模型中表示实体关系数据; 将搜索系统中的实体表示为包含实体的可搜索内容和元数据的文档; 确定搜索者对实体及其关系的授权规则; 并扩展实体文档以包括定义授权规则的可搜索令牌。 定义授权规则可以包括:识别实体搜索的查询谓词约束; 并将可搜索令牌定义为查询谓词和可允许的搜索者的路径; 其中允许的搜索者被允许基于包含谓词的查询访问数据。 定义授权规则还可以包括:定义具有字段名称作为允许搜索者的令牌的自由文本谓词的可搜索文档文件,并将该字段值作为可搜索内容。

    Information retrieval with unified search using multiple facets
    9.
    发明授权
    Information retrieval with unified search using multiple facets 有权
    使用多个方面进行统一搜索的信息检索

    公开(公告)号:US08024324B2

    公开(公告)日:2011-09-20

    申请号:US12164139

    申请日:2008-06-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30675

    摘要: A method for information retrieval with unified search between heterogeneous objects includes indexing a first object as a document in a search index; referencing a second object related to the first object in a facet of the document; and storing a relationship strength between the first and second objects in the facet of the document in the search index. Multiple heterogeneous objects can be related to the first object and referenced in multiple facets of the document, each with its relationship strength to the first object. Scoring an indirect object by indirect relation to a query object can be carried out by aggregating the relationship strengths between the indirect object and the retrieved objects multiplied by the retrieved objects' direct scores of relationship strength to the query object.

    摘要翻译: 用于异构对象之间的统一搜索的信息检索方法包括将第一对象作为搜索索引中的文档进行索引; 在所述文档的方面引用与所述第一对象相关的第二对象; 以及在所述搜索索引中存储所述文档的所述面中的所述第一和第二对象之间的关系强度。 多个异构对象可以与第一个对象相关,并在文档的多个方面被引用,每一个都具有与第一个对象的关系强度。 通过与查询对象的间接关系来计算间接对象可以通过将间接对象和检索对象之间的关系强度乘以检索到的对象的关系强度的直接得分与查询对象进行。

    Indexing and searching entity-relationship data
    10.
    发明授权
    Indexing and searching entity-relationship data 有权
    索引和搜索实体关系数据

    公开(公告)号:US08751505B2

    公开(公告)日:2014-06-10

    申请号:US13417248

    申请日:2012-03-11

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30604

    摘要: Method, system, and computer program product for indexing and searching entity-relationship data are provided. The method includes: defining a logical document model for entity-relationship data including: representing an entity as a document containing the entity's searchable content and metadata; dually representing the entity as a document and as a category; and representing each relationship instance for the entity as a category set that contains categories of all participating entities in the relationship. The method also includes: translating entity-relationship data into the logical document model; and indexing the entity-relationship data of the populated logical document model as an inverted index. The method may include searching indexed entity-relationship data using a faceted search, wherein the categories are all categories required for supporting faceted navigation.

    摘要翻译: 提供了索引和搜索实体关系数据的方法,系统和计算机程序产品。 该方法包括:定义用于实体关系数据的逻辑文档模型,包括:将实体表示为包含该实体的可搜索内容和元数据的文档; 将实体双重表示为文件和类别; 并将实体的每个关系实例表示为包含关系中所有参与实体的类别的类别集合。 该方法还包括:将实体关系数据转换为逻辑文档模型; 并将填充的逻辑文档模型的实体关系数据索引为反向索引。 该方法可以包括使用分面搜索搜索索引的实体关系数据,其中类别是支持分面导航所需的所有类别。