Query indexing with permissions integration and federated search

    公开(公告)号:US11341257B1

    公开(公告)日:2022-05-24

    申请号:US16362383

    申请日:2019-03-22

    摘要: In an embodiment, a computer-implemented method comprises receiving a plurality of identifiers of first versions of electronic documents that have been stored in a distributed data storage system and queuing the identifiers in a queue table of a database; obtaining, from the queue table, a set of the identifiers of the first versions of the electronic documents; invoking, using the set of the identifiers, a query indexing function of a search engine system and receiving a set of matching identifiers of electronic documents, from among the set of identifiers, that match queries that have been registered using the query indexing function; storing the matching identifiers in a last-processed table; processing second versions of the electronic documents associated with the matching identifiers, using the query indexing function; updating a feed associated with a user account to show messages referring to the first versions of the electronic documents in response to determining that the second versions of the electronic documents do not match the queries.

    Federated search of multiple sources with conflict resolution
    2.
    发明授权
    Federated search of multiple sources with conflict resolution 有权
    联合搜索具有冲突解决的多个来源

    公开(公告)号:US09348880B1

    公开(公告)日:2016-05-24

    申请号:US14676621

    申请日:2015-04-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30566 G06F17/30575

    摘要: Methods and apparatuses related to federated search of multiple sources with conflict resolution are disclosed. A method may comprise obtaining a set of data ontologies (e.g., types, properties, and links) associated with a plurality of heterogeneous data sources; receiving a selection of a graph comprising a plurality of graph nodes connected by one or more graph edges; and transforming the graph into one or more search queries across the plurality of heterogeneous data sources. A method may comprise obtaining a first data object as a result of executing a first search query across a plurality of heterogeneous data sources; resolving, based on one or more resolution rules, at least the first data object with a repository data object; deduplicating data associated with at least the first data object and the repository data object prior to storing the deduplicated data in a repository that has a particular data model.

    摘要翻译: 公开了与联合搜索具有冲突解决的多个源相关的方法和设备。 一种方法可以包括获得与多个异构数据源相关联的一组数据本体(例如,类型,属性和链接); 接收包括通过一个或多个图形边缘连接的多个图形节点的图形的选择; 以及将所述图形转换成跨所述多个异构数据源的一个或多个搜索查询。 一种方法可以包括:通过跨多个异构数据源执行第一搜索查询的结果来获得第一数据对象; 基于一个或多个解决规则,至少解决具有存储库数据对象的第一数据对象; 在将重复数据删除的数据存储在具有特定数据模型的存储库之前,将至少与第一数据对象和存储库数据对象相关联的数据重复数据删除。

    Efficient access marking approach for efficient retrieval of document access data

    公开(公告)号:US11921767B1

    公开(公告)日:2024-03-05

    申请号:US16258451

    申请日:2019-01-25

    摘要: A computer-implemented method comprises creating and storing a plurality of different access group identifiers each associated with one or more user account identifiers, and a plurality of different classification markings each representing a different access restriction for an electronic document, and associating each of the user account identifiers with one or more of the classification markings; indexing each particular electronic document among a plurality of different electronic documents in association with values of one or more of the access group identifiers and with an inverse list of values of the classification markings that apply to the particular electronic document in a classification index; receiving a search query that specifies one or more attributes of electronic documents; obtaining one or more first classification markings, among the plurality of classification markings, which are associated with a particular user account identifier that is associated with the search query; executing a search of the classification index based on the search query using a covering query that requires a specified minimum number of matches between the one or more first classification markings and one or more second classification markings that are associated with a particular electronic document, and adding the particular electronic document to a result set of the search only when the covering query is satisfied; providing the result set in response to the search query.