Combination of datasets with use of a dataset filter

    公开(公告)号:US09934281B2

    公开(公告)日:2018-04-03

    申请号:US15126248

    申请日:2015-10-10

    IPC分类号: G06F17/30

    摘要: Technologies are generally described for methods and systems to join a first source dataset with a second source dataset. In an example, a processor may receive a request to join the first and second source datasets based on a condition. The processor may determine a first range of attribute values from the first source dataset, and a second range of attribute values from the second source dataset. The processor may produce a filtered dataset based on a comparison of the first and second ranges. A map unit may transform the filtered dataset and the second source dataset into first and second mapped datasets based on the condition. A reduce unit may transform the first and second mapped datasets into first and second reduced datasets based on the condition. A combiner unit may combine the first and second reduced datasets to join the first and second source datasets.

    CROSS-TENANT DATA LEAKAGE ISOLATION
    4.
    发明申请

    公开(公告)号:US20180063089A1

    公开(公告)日:2018-03-01

    申请号:US15249554

    申请日:2016-08-29

    IPC分类号: H04L29/06 G06F21/60 G06F17/30

    摘要: A method and proxy device for cross-tenant data leakage isolation in a multi-tenant database are provided. The method includes monitoring, by a proxy device, traffic flows between a server executing at least one cloud-based application and the multi-tenant database, wherein the proxy device is communicatively connected between the server and the multi-tenant database; capturing, by the proxy device, at least a request to access the multi-tenant database, wherein the request is communicated using a database-specific protocol; analyzing the request to determine if the request is legitimate; upon determining that the request is not legitimate, modifying the request to point to a global-tenant table and to designate a unique tenant identifier, wherein the unique tenant identifier corresponds to a tenant-specific table name designated in the global-tenant table; and sending the modified request to the multi-tenant database using the database-specific protocol.

    Parallel streaming of external data

    公开(公告)号:US09898469B1

    公开(公告)日:2018-02-20

    申请号:US14468089

    申请日:2014-08-25

    IPC分类号: G06F17/30 H04L29/08

    摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for streaming external data in parallel from a second distributed system to a first distributed system. One of the methods includes receiving a query that requests a join of first rows of a first table in a first distributed system with second rows of an external table, the external table representing data in a second distributed system. Each of the segment nodes communicates with a respective extension service that obtains fragments from one or more data nodes of the second distributed system according to location information for the respective fragments, and provides to the segment node a stream of data corresponding to second rows of the external table. Each of the segment nodes computes joined rows between the first rows of the first table and the stream of data corresponding to second rows of the external table.

    Priority Queuing for Updates in a Database System

    公开(公告)号:US20180046661A1

    公开(公告)日:2018-02-15

    申请号:US15336535

    申请日:2016-10-27

    申请人: SAP SE

    发明人: Brendon Atkins

    IPC分类号: G06F17/30

    摘要: In one embodiment, a method receives a list of subscribers at a set interval time. Each subscriber includes one or more updates in which an update to the data stored in a database is required. A queue process in a plurality of queue processes is selected based on a first selection process where different queue processes are used over multiple set interval times. The queue process places subscribers in the list of subscribers in a plurality of queues using a placement process that is based on a characteristic of the subscribers. Also, the queue processes in the plurality of queue processes use different placement processes to place subscribers in the plurality of queues. The method then retrieves subscribers from the plurality of queues based on a second selection process, the retrieved subscribers being provided to the plurality of processors for processing of the respective updates for the subscribers.

    Data analysis apparatus and method

    公开(公告)号:US09842159B2

    公开(公告)日:2017-12-12

    申请号:US15239199

    申请日:2016-08-17

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30598 G06F17/30498

    摘要: The present invention relates to a heterogeneous data cluster generation apparatus and method and a data clustering method and apparatus, and more particularly, to a data clustering method and apparatus which cluster data measured by different sensors into a number of groups. Aspects of the present invention provide an apparatus and method for generating clusters by putting together heterogeneous data which are values measured by different types of sensors. Aspects of the present invention also provide an apparatus and method for generating clusters by setting indices in order to effectively cluster multi-dimensional data, massive data, or scattered data.