摘要:
Embodiments relate to storing sparse matrices in an in-memory column-oriented database system. Specifically, recent hardware shifts of primary storage from disc into memory, allow execution of linear algebra queries directly in the database engine. Dynamic matrix manipulation operations (like online insertion or deletion of elements) are not covered by most linear algebra frameworks. Therefore a hybrid architecture comprises a read-optimized main structure, and a write-optimized delta structure. The resulting system layout derived from the Compressed Sparse Row (CSR) representation, integrates well with a columnar database design. Moreover, the resulting architecture is amenable to a wide range of non-numerical use cases when dictionary encoding is used. Performance in specific examples is evaluated for dynamic sparse matrix workloads, by applying work flows of nuclear science and network graphs. Embodiments allow performing linear algebra operations on large, sparse matrices commonly associated with scientific computations and analytical business applications.
摘要:
Embodiments relate to storing sparse matrices in an in-memory column-oriented database system. Specifically, recent hardware shifts of primary storage from disc into memory, allow execution of linear algebra queries directly in the database engine. Dynamic matrix manipulation operations (like online insertion or deletion of elements) are not covered by most linear algebra frameworks. Therefore a hybrid architecture comprises a read-optimized main structure, and a write-optimized delta structure. The resulting system layout derived from the Compressed Sparse Row (CSR) representation, integrates well with a columnar database design. Moreover, the resulting architecture is amenable to a wide range of non-numerical use cases when dictionary encoding is used. Performance in specific examples is evaluated for dynamic sparse matrix workloads, by applying work flows of nuclear science and network graphs. Embodiments allow performing linear algebra operations on large, sparse matrices commonly associated with scientific computations and analytical business applications.
摘要:
A data recovery system and method are disclosed. Primary data is stored a database in byte-addressable NVRAM, where the database includes one or more persistent tables of data in a byte-addressable, RAM format, and a persistent memory allocator that maps persistent memory pointers of the persistent memory to virtual memory pointers of a virtual memory associated with the database. Secondary data is stored in volatile DRAM. A failure recovery includes recovering the persistent memory allocator, mapping the persistent memory to the virtual memory to recover primary data using their persistent memory pointers, translating the persistent memory pointers to virtual memory pointers, undoing changes to the primary data made by unfinished transactions of the query execution at the time of failure of one of the one or more queries, and reconstructing the secondary data from the primary data.
摘要:
A system is described for processing schema updated in a zero-downtime environment. A technique includes establishing an application session to access a database, receiving a schema update, converting the database to an updated database according to the schema update after establishing the application session, generating a temporary compensation view from the schema update, the temporary compensation view containing compensation logic to locate database objects belonging to the database, receiving a database transaction from the application session to access a database object in the database; and processing the compensation logic to locate the database object.
摘要:
Methods, systems, and computer-readable storage media for providing at least one parameter for use with a forecast model. Implementations include actions of receiving a first context vector, the first context vector including a plurality of context attributes that describe a first context, retrieving a first parameter vector from a repository based on the first context vector, the repository electronically storing a plurality of parameter vector, each parameter vector being associated with a respective context and including one or more parameters, parameterizing the forecast model based on parameters provided in the first parameter vector to provide a parameterized forecast model, optimizing the parameterized forecast model to provide an optimized forecast model, and forecasting one or more values using the optimized forecast model.
摘要:
One embodiment of the present invention provides a method for incrementally maintaining a Bernoulli sample S with sampling rate q over a multiset R in the presence of update, delete, and insert transactions. The method includes processing items inserted into R using Bernoulli sampling and augmenting S with tracking counters during this processing. Items deleted from R are processed by using the tracking counters and by removing newly deleted items from S using a calculated probability while maintaining a degree of uniformity in S.
摘要:
In a fuel injection system for an internal-combustion engine which includes a dual-fluid nozzle which is supplied with fuel by a high-pressure pump and with water by a feed pump providing a substantially lower pressure than the high-pressure pump and which has disposed in a nozzle body a nozzle needle which is spring biased into a closing position where one end of the needle is seated on a valve seat, a fuel supply passage extends to an annular space around the valve needle and a branch passage leads to a control chamber formed at the other end of the valve needle and includes a solenoid valve for controlling the application of pressurized fluid to, and the release thereof from, the other end of the nozzle needle, an additional fluid line extends from the feed pump to the annular space and includes a check valve for permitting flow of additional fluid only toward the annular space and a pressure relief line in communication with the fuel supply passage via a control valve, and a control device is provided for operating the solenoid valve for timed relief of fluid pressure from the control chamber for unseating the nozzle needle and for operating the control valve to either supply fuel under pressure to the annular chamber or releasing pressure from the annular space so as to permit feeding of water into the annular space for subsequent injection, together with the fuel, from the dual-fluid nozzle.
摘要:
A system, computer-implemented method, and computer-readable storage medium for generating a block-based index, are provided. A block index is generated where the block index comprises a plurality of blocks and a block corresponds to a section of a graph column that stores a value. A block range vector is also generated for the index where the block range vector includes range information for the block that corresponds to the section of the graph and where the block-based index facilitates traversal of the graph column that searches for the value by constraining the traversal to the section of the graph.
摘要:
Technologies are disclosed for generating query execution plans optimized for parallel execution for programs having both core database relational functions and user-defined functions. A variety of optimization strategies can be employed to improve performance in a parallel execution scenarios. A flexible range of permitted partition arrangements can be specified as acceptable to parallelized instances of the user-defined function. The optimizer can leverage such information when constructing an optimized query execution plan. Partitioning arrangements or other properties can be leveraged to avoid additional or unnecessary processing.
摘要:
A system, computer-implemented method, and a computer-readable storage medium for a traversal of a property graph, are provided. The edge table of the property graph is divided into a plurality of fragments. A first fragment is selected for traversal using a set of selected vertices, where the traversal identifies a set of edges. Based on the set of edges, a set of adjacent vertices is determined during the traversal. A set of discovered vertices in the property graph is determined based on the set of selected vertices and the set of adjacent vertices.