Inverted indexing
    1.
    发明授权

    公开(公告)号:US09971770B2

    公开(公告)日:2018-05-15

    申请号:US14552897

    申请日:2014-11-25

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30011 G06F17/30622

    摘要: Disclosed herein are system, method, and computer program product embodiments for inverted indexing. An embodiment operates by receiving a request to add a new document identifier (doc ID) to an inverted index. A memory block corresponding to where the doc ID is to be added is determined or located, and the available space remaining in the memory block is determined. If the memory block has available space, the doc ID is added to the memory block.

    Applying a database transaction log record directly to a database table container

    公开(公告)号:US09811549B2

    公开(公告)日:2017-11-07

    申请号:US14552914

    申请日:2014-11-25

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30371 G06F17/30368

    摘要: Disclosed herein are system, method, and computer program product embodiments for directly restoring a database from a log volume. An embodiment operates by reading one or more database transaction log records from a log volume. The embodiment then loads from a database table persistent storage one or more pages containing rows in the database table that will be read from or written to by the one or more database transaction log records. The embodiment then loads a dictionary for each column associated with the one or more database transaction log records. The embodiment then applies the one or more database transaction log records directly into the one or more loaded pages and the one or more dictionaries. Each dictionary and page modified in memory is then written to the database table persistent storage when a savepoint operation is performed.

    Garbage collection of multi-version concurrency control (MVCC) data blocks

    公开(公告)号:US09665609B2

    公开(公告)日:2017-05-30

    申请号:US14553901

    申请日:2014-11-25

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30348

    摘要: Disclosed herein are system, method, and computer program product embodiments for performing garbage collection in a database management system with a multi-version concurrency control. An embodiment operate by qualifying a multi-version concurrency control (MVCC) block for garbage collection, where the MVCC block includes multiple cells, each cell corresponding to a row of a table that was acted on by a transaction. Determining that the MVCC block can be garbage collected based on MVCC information in the MVCC block, where the MVCC information includes information that determines whether changes made to rows in the multiple cells are visible in a database management system. Based on the determining, garbage collecting the MVCC block.

    Lock-free, scalable read access to shared data structures using garbage collection

    公开(公告)号:US09626291B2

    公开(公告)日:2017-04-18

    申请号:US14643755

    申请日:2015-03-10

    申请人: Ivan Schreter

    发明人: Ivan Schreter

    IPC分类号: G06F17/30 G06F12/02

    摘要: At least one read operation of at least one object of a data container is initiated. The data container includes an anchor object, a first internal data object and a first garbage collection object, the anchor object comprising a pointer to a versioned structure tree. Thereafter, in response to the at least one incompatible write operation, a second internal data object and a second garbage collection object are created for the data container. The second garbage collection object has a reference to the second internal data object. Subsequently, the second internal data object is installed in the anchor object and the first garbage collection object is passed to a garbage collection process so that space used by the first garbage collection object in a database can be reused. Related apparatus, systems, techniques and articles are also described.

    Shadow paging based log segment directory
    7.
    发明授权
    Shadow paging based log segment directory 有权
    基于阴影寻呼的日志段目录

    公开(公告)号:US09542279B2

    公开(公告)日:2017-01-10

    申请号:US13290726

    申请日:2011-11-07

    申请人: Ivan Schreter

    发明人: Ivan Schreter

    IPC分类号: G06F17/30 G06F17/40 G06F11/14

    摘要: Replay of data transactions is initiated in a data storage application. Pages of a log segment directory characterizing metadata for a plurality of log segment are loaded into memory. Thereafter, redundant pages within the log segment directory are removed. It is then determined, based on the log segment directory, which log segments need to be accessed. These log segments are accessed to execute the log replay. Related apparatus, systems, techniques and articles are also described.

    摘要翻译: 在数据存储应用程序中启动数据事务的重放。 表示多个日志段的元数据的日志段目录的页面被加载到存储器中。 此后,将删除日志段目录中的冗余页面。 然后根据日志段目录确定哪些日志段需要被访问。 访问这些日志段以执行日志重放。 还描述了相关设备,系统,技术和物品。

    Management of I/O and log size for columnar database
    8.
    发明授权
    Management of I/O and log size for columnar database 有权
    管理数据库的I / O和日志大小

    公开(公告)号:US09519673B2

    公开(公告)日:2016-12-13

    申请号:US13215726

    申请日:2011-08-23

    申请人: Ivan Schreter

    发明人: Ivan Schreter

    IPC分类号: G06F17/30 G06F11/14

    摘要: A system includes reception of a transaction to change a columnar table, recordation of the transaction within a delta structure in a volatile memory, recordation of the transaction within a log of a persisted memory, reception of an instruction to merge the delta structure in the volatile memory with a first instantiation of the columnar table in the volatile memory, and in response to the instruction, merging of the delta structure in the volatile memory with the first instantiation of the columnar table in the volatile memory to create a merged columnar table in the volatile memory, and not storing the merged columnar table in the persisted memory.

    摘要翻译: 系统包括接收交易以更改列表,在易失性存储器内的增量结构内记录事务,在持久存储器的日志内记录事务的记录,接收将易失性存储器中的delta结构合并的指令 存储器,其中易失性存储器中的列表的第一次实例化,并且响应于该指令,将易失性存储器中的delta结构与易失性存储器中的列表的第一次实例合并,以便在易失性存储器中创建合并的列表 易失性存储器,并且不将合并的列表存储在持久存储器中。

    Unified table query processing
    9.
    发明授权
    Unified table query processing 有权
    统一表查询处理

    公开(公告)号:US09465844B2

    公开(公告)日:2016-10-11

    申请号:US13843898

    申请日:2013-03-15

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    摘要: A system and method of query processing in a multi-level storage system having a unified table architecture. A query is received by a common query execution engine connected with the unified table architecture, the query specifying a data record. The common query execution engine performs a look-up for the data record based on the query at the first level storage structure. If the data record is not present at the first level storage structure, the common query execution engine performs separate look-ups in each of the second level storage structure and the main store.

    摘要翻译: 具有统一表架构的多级存储系统中的查询处理系统和方法。 与统一表架构相连的通用查询执行引擎接收到查询,查询指定数据记录。 普通查询执行引擎根据第一级存储结构的查询执行数据记录的查找。 如果第一级存储结构中不存在数据记录,则公共查询执行引擎在第二级存储结构和主存储中的每一个中执行单独的查找。

    Address generation in distributed systems using tree method

    公开(公告)号:US09444732B2

    公开(公告)日:2016-09-13

    申请号:US13953557

    申请日:2013-07-29

    申请人: Ivan Schreter

    发明人: Ivan Schreter

    摘要: Methods and apparatus, including computer program products, for managing a cluster of servers organized into nodes. A method of one aspect includes establishing a cluster; establishing a set of ultimate identifiers for nodes resulting from splitting in the cluster; and storing every new data object on a node that has a node identifier that identifies a subset of the set of ultimate identifiers, and providing for the object a universal identifier that combines (i) an object identifier that is unique on the node and (ii) a server identifier that is one of the ultimate identifiers in the subset. A method of another aspect includes generating for a new data object a universal identifier that has a node identifier part that uniquely identifies a node, a reserve part generated at least in part as a pseudo-random value, and an object identifier part that uniquely identifies the object in the node.