-
公开(公告)号:US20220414015A1
公开(公告)日:2022-12-29
申请号:US17359087
申请日:2021-06-25
Applicant: Microsoft Technology Licensing, LLC
Inventor: Krystyna Ewa REISTETER , Cristian DIACONU , Rogério RAMOS , Sarika R. IYER , Siddharth Deepak MEHTA , Huanhui HU
IPC: G06F12/0882 , G06F12/0871 , G06F12/02 , G06F3/06
Abstract: Data base performance is improved using write-behind optimization of covering cache. Non-volatile memory data cache includes a full copy of stored data file(s). Data cache and storage writes, checkpoints, and recovery may be decoupled (e.g., with separate writes, checkpoints and recoveries). A covering data cache supports improved performance by supporting database operation during storage delays or outages and/or by supporting reduced I/O operations using aggregate writes of contiguous data pages (e.g., clean and dirty pages) to stored data file(s). Aggregate writes reduce data file fragmentation and reduce the cost of snapshots. Performing write-behind operations in a background process with optimistic concurrency control may support improved database performance, for example, by not interfering with write operations to data cache. Data cache may store (e.g., in metadata) data cache checkpoint information and storage checkpoint information. A stored data file may store storage checkpoint information (e.g., in a file header).
-
公开(公告)号:US20200097566A1
公开(公告)日:2020-03-26
申请号:US16138103
申请日:2018-09-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cristian DIACONU , Naveen PRAKASH , Alejandro Hernandez SAENZ , Alexander BUDOVSKI
Abstract: The bifurcation of a log into a fixed-size log portion and a growable log portion. Log records are first written to the fixed-size log portion. At some point, the older log records within the fixed-size log portion are destaged to the growable log portion. Destaging may involve copying the log records from the fixed-size log portion and tacking the log record onto the end of the growable log portion. This destaging might occur one sub-portion (e.g., one virtual log file) at a time. The more recent log records are within the fixed-size log portion. Accordingly, new writes to the log occur to the fixed-size log portion. Furthermore, a large portion of the reads are of the more recent log records within the fixed-size log portion. Thus, optimizing performance within the fixed-size log portion results in significant improvement in the performance of the overall log.
-
公开(公告)号:US20200099745A1
公开(公告)日:2020-03-26
申请号:US16138340
申请日:2018-09-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cristian DIACONU , Vikram WAKADE , Krystyna REISTETER
IPC: H04L29/08
Abstract: A storage segment server environment that includes storage segment server(s) that cover a contiguous address space of data. The storage segment server has storage segments that are contiguous and that matches the address space of a respective of storage segments in the cloud store service. Thus, a read of the contiguous address spaces of the data from the cloud store service may instead be handled as a single read of the contiguous address space of the storage segment server. Each storage segment server(s) may be assigned to a range within the total contiguous address space of the data. Each storage segment server(s) includes a respective storage segment write component that is configured to perform write operations to storage segments in the respective storage segment server. Each storage segment server(s) also includes a respective cloud storage write component that initiates corresponding write operations to the cloud store service.
-
公开(公告)号:US20200097578A1
公开(公告)日:2020-03-26
申请号:US16138238
申请日:2018-09-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cristian DIACONU , Naveen PRAKASH , Alexander BUDOVSKI , Huanhui HU , Alejandro Hernandez SAENZ
IPC: G06F17/30
Abstract: Brokering log records so as to prevent log records that are not yet persisted in a persistent log from being disseminated. The log records may be generated as a primary compute system performs operations. Upon receiving a request for a log record, the broker component determines whether the requested log record has been persisted in a persistent log. If the broker component determines that the log record has been persisted in the persistent log, the broker component responds to the request by causing the requested log record to be provided to the requesting entity (e.g., a secondary compute system). On the other hand, if the log record cannot yet determine that the log record has been persisted in the persistent log, the broker component prevents the log record from being provided to the requesting entity. This prevents data from being inconsistent during recovery.
-
公开(公告)号:US20240427792A1
公开(公告)日:2024-12-26
申请号:US18825671
申请日:2024-09-05
Applicant: Microsoft Technology Licensing, LLC
Inventor: Alejandro Hernandez SAENZ , Cristian DIACONU , Krystyna Ewa REISTETER , Naveen PRAKASH , Sheetal SHROTRI , Rogério RAMOS , Alexander BUDOVSKI , Hanumantha Rao KODAVALLA
IPC: G06F16/25 , G06F16/22 , G06F16/2455 , G06F16/27
Abstract: Distributed database systems including compute nodes and page servers are described herein that enable separating logical and physical storage of database files in a distributed database system. A distributed database system includes a page server and a compute node, and is configured to store a logical database file that includes data and is associated with a file identifier. Each page server is configurable to store slices (i.e., subportions) of the logical database file. The compute node is coupled to the plurality of page servers and configured to store the logical database file responsive to a received command. In an aspect, such storage may comprise slicing the data comprising the logical database file into a set of slices with each being associated with a respective page server, maintaining an endpoint mapping for each slice of the first set of slices, and transmitting each slice to the associated for storage thereby.
-
公开(公告)号:US20200097366A1
公开(公告)日:2020-03-26
申请号:US16138683
申请日:2018-09-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cristian DIACONU , Huanhui HU , Sheetal SHROTRI
Abstract: Assembly of a state of user data as of a particular point in time in the context of the data being spread across multiple storage segment servers. Furthermore, the assembly occurs despite the storage segment servers not needing to coordinate a timing of any snapshots. Rather, the storage segment servers perform a snapshot of the portion of the data that it individually stores without having to coordinate with any other storage segment server. The particular point of time may be any arbitrary time, such as a time selected by a user.
-
公开(公告)号:US20190205295A1
公开(公告)日:2019-07-04
申请号:US16298498
申请日:2019-03-11
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cristian DIACONU , Craig Steven FREEDMAN , Per-Ake LARSON , Michael James ZWILLING
IPC: G06F16/23 , G06F16/2455 , G06F16/21 , G06F12/02 , G06F16/27
CPC classification number: G06F16/2379 , G06F12/0253 , G06F16/219 , G06F16/24561 , G06F16/27 , G06F2212/1044
Abstract: An in-memory database system stores an entirety of a database within main memory of the computer system. The database includes a table comprising a plurality of rows including a particular row. A first transaction updating the particular row is executed to create a first version of the particular row. A second transaction updating the particular row is executed to create a second version of the particular row. The first version is visible only to the first transaction, and the second version is visible only to the second transaction. The in-memory database system may associate an index with the database table, the index referencing the plurality of rows, including referencing the first version of the particular row and the second version of the particular row. The in-memory database system may initiate garbage collection of the first version of the particular row and/or the second version of the particular row.
-
公开(公告)号:US20150254273A1
公开(公告)日:2015-09-10
申请号:US14712755
申请日:2015-05-14
Applicant: Microsoft Technology Licensing, LLC
Inventor: Per-Ake LARSON , Michael James ZWILLING , Cristian DIACONU
IPC: G06F17/30
CPC classification number: G06F17/30227 , G06F17/30353 , G06F17/30371 , G06F17/30575 , G06F17/30581 , G06F17/30864
Abstract: The subject disclosure relates to a distributed transaction management technique that ensures synchronization between participating nodes in a global or distributed transaction. The technique leverages a commit protocol that uses local clocks at the respective participating nodes. Participants in a global transaction are configured to utilize the same commit timestamp and logical read time and can advance their respective local clocks to establish this synchronization. In one embodiment, distributed commit utilizes a modified version of two-phase commit that includes an extra phase to collect commit timestamp votes from participants. Additionally, a heartbeat mechanism can be used to establish loose synchronization between nodes. In another embodiment, a node can respond to a remote transaction request by returning a list of nodes involved in generating the result of the transaction and the types of access used by such nodes in addition to the transaction result itself.
Abstract translation: 本公开涉及分布式事务管理技术,其确保全局或分布式事务中的参与节点之间的同步。 该技术利用在相应参与节点处使用本地时钟的提交协议。 全局事务中的参与者被配置为利用相同的提交时间戳和逻辑读取时间,并且可以推进其各自的本地时钟以建立该同步。 在一个实施例中,分布式提交利用两阶段提交的修改版本,其包括从参与者收集提交时间戳投票的额外阶段。 另外,可以使用心跳机制来建立节点之间的松散同步。 在另一个实施例中,节点可以通过返回生成事务结果所涉及的节点列表以及除了事务结果本身之外由这些节点使用的访问类型来响应远程事务请求。
-
公开(公告)号:US20230350912A1
公开(公告)日:2023-11-02
申请号:US18351258
申请日:2023-07-12
Applicant: Microsoft Technology Licensing, LLC
Inventor: Alejandro Hernandez SAENZ , Cristian DIACONU , Krystyna Ewa REISTETER , Naveen PRAKASH , Sheetal SHROTRI , Rogério RAMOS , Alexander BUDOVSKI , Hanumantha Rao KODAVALLA
CPC classification number: G06F16/256 , G06F16/278 , G06F16/2272
Abstract: Distributed database systems including compute nodes and page servers are described herein that enable separating logical and physical storage of database files in a distributed database system. A distributed database system includes a page server and a compute node, and is configured to store a logical database file that includes data and is associated with a file identifier. Each page server is configurable to store slices (i.e., subportions) of the logical database file. The compute node is coupled to the plurality of page servers and configured to store the logical database file responsive to a received command. In an aspect, such storage may comprise slicing the data comprising the logical database file into a set of slices with each being associated with a respective page server, maintaining an endpoint mapping for each slice of the first set of slices, and transmitting each slice to the associated for storage thereby.
-
公开(公告)号:US20230297510A1
公开(公告)日:2023-09-21
申请号:US18311699
申请日:2023-05-03
Applicant: Microsoft Technology Licensing, LLC
Inventor: Krystyna Ewa REISTETER , Cristian DIACONU , Rogério RAMOS , Sarika R. IYER , Siddharth Deepak MEHTA , Huanhui HU
IPC: G06F12/0882 , G06F3/06 , G06F12/02 , G06F12/0871
CPC classification number: G06F12/0882 , G06F3/0604 , G06F3/0656 , G06F3/0679 , G06F12/0246 , G06F12/0871
Abstract: Data base performance is improved using write-behind optimization of covering cache. Non-volatile memory data cache includes a full copy of stored data file(s). Data cache and storage writes, checkpoints, and recovery may be decoupled (e.g., with separate writes, checkpoints and recoveries). A covering data cache supports improved performance by supporting database operation during storage delays or outages and/or by supporting reduced I/O operations using aggregate writes of contiguous data pages (e.g., clean and dirty pages) to stored data file(s). Aggregate writes reduce data file fragmentation and reduce the cost of snapshots. Performing write-behind operations in a background process with optimistic concurrency control may support improved database performance, for example, by not interfering with write operations to data cache. Data cache may store (e.g., in metadata) data cache checkpoint information and storage checkpoint information. A stored data file may store storage checkpoint information (e.g., in a file header).
-
-
-
-
-
-
-
-
-