-
公开(公告)号:US10802766B2
公开(公告)日:2020-10-13
申请号:US15720959
申请日:2017-09-29
Applicant: Oracle International Corporation
Inventor: Nilesh Choudhury , Kothanda Umamageswaran , Krishnan Meiyyappan , Jia Shi , Vijay Sridharan , Zuoyu Tao , Kai Zhang , Semen Ustimenko , Salini Selvaraj Kowsalya , Somayeh Sardashti
IPC: G06F16/23 , G06F3/06 , G06F16/18 , G06F16/903
Abstract: A shared storage architecture persistently stores database files in non-volatile random access memories (NVRAMs) of computing nodes of a multi-node DBMS. The computing nodes of the multi-node DBMS not only collectively store database data on NVRAMs of the computing nodes, but also host database server instances that process queries in parallel, host database sessions and database processes, and together manage access to a database stored on the NVRAMs of the computing nodes. To perform a data block read operation from persistent storage, a data block may be transferred directly over a network between NVRAM of a computing node that persistently stores the data block to a database buffer in non-volatile RAM of another computing node that requests the data block. The transfer is accomplished using remote direct memory access (“RDMA).
-
公开(公告)号:US10250510B2
公开(公告)日:2019-04-02
申请号:US14488161
申请日:2014-09-16
Applicant: Oracle International Corporation
Inventor: Vijay Sridharan , Richard Frank , Jia Shi , Kothanda Umamageswaran , Juan Loaiza , Alexander Tsukerman , Bang Nguyen , Sumanta Chatterjee
IPC: G06F15/16 , H04L12/851 , H04L29/08
Abstract: A method and apparatus for intelligent network resource manager for distributed computing systems is provided. A first priority is assigned to a first virtual channel set that includes at least two virtual channels of a plurality of virtual channels associated with a physical communication channel. A second priority is assigned to a second virtual channel set that includes at least one virtual channel of the plurality of virtual channels. The first virtual channel set has more virtual channels than the second virtual channel set. Outbound messages of the first priority are directed to virtual channels of the first virtual channel set. Outbound messages of the second priority are directed to virtual channels of the second virtual channel set. The virtual channels are processed in a round-robin order, where processing includes sending the outbound messages over the physical communication channel.
-
公开(公告)号:US10229161B2
公开(公告)日:2019-03-12
申请号:US14489221
申请日:2014-09-17
Applicant: Oracle International Corporation
Inventor: Sarat B. Kakarla , Jia Shi , Selcuk Aya , Kothanda Umamageswaran , Juan R. Loaiza
IPC: G06F17/30
Abstract: Approaches, techniques, and mechanisms are disclosed for improved caching in database systems that deal with multiple data access patterns, such as in database systems that interface with both OLTP and Data Warehouse clients. A cache is deployed between a database server and a storage system that stores data units. Some of the data units accessed by the database server are buffered within the cache. The data units may be associated with data access patterns, such as a random data access pattern or a scan data access pattern, in accordance with which the database server is or appears to be accessing the data units. A processor selects when to cache data units accessed by the database server, based at least on the associated data access patterns. Recent access counts may also be stored for the data units, and may further be utilized to select when to cache data units.
-
24.
公开(公告)号:US20140281272A1
公开(公告)日:2014-09-18
申请号:US13801319
申请日:2013-03-13
Applicant: ORACLE INTERNATIONAL CORPORATION
Inventor: Juan R. Loaiza , Kothanda Umamageswaran , David Friedman , Jia Shi , Zuoyu Tao , Alex Tsukerman
IPC: G06F12/08
CPC classification number: G06F12/0868 , G06F11/2058 , G06F11/2082 , G06F12/0804 , G06F12/12 , G06F2212/222 , G06F2212/262 , G06F2212/286
Abstract: No-loss rapid recovery performs resynchronization efficiently while concurrently allowing availability to mirrored data on the storage device. No-loss rapid recovery has two stages and involves storage devices that have both a non-volatile cache and primary storage and that operate as mirror buddies. The first stage is referred to herein as the buddy-retention stage. During the buddy-retention stage, writes to mirrored data are not performed on the offline mirror buddy but are performed on the online mirror buddy. The mirrored data changed in the online mirrored buddy is retained in the non-volatile cache of the retention buddy. The next stage is referred to herein as the rapid resynchronization stage. In this stage, the changed mirrored data retained by the retention buddy for no-loss rapid recovery is used to resynchronize the offline buddy. The storage device is resynchronized using the changed mirrored data retained in the cache of the mirror buddy.
Abstract translation: 无损失快速恢复有效地执行重新同步,同时允许可用性在存储设备上镜像数据。 无损快速恢复有两个阶段,涉及具有非易失性缓存和主存储并且作为镜像伙伴操作的存储设备。 第一阶段在本文中称为保留阶段。 在好友保留期间,对镜像数据的写入不会在离线镜像伙伴上执行,而是在在线镜像伙伴上执行。 在线镜像伙伴中更改的镜像数据保留在保留伙伴的非易失性缓存中。 下一阶段在本文中被称为快速再同步化阶段。 在这个阶段,由保留伙伴保留的无损失快速恢复所更改的镜像数据用于重新同步脱机伙伴。 使用保留在镜像伙伴缓存中的已更改的镜像数据重新同步存储设备。
-
公开(公告)号:US10956335B2
公开(公告)日:2021-03-23
申请号:US15720972
申请日:2017-09-29
Applicant: Oracle International Corporation
Inventor: Zuoyu Tao , Jia Shi , Kothanda Umamageswaran , Juan R. Loaiza
IPC: G06F12/0873 , G06F12/0864 , G06F16/22 , G06F16/2455 , G06F12/02 , G06F12/0868 , G06F12/0871 , G06F15/173
Abstract: Data blocks are cached in a persistent cache (“NV cache”) allocated from as non-volatile RAM (“NVRAM”). The data blocks may be accessed in place in the NV cache of a “source” computing element by another “remote” computing element over a network using remote direct memory access (“RMDA”). In order for a remote computing element to access the data block in NV cache on a source computing element, the remote computing element needs the memory address of the data block within the NV cache. For this purpose, a hash table is stored and maintained in RAM on the source computing element. The hash table identifies the data blocks in the NV cache and specifies a location of the cached data block within the NV cache.
-
公开(公告)号:US10380021B2
公开(公告)日:2019-08-13
申请号:US13801319
申请日:2013-03-13
Applicant: Oracle International Corporation
Inventor: Juan R. Loaiza , Kothanda Umamageswaran , David Friedman , Jia Shi , Zuoyu Tao , Alex Tsukerman
IPC: G06F12/0868 , G06F12/0804 , G06F11/20 , G06F12/12
Abstract: No-loss rapid recovery performs resynchronization efficiently while concurrently allowing availability to mirrored data on the storage device. No-loss rapid recovery has two stages and involves storage devices that have both a non-volatile cache and primary storage and that operate as mirror buddies. The first stage is referred to herein as the buddy-retention stage. During the buddy-retention stage, writes to mirrored data are not performed on the offline mirror buddy but are performed on the online mirror buddy. The mirrored data changed in the online mirrored buddy is retained in the non-volatile cache of the retention buddy. The next stage is referred to herein as the rapid resynchronization stage. In this stage, the changed mirrored data retained by the retention buddy for no-loss rapid recovery is used to resynchronize the offline buddy. The storage device is resynchronized using the changed mirrored data retained in the cache of the mirror buddy.
-
公开(公告)号:US20190079864A1
公开(公告)日:2019-03-14
申请号:US15705166
申请日:2017-09-14
Applicant: Oracle International Corporation
Inventor: Jia Shi , Wei Zhang , Kothanda Umamageswaran , Neil J.S. MacNaughton , Vijayakrishnan Nagarajan
IPC: G06F12/0808 , G06F13/28 , G06F17/30
Abstract: Techniques related to a server-side extension of client-side caches are provided. A storage server computer receives, from a database server computer, an eviction notification indicating that a data block has been evicted from the database server computer's cache. The storage server computer comprises a memory hierarchy including a volatile cache and a persistent cache. Upon receiving the eviction notification, the storage server computer retrieves the data block from the persistent cache and stores it in the volatile cache. When the storage server computer receives, from the database server computer, a request for the data block, the storage server computer retrieves the data block from the volatile cache. Furthermore, the storage server computer sends the data block to the database server computer, thereby causing the data block to be stored in the database server computer's cache. Still further, the storage server computer evicts the data block from the volatile cache.
-
公开(公告)号:US10031855B2
公开(公告)日:2018-07-24
申请号:US15217136
申请日:2016-07-22
Applicant: Oracle International Corporation
Inventor: Zuoyu Tao , Jia Shi , Kothanda Umamageswaran , Selcuk Aya
IPC: G06F12/08 , G06F12/0893 , G06F12/0806 , G06F3/06 , G06F17/30 , G06F12/02 , G06F12/0804 , G06F12/0866
Abstract: A method and system for fast file initialization is provided. An initialization request to create or extend a file is received. The initialization request comprises or identifies file template metadata. A set of allocation units are allocated, the set of allocation units comprising at least one allocation unit for the file on a primary storage medium without initializing at least a portion of the file on the primary storage medium. The file template metadata is stored in a cache. The cache resides in at least one of volatile memory and persistent flash storage. A second request is received corresponding to a particular allocation unit of the set of allocation units. Particular file template metadata associated with the particular allocation unit is obtained. In response to the second request, at least a portion of a new allocation unit is generated.
-
公开(公告)号:US09772793B2
公开(公告)日:2017-09-26
申请号:US14490367
申请日:2014-09-18
Applicant: Oracle International Corporation
Inventor: Vijay Sridharan , Alexander Tsukerman , Jia Shi , Kothanda Umamageswaran
IPC: G06F12/00 , G06F3/06 , G06F12/0875
CPC classification number: G06F3/065 , G06F3/061 , G06F3/0619 , G06F3/0659 , G06F3/067 , G06F3/0689 , G06F12/0875 , G06F2212/313
Abstract: In a write by-peer-reference, a storage device client writes a data block to a target storage device in the storage system by sending a write request to the target storage device, the write request specifying information used to obtain the data block from a source storage device in the storage system. The target storage device sends a read request to the source storage device for the data block. The source storage device sends the data block to the target storage device, which then writes the data block to the target storage device. The data block is thus written to the target storage device without the storage device client transmitting the data block itself to the target storage device.
-
公开(公告)号:US20160328331A1
公开(公告)日:2016-11-10
申请号:US15217136
申请日:2016-07-22
Applicant: Oracle International Corporation
Inventor: Zuoyu Tao , Jia Shi , Kothanda Umamageswaran , Selcuk Aya
IPC: G06F12/0893 , G06F3/06
CPC classification number: G06F12/0893 , G06F3/06 , G06F3/0611 , G06F3/0632 , G06F3/0643 , G06F3/0685 , G06F12/0246 , G06F12/0804 , G06F12/0806 , G06F12/0866 , G06F17/30115 , G06F2212/1008 , G06F2212/1024 , G06F2212/225 , G06F2212/46 , G06F2212/604 , G06F2212/608 , G06F2212/7207 , G06F2212/7208
Abstract: A method and system for fast file initialization is provided. An initialization request to create or extend a file is received. The initialization request comprises or identifies file template metadata. A set of allocation units are allocated, the set of allocation units comprising at least one allocation unit for the file on a primary storage medium without initializing at least a portion of the file on the primary storage medium. The file template metadata is stored in a cache. The cache resides in at least one of volatile memory and persistent flash storage. A second request is received corresponding to a particular allocation unit of the set of allocation units. Particular file template metadata associated with the particular allocation unit is obtained. In response to the second request, at least a portion of a new allocation unit is generated.
-
-
-
-
-
-
-
-
-