-
公开(公告)号:US20130254612A1
公开(公告)日:2013-09-26
申请号:US13893465
申请日:2013-05-14
Applicant: VMware, Inc.
Inventor: Carl A. WALDSPURGER , Dilpreet BINDRA , Gregory HARM , Patrick TULLMANN
IPC: G06F11/08
CPC classification number: G06F11/08 , G06F11/141
Abstract: A method is provided for recovering from an uncorrected memory error located at a memory address as identified by a memory device. A stored hash value for a memory page corresponding to the identified memory address is used to determine the correct data. Because the memory device specifies the location of the corrupted data, and the size of the window where the corruption occurred, the stored hash can be used to verify memory page reconstruction. With the known good part of the data in hand, the hashes of the pages using possible values in place of the corrupted data are calculated. It is expected that there will be a match between the previously stored hash and one of the computed hashes. As long as there is one and only one match, then that value, used in the place of the corrupted data, is the correct value. The corrupt data, once replaced, allows operation of the memory device to continue without needing to interrupt or otherwise affect a system's operation.
Abstract translation: 提供一种用于从由存储器件识别的存储器地址处的未校正的存储器错误中恢复的方法。 用于与所识别的存储器地址相对应的存储器页的存储的散列值用于确定正确的数据。 由于内存设备指定损坏的数据的位置以及发生损坏的窗口的大小,因此可以使用存储的散列来验证内存页重建。 利用手头已知的很好的部分数据,可以计算使用可能值代替已损坏数据的页面散列。 预期在先前存储的散列和所计算的散列之一将存在匹配。 只要有一个只有一个匹配,那么在损坏的数据的位置使用的值是正确的值。 损坏的数据一旦被更换,就允许存储设备的操作继续进行,而不需要中断或以其他方式影响系统的操作。