Failed temperature sensor detection and mitigation within data storage devices

    公开(公告)号:US12013749B2

    公开(公告)日:2024-06-18

    申请号:US17853510

    申请日:2022-06-29

    IPC分类号: G06F11/00 G06F11/07 G06F11/30

    摘要: Methods and apparatus for detecting a failed temperature sensor within a data storage device and for mitigating the loss of the sensor are provided. One such data storage device includes a non-volatile memory (NVM), a set of temperature sensors, and a processor coupled to the NVM and the temperature sensors. The processor is configured to detect failure of one of the temperature sensors and obtain temperature data from the other temperature sensors. The processor is further configured to estimate, based on the obtained temperature data, the temperature at the failed sensor, and then control at least one function of the data storage device based on the estimated temperature, such as controlling entry into a Read Only mode. In some examples, the processor estimates the temperature at the failed sensor or at various virtual sensor locations using pre-determined formulas having offsets and coefficients determined during an initial machine learning calibration procedure.

    Custom patching automation with machine learning integration

    公开(公告)号:US12008114B2

    公开(公告)日:2024-06-11

    申请号:US18139455

    申请日:2023-04-26

    摘要: A machine learning computing system identifies a vulnerability associated with a server. Based on information associated with the server and a knowledge base, the computing system schedules an interval for patching the server in a centralized tracking module. Based on the knowledge base and the vulnerability, the computing system creates, validates, and deploys the patch job. During patch job execution, the computing system monitors the status of the patch job at the server and transmits status updates to a user interface module. After expiration of the interval, the computing system generates an assessment report for the executed patch job. The computing system updates the knowledge base based on the assessment report to improve future decisioning processes. Based on the success or failure of the patch job, the computing system, upon a failure indication, automatically reschedules an interval for patching the server.

    Restoring a system by load switching to an alternative cloud instance and self healing

    公开(公告)号:US12007832B2

    公开(公告)日:2024-06-11

    申请号:US17652637

    申请日:2022-02-25

    摘要: A system includes a production computing environment including a plurality of components, a centralized data repository that receives and stores data feeds relating to one or more components as a data log, a plurality of cloud infrastructures and at least one processor configured to receive the data feed relating to the components, predict an anomaly associated with a component based on the data feed relating to the component, identify a system configuration needed to run a current workload associated with the component, search each cloud infrastructure for a cloud instance that can support the identified system configuration, identify a cloud instance based on the search, initiate the identified cloud instance and switch the workload from an original system to the initiated cloud instance.

    Wide stripe data storage and constructing, repairing and updating method thereof

    公开(公告)号:US12001278B2

    公开(公告)日:2024-06-04

    申请号:US17865313

    申请日:2022-07-14

    IPC分类号: G06F11/00 G06F11/10

    CPC分类号: G06F11/1004 G06F11/1076

    摘要: A wide stripe data storage and a constructing, repairing and updating method thereof are provided. In the wide stripe data storage, a stripe is (n, k, r, z) and comprises local groups and a global group comprising a rack for storing global parity blocks; each local group comprises





    r
    +
    1

    c




    racks, each rack has different storage nodes for storing data blocks or local parity blocks, the total number of the data blocks and the local parity blocks in each rack is c; one local parity block and r data blocks are stored in each local group; the number of the storage node in each rack is greater than or equal to c; c is smaller than or equal to the node fault-tolerant capability f of the wide stripe based on fault-tolerant attribute of the rack; the number of the global parity blocks is f−1; z is the total number of the racks.