用于保持服务器稳定的方法及其系统

    公开(公告)号:WO2018076801A1

    公开(公告)日:2018-05-03

    申请号:PCT/CN2017/092963

    申请日:2017-07-14

    发明人: 彭仁诚

    IPC分类号: G06F11/07

    CPC分类号: G06F11/079 G06F11/0793

    摘要: 用于保持服务器稳定的方法及其系统,该方法包括内部判断步骤,判断是否是接口、进程或者数据中至少一种出现异常,若是接口、进程或者数据中至少一种出现异常,则进行异常处理步骤,若判断不是接口、进程或者数据中至少一种出现异常,则进行外部故障处理步骤;异常处理步骤,采用数据校正、进程部分或全部杀掉并重启杀掉的进程,以及记录和通知进行异常处理;外部故障处理步骤,实时对服务器外部的接口进行监控,并通知用户进行接口的重新调整和处理,实现对服务器进行实时监控,服务器出现异常时,自动进行异常与故障的清除,保持服务器的稳定性,保证企业的正常运作。

    STORAGE ANOMALY DETECTION
    3.
    发明申请
    STORAGE ANOMALY DETECTION 审中-公开
    存储异常检测

    公开(公告)号:WO2018022183A1

    公开(公告)日:2018-02-01

    申请号:PCT/US2017/034857

    申请日:2017-05-26

    IPC分类号: G06F11/30 G06F11/34

    摘要: The technology described in this document is, among other things, capable of efficiently monitoring storage device signal data for anomalies. In an example method, signal data for a plurality of non-transitory storage devices is collected. The method determines a hyper feature representation from the collected signal data and computes, using the hyper feature representation, scores for statistics associated with the non-transitory storage devices. The method further determines a reduced hyper feature representation aggregating the scores for each of the statistics associated with each of the non-transitory storage devices; generates, using the reduced hyper feature representation, storage device scores for the non-transitory storage devices of the plurality, respectively; and identifies one or more non-transitory storage devices from among the plurality of non-transitory storage devices exhibiting anomalous storage device behavior using the storage device scores.

    摘要翻译: 除此之外,本文档中描述的技术能够有效地监控存储设备信号数据的异常情况。 在示例方法中,收集多个非瞬态存储设备的信号数据。 该方法从收集的信号数据确定超级特征表示,并且使用超级特征表示计算与非临时性存储设备相关联的统计的分数。 该方法进一步确定缩减的超级特征表示,其聚合与每个非暂时性存储设备相关联的每个统计数据的得分; 使用所述减少的超级特征表示,分别为所述多个非瞬时存储设备生成存储设备分数; 并且使用所述存储设备得分从表现出异常存储设备行为的所述多个非暂时性存储设备中识别一个或多个非暂时性存储设备。

    AUTOMATED ORDERING OF COMPUTER SYSTEM REPAIR
    4.
    发明申请
    AUTOMATED ORDERING OF COMPUTER SYSTEM REPAIR 审中-公开
    计算机系统维修的自动排序

    公开(公告)号:WO2017142791A1

    公开(公告)日:2017-08-24

    申请号:PCT/US2017/017274

    申请日:2017-02-09

    IPC分类号: G06F11/07 G06F11/30

    摘要: Monitoring the health of a computer system and suggesting an order of repair when problems within the computer system have been identified. Problem(s) and problem entity(s) within the computer system are identified during monitoring. Relationship(s) of the problem entities with other entities in the computer system are identified. A relationship type for each of the identified relationship(s) is determined. A combination of the identified problem(s), the identified problem entity(s), and the determined relationship type(s) is analyzed to determine an order in which repairs of one or more user-visible entities of the computing system should occur in order to address the identified problem(s). An alert comprising the determined order of the repairs is then presented to a user.

    摘要翻译:

    监控计算机系统的运行状况,并在计算机系统内出现问题时提出修复的顺序。 计算机系统内的问题和问题实体在监测过程中被识别出来。 确定问题实体与计算机系统中其他实体的关系。 确定每个识别的关系的关系类型。 分析识别的问题,识别的问题实体和确定的关系类型的组合,以确定计算系统的一个或多个用户可见实体的修复应该发生在 为了解决已识别的问题。 包含确定的修理顺序的警报然后呈现给用户。

    HARDWARE APPARATUSES AND METHODS FOR MEMORY CORRUPTION DETECTION
    5.
    发明申请
    HARDWARE APPARATUSES AND METHODS FOR MEMORY CORRUPTION DETECTION 审中-公开
    硬件设备和记忆腐败检测方法

    公开(公告)号:WO2017112234A1

    公开(公告)日:2017-06-29

    申请号:PCT/US2016/063211

    申请日:2016-11-22

    申请人: INTEL CORPORATION

    IPC分类号: G06F12/02 G06F11/07

    摘要: Methods and apparatuses relating to memory corruption detection are described. In one embodiment, a hardware processor includes an execution unit to execute an instruction to request access to a block of a memory through a pointer to the block of the memory, and a memory management unit to allow access to the block of the memory when a memory corruption detection value in the pointer is validated with a memory corruption detection value in the memory for the block, wherein a position of the memory corruption detection value in the pointer is selectable between a first location and a second, different location.

    摘要翻译: 描述与存储器损坏检测有关的方法和设备。 在一个实施例中,硬件处理器包括执行单元和存储器管理单元,所述执行单元执行指令以请求访问存储器的块,所述指针指向存储器的块,以及存储器管理单元, 指针中的存储器损坏检测值用该存储器中用于该块的存储器损坏检测值进行验证,其中指针中存储器损坏检测值的位置可在第一位置和第二不同位置之间选择。

    STORAGE ERROR TYPE DETERMINATION
    6.
    发明申请
    STORAGE ERROR TYPE DETERMINATION 审中-公开
    存储错误类型确定

    公开(公告)号:WO2017079454A1

    公开(公告)日:2017-05-11

    申请号:PCT/US2016/060358

    申请日:2016-11-03

    发明人: DONLIN, Pat

    IPC分类号: G06F11/00

    摘要: The present disclosure relates to an apparatus and a method for collecting failure/error history lists to identify and categorize erring memory locations in randomly accessible memory of a computer system. Method and apparatus consistent with the present disclosure may identify whether particular memory cells, rows of memory cells, or columns of memory cells within a memory device are associated with transient or persistent errors. These methods and apparatus may also avoid using portions of memory that have been associated with persistent errors or failures.

    摘要翻译: 本公开涉及用于收集故障/错误历史列表以识别和分类计算机系统的可随机访问的存储器中的错误存储器位置的设备和方法。 与本公开一致的方法和装置可以识别存储器设备内的特定存储器单元,存储器单元行或存储器单元列是否与瞬态或持久性错误相关联。 这些方法和设备还可以避免使用与持久性错误或失败相关的内存部分。

    SYSTEM AND METHOD FOR GENERATING A GRAPHICAL DISPLAY REGION INDICATIVE OF CONDITIONS OF A COMPUTING INFRASTRUCTURE
    7.
    发明申请
    SYSTEM AND METHOD FOR GENERATING A GRAPHICAL DISPLAY REGION INDICATIVE OF CONDITIONS OF A COMPUTING INFRASTRUCTURE 审中-公开
    用于生成指示计算基础设施的条件的图形显示区域的系统和方法

    公开(公告)号:WO2017079118A1

    公开(公告)日:2017-05-11

    申请号:PCT/US2016/059839

    申请日:2016-11-01

    申请人: SERVICENOW, INC.

    IPC分类号: G06F11/34

    摘要: Generating a graphical display region including a synchronized display of alert data and impact data indicative of conditions of a computing infrastructure is described. Alerts are identified where each alert has a timestamp indicative of a first time at which it was identified. An impact calculation is performed to generate the impact data based on alerts valid as of a second time proximate to an impact calculation start time. The generated graphical display region includes impact data valid as of a display time and alert data indicative of the alerts valid as of the second time.

    摘要翻译: 描述了生成包括警报数据和指示计算基础设施的状况的影响数据的同步显示的图形显示区域。 警报被识别出来,每个警报都有一个时间戳,指示它被识别的第一次。 执行影响计算以基于接近影响计算开始时间的第二次有效的警报来生成影响数据。 生成的图形显示区域包括自显示时间起的有效数据和指示第二次有效的警报数据。

    METHOD AND SYSTEM FOR ALLOCATING JOBS TO A SET OF COMPUTING NODES BASED ON A HARDWARE HEALTH CHECK
    8.
    发明申请
    METHOD AND SYSTEM FOR ALLOCATING JOBS TO A SET OF COMPUTING NODES BASED ON A HARDWARE HEALTH CHECK 审中-公开
    基于硬件健康检查将作业分配到一组计算节点的方法和系统

    公开(公告)号:WO2017074506A1

    公开(公告)日:2017-05-04

    申请号:PCT/US2016/029956

    申请日:2016-04-29

    IPC分类号: G06F11/07 G06F9/48

    摘要: Example computer-implemented methods, computer-readable media, and computer systems are described for performing a computing node health check. In some aspects, a routine health check of a plurality of computing nodes of a computer system is performed. A computing job is assessed. A first set of computing nodes are allocated from the plurality of computing nodes to the computing job. A prior-job-execution diagnosis is performed on the first set of computing nodes. Whether the first set of computing nodes are all healthy is determined. In response to determining that the first set of computing nodes are healthy, the job is executed. The job is monitored while the job is running. Whether the job fails or succeeds is determined. In response to determining that the job fails, a post-job-execution diagnosis is performed on an exit code of the job. A result of the post-job-execution diagnosis is output via a user interface of the computer system.

    摘要翻译: 描述了用于执行计算节点健康检查的示例计算机实现的方法,计算机可读介质和计算机系统。 在一些方面,执行计算机系统的多个计算节点的例行健康检查。 计算工作进行评估。 第一组计算节点从多个计算节点分配给计算作业。 在第一组计算节点上执行事前执行诊断。 确定第一组计算节点是否全部健康。 响应于确定第一组计算节点是健康的,执行作业。 在作业运行时监控作业。 确定工作是否失败或成功。 响应于确定作业失败,对作业的退出代码执行作业执行后诊断。 作业执行后诊断的结果通过计算机系统的用户界面输出。

    CONTINGENT LOAD SUPPRESSION
    9.
    发明申请
    CONTINGENT LOAD SUPPRESSION 审中-公开
    同时加载抑制

    公开(公告)号:WO2017021679A1

    公开(公告)日:2017-02-09

    申请号:PCT/GB2016/051856

    申请日:2016-06-21

    申请人: ARM LIMITED

    IPC分类号: G06F9/30 G06F9/38

    摘要: A data processing system (2) supports non-speculative execution of vector load instructions that perform at least one contingent load of a data value. Fault detection circuitry (26) serves to detect whether a contingent load is fault-generating contingent load or a fault-free contingent load. Contingent load suppression circuitry (28) detects and suppresses a fault-free contingent load that matches a predetermined criteria that may result in an undesired change of architectural state (undesired side-effect). Examples of such predetermined criteria are that the contingent load is to a non-memory device or that the contingent load will trigger a diagnostic response such as entry of a halting debug halting mode or triggering of a debug exception.

    摘要翻译: 数据处理系统(2)支持不推测执行执行数据值的至少一个偶然负载的向量加载指令。 故障检测电路(26)用于检测偶然负载是故障产生或有负载还是无故障偶然负载。 或有负载抑制电路(28)检测和抑制与可能导致架构状态(不期望的副作用)的不期望的改变的预定准则相匹配的无故障随机负载。 这种预定标准的示例是偶然负载是非存储设备,或偶然负载将触发诊断响应,例如暂停调试停止模式的进入或调试异常的触发。

    MEMRISTIVE NEUROMORPHIC CIRCUIT AND METHOD FOR TRAINING THE MEMRISTIVE NEUROMORPHIC CIRCUIT
    10.
    发明申请
    MEMRISTIVE NEUROMORPHIC CIRCUIT AND METHOD FOR TRAINING THE MEMRISTIVE NEUROMORPHIC CIRCUIT 审中-公开
    用于训练神经元电磁场的电磁神经元电路及方法

    公开(公告)号:WO2017010049A1

    公开(公告)日:2017-01-19

    申请号:PCT/JP2016/003001

    申请日:2016-06-22

    IPC分类号: G06N3/063

    摘要: A neural network (10) is implemented as a memristive neuromorphic circuit that includes a neuron circuit (112, 114) and a memristive device (113) connected to the neuron circuit (112, 114). An input voltage is sensed at a first terminal of the memristive device (113) during a feedforward operation of the neural network (10). An error voltage is sensed at a second terminal of the memristive device (113) during an error backpropagation operation of the neural network (10). In accordance with a training rule, a desired conductance change for the memristive device (113) is computed based on the sensed input voltage and the sensed error voltage. Then a training voltage is applied to the memristive device (113). Here, the training voltage is proportional to a logarithmic value of the desired conductance change.

    摘要翻译: 神经网络(10)被实现为包含神经元电路(112,114)和连接到神经元电路(112,114)的忆阻器(113)的忆阻神经元电路。 在神经网络(10)的前馈操作期间,在忆阻器件(113)的第一端子处感测输入电压。 在神经网络(10)的误差反向传播操作期间,在忆阻器(113)的第二端处感测到误差电压。 根据训练规则,基于感测的输入电压和感测到的误差电压来计算忆阻装置(113)的期望的电导变化。 然后将训练电压施加到忆阻器(113)。 这里,训练电压与期望的电导变化的对数值成比例。