Hardware/software based indirect time stamping methodology for proactive hardware/software event detection and control
    1.
    发明申请
    Hardware/software based indirect time stamping methodology for proactive hardware/software event detection and control 失效
    用于主动硬件/软件事件检测和控制的基于硬件/软件的间接时间戳方法

    公开(公告)号:US20050144532A1

    公开(公告)日:2005-06-30

    申请号:US10735412

    申请日:2003-12-12

    IPC分类号: G06F11/00

    摘要: An improved method and apparatus for time stamping events occurring on a large scale distributed network uses a local counter associated with each processor of the distributed network. Each counter resets at the same time globally so that all events are recorded with respect to a particular time. The counter is stopped when a critical event is detected. The events are masked or filtered in an online or offline fashion to eliminate non-critical events from triggering a collection by the system monitor or service/host processor. The masking can be done dynamically through the use of an event history logger. The central system may poll the remote processor periodically to receive the accurate counter value from the local counter and device control register. Remedial action can be taken when conditional probability calculations performed on the historical information indicate that a critical event is about to occur.

    摘要翻译: 用于在大规模分布式网络上发生的时间戳事件的改进的方法和装置使用与分布式网络的每个处理器相关联的本地计数器。 每个计数器在全局同时重置,以便在特定时间内记录所有事件。 当检测到关键事件时,计数器停止。 这些事件以在线或离线方式被屏蔽或过滤,以消除系统监视器或服务/主机处理器触发集合的非关键事件。 可以通过使用事件历史记录器来动态地完成掩蔽。 中央系统可以周期性地轮询远程处理器以从本地计数器和设备控制寄存器接收准确的计数器值。 对历史信息进行条件概率计算时可以采取补救措施,表明将会发生重大事件。

    System for discovering business processes from noisy activities logs
    2.
    发明授权
    System for discovering business processes from noisy activities logs 失效
    从嘈杂的活动日志中发现业务流程的系统

    公开(公告)号:US08352414B2

    公开(公告)日:2013-01-08

    申请号:US12575294

    申请日:2009-10-07

    IPC分类号: G06F7/00 G06F17/00 G06Q10/00

    CPC分类号: G06Q10/06 G06Q10/063

    摘要: A system for discovering business processes from noisy activities logs from various activities performed during the execution of the process. Activities are observed from the noisy activity logs that may include text from manually entered activity logs, chat scripts, emails, voice transcripts, desktop captures, and tool logs, wherein the noisy activity logs are received from multiple person/tool actors with each of the actors performing one or more activities related to one/more business tasks. Extracting information from the noisy activity logs to capture activity based information, and then analyzing similar activities and finding possible paths in the similar activities. The results are used to build a process graph based on the similar activities and the possible paths in the similar activities.

    摘要翻译: 从执行过程中执行的各种活动中发现来自嘈杂活动日志的业务流程的系统。 从嘈杂的活动日志观察到活动,其中可能包括手动输入的活动日志,聊天脚本,电子邮件,语音记录,桌面捕获和工具日志的文本,其中从多个人/工具演员接收嘈杂活动日志,其中每个 执行与一个/多个业务任务相关的一个或多个活动的演员。 从嘈杂的活动日志中提取信息以捕获基于活动的信息,然后分析类似活动并在类似活动中查找可能的路径。 结果用于根据类似活动和类似活动中可能的路径构建流程图。

    SCALABLE METHOD OF CONTINUOUS MONITORING THE REMOTELY ACCESSIBLE RESOURCES AGAINST THE NODE FAILURES FOR VERY LARGE CLUSTERS
    3.
    发明申请
    SCALABLE METHOD OF CONTINUOUS MONITORING THE REMOTELY ACCESSIBLE RESOURCES AGAINST THE NODE FAILURES FOR VERY LARGE CLUSTERS 有权
    连续监测远程可用资源的可扩展方法对于非常大的群集的节点失败

    公开(公告)号:US20070277058A1

    公开(公告)日:2007-11-29

    申请号:US11835892

    申请日:2007-08-08

    IPC分类号: G06F11/00

    摘要: The notion of controlling, using and monitoring remote resources in a distributed data processing system through the use of proxy resource managers and agents is extended to provide failover capability so that resource coverage is preserved and maintained even in the event of either temporary or longer duration node failure. Mechanisms are provided for consistent determination of resource status. Mechanisms are also provided which facilitate the joining of nodes to a group of nodes while still preserving remote resource operations. Additional mechanisms are also provided for the return of remote resource management to the control of a previously failed, but now recovered node, even if the failure had resulted in a node reset.

    摘要翻译: 扩展了通过使用代理资源管理器和代理来控制,使用和监视分布式数据处理系统中的远程资源的概念,以提供故障切换功能,以便即使在临时或更长持续时间节点的情况下也可以保留和维护资源覆盖 失败。 为资源状况的一致确定提供了机制。 还提供了机制,其有助于将节点连接到一组节点,同时仍保留远程资源操作。 还提供了附加机制,用于将远程资源管理返回到先前故障但现在恢复的节点的控制,即使故障导致节点重置。

    Hybrid method for event prediction and system control
    4.
    发明申请
    Hybrid method for event prediction and system control 失效
    用于事件预测和系统控制的混合方法

    公开(公告)号:US20050114739A1

    公开(公告)日:2005-05-26

    申请号:US10720300

    申请日:2003-11-24

    IPC分类号: G06F11/00 G06F11/34

    摘要: A hybrid method of predicting the occurrence of future critical events in a computer cluster having a series of nodes records system performance parameters and the occurrence of past critical events. A data filter filters the logged to data to eliminate redundancies and decrease the data storage requirements of the system. Time-series models and rule based classification schemes are used to associate various system parameters with the past occurrence of critical events and predict the occurrence of future critical events. Ongoing processing jobs are migrated to nodes for which no critical events are predicted and future jobs are routed to more robust nodes.

    摘要翻译: 在具有一系列节点的计算机集群中预测未来关键事件的发生的混合方法记录系统性能参数和过去关键事件的发生。 数据过滤器将记录到数据进行过滤,以消除冗余并减少系统的数据存储要求。 时间序列模型和基于规则的分类方案用于将各种系统参数与过去发生的关键事件相关联,并预测未来关键事件的发生。 正在进行的处理作业将迁移到不预测到关键事件的节点,并且将来的作业路由到更健壮的节点。

    Method and system for deciding when to checkpoint an application based on risk analysis
    5.
    发明申请
    Method and system for deciding when to checkpoint an application based on risk analysis 失效
    基于风险分析决定何时检查应用程序的方法和系统

    公开(公告)号:US20060168473A1

    公开(公告)日:2006-07-27

    申请号:US11042611

    申请日:2005-01-25

    IPC分类号: G06F11/00

    CPC分类号: G06F11/1471

    摘要: Briefly, according to the invention in an information processing system including a plurality of information processing nodes, a request for checkpointing by an application includes node health criteria (or parameters). The system has the authority to grant or deny the checkpointing request depending on the system health or availability. This scheme significantly improves not only the system performance, but also the application running time as the system. By skipping a checkpoint the application can use the same time to run the application instead of spending extra time for checkpointing.

    摘要翻译: 简而言之,根据本发明,在包括多个信息处理节点的信息处理系统中,由应用程序检查点的请求包括节点健康标准(或参数)。 系统有权根据系统运行状况或可用性来授予或拒绝检查点请求。 该方案不仅显着提高了系统性能,而且显着提高了作为系统的应用运行时间。 通过跳过检查点,应用程序可以使用相同的时间运行应用程序,而不是花费额外的时间进行检查点。

    SCALABLE METHOD OF CONTINUOUS MONITORING THE REMOTELY ACCESSIBLE RESOURCES AGAINST THE NODE FAILURES FOR VERY LARGE CLUSTERS
    6.
    发明申请
    SCALABLE METHOD OF CONTINUOUS MONITORING THE REMOTELY ACCESSIBLE RESOURCES AGAINST THE NODE FAILURES FOR VERY LARGE CLUSTERS 有权
    连续监测远程可用资源的可扩展方法对于非常大的群集的节点故障

    公开(公告)号:US20060242454A1

    公开(公告)日:2006-10-26

    申请号:US11456585

    申请日:2006-07-11

    IPC分类号: G06F11/00

    摘要: The notion of controlling, using and monitoring remote resources in a distributed data processing system through the use of proxy resource managers and agents is extended to provide failover capability so that resource coverage is preserved and maintained even in the event of either temporary or longer duration node failure. Mechanisms are provided for consistent determination of resource status. Mechanisms are also provided which facilitate the joining of nodes to a group of nodes while still preserving remote resource operations. Additional mechanisms are also provided for the return of remote resource management to the control of a previously failed, but now recovered node, even if the failure had resulted in a node reset.

    摘要翻译: 扩展了通过使用代理资源管理器和代理来控制,使用和监视分布式数据处理系统中的远程资源的概念,以提供故障切换功能,以便即使在临时或更长持续时间节点的情况下也可以保留和维护资源覆盖 失败。 为资源状况的一致确定提供了机制。 还提供了机制,其有助于将节点连接到一组节点,同时仍保留远程资源操作。 还提供了附加机制,用于将远程资源管理返回到先前故障但现在恢复的节点的控制,即使故障导致节点重置。

    Method for using a priority queue to perform job scheduling on a cluster based on node rank and performance
    7.
    发明申请
    Method for using a priority queue to perform job scheduling on a cluster based on node rank and performance 有权
    基于节点等级和性能使用优先级队列对集群执行作业调度的方法

    公开(公告)号:US20060184939A1

    公开(公告)日:2006-08-17

    申请号:US11057969

    申请日:2005-02-15

    IPC分类号: G06F9/46

    CPC分类号: G06F9/505 G06F2209/508

    摘要: In a multi node information processing system, a method for scheduling jobs, includes steps of: determining node-related performance parameters for a plurality of nodes; determining a ranking for each node based on the node related performance parameters for each node; and ordering each nodes by its ranking for job scheduling.

    摘要翻译: 在多节点信息处理系统中,调度作业的方法包括以下步骤:确定多个节点的节点相关性能参数; 基于每个节点的与节点相关的性能参数来确定每个节点的排名; 并通过其对作业调度的排名来排序每个节点。