Distributed multi-phase batch job processing
    1.
    发明授权
    Distributed multi-phase batch job processing 有权
    分布式多阶段批处理作业处理

    公开(公告)号:US08966486B2

    公开(公告)日:2015-02-24

    申请号:US13099814

    申请日:2011-05-03

    IPC分类号: G06F9/46 G06F9/48

    CPC分类号: G06F9/4843

    摘要: A distributed job-processing environment including a server, or servers, capable of receiving and processing user-submitted job queries for data sets on backend storage servers. The server identifies computational tasks to be completed on the job as well as a time frame to complete some of the computational tasks. Computational tasks may include, without limitation, preprocessing, parsing, importing, verifying dependencies, retrieving relevant metadata, checking syntax and semantics, optimizing, compiling, and running. The server performs the computational tasks, and once the time frame expires, a message is transmitted to the user indicating which tasks have been completed. The rest of the computational tasks are subsequently performed, and eventually, job results are transmitted to the user.

    摘要翻译: 分布式作业处理环境,包括能够接收和处理用户提交的后端存储服务器上的数据集作业查询的服务器或服务器。 服务器识别在作业上完成的计算任务以及完成一些计算任务的时间框架。 计算任务可以包括但不限于预处理,解析,导入,验证相关性,检索相关元数据,检查语法和语义,优化,编译和运行。 服务器执行计算任务,一旦时间段到期,向用户发送一条消息,指示哪些任务已经完成。 随后执行其余的计算任务,最终将作业结果传送给用户。

    AUTOMATED AND DYNAMIC MANAGEMENT OF QUERY VIEWS FOR DATABASE WORKLOADS

    公开(公告)号:US20080177700A1

    公开(公告)日:2008-07-24

    申请号:US11624876

    申请日:2007-01-19

    IPC分类号: G06F7/06

    摘要: The embodiments of the invention provide a method, program storage device, etc. for automated and dynamic management of query views for database workloads. More specifically, a method begins by executing queries, which includes accessing a set of data tables for each of the queries. During the executing of the queries, the method accesses a required data table from a cache if the required data table is present in the cache and creates the required data table if the required data table is not present in the cache. The accessing of the required data table from the cache has a lower processing cost than accessing the required data table from a base table. Also during the executing of the queries, created data tables are stored in the cache, wherein one or more of the created data tables are removed from the cache when the cache becomes full.

    SYSTEM FOR PARTITIONING AND PRUNING OF ADVERTISEMENTS
    3.
    发明申请
    SYSTEM FOR PARTITIONING AND PRUNING OF ADVERTISEMENTS 审中-公开
    广告分类和惩罚系统

    公开(公告)号:US20090259540A1

    公开(公告)日:2009-10-15

    申请号:US12103636

    申请日:2008-04-15

    申请人: Thomas Phan

    发明人: Thomas Phan

    IPC分类号: G06Q30/00

    摘要: A system is disclosed for selecting advertisements for delivery. The system may be configured to assign the advertisements to categories. The system also may be configured to deliver the advertisements according to a frequency assigned to each category.

    摘要翻译: 公开了用于选择用于传送的广告的系统。 该系统可以被配置为将广告分配给类别。 该系统还可以被配置为根据分配给每个类别的频率递送广告。

    METHOD AND MEANS FOR CO-SCHEDULING JOB ASSIGNMENTS AND DATA REPLICATION IN WIDE-AREA DISTRIBUTED SYSTEMS
    4.
    发明申请
    METHOD AND MEANS FOR CO-SCHEDULING JOB ASSIGNMENTS AND DATA REPLICATION IN WIDE-AREA DISTRIBUTED SYSTEMS 审中-公开
    用于协调分配系统中工作分配和数据复制的方法和手段

    公开(公告)号:US20080049254A1

    公开(公告)日:2008-02-28

    申请号:US11466778

    申请日:2006-08-24

    IPC分类号: G06K15/00

    摘要: The embodiments of the invention provide a method, service, computer program product, etc. of co-scheduling job assignments and data replication in wide-area systems using a genetic method. A method begins by co-scheduling assignment of jobs and replication of data objects based on job ordering within a scheduler queue, job-to-compute node assignments, and object-to-local data store assignments. More specifically, the job ordering is determined according to an order in which the jobs are assigned from the scheduler to the compute nodes. Further, the job-to-compute node assignments are determined according to which of the jobs are assigned to which of the compute nodes; and, the object-to-local data store assignments are determined according to which of the data objects are replicated to which of the local data stores.

    摘要翻译: 本发明的实施例提供了使用遗传方法在广域系统中共同调度作业分配和数据复制的方法,服务,计算机程序产品等。 一种方法是通过根据调度程序队列中的作业排序,作业到计算节点分配和对象到本地数据存储分配共同调度作业的分配和数据对象的复制。 更具体地,根据从调度器向计算节点分配作业的顺序来确定作业排序。 此外,根据哪个作业分配给哪个计算节点来确定作业到计算节点分配; 并且根据哪个数据对象被复制到本地数据存储中的哪一个确定对象到本地数据存储器分配。

    Near real-time analysis of dynamic social and sensor data to interpret user situation
    5.
    发明授权
    Near real-time analysis of dynamic social and sensor data to interpret user situation 有权
    近实时分析动态社会和传感器数据来解释用户情况

    公开(公告)号:US08838516B2

    公开(公告)日:2014-09-16

    申请号:US13413546

    申请日:2012-03-06

    IPC分类号: G06F17/00 G06N3/00

    CPC分类号: G06N3/00 G06F17/30867

    摘要: In a first embodiment of the present invention, a method of interpreting a situation of a user of an electronic device is provided, comprising: gathering social data related to the user; monitoring one or more physical sensors on the electronic device in order to gather physical data related to the user; mapping structured data in the social data and the physical data to internal data types; extracting features from unstructured social data; identifying attributes related to the features; obtaining values for the identified attributes; interpreting movement of the user by examining the physical data; and detecting a situation of the user by fusing the movement of the user with the identified attributes of the features of the unstructured social data and features of structured social data.

    摘要翻译: 在本发明的第一实施例中,提供了一种解释电子设备用户的情况的方法,包括:收集与用户相关的社交数据; 监视电子设备上的一个或多个物理传感器,以便收集与用户相关的物理数据; 将社会数据中的结构化数据和物理数据映射到内部数据类型; 从非结构化社会数据中提取特征; 识别与特征相关的属性; 获取所识别属性的值; 通过检查物理数据解释用户的移动; 以及通过将用户的移动与非结构化社交数据的特征的已识别属性和结构化社交数据的特征融合来检测用户的情况。

    HEURISTICS-BASED SCHEDULING FOR DATA ANALYTICS
    6.
    发明申请
    HEURISTICS-BASED SCHEDULING FOR DATA ANALYTICS 有权
    用于数据分析的基于HEURISTICS的调度

    公开(公告)号:US20130117752A1

    公开(公告)日:2013-05-09

    申请号:US13353109

    申请日:2012-01-18

    IPC分类号: G06F9/46

    摘要: A scheduler may receive a plurality of jobs for scheduling of execution thereof on a plurality of computing nodes. An evaluation module may provide a common interface for each of a plurality of scheduling algorithms. An algorithm selector may utilize the evaluation module in conjunction with benchmark data for a plurality of jobs of varying types to associate one of the plurality of scheduling algorithms with each job type. A job comparator may compare a current job for scheduling against the benchmark data to determine a current job type of the current job. The evaluation module may further schedule the current job for execution on the plurality of computing nodes, based on the current job type and the associated scheduling algorithm.

    摘要翻译: 调度器可以在多个计算节点上接收用于调度其执行的多个作业。 评估模块可以为多个调度算法中的每一个提供公共接口。 算法选择器可以将评估模块与用于多种类型的多个作业的基准数据结合使用,以将多个调度算法中的一个与每个作业类型相关联。 作业比较器可将当前作业与基准数据进行比较,以确定当前作业的当前作业类型。 评估模块可以基于当前作业类型和相关联的调度算法进一步调度当前作业以在多个计算节点上执行。

    Automated Content-Based Adjustment of Formatting and Application Behavior
    7.
    发明申请
    Automated Content-Based Adjustment of Formatting and Application Behavior 审中-公开
    自动基于内容的格式和应用程序行为调整

    公开(公告)号:US20090164888A1

    公开(公告)日:2009-06-25

    申请号:US11960483

    申请日:2007-12-19

    申请人: Thomas Phan

    发明人: Thomas Phan

    IPC分类号: G06F17/21 G06F17/30 G06F15/16

    CPC分类号: G06F17/2785 G06F17/211

    摘要: Systems, including clients and servers, and methods relate to dynamically formatting content in a style based in part on the content. In particular, content generated by one user or group of users and presented over a communication medium is formatted based on features of the content. Embodiments relating to web, email and messaging content are described.

    摘要翻译: 包括客户端和服务器在内的系统和方法涉及到基于部分内容的风格动态格式化内容。 特别地,基于内容的特征来格式化由一个用户或通过通信媒体呈现的用户或者一组用户生成的内容。 描述与网络,电子邮件和消息传递内容相关的实施例。

    SYSTEM AND METHOD FOR AUTOMATING AND SCHEDULING REMOTE DATA TRANSFER AND COMPUTATION FOR HIGH PERFORMANCE COMPUTING
    8.
    发明申请
    SYSTEM AND METHOD FOR AUTOMATING AND SCHEDULING REMOTE DATA TRANSFER AND COMPUTATION FOR HIGH PERFORMANCE COMPUTING 有权
    用于自动化和调度远程数据传输和高性能计算的计算的系统和方法

    公开(公告)号:US20080178179A1

    公开(公告)日:2008-07-24

    申请号:US11624253

    申请日:2007-01-18

    IPC分类号: G06F9/44

    CPC分类号: G06F9/5027

    摘要: The invention pertains to a system and method for a set of middleware components for supporting the execution of computational applications on high-performance computing platform. A specific embodiment of this invention was used to deploy a financial risk application on Blue Gene/L parallel supercomputer. The invention is relevant to any application where the input and output data are stored in external sources, such as SQL databases, where the automatic pre-staging and post-staging of the data between the external data sources and the computational platform is desirable. This middleware provides a number of core features to support these applications including for example, an automated data extraction and staging gateway, a standardized high-level job specification schema, a well-defined web services (SOAP) API for interoperability with other applications, and a secure HTML/JSP web-based interface suitable for non-expert and non-privileged users.

    摘要翻译: 本发明涉及用于在高性能计算平台上支持计算应用的执行的一组中间件组件的系统和方法。 本发明的具体实施方案用于在Blue Gene / L并行超级计算机上部署金融风险应用程序。 本发明涉及将输入和输出数据存储在诸如SQL数据库的外部数据库中的任何应用程序,其中外部数据源和计算平台之间的数据的自动预分段和后期是期望的。 该中间件提供了许多核心功能来支持这些应用程序,例如自动数据提取和登台网关,标准化的高级工作规范模式,与其他应用程序互操作的明确定义的Web服务(SOAP)API,以及 一种适用于非专家和非特权用户的安全的基于HTML / JSP Web的界面。

    Heuristics-based scheduling for data analytics
    9.
    发明授权
    Heuristics-based scheduling for data analytics 有权
    基于启发式的数据分析调度

    公开(公告)号:US09183058B2

    公开(公告)日:2015-11-10

    申请号:US13353109

    申请日:2012-01-18

    IPC分类号: G06F9/50 G06F11/34

    摘要: A scheduler may receive a plurality of jobs for scheduling of execution thereof on a plurality of computing nodes. An evaluation module may provide a common interface for each of a plurality of scheduling algorithms. An algorithm selector may utilize the evaluation module in conjunction with benchmark data for a plurality of jobs of varying types to associate one of the plurality of scheduling algorithms with each job type. A job comparator may compare a current job for scheduling against the benchmark data to determine a current job type of the current job. The evaluation module may further schedule the current job for execution on the plurality of computing nodes, based on the current job type and the associated scheduling algorithm.

    摘要翻译: 调度器可以在多个计算节点上接收用于调度其执行的多个作业。 评估模块可以为多个调度算法中的每一个提供公共接口。 算法选择器可以将评估模块与用于多种类型的多个作业的基准数据结合使用,以将多个调度算法中的一个与每个作业类型相关联。 作业比较器可将当前作业与基准数据进行比较,以确定当前作业的当前作业类型。 评估模块可以基于当前作业类型和相关联的调度算法进一步调度当前作业以在多个计算节点上执行。

    System and method for automating and scheduling remote data transfer and computation for high performance computing
    10.
    发明授权
    System and method for automating and scheduling remote data transfer and computation for high performance computing 有权
    用于自动化和调度高性能计算的远程数据传输和计算的系统和方法

    公开(公告)号:US09104483B2

    公开(公告)日:2015-08-11

    申请号:US11624253

    申请日:2007-01-18

    IPC分类号: G06F9/50

    CPC分类号: G06F9/5027

    摘要: The invention pertains to a system and method for a set of middleware components for supporting the execution of computational applications on high-performance computing platform. A specific embodiment of this invention was used to deploy a financial risk application on Blue Gene/L parallel supercomputer. The invention is relevant to any application where the input and output data are stored in external sources, such as SQL databases, where the automatic pre-staging and post-staging of the data between the external data sources and the computational platform is desirable. This middleware provides a number of core features to support these applications including for example, an automated data extraction and staging gateway, a standardized high-level job specification schema, a well-defined web services (SOAP) API for interoperability with other applications, and a secure HTML/JSP web-based interface suitable for non-expert and non-privileged users.

    摘要翻译: 本发明涉及用于在高性能计算平台上支持计算应用的执行的一组中间件组件的系统和方法。 本发明的具体实施方案用于在Blue Gene / L并行超级计算机上部署金融风险应用程序。 本发明涉及将输入和输出数据存储在诸如SQL数据库的外部数据库中的任何应用程序,其中外部数据源和计算平台之间的数据的自动预分段和后期是期望的。 该中间件提供了许多核心功能来支持这些应用程序,例如自动数据提取和登台网关,标准化的高级工作规范模式,与其他应用程序互操作的明确定义的Web服务(SOAP)API,以及 一种适用于非专家和非特权用户的安全的基于HTML / JSP Web的界面。