Creating NoSQL database index for semi-structured data

    公开(公告)号:US09953102B2

    公开(公告)日:2018-04-24

    申请号:US14599296

    申请日:2015-01-16

    IPC分类号: G06F17/30

    摘要: Semi-structured source data is preprocessed to obtain text partitions to be stored into a data table with a first combined primary key including a structure thread primary key and a sequence value primary key. The structure thread primary key identifies a structure thread that is segmented into several consecutive intervals according to a determined or predetermined sequence. An inverted index table, created for the preprocessed text partitions, includes a second combined primary key including the structure thread primary key and a keyword primary key. Corresponding to values of the primary keys in the second combined primary key, related text partition sequence IDs are recorded as index values of the inverted index table. Index values having a same keyword primary key value but different structure thread primary key values are located in different rows in the inverted index table. The present techniques improve query efficiency of database index and facilitate updating.

    METHOD AND SYSTEM FOR MONITORING EXECUTION OF USER REQUEST IN DISTRIBUTED SYSTEM
    2.
    发明申请
    METHOD AND SYSTEM FOR MONITORING EXECUTION OF USER REQUEST IN DISTRIBUTED SYSTEM 有权
    用于分布式系统中用户请求执行的方法和系统

    公开(公告)号:US20140082184A1

    公开(公告)日:2014-03-20

    申请号:US14023014

    申请日:2013-09-10

    IPC分类号: H04L29/08

    摘要: Embodiments of the present application relate to a method, a system and a computer program product for monitoring execution of a user request on a distributed system. A method for monitoring execution of user requests on a distributed system is provided. The method includes sampling a user request received by a plurality of servers, generating an identifier corresponding to the user request sampled by the plurality of servers, injecting tracking interfaces in advance into processes executed by the plurality of servers, determining whether the current process contain the identifier corresponding to the user request, in the event that the current process contains the identifier corresponding to the user request when the process has executed the tracking interface, recording information corresponding to the identifier relating to the function indicated by the tracking interface and generating a log, and individually collecting the generated log according to the corresponding identifier.

    摘要翻译: 本申请的实施例涉及用于监视在分布式系统上的用户请求的执行的方法,系统和计算机程序产品。 提供了一种在分布式系统上监视用户请求的执行方法。 该方法包括对由多个服务器接收到的用户请求进行采样,生成与多个服务器采样的用户请求对应的标识符,预先将跟踪接口注入到由多个服务器执行的进程中,确定当前进程是否包含 当处理已经执行跟踪接口时,当前进程包含与用户请求对应的标识符的情况下,对应于用户请求的标识符,记录与跟踪接口所指示的功能相关的标识符对应的信息,并生成日志 ,并根据相应的标识符单独收集生成的日志。

    Code coverage rate determination method and system
    3.
    发明授权
    Code coverage rate determination method and system 有权
    代码覆盖率确定方法和系统

    公开(公告)号:US09317400B2

    公开(公告)日:2016-04-19

    申请号:US13763497

    申请日:2013-02-08

    IPC分类号: G06F9/44 G06F11/36

    CPC分类号: G06F11/3676

    摘要: Embodiments of the present application relate to a code coverage rate determination method, a code coverage rate determination system, and a computer program product for determining code coverage rate. A code coverage rate determination method is provided. The method includes retrieving source code of a program, determining theoretical number of log file output points included in the source code and location information of the log file output points, retrieving log files actually outputted during the execution of the program, determining an actual number of log file output points of the actual outputted log files based on the location information of the corresponding log file output points recorded in each log file, and determining a code coverage rate of the program based on the theoretical number of log file output points and the actual number of the log file output points of the actual outputted log files.

    摘要翻译: 本申请的实施例涉及用于确定代码覆盖率的代码覆盖率确定方法,代码覆盖率确定系统和计算机程序产品。 提供了代码覆盖率确定方法。 该方法包括检索程序的源代码,确定源代码中包括的日志文件输出点的理论数量和日志文件输出点的位置信息,检索在程序执行期间实际输出的日志文件,确定实际数量 根据记录在每个日志文件中的相应日志文件输出点的位置信息,根据日志文件输出点的理论数量和实际输出的日志文件输出点的理论数量确定代码覆盖率,实际输出的日志文件的日志文件输出点 实际输出的日志文件的日志文件输出点数。

    CREATING NoSQL DATABASE INDEX FOR SEMI-STRUCTURED DATA
    4.
    发明申请
    CREATING NoSQL DATABASE INDEX FOR SEMI-STRUCTURED DATA 有权
    创建用于半结构数据的NoSQL数据库索引

    公开(公告)号:US20150205885A1

    公开(公告)日:2015-07-23

    申请号:US14599296

    申请日:2015-01-16

    IPC分类号: G06F17/30

    摘要: Semi-structured source data is preprocessed to obtain text partitions to be stored into a data table with a first combined primary key including a structure thread primary key and a sequence value primary key. The structure thread primary key identifies a structure thread that is segmented into several consecutive intervals according to a determined or predetermined sequence. An inverted index table, created for the preprocessed text partitions, includes a second combined primary key including the structure thread primary key and a keyword primary key. Corresponding to values of the primary keys in the second combined primary key, related text partition sequence IDs are recorded as index values of the inverted index table. Index values having a same keyword primary key value but different structure thread primary key values are located in different rows in the inverted index table. The present techniques improve query efficiency of database index and facilitate updating.

    摘要翻译: 半结构化源数据被预处理以获得要被存储到具有包括结构线程主键和序列值主键的第一组合主键的数据表中的文本分区。 结构线程主键标识根据确定的或预定的顺序被分段成几个连续间隔的结构线程。 为预处理的文本分区创建的反向索引表包括包含结构线程主键和关键字主键的第二组合主键。 对应于第二组合主键中的主键的值,相关的文本分区序列ID被记录为反向索引表的索引值。 具有相同关键字主键值但不同结构线程主键值的索引值位于反向索引表中的不同行中。 本技术提高了数据库索引的查询效率,促进了更新。

    Method and system for monitoring execution of user request in distributed system
    5.
    发明授权
    Method and system for monitoring execution of user request in distributed system 有权
    分布式系统中用户请求的执行监控方法和系统

    公开(公告)号:US09485317B2

    公开(公告)日:2016-11-01

    申请号:US14023014

    申请日:2013-09-10

    摘要: Embodiments of the present application relate to a method, a system and a computer program product for monitoring execution of a user request on a distributed system. A method for monitoring execution of user requests on a distributed system is provided. The method includes sampling a user request received by a plurality of servers, generating an identifier corresponding to the user request sampled by the plurality of servers, injecting tracking interfaces in advance into processes executed by the plurality of servers, determining whether the current process contain the identifier corresponding to the user request, in the event that the current process contains the identifier corresponding to the user request when the process has executed the tracking interface, recording information corresponding to the identifier relating to the function indicated by the tracking interface and generating a log, and individually collecting the generated log according to the corresponding identifier.

    摘要翻译: 本申请的实施例涉及用于监视在分布式系统上的用户请求的执行的方法,系统和计算机程序产品。 提供了一种在分布式系统上监视用户请求的执行方法。 该方法包括对由多个服务器接收到的用户请求进行采样,生成与多个服务器采样的用户请求对应的标识符,预先将跟踪接口注入到由多个服务器执行的进程中,确定当前进程是否包含 当处理已经执行跟踪接口时,当前进程包含与用户请求对应的标识符的情况下,对应于用户请求的标识符,记录与跟踪接口所指示的功能相关的标识符对应的信息并生成日志 ,并根据相应的标识符单独收集生成的日志。

    CODE COVERAGE RATE DETERMINATION METHOD AND SYSTEM

    公开(公告)号:US20160292068A1

    公开(公告)日:2016-10-06

    申请号:US15071060

    申请日:2016-03-15

    IPC分类号: G06F11/36

    CPC分类号: G06F11/3676

    摘要: Embodiments of the present application relate to a code coverage rate determination method, a code coverage rate determination system, and a computer program product for determining code coverage rate. A code coverage rate determination method is provided. The method includes retrieving source code of a program, determining theoretical number of log file output points included in the source code and location information of the log file output points, retrieving log files actually outputted during the execution of the program, determining an actual number of log file output points of the actual outputted log files based on the location information of the corresponding log file output points recorded in each log file, and determining a code coverage rate of the program based on the theoretical number of log file output points and the actual number of the log file output points of the actual outputted log files.

    CODE COVERAGE RATE DETERMINATION METHOD AND SYSTEM
    7.
    发明申请
    CODE COVERAGE RATE DETERMINATION METHOD AND SYSTEM 有权
    代码覆盖率测定方法与系统

    公开(公告)号:US20130212564A1

    公开(公告)日:2013-08-15

    申请号:US13763497

    申请日:2013-02-08

    IPC分类号: G06F11/36

    CPC分类号: G06F11/3676

    摘要: Embodiments of the present application relate to a code coverage rate determination method, a code coverage rate determination system, and a computer program product for determining code coverage rate. A code coverage rate determination method is provided. The method includes retrieving source code of a program, determining theoretical number of log file output points included in the source code and location information of the log file output points, retrieving log files actually outputted during the execution of the program, determining an actual number of log file output points of the actual outputted log files based on the location information of the corresponding log file output points recorded in each log file, and determining a code coverage rate of the program based on the theoretical number of log file output points and the actual number of the log file output points of the actual outputted log files.

    摘要翻译: 本申请的实施例涉及用于确定代码覆盖率的代码覆盖率确定方法,代码覆盖率确定系统和计算机程序产品。 提供了代码覆盖率确定方法。 该方法包括检索程序的源代码,确定源代码中包括的日志文件输出点的理论数量和日志文件输出点的位置信息,检索在程序执行期间实际输出的日志文件,确定实际数量 根据记录在每个日志文件中的相应日志文件输出点的位置信息,根据日志文件输出点的理论数量和实际输出的日志文件输出点的理论数量确定代码覆盖率,实际输出的日志文件的日志文件输出点 实际输出的日志文件的日志文件输出点数。

    Job scheduling method, device, and distributed system

    公开(公告)号:US10521268B2

    公开(公告)日:2019-12-31

    申请号:US15852786

    申请日:2017-12-22

    发明人: Hua Cai

    IPC分类号: G06F9/48 G06F9/50 G06F9/52

    摘要: A job scheduling method including notifying, by a first control node scheduling a first task, a second control node scheduling a second task to obtain execution data generated by at least one task instance of the first task when the at least one task instance of the first task completes execution; obtaining, by the second control node, the execution data generated by the at least one task instance of the first task and assigning the execution data to each task instance of the second task; and scheduling, by the second control node, an execution of at least one task instance of the second task and processing the execution data. The techniques of the present disclosure improve scheduling efficiency and resource utilization rate.

    Job Scheduling Method, Device, and Distributed System

    公开(公告)号:US20180121240A1

    公开(公告)日:2018-05-03

    申请号:US15852786

    申请日:2017-12-22

    发明人: Hua Cai

    IPC分类号: G06F9/48

    摘要: A job scheduling method including notifying, by a first control node scheduling a first task, a second control node scheduling a second task to obtain execution data generated by at least one task instance of the first task when the at least one task instance of the first task completes execution; obtaining, by the second control node, the execution data generated by the at least one task instance of the first task and assigning the execution data to each task instance of the second task; and scheduling, by the second control node, an execution of at least one task instance of the second task and processing the execution data. The techniques of the present disclosure improve scheduling efficiency and resource utilization rate.