PROPORTION BASED REAL TIME DISPLAY OF STATISTICS AND VALUES FOR SELECTED REGULAR EXPRESSIONS
    136.
    发明申请
    PROPORTION BASED REAL TIME DISPLAY OF STATISTICS AND VALUES FOR SELECTED REGULAR EXPRESSIONS 审中-公开
    基于比例的实时统计显示和选定的常规表达值

    公开(公告)号:US20150339357A1

    公开(公告)日:2015-11-26

    申请号:US14816036

    申请日:2015-08-02

    Applicant: Splunk Inc.

    Abstract: Embodiments are directed towards real time display of event records and extracted values based on at least one extraction rule, such as a regular expression. A user interface may be employed to enable a user to have an extraction rule automatically generate and/or to manually enter an extraction rule. The user may be enabled to manually edit a previously provided extraction rule, which may result in real time display of updated extracted values. The extraction rule may be utilized to extract values from each of a plurality of records, including event records of unstructured machine data. Statistics may be determined for each unique extracted value, and may be displayed to the user in real time. The user interface may also enable the user to select at least one unique extracted value to display those event records that include an extracted value that matches the selected value.

    Abstract translation: 实施例涉及基于诸如正则表达式的至少一个提取规则来实时显示事件记录和提取的值。 可以使用用户界面来使用户能够自动生成提取规则和/或手动输入提取规则。 可以使用户手动编辑先前提供的提取规则,这可以导致更新的提取值的实时显示。 提取规则可以用于从多个记录中的每一个提取值,包括非结构化机器数据的事件记录。 可以针对每个唯一提取的值确定统计量,并且可以实时地向用户显示。 用户界面还可以使用户能够选择至少一个唯一的提取值来显示包括与所选择的值匹配的提取值的那些事件记录。

    Sampling Events for Rule Creation with Process Selection
    138.
    发明申请
    Sampling Events for Rule Creation with Process Selection 有权
    用过程选择进行规则创建的抽样事件

    公开(公告)号:US20150234905A1

    公开(公告)日:2015-08-20

    申请号:US14700006

    申请日:2015-04-29

    Applicant: SPLUNK INC.

    Abstract: Embodiments are directed towards generating a representative sampling as a subset from a larger dataset that includes unstructured data. A graphical user interface enables a user to provide various data selection parameters, including specifying a data source and one or more subset types desired, including one or more of latest records, earliest records, diverse records, outlier records, and/or random records. Diverse and/or outlier subset types may be obtained by generating clusters from an initial selection of records obtained from the larger dataset. An iteration analysis is performed to determine whether a sufficient number of clusters and/or cluster types have been generated that exceed at least one threshold and when not exceeded, additional clustering is performed on additional records. From the resultant clusters, and/or other subtype results, a subset of records is obtained as the representative sampling subset.

    Abstract translation: 实施例旨在从包括非结构化数据的较大数据集生成代表性采样作为子集。 图形用户界面使得用户能够提供各种数据选择参数,包括指定数据源和期望的一个或多个子集类型,包括最新记录,最早记录,不同记录,离群记录和/或随机记录中的一个或多个。 可以通过从从较大数据集获得的记录的初始选择生成聚类来获得不同的和/或离群子集类型。 执行迭代分析以确定是否已经生成了超过至少一个阈值的足够数量的集群和/或集群类型,并且当不超过时,对附加记录执行附加集群。 从所得到的集群和/或其他子类型结果中,获得记录的子集作为代表性抽样子集。

Patent Agency Ranking