Sampling of events to use for developing a field-extraction rule for a field to use in event searching
    22.
    发明授权
    Sampling of events to use for developing a field-extraction rule for a field to use in event searching 有权
    对事件进行抽样以用于开发用于事件搜索的字段的字段提取规则

    公开(公告)号:US09031955B2

    公开(公告)日:2015-05-12

    申请号:US14168888

    申请日:2014-01-30

    Applicant: Splunk Inc.

    Abstract: Embodiments are directed towards generating a representative sampling as a subset from a larger dataset that includes unstructured data. A graphical user interface enables a user to provide various data selection parameters, including specifying a data source and one or more subset types desired, including one or more of latest records, earliest records, diverse records, outlier records, and/or random records. Diverse and/or outlier subset types may be obtained by generating clusters from an initial selection of records obtained from the larger dataset. An iteration analysis is performed to determine whether a sufficient number of clusters and/or cluster types have been generated that exceed at least one threshold and when not exceeded, additional clustering is performed on additional records. From the resultant clusters, and/or other subtype results, a subset of records is obtained as the representative sampling subset.

    Abstract translation: 实施例旨在从包括非结构化数据的较大数据集生成代表性采样作为子集。 图形用户界面使得用户能够提供各种数据选择参数,包括指定数据源和期望的一个或多个子集类型,包括最新记录,最早记录,不同记录,离群记录和/或随机记录中的一个或多个。 可以通过从从较大数据集获得的记录的初始选择生成聚类来获得不同的和/或离群子集类型。 执行迭代分析以确定是否已经生成了超过至少一个阈值的足够数量的集群和/或集群类型,并且当不超过时,对附加记录执行附加集群。 从所得到的集群和/或其他子类型结果中,获得记录的子集作为代表性抽样子集。

    Variable representative sampling under resource constraints
    23.
    发明授权
    Variable representative sampling under resource constraints 有权
    资源约束下的可变代数抽样

    公开(公告)号:US08751499B1

    公开(公告)日:2014-06-10

    申请号:US13747153

    申请日:2013-01-22

    Applicant: Splunk Inc.

    Abstract: Embodiments are directed towards generating a representative sampling as a subset from a larger dataset that includes unstructured data. A graphical user interface enables a user to provide various data selection parameters, including specifying a data source and one or more subset types desired, including one or more of latest records, earliest records, diverse records, outlier records, and/or random records. Diverse and/or outlier subset types may be obtained by generating clusters from an initial selection of records obtained from the larger dataset. An iteration analysis is performed to determine whether a sufficient number of clusters and/or cluster types have been generated that exceed at least one threshold and when not exceeded, additional clustering is performed on additional records. From the resultant clusters, and/or other subtype results, a subset of records is obtained as the representative sampling subset.

    Abstract translation: 实施例旨在从包括非结构化数据的较大数据集生成代表性采样作为子集。 图形用户界面使得用户能够提供各种数据选择参数,包括指定数据源和期望的一个或多个子集类型,包括最新记录,最早记录,不同记录,离群记录和/或随机记录中的一个或多个。 可以通过从从较大数据集获得的记录的初始选择生成聚类来获得不同的和/或离群子集类型。 执行迭代分析以确定是否已经生成了超过至少一个阈值的足够数量的集群和/或集群类型,并且当不超过时,对附加记录执行附加集群。 从所得到的集群和/或其他子类型结果中,获得记录的子集作为代表性抽样子集。

    Filtering event records based on selected extracted value

    公开(公告)号:US11520816B2

    公开(公告)日:2022-12-06

    申请号:US17028722

    申请日:2020-09-22

    Applicant: SPLUNK INC.

    Abstract: Embodiments are directed towards real time display of event records and extracted values based on at least one extraction rule, such as a regular expression. A user interface may be employed to enable a user to have an extraction rule automatically generate and/or to manually enter an extraction rule. The user may be enabled to manually edit a previously provided extraction rule, which may result in real time display of updated extracted values. The extraction rule may be utilized to extract values from each of a plurality of records, including event records of unstructured machine data. Statistics may be determined for each unique extracted value, and may be displayed to the user in real time. The user interface may also enable the user to select at least one unique extracted value to display those event records that include an extracted value that matches the selected value.

    Automatic rule modification
    30.
    发明授权

    公开(公告)号:US11210325B2

    公开(公告)日:2021-12-28

    申请号:US15582670

    申请日:2017-04-29

    Applicant: SPLUNK INC.

    Abstract: Embodiments are directed towards real time display of event records and extracted values based on at least one extraction rule, such as a regular expression. A user interface may be employed to enable a user to have an extraction rule automatically generate and/or to manually enter an extraction rule. The user may be enabled to manually edit a previously provided extraction rule, which may result in real time display of updated extracted values. The extraction rule may be utilized to extract values from each of a plurality of records, including event records of unstructured machine data. Statistics may be determined for each unique extracted value, and may be displayed to the user in real time. The user interface may also enable the user to select at least one unique extracted value to display those event records that include an extracted value that matches the selected value.

Patent Agency Ranking