USING NETWORK LOCATIONS OBTAINED FROM MULTIPLE THREAT LISTS TO EVALUATE NETWORK DATA OR MACHINE DATA
    531.
    发明申请
    USING NETWORK LOCATIONS OBTAINED FROM MULTIPLE THREAT LISTS TO EVALUATE NETWORK DATA OR MACHINE DATA 审中-公开
    使用从多个威胁级别获取的网络位置来评估网络数据或机器数据

    公开(公告)号:US20150180891A1

    公开(公告)日:2015-06-25

    申请号:US14135427

    申请日:2013-12-19

    Applicant: Splunk Inc.

    CPC classification number: H04L63/1416 G06F16/212 G06F16/951 H04L63/1425

    Abstract: Systems and methods are provided for identifying network addresses and/or IDs of a deduplicated list among network data, machine data, and/or events derived from network data and/or machine data, and for identifying notable events by searching for the presence of network addresses and/or network IDs that are deduplicated across lists received from multiple external sources. One method includes receiving a plurality of lists of network locations, wherein each list is received from over a network, wherein each of the network locations includes a domain name or an IP address, and wherein at least two of the plurality of lists each include a same network location; aggregating the plurality of lists of network locations into a deduplicated list of unique network locations; and searching network data or machine data for a network location included in the deduplicated list of unique network locations.

    Abstract translation: 系统和方法被提供用于识别网络数据,机器数据和/或从网络数据和/或机器数据导出的事件中的重复数据删除的列表的网络地址和/或ID,并且通过搜索网络的存在来识别显着的事件 在从多个外部源接收到的列表中进行重复数据删除的地址和/或网络ID。 一种方法包括接收多个网络位置列表,其中每个列表通过网络接收,其中每个网络位置包括域名或IP地址,并且其中多个列表中的至少两个列表包括 同一网络位置; 将多个网络位置列表聚合成唯一网络位置的重复数据删除列表; 以及搜索包含在唯一网络位置的重复数据删除列表中的网络位置的网络数据或机器数据。

    ADVANCED FIELD EXTRACTOR WITH MULTIPLE POSITIVE EXAMPLES
    532.
    发明申请
    ADVANCED FIELD EXTRACTOR WITH MULTIPLE POSITIVE EXAMPLES 有权
    具有多个积极实例的先进场提取器

    公开(公告)号:US20150149879A1

    公开(公告)日:2015-05-28

    申请号:US14610668

    申请日:2015-01-30

    Applicant: Splunk Inc.

    CPC classification number: G06F17/243 G06F17/30551

    Abstract: The technology disclosed relates to formulating and refining field extraction rules that are used at query time on raw data with a late-binding schema. The field extraction rules identify portions of the raw data, as well as their data types and hierarchical relationships. These extraction rules are executed against very large data sets not organized into relational structures that have not been processed by standard extraction or transformation methods. By using sample events, a focus on primary and secondary example events help formulate either a single extraction rule spanning multiple data formats, or multiple rules directed to distinct formats. Selection tools mark up the example events to indicate positive examples for the extraction rules, and to identify negative examples to avoid mistaken value selection. The extraction rules can be saved for query-time use, and can be incorporated into a data model for sets and subsets of event data.

    Abstract translation: 所公开的技术涉及制定和提炼在查询时使用具有后期绑定模式的原始数据的字段提取规则。 字段提取规则识别原始数据的部分,以及它们的数据类型和层次关系。 这些提取规则是针对未组织成尚未通过标准提取或转换方法处理的关系结构的非常大的数据集执行的。 通过使用示例事件,关注主要和次要示例事件有助于制定跨多个数据格式的单个提取规则,或者针对不同格式的多个规则。 选择工具标记示例事件以指示提取规则的正例,并确定负面示例以避免错误的值选择。 提取规则可以保存以供查询时间使用,并且可以被并入事件数据的集合和子集的数据模型中。

    PREVIEWING AN EXTRACTION RULE FOR RAW MACHINE DATA AND MODIFYING THE RULE THROUGH COUNTER-EXAMPLE
    535.
    发明申请
    PREVIEWING AN EXTRACTION RULE FOR RAW MACHINE DATA AND MODIFYING THE RULE THROUGH COUNTER-EXAMPLE 审中-公开
    检查原始机器数据的提取规则并通过反例来修改规则

    公开(公告)号:US20150143220A1

    公开(公告)日:2015-05-21

    申请号:US14611093

    申请日:2015-01-30

    Applicant: Splunk Inc.

    Abstract: Embodiments are directed towards real time display of event records and extracted values based on at least one extraction rule, such as a regular expression. A user interface may be employed to enable a user to have an extraction rule automatically generate and/or to manually enter an extraction rule. The user may be enabled to manually edit a previously provided extraction rule, which may result in real time display of updated extracted values. The extraction rule may be utilized to extract values from each of a plurality of records, including event records of unstructured machine data. Statistics may be determined for each unique extracted value, and may be displayed to the user in real time. The user interface may also enable the user to select at least one unique extracted value to display those event records that include an extracted value that matches the selected value.

    Abstract translation: 实施例涉及基于诸如正则表达式的至少一个提取规则来实时显示事件记录和提取的值。 可以使用用户界面来使用户能够自动生成提取规则和/或手动输入提取规则。 可以使用户手动编辑先前提供的提取规则,这可以导致更新的提取值的实时显示。 提取规则可以用于从多个记录中的每一个提取值,包括非结构化机器数据的事件记录。 可以针对每个唯一提取的值确定统计量,并且可以实时地向用户显示。 用户界面还可以使用户能够选择至少一个唯一的提取值来显示包括与所选择的值匹配的提取值的那些事件记录。

    GENERATION OF A DATA MODEL APPLIED TO QUERIES
    536.
    发明申请
    GENERATION OF A DATA MODEL APPLIED TO QUERIES 审中-公开
    适用于查询的数据模型的生成

    公开(公告)号:US20150142847A1

    公开(公告)日:2015-05-21

    申请号:US14611232

    申请日:2015-01-31

    Applicant: Splunk Inc.

    Abstract: Embodiments include generating data models that may give semantic meaning for unstructured or structured data that may include data generated and/or received by search engines, including a time series engine. A method includes generating a data model for data stored in a repository. Generating the data model includes generating an initial query string, executing the initial query string on the data, generating an initial result set based on the initial query string being executed on the data, determining one or more candidate fields from one or results of the initial result set, generating a candidate data model based on the one or more candidate fields, iteratively modifying the candidate data model until the candidate data model models the data, and using the candidate data model as the data model.

    Abstract translation: 实施例包括生成可以给非结构化或结构化数据赋予语义意义的数据模型,其可以包括由搜索引擎(包括时间序列引擎)生成和/或接收的数据。 一种方法包括为存储在存储库中的数据生成数据模型。 生成数据模型包括生成初始查询字符串,对数据执行初始查询字符串,基于对数据执行的初始查询字符串生成初始结果集,从一个或多个初始查询字符串的结果确定一个或多个候选字段 生成基于一个或多个候选字段的候选数据模型,迭代地修改候选数据模型,直到候选数据模型对数据建模,并使用候选数据模型作为数据模型。

    Sampling of events to use for developing a field-extraction rule for a field to use in event searching
    539.
    发明授权
    Sampling of events to use for developing a field-extraction rule for a field to use in event searching 有权
    对事件进行抽样以用于开发用于事件搜索的字段的字段提取规则

    公开(公告)号:US09031955B2

    公开(公告)日:2015-05-12

    申请号:US14168888

    申请日:2014-01-30

    Applicant: Splunk Inc.

    Abstract: Embodiments are directed towards generating a representative sampling as a subset from a larger dataset that includes unstructured data. A graphical user interface enables a user to provide various data selection parameters, including specifying a data source and one or more subset types desired, including one or more of latest records, earliest records, diverse records, outlier records, and/or random records. Diverse and/or outlier subset types may be obtained by generating clusters from an initial selection of records obtained from the larger dataset. An iteration analysis is performed to determine whether a sufficient number of clusters and/or cluster types have been generated that exceed at least one threshold and when not exceeded, additional clustering is performed on additional records. From the resultant clusters, and/or other subtype results, a subset of records is obtained as the representative sampling subset.

    Abstract translation: 实施例旨在从包括非结构化数据的较大数据集生成代表性采样作为子集。 图形用户界面使得用户能够提供各种数据选择参数,包括指定数据源和期望的一个或多个子集类型,包括最新记录,最早记录,不同记录,离群记录和/或随机记录中的一个或多个。 可以通过从从较大数据集获得的记录的初始选择生成聚类来获得不同的和/或离群子集类型。 执行迭代分析以确定是否已经生成了超过至少一个阈值的足够数量的集群和/或集群类型,并且当不超过时,对附加记录执行附加集群。 从所得到的集群和/或其他子类型结果中,获得记录的子集作为代表性抽样子集。

    DISPLAYING STATE INFORMATION FOR COMPUTING NODES IN A HIERARCHICAL COMPUTING ENVIROMENT
    540.
    发明申请
    DISPLAYING STATE INFORMATION FOR COMPUTING NODES IN A HIERARCHICAL COMPUTING ENVIROMENT 审中-公开
    显示在分层计算环境中计算节点的状态信息

    公开(公告)号:US20150089503A1

    公开(公告)日:2015-03-26

    申请号:US14529019

    申请日:2014-10-30

    Applicant: Splunk Inc.

    Abstract: The disclosed embodiments relate to a system for monitoring a virtual-machine environment. During operation, the system identifies a parent and a set of two or more child components that are related to the parent component in the virtual-machine environment. Next, the system determines a performance metric for each child component in the set of two or more child components. The system then determines a child-component performance state for each child component in the set of two or more child components based on the performance metric for the child component and a child-component state criterion. Finally, the system determines a parent state for the parent component based on the child-component performance state for each child component in the set of two or more child components and a parent-component state criterion, wherein the parent-component state criterion includes a threshold percentage or number of child components that have a specified state.

    Abstract translation: 所公开的实施例涉及用于监视虚拟机环境的系统。 在操作期间,系统识别与虚拟机环境中的父组件相关联的父级和一组两个或多个子组件。 接下来,系统确定两个或多个子组件集合中每个子组件的性能度量。 然后,系统基于子组件的性能度量和子组件状态标准,确定两组或多组子组件中每个子组件的子组件性能状态。 最后,系统基于两组或多组子组件中的每个子组件的子组件性能状态和父组件状态标准来确定父组件的父状态,其中父组件状态标准包括 阈值百分比或具有指定状态的子组件的数量。

Patent Agency Ranking